Stack Overflow Snippets in GitHub Projects
Supplementary Material (EMSE Journal Paper)
Usage and Attribution of Stack Overflow Code Snippets in GitHub Projects — Supplementary Material.
Sebastian Baltes.
http://doi.org/10.5281/zenodo.1148069
The dataset is licensed under the Creative Commons Attribution Share-Alike 4.0 International License.
Supplementary Material (ICSE Extended Abstract)
-
Preliminary Study: We provide the survey codebook, the raw response data, as well as the R script used for analysis: ZIP
-
Programming Language Ranking: We provide instructions to recreate the ranking as well as the ranking itself: ZIP
-
Code Clone Analysis: We provide all scripts and data for the code clone analysis in one package: ZIP
-
Quantitative Analysis I-III: We provide all scripts and data for all quantitative analyses one package: Readme, ZIP-1-1, ZIP-1-2, ZIP-1-3, ZIP-1-4, ZIP-2-1, ZIP-2-2, ZIP-2-3, ZIP-2-4, ZIP-3
-
Qualitative Analysis: We provide the raw data and our coding of the Stack Overflow references in one package: ZIP
-
Other Sources: Stack Exchange data dump, GHTorrent data dump, GitHub BigQuery data set, and GHTorrent BigQuery data set.
License
The scripts and data we created as well as the data from the surveys are licensed under CC BY 4.0.
For data retrieved from the BigQuery GitHub data set, see the GitHub Terms of Service. All content retrieved from Stack Overflow, including content from the BigQuery Stack Overflow data set, is licensed under CC BY-SA 3.0, see also the Stack Exchange Network Terms of Service. GHTorrent is distributed under a dual licensing scheme (see GHTorrent FAQ and CCPlus).
Copyright Notice
The documents distributed on this website have been provided by the contributing authors by means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author’s copyright and the provided license. Not CC licensed works may not be reposted without the explicit permission of the copyright holder.