GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.
Collection of utility classes and methods used across different projects related to SOTorrent.
R scripts used to retrieve samples of SO posts, to compare the results of the metrics evaluation, and to conduct analyses using the SOTorrent dataset.
Comparator app to validate connections of ground truth and computed similarity.
Visualization of edit and comment events in Stack Overflow threads.
Repository for Maven deployment.
Implementation of various string similarity metrics.
Extracts the version history of text and code blocks from the official Stack Overflow data dump.
Shows code clones on Stack Overflow.
Comparision of different string similarity metrics for reconstructing the history Stack Overflow posts.
Tool to create manually validated Stack Overflow post histories.