SOTorrent
Popular repositories Loading
-
db-scripts
db-scripts PublicSQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from th…
-
posthistory-extractor
posthistory-extractor PublicExtracts the version history of text and code blocks from the official Stack Overflow data dump.
-
string-similarity
string-similarity PublicImplementation of various string similarity metrics.
-
metric-evaluation
metric-evaluation PublicComparision of different string similarity metrics for reconstructing the history Stack Overflow posts.
Repositories
- posthistory-extractor Public
Extracts the version history of text and code blocks from the official Stack Overflow data dump.
sotorrent/posthistory-extractor’s past year of commit activity - db-scripts Public
SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.
sotorrent/db-scripts’s past year of commit activity - posthistory-comparator-gt-cs Public
Comparator app to validate connections of ground truth and computed similarity.
sotorrent/posthistory-comparator-gt-cs’s past year of commit activity - metric-evaluation Public
Comparision of different string similarity metrics for reconstructing the history Stack Overflow posts.
sotorrent/metric-evaluation’s past year of commit activity - preprocessing-pipeline Public
Preprocessing pipeline to extract and normalize text/code blocks from Stack Exchange forum posts and comments.
sotorrent/preprocessing-pipeline’s past year of commit activity