Skip to content
@sotorrent

SOTorrent

Reconstructing and analyzing the evolution of Stack Overflow posts.

Popular repositories Loading

  1. db-scripts db-scripts Public

    SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from th…

    Shell 14 7

  2. posthistory-extractor posthistory-extractor Public

    Extracts the version history of text and code blocks from the official Stack Overflow data dump.

    Java 2 2

  3. string-similarity string-similarity Public

    Implementation of various string similarity metrics.

    Java 2 3

  4. r-scripts r-scripts Public

    R scripts used to retrieve samples of SO posts, to compare the results of the metrics evaluation, and to conduct analyses using the SOTorrent dataset.

    R 2 2

  5. metric-evaluation metric-evaluation Public

    Comparision of different string similarity metrics for reconstructing the history Stack Overflow posts.

    Java 1 2

  6. util util Public

    Collection of utility classes and methods used across different projects related to SOTorrent.

    Java 1 3

Repositories

Showing 10 of 16 repositories
  • posthistory-comparator-gt-cs Public

    Comparator app to validate connections of ground truth and computed similarity.

    sotorrent/posthistory-comparator-gt-cs’s past year of commit activity
    Java 0 0 0 0 Updated May 7, 2024
  • metric-evaluation Public

    Comparision of different string similarity metrics for reconstructing the history Stack Overflow posts.

    sotorrent/metric-evaluation’s past year of commit activity
    Java 1 Apache-2.0 2 0 0 Updated May 7, 2024
  • string-similarity Public

    Implementation of various string similarity metrics.

    sotorrent/string-similarity’s past year of commit activity
    Java 2 Apache-2.0 3 0 1 Updated May 7, 2024
  • util Public

    Collection of utility classes and methods used across different projects related to SOTorrent.

    sotorrent/util’s past year of commit activity
    Java 1 Apache-2.0 3 0 0 Updated Jun 15, 2023
  • preprocessing-pipeline Public

    Preprocessing pipeline to extract and normalize text/code blocks from Stack Exchange forum posts and comments.

    sotorrent/preprocessing-pipeline’s past year of commit activity
    Python 0 Apache-2.0 3 0 0 Updated Dec 20, 2022
  • posthistory-extractor Public

    Extracts the version history of text and code blocks from the official Stack Overflow data dump.

    sotorrent/posthistory-extractor’s past year of commit activity
    Java 2 Apache-2.0 2 2 0 Updated Jun 27, 2022
  • so-edit-viz Public

    Visualization of edit and comment events in Stack Overflow threads.

    sotorrent/so-edit-viz’s past year of commit activity
    JavaScript 1 Apache-2.0 1 0 0 Updated Apr 21, 2022
  • so-clones Public

    Shows code clones on Stack Overflow.

    sotorrent/so-clones’s past year of commit activity
    HTML 1 Apache-2.0 2 0 0 Updated Apr 21, 2022
  • db-scripts Public

    SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.

    sotorrent/db-scripts’s past year of commit activity
    Shell 14 Apache-2.0 7 0 0 Updated Apr 7, 2022
  • pipeline Public

    SOTorrent pipeline running on Google Cloud

    sotorrent/pipeline’s past year of commit activity
    Python 1 Apache-2.0 2 0 0 Updated Jun 30, 2021

Top languages

Loading…

Most used topics

Loading…