@WZBSocialScienceCenter

Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center

Repository with scripts and tools used and developed at the WZB Berlin Social Science Center. See also https://datascience.blog.wzb.eu/.

Pinned repositories

  1. pdftabextract

    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

    Python 1.3k 181

  2. pdf2xml-viewer

    A simple viewer and inspection tool for text boxes in PDF documents

    HTML 43 8

  3. tmtoolkit

    Text Mining and Topic Modeling Toolkit for Python with parallel processing power

    Python 25 3

  4. otreeutils

    A package with common oTree utilities that allow easier creation of surveys, understanding questions, timeout warnings and more.

    Python 6 1

  5. geovoronoi

    a package to create and plot Voronoi regions within geographic boundaries

    Python 1 1

  6. germalemma

    A lemmatizer for German language text

    Python 24 4

  • Text Mining and Topic Modeling Toolkit for Python with parallel processing power

    Python 25 3 Apache-2.0 Updated Sep 17, 2018
  • A lemmatizer for German language text

    Python 24 4 Apache-2.0 Updated Sep 17, 2018
  • A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

    Python 1,284 181 Apache-2.0 Updated Sep 10, 2018
  • Documents for R tutorial given at WZB accompanying the lecture "Studying Social Stratification with Big Data" (Hipp, Ulbricht) in winter semester 2018

    HTML Updated Sep 10, 2018
  • wzbsocialsciencecenter.github.io landing page.

    Updated Sep 10, 2018
  • Companion code for the article "oTree: Writing short and efficient code for experiments with dynamically determined data quantity" published in a Special Issue for JBEF. Illustrative example implementation for a simple stylized market simulation relying on "custom data models".

    Python Apache-2.0 Updated Jul 31, 2018
  • Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame

    Python Updated Jul 16, 2018
  • a package to create and plot Voronoi regions within geographic boundaries

    Python 1 1 Apache-2.0 Updated May 23, 2018
  • An example topic model for debates from the 18th German Bundestag

    Jupyter Notebook 1 Apache-2.0 Updated May 17, 2018
  • A package with common oTree utilities that allow easier creation of surveys, understanding questions, timeout warnings and more.

    Python 6 1 Apache-2.0 Updated Apr 25, 2018
  • A simple viewer and inspection tool for text boxes in PDF documents

    HTML 43 8 Apache-2.0 Updated Jan 24, 2018
  • pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.

    Python 20 Updated Jan 24, 2018
  • lda

    Forked from lda-project/lda

    Topic modeling with latent Dirichlet allocation using Gibbs sampling

    Python 251 MPL-2.0 Updated Sep 21, 2017
  • d3.js extension for interactive balloon plots

    HTML 1 1 Apache-2.0 Updated May 19, 2017
  • Easily add tabs to django admin forms

    Python 23 BSD-3-Clause Updated May 12, 2017
  • Styling individual cells in Excel output files created with pandas.

    Python 3 4 Updated May 12, 2017
  • Python 36 Updated Nov 9, 2016
  • Example project showing how to use custom models in oTree for recording complex decisions in experiments

    Python 6 Apache-2.0 Updated Nov 1, 2016