Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
A lemmatizer for German language text
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Documents for R tutorial given at WZB accompanying the lecture "Studying Social Stratification with Big Data" (Hipp, Ulbricht) in winter semester 2018
wzbsocialsciencecenter.github.io landing page.
Companion code for the article "oTree: Writing short and efficient code for experiments with dynamically determined data quantity" published in a Special Issue for JBEF. Illustrative example implementation for a simple stylized market simulation relying on "custom data models".
Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame
a package to create and plot Voronoi regions within geographic boundaries
An example topic model for debates from the 18th German Bundestag
A package with common oTree utilities that allow easier creation of surveys, understanding questions, timeout warnings and more.
A simple viewer and inspection tool for text boxes in PDF documents
pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.
Topic modeling with latent Dirichlet allocation using Gibbs sampling
d3.js extension for interactive balloon plots
Easily add tabs to django admin forms
Styling individual cells in Excel output files created with pandas.
Example project showing how to use custom models in oTree for recording complex decisions in experiments