-
Georgia State University
- Atlanta, GA
Highlights
- Pro
Stars
An open-source visual programming environment for battle-testing prompts to LLMs.
Example scripts for the pushshift dump files
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Systematic analysis of the responses of GPT-3 to different categories of statements and the potential vulnerabilities to simple prompting changes. We analyze what confuses GPT-3: how the model resp…
Create behavioral experiments in a browser using JavaScript
Neural Networks: Zero to Hero
Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Package for Statistically significant linguistic change
A small repo showing how to easily use BERT (or other transformers) for inference
Schedule and Syllabus for Human-Centered Machine learning.
A reading list of up-to-date papers on NLP for Social Good.
Tools for accessing/processing Reddit data and constructing networks based on this data. (Not an API crawler.)
Collection of tools for building diachronic/historical word vectors
Code and data for inducing domain-specific sentiment lexicons.
Socially-primed LSTM model to predict intercommunity conflict on Reddit.
Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushshift API.
This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017
Patent analysis using the Google Patents Public Datasets on BigQuery