Skip to content
@BatsResearch

Bats Research

We are a machine learning research group at Brown University. We work on improving the processes by which humans teach and instruct computers.

Pinned Loading

  1. bonito Public

    A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

    Python 746 49

  2. alfred Public

    A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

    Python 52 6

  3. csp Public

    Learning to compose soft prompts for compositional zero-shot learning.

    Python 88 5

  4. zsl-kg Public

    Framework for zero-shot learning with knowledge graphs.

    Python 113 8

Repositories

Showing 10 of 29 repositories
  • Python 0 0 0 0 Updated Mar 10, 2025
  • bonito Public

    A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

    Python 746 BSD-3-Clause 49 5 1 Updated Feb 28, 2025
  • menghini-neurips23-code Public

    Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.

    Python 48 3 1 0 Updated Nov 8, 2024
  • planetarium Public

    Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

    Python 48 BSD-3-Clause 4 1 0 Updated Oct 16, 2024
  • alfred Public

    A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

    Python 52 BSD-3-Clause 6 1 0 Updated Oct 10, 2024
  • cross-lingual-detox Public

    Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024

    Jupyter Notebook 17 BSD-3-Clause 0 0 0 Updated Oct 4, 2024
  • LexC-Gen-Data-Archive Public

    Data Repository for LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

    1 1 0 0 Updated Oct 3, 2024
  • LexC-Gen Public

    Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

    Python 15 4 0 0 Updated Oct 3, 2024
  • Python 20 1 1 0 Updated Jul 16, 2024
  • nplm Public

    A weak supervision framework for (partial) labeling functions

    Python 16 BSD-3-Clause 3 0 0 Updated Jul 15, 2024

Most used topics

Loading…