Skip to content
Change the repository type filter

All

    Repositories list

    • Hugo Blox WING Website pilot
      TeX
      MIT License
      19000Updated Nov 2, 2024Nov 2, 2024
    • HTML
      MIT License
      0000Updated Nov 1, 2024Nov 1, 2024
    • [Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
      Python
      2200Updated Aug 24, 2024Aug 24, 2024
    • [Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
      Python
      2100Updated Aug 22, 2024Aug 22, 2024
    • The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024
      Python
      3000Updated Aug 7, 2024Aug 7, 2024
    • Python
      0000Updated Jul 2, 2024Jul 2, 2024
    • ELCo

      Public
      The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
      Python
      01100Updated May 11, 2024May 11, 2024
    • Python
      1000Updated Apr 28, 2024Apr 28, 2024
    • Sealing

      Public
      [NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"
      Python
      MIT License
      2100Updated Mar 31, 2024Mar 31, 2024
    • Python
      3000Updated Mar 12, 2024Mar 12, 2024
    • Item Tokenization: the future for the recommender systems
      Python
      1100Updated Mar 11, 2024Mar 11, 2024
    • SciAssist

      Public
      Python
      Other
      419102Updated Feb 17, 2024Feb 17, 2024
    • nnose

      Public
      Codebase for NNOSE: Nearest Neighbor Occupational Skill Extraction
      Python
      MIT License
      2000Updated Jan 28, 2024Jan 28, 2024
    • This is the distribution point for the NUS SMS Corpus as described and updated from This is a corpus of SMS (Short Message Service) messages collected for research at the Department of Computer Science at the National University of Singapore. This dataset consists of 67,093 SMS messages taken from the corpus on Mar 9, 2015. The messages largely …
      462200Updated Jan 20, 2024Jan 20, 2024
    • This repository contains codes and models for the paper: Exploring Question-Specific Rewards for Generating Deep Questions (COLING 2020).
      Python
      MIT License
      10000Updated Jan 20, 2024Jan 20, 2024
    • This repository contains code and models for the paper: Semantic Graphs for Generating Deep Questions (ACL 2020).
      Python
      MIT License
      336400Updated Jan 20, 2024Jan 20, 2024
    • Molweni

      Public
      Other
      10100Updated Jan 20, 2024Jan 20, 2024
    • Summarization Papers
      TeX
      143600Updated Jan 20, 2024Jan 20, 2024
    • Library for processing MOOC data dumps. Currently limited to Coursera data.
      Perl
      GNU General Public License v3.0
      3201Updated Jan 20, 2024Jan 20, 2024
    • Data for Automatic Keyphrase Extraction Task
      97100Updated Jan 20, 2024Jan 20, 2024
    • FANG

      Public
      Python
      22100Updated Jan 20, 2024Jan 20, 2024
    • QMSum

      Public
      Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"
      Jupyter Notebook
      MIT License
      20100Updated Jan 20, 2024Jan 20, 2024
    • sciwing

      Public
      SciWING is a modern toolkit for scientific document processing from WING-NUS
      Python
      MIT License
      15610Updated Jan 20, 2024Jan 20, 2024
    • AdvFM

      Public
      Adversarial Deep Factorization Machine
      Python
      1100Updated Jan 20, 2024Jan 20, 2024
    • SciTab

      Public
      The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"
      MIT License
      2000Updated Jan 20, 2024Jan 20, 2024
    • This is the official repository for "CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation"
      Jupyter Notebook
      1000Updated Jan 20, 2024Jan 20, 2024
    • This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (to appear on Findings of EMNLP 2023).
      Python
      2000Updated Jan 20, 2024Jan 20, 2024
    • QACheck

      Public
      About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"
      Python
      Apache License 2.0
      4100Updated Jan 20, 2024Jan 20, 2024
    • Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"
      Python
      MIT License
      3000Updated Jan 20, 2024Jan 20, 2024
    • Python
      MIT License
      6100Updated Jan 20, 2024Jan 20, 2024