GUNDAM is a data management system that prioritizes data using language models.
-
Updated
Jul 27, 2023 - Python
GUNDAM is a data management system that prioritizes data using language models.
DSIR large-scale data selection framework for language model training
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
This repository contains all (Python 3) code and libraries required for the 2022-2023 Notre Dame Rocketry Team (NDRT) Apogee Control System (ACS). It also contains sensor/actuator example code and flight data.
Base-call error-filtering and read preprocessing pipeline for fastq libraries
This software anonymises data inside text files and CSV-like files. It removes various sorts of personally identifiable information. Each removed part is replaced with a suitable generic text, depending on the type of removed data. Currently English and Russian languages are supported. Russian works both with Cyrillic and Latin characters.
A powerful tool that allows users to query JSON data using SQL-like syntax. Effortlessly search, filter, and manipulate your JSON data with familiar SQL queries.
A Python script to filter and extract information from GTF files based on chromosome names, designed to be easily accessible for biologists without extensive programming experience.
Data exploration project introduced by Udacity Data Analysis Nanodegree
A Python script that filters C3D files containing motion capture data and converts them into CSV file format.
This repository contains a Python script that allows you to filter data in an Excel file using Streamlit, a web application framework for Python. The script utilizes the pandas library for data manipulation.
Filter DE genes based on log2Folchange, FDR value or both
An intuitive GUI-based Python application allowing a user to easily extract data from a file based on specific keywords to generate a focused output file.
Add a description, image, and links to the data-filtering topic page so that developers can more easily learn about it.
To associate your repository with the data-filtering topic, visit your repo's landing page and select "manage topics."