Curate better data for LLMs
-
Updated
Mar 19, 2024 - Python
Curate better data for LLMs
Code for the CVPR 2019 paper : Spectral Metric for Dataset Complexity Assessment
Using fuzzy cognitive maps for multivariate data forecasting in Python 3.8.
目标检测助手(ObjectDetectionAssistant):分析数据集各类数量分布,BBox大小分布, BBox比率分布, BBox中心点分布;数据增强
Analysis of video quality datasets via design of minimalistic video quality models
A Machine Learning app created with Streamlit (https://www.streamlit.io/).
Python scripts for analyzing the 'Top 250 IMDb TV Shows' dataset. Tasks include EDA, building a TV show recommendation system, sentiment analysis, IMDb rating prediction, and TV show clustering.
Git for the Programming and scripting project 2020
A simple project that creates a dataset of News Headlines with Primary Category, Secondary Category, Date, Day, Month,Year, Sentiment, SentimentPolarity, Emotion and Url. All News Headlines are scraped from punch newspaper and sorted into a csv file.
Create a two python scripts to analyze company financial records and election data
Add a description, image, and links to the dataset-analysis topic page so that developers can more easily learn about it.
To associate your repository with the dataset-analysis topic, visit your repo's landing page and select "manage topics."