Lists (2)
Sort Name ascending (A-Z)
Stars
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Top2Vec learns jointly embedded topic, document and word vectors.
A unified, comprehensive and efficient recommendation library
A Python scikit for building and analyzing recommender systems
Python wrapper for LibRec and other recommendation frameworks.
A library of metrics for evaluating recommender systems
DSPy: The framework for programming—not prompting—language models
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Practice your pandas skills!
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks
Header-only C++/python library for fast approximate nearest neighbors
Unofficial Galaxy Buds Manager for Windows, macOS, Linux, and Android
Python Data Science Handbook: full text in Jupyter Notebooks
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
davified / clean-code-ml
Forked from zedr/clean-code-python🛁 Clean Code concepts adapted for machine learning and data science. Now a free video series 😎 https://bit.ly/2yGDyqT
Python library for interactive topic model visualization. Port of the R LDAvis package.
Visualize and compare datasets, target values and associations, with one line of code.
Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Basic and advanced MLflow examples for many ML flavors
Open source platform for the machine learning lifecycle