LC_Pill_Checker 💊🔍

NOTE: This project is an ongoing experiment and certain features are not optimized for retrieval as of yet. Cleaning of data, validation datasets, and experimenting with different types of embeddings models, vector stores, and dimensionality reduction are needed to improve this project.

Problem:

There is currently no publicly available pill-identifying software in the UK which identifies tablets/capsules and their manufacturers based on their appearance. Furthermore, there are no publicly available and up-to-date images of medicines that may be used to train image models. This project is a Jupyter notebook which handles text descriptions of tablets and capsules as an alternative.

Usage

Follow the steps in the notebook after preparing the dependencies and imports to obtain an api key from OpenAI.

This template is currently set to receive a text document with information on the tablet/capsule, drug description, and manufacturer with each entry separated by a new line. Adjust this as necessary based on your data.

What I learned while making this ✍️

What retrieval augmented generation (RAG) is and how it can be used to improve performance of large language models in responding to queries relating to specific information.
The various steps involved in the RAG architecture such as text embeddings (e.g. Tf-idf) and vector stores (e.g. FAISS, Chromadb, Lancedb).
The basics of the LangChain framework 🦜️ and the various tools/databases that are available to carry out RAG.

Things to work on 🛠️🔬

Explore different embeddings models and vectorstores
Explore the use of dimensionality reduction (UMAP) to improve model perfomance
Explore the use of corrective RAG to improve accuracy

DISCLAIMER: The information above is provided for private study and / or personal use purposes only, and is not intended to be a substitute for a health care provider’s consultation or advice. The information above does not constitute legal or technical advice.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LC_Pill_Checker.ipynb		LC_Pill_Checker.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LC_Pill_Checker 💊🔍

NOTE: This project is an ongoing experiment and certain features are not optimized for retrieval as of yet. Cleaning of data, validation datasets, and experimenting with different types of embeddings models, vector stores, and dimensionality reduction are needed to improve this project.

Usage

What I learned while making this ✍️

Things to work on 🛠️🔬

About

Releases

Packages

Languages

License

Pauullamm/LC_Pill_Checker

Folders and files

Latest commit

History

Repository files navigation

LC_Pill_Checker 💊🔍

NOTE: This project is an ongoing experiment and certain features are not optimized for retrieval as of yet. Cleaning of data, validation datasets, and experimenting with different types of embeddings models, vector stores, and dimensionality reduction are needed to improve this project.

Usage

What I learned while making this ✍️

Things to work on 🛠️🔬

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages