Skip to content

asapsav/scMagic

Repository files navigation

scMagic

AI co-pilot for scRNA-seq analysis. Serves as valuable tool that can handle computational and data-driven tasks, leaving researchers more time for conceptual work.

This repository is, I cannot stress this enough, very experimental. Do not use it as a dependency.

Who?

  • Savelii K - B.S. Physics MIPT, M.S CS Duke, xMcKinsey
  • Evgenii K - M.S. Physics MIPT, PhD Harvard Medical School

What?

v0.3 vision

  1. Upload scRNA-seq data, research background, research questions, things to try and things to be aware of.
  2. LLM agent-based AI runs the analysis on the cloud, then emails you when the report is ready. Manifesto:
  • usefull AI <=> 1) reliable 2) result-interpretable

Why?

To advance AI alingment we want to learn how to solve complex planning and design problems. scRNA-seq analyis is one of those problems, that this team happens to have exposure to.

Roadmap

  • Make “hello world” Streamlit langchain demo running
  • Make it running on a demo sc data but do random things (silence “upload dataset button”)
  • Implement the loop: 1) generate next step given last step + the "book" 2) write code for the next step and run the repl 3) write observation, 1) generate next ..
  • Make “hello world” Streamlit langchain demo running with Chroma DB (sepparately, write down why we need and dont need vector db rn)
  • Add functionality “easy feedback”
  • OD on cheap coffee and twix chocoalte bars and come up with a new general problem-solver architecture
  • Make a completely open source version using Llama v2 functions calling API by Ed
  • Expand agents tools to include scRNA-seq foundational models
  • Expand agents tools to include writing and excecuding Python code (REPL)
  • Expand agents tools to include wrtie and excecude R code
  • Expand agetns tools to include read and write "ToT" = tree of though
  • Expand agents tools to include planning tool
  • Expand agents tools to predict complexity of the question (incorporate system 1 and 2 thinking)
  • Expand agents tools to include "setting up dev environments"
  • Incorporate "exploration" or balance "horisontal vs vertical research strategy"
  • Redundancy (Shuttle computers)
  • 1) Optimise chain
  • 2) Optimise prompts
  • 3) Optimise models (PERF / RL)
  • 4) Attempt cost saving meaures (smaller models, smaller prompts, smaller chain loops, remove unnessesary chain loops (sepparate startup lol))

Some relevant resources for agents to use

Prototypes

About

We aim to create a useful co-pilot for scRNA analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages