Skip to content
View qaicodes's full-sized avatar
πŸ‘‹
πŸ‘‹

Block or report qaicodes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
qaicodes/README.md


πŸ‘‹ About Me

πŸ”Ή Lead Data Scientist with expertise in LLMs, RAG, Vector Databases, and Cloud AI
πŸ”Ή Passionate about optimizing AI for performance, cost-efficiency, and scalability
πŸ”Ή Contributor to Open-Source AI Projects & DevOps Automation
πŸ”Ή Experienced in FastAPI, LangChain, Pinecone, Weaviate, Docker, AWS Lambda, Kubernetes
πŸ”Ή Exploring Multi-Cloud AI Deployments & Enterprise-Grade LLM Systems

πŸ–₯️ Portfolio: qaicodes.com
πŸ”— LinkedIn: linkedin.com/in/qurratulain
✍ Medium: medium.com/@qaicodes
πŸ’» GitHub: github.com/qaicodes


πŸ’‘ Featured AI Projects

πŸ”Ή Scalable RAG Chatbot (LangChain + OpenAI + Pinecone)

πŸ”— GitHub | πŸ–₯️ Live Demo
πŸ“Œ Tech Stack: Python, FastAPI, OpenAI API, LangChain, FAISS, Pinecone, Weaviate
πŸ“Œ Highlights:
βœ… Reduced response time from 2s β†’ 400ms via optimized vector retrieval
βœ… Hybrid search (BM25 + dense embeddings) for improved search relevance
βœ… Deployed on AWS Lambda with API Gateway for serverless scalability


πŸ”Ή Enterprise Knowledge Graph for AI Retrieval

πŸ”— GitHub | πŸ“– Blog Post
πŸ“Œ Tech Stack: GraphDB, Neo4j, LangChain, Weaviate, OpenAI, FastAPI
πŸ“Œ Highlights:
βœ… Enhanced RAG workflows using graph-based contextual retrieval
βœ… Optimized LLM context window usage, reducing token costs by 40%
βœ… GraphQL-powered API for fast & flexible knowledge retrieval


πŸ“ˆ AI & ML Achievements

πŸš€ Reduced AI model latency from 2s β†’ 400ms in production RAG pipelines
πŸš€ Optimized retrieval workflows, improving search accuracy by 40%
πŸš€ Lowered AI deployment costs by 35% via serverless cloud architecture
πŸš€ Built multi-modal AI systems integrating NLP, OCR, and RAG pipelines
πŸš€ Contributed to vector search optimizations for open-source LangChain modules


πŸ› οΈ AI Tools & Libraries I Contribute To

πŸ”Ή LangChain – Developed custom retriever & function calling modules
πŸ”Ή Weaviate & Pinecone – Fine-tuned hybrid search performance & cost efficiency
πŸ”Ή FastAPI – Created high-performance ML serving APIs
πŸ”Ή Neo4j GraphDB – Integrated knowledge graphs into RAG pipelines
πŸ”Ή AWS Lambda AI Deployments – Optimized serverless inference scaling


πŸ“– Favorite AI Resources & Learning Paths

πŸ“– Designing Large-Scale AI Systems – Chip Huyen
πŸ“– Deep Learning for LLMs – Andrej Karpathy’s Lecture Series
πŸ“– Retrieval-Augmented Generation for AI Applications – DataCamp
πŸ“– MLOps & Scalable AI Deployment – DeepLearning.AI


πŸ“« Let's Connect!

πŸš€ Open to remote AI engineering roles & collaborations!

Pinned Loading

  1. Retrieval-Augmented-Generation-App Retrieval-Augmented-Generation-App Public

    Jupyter Notebook

  2. bert-text-classification-gradio bert-text-classification-gradio Public

    Jupyter Notebook

  3. llama-squad llama-squad Public

    Forked from teticio/llama-squad

    Train Llama 2 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.

    Python

  4. local-llm-document-qa local-llm-document-qa Public

    Python

  5. scanned-pdf-to-images-converter scanned-pdf-to-images-converter Public

    Jupyter Notebook