I'm a Data Scientist who loves playing with data using smart thinking and tech skills. I've been doing this stuff for over 4 years now and am currently pursuing Masters in Computer Science at New York University with a specialization in AI/ML. I'm into tech like deep learning, machine learning and computer vision, using tools like TensorFlow, PyTorch, OpenCV and Explainable AI. I'm all about turning complicated data into useful insights in order to support data-driven decision making.
Quick Note: I make mistakes but quickly learn from it (think of me as an RL agent xD). Additionally, I get things done!
- Relevant Coursework: Design & Analysis of Algorithms, Machine Learning, Big Data, Deep Learning, Computer Vision, Foundation of Entrepreneurship (Stern), Cloud Computing, Software Engineering, Human Computer Interaction
- Relevant Coursework: Design & Analysis of Algorithms, SQL (Database Management Systems), Artificial Intelligence, Machine Learning, Operating Systems
- Developing Inquis-AI, an AI-powered content management system, leveraging Flask, Python, Pinecone vector database, and Firebase for backend scalability and integration.
- Engineering personalized AI assistants utilizing RAG (Retrieval-Augmented Generation) and Llama 2, reducing repetitive data input and improving context management by 40%, streamlining academic and corporate workflows.
- Enhancing collaboration and real-time data synchronization with a vector-based content management system, allowing efficient file processing, instant content retrieval, and seamless data sharing across teams.
- Streamlined consent form processing for clinical trials by integrating advanced NLP and LLM techniques, reducing setup time by 45% and costs by 30%, while improving participant onboarding efficiency.
- Developed a custom algorithm to detect document language and identify key paragraphs, optimizing document structure analysis and significantly enhancing the overall document processing workflow.
- Automated form field mapping using Claude AI Sonnet via AWS Bedrock, reducing annotation time from 15 minutes to under 2 minutes, boosting processing speed and accuracy.
- Utilized Apriori algorithm for purchase analysis, discovering key product associations/patterns for upselling and cross-selling, potentially leading to a 15% increase in avg. revenue per customer.
- Implemented an end-to-end custom LLM system on top of OpenAI’s GPT-3.5 for answering questions based on purchase analysis. Used vector embeddings and Neo4j graph networks for faster queries.
- Developed an ensemble of Customer Lifetime Value (CLV) and churn prediction models, leading to data-driven customer segmentation & risk profiling with 94% accuracy.
- Designed a robust & scalable architecture to identify & classify 15 defect categories on stoppers using XceptionNet deep learning model, achieving an accuracy of ~92%.
- Deployed the defect detection deep learning model in production using Docker and Azure. Used MLFlow for version control and monitoring model performance.
- Created Power BI reports to visually represent & communicate daily classification results & insights to stakeholders, resulting in a 37% better identification of production line issues.
- Designed and set up ROS-Gazebo pipelines to train Turtlebot3 Waffle Pi for autonomous navigation in the manufacturing plant using DQN algorithm, achieving a 95% success rate.
- Reduced training time for Turtlebot3 by approximately 15% by integrating Human Intervention Learning, enabling the robot to learn directly from human experience stored in the buffer.
- Trained and deployed Particulate classification models for structured and unstructured data using Azure ML and Azure Function Apps. Achieved an overall accuracy of ~97%.
- Replaced the manual classification process used by the Lab Analysis team with the new models, resulting in improved accuracy and efficiency in particulate classification.
Feel free to reach out for collaborations or just a friendly chat:
| 📧 Email: arsalan.anwar@nyu.edu
🌐 While you're here, don't forget to check out my repositories below!