Skip to content
View arsalananwar11's full-sized avatar
💻
Looking for full-time opportunities
💻
Looking for full-time opportunities

Highlights

  • Pro

Block or report arsalananwar11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arsalananwar11/README.md

🚀 About Me

Hey! I'm Arsalan Anwar!

I'm a Data Scientist who loves playing with data using smart thinking and tech skills. I've been doing this stuff for over 4 years now and am currently pursuing Masters in Computer Science at New York University with a specialization in AI/ML. I'm into tech like deep learning, machine learning and computer vision, using tools like TensorFlow, PyTorch, OpenCV and Explainable AI. I'm all about turning complicated data into useful insights in order to support data-driven decision making.

Quick Note: I make mistakes but quickly learn from it (think of me as an RL agent xD). Additionally, I get things done!


✨ GitHub Stats Arsalan Anwar's Github Stats


🎓 Education

New York University

M.S. in Computer Science | Sep 2023 - Present
  • Relevant Coursework: Design & Analysis of Algorithms, Machine Learning, Big Data, Deep Learning, Computer Vision, Foundation of Entrepreneurship (Stern), Cloud Computing, Software Engineering, Human Computer Interaction

Bangalore Institute of Technology

B.Tech in Computer Science | Aug 2016 - Aug 2020
  • Relevant Coursework: Design & Analysis of Algorithms, SQL (Database Management Systems), Artificial Intelligence, Machine Learning, Operating Systems

💻 Skills

  • Languages: Python SQL C# Java C C++

  • Frameworks/ Libraries: TensorFlow PyTorch Spark Flask NumPy Pandas OpenCV MLOps ROS

  • Tools/ Technologies: Azure DevOps Postman Power BI Docker Snowflake GitHub

  • Operating Systems: Linux Windows Mac OS


💼 Work Experience

Data Scientist (Research Assistant), New York University

New York City, US | Jan 2024 - Present
  • Developing Inquis-AI, an AI-powered content management system, leveraging Flask, Python, Pinecone vector database, and Firebase for backend scalability and integration.
  • Engineering personalized AI assistants utilizing RAG (Retrieval-Augmented Generation) and Llama 2, reducing repetitive data input and improving context management by 40%, streamlining academic and corporate workflows.
  • Enhancing collaboration and real-time data synchronization with a vector-based content management system, allowing efficient file processing, instant content retrieval, and seamless data sharing across teams.

Data Science Intern, Medidata

New York City, US | May 2024 - Aug 2024
  • Streamlined consent form processing for clinical trials by integrating advanced NLP and LLM techniques, reducing setup time by 45% and costs by 30%, while improving participant onboarding efficiency.
  • Developed a custom algorithm to detect document language and identify key paragraphs, optimizing document structure analysis and significantly enhancing the overall document processing workflow.
  • Automated form field mapping using Claude AI Sonnet via AWS Bedrock, reducing annotation time from 15 minutes to under 2 minutes, boosting processing speed and accuracy.

Data Scientist, Course5 Intelligence

Bangalore, IN | Feb 2023 - Aug 2023
  • Utilized Apriori algorithm for purchase analysis, discovering key product associations/patterns for upselling and cross-selling, potentially leading to a 15% increase in avg. revenue per customer.
  • Implemented an end-to-end custom LLM system on top of OpenAI’s GPT-3.5 for answering questions based on purchase analysis. Used vector embeddings and Neo4j graph networks for faster queries.
  • Developed an ensemble of Customer Lifetime Value (CLV) and churn prediction models, leading to data-driven customer segmentation & risk profiling with 94% accuracy.

Data Scientist, West Pharmaceutical Services

Bangalore, IN | Aug 2022 - Feb 2023
  • Designed a robust & scalable architecture to identify & classify 15 defect categories on stoppers using XceptionNet deep learning model, achieving an accuracy of ~92%.
  • Deployed the defect detection deep learning model in production using Docker and Azure. Used MLFlow for version control and monitoring model performance.
  • Created Power BI reports to visually represent & communicate daily classification results & insights to stakeholders, resulting in a 37% better identification of production line issues.

Associate Data Scientist, West Pharmaceutical Services

Bangalore, IN | Jul 2020 - Jul 2022
  • Designed and set up ROS-Gazebo pipelines to train Turtlebot3 Waffle Pi for autonomous navigation in the manufacturing plant using DQN algorithm, achieving a 95% success rate.
  • Reduced training time for Turtlebot3 by approximately 15% by integrating Human Intervention Learning, enabling the robot to learn directly from human experience stored in the buffer.

Graduate Software Trainee, West Pharmaceutical Services

Bangalore, IN | Jan 2020 - Jun 2020
  • Trained and deployed Particulate classification models for structured and unstructured data using Azure ML and Azure Function Apps. Achieved an overall accuracy of ~97%.
  • Replaced the manual classification process used by the Lab Analysis team with the new models, resulting in improved accuracy and efficiency in particulate classification.

📮 Connect with Me

Feel free to reach out for collaborations or just a friendly chat:

Linkedin | 📧 Email: arsalan.anwar@nyu.edu


🌐 While you're here, don't forget to check out my repositories below!

Pinned Loading

  1. Automated-Industrial-Inspection-System-using-Computer-Vision Automated-Industrial-Inspection-System-using-Computer-Vision Public

    Automated Industrial Inspection System using Computer Vision to inspect manufactured parts and enhance quality control by using computer vision techniques. This system aims to boost inspection accu…

    Python 1

  2. Image-Captioning-using-Encoder-Decoder-Models Image-Captioning-using-Encoder-Decoder-Models Public

    Forked from tkobil/image-captioning-using-encoder-decoder-models

    Image Captioning Benchmarking using Encoder Decoder Models

    Jupyter Notebook 1

  3. Autonomous-Robot-Navigation-Using-DRL Autonomous-Robot-Navigation-Using-DRL Public

    Forked from rcampbell95/turtlebot3_ddpg

    Python 1

  4. Safeguarding-NYC-Analyzing-Crime-Patterns-using-Big-Data Safeguarding-NYC-Analyzing-Crime-Patterns-using-Big-Data Public

    Objective: To provide an in-depth and integrated analysis of crime data from multiple sources in New York City. This study aims to identify patterns, intensities, and distributions of crimes, with …

    Jupyter Notebook 6

  5. Face-Mask-Detection Face-Mask-Detection Public

    Forked from balajisrinivas/Face-Mask-Detection

    Python