Skip to content
View sohan-bot's full-sized avatar
🏠
Working for companies
🏠
Working for companies

Block or report sohan-bot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sohan-bot/README.md

Hi there, I'm Sohan Phadikar! πŸ‘‹

Data Analyst | Python & SQL Expert | Statistical Modeling

LinkedIn Gmail

πŸ‘¨β€πŸ’» About Me

I am a results-oriented Data Analyst with 2+ years of experience in delivering advanced analytics solutions and business value. I specialize in building end-to-end analytics workflows, KPI dashboards, and performance-optimized data pipelines.

  • πŸ”­ I’m currently working on: Advanced predictive modeling and end-to-end data engineering projects.
  • πŸ’Ό Experience: Operational Analyst at Samatrix.io (Jun 2023 – Jan 2026).
  • πŸ“ˆ Key Achievement: Analyzed customer journey data to identify drop-offs, improving conversion rates by 12%.
  • πŸŽ“ Education: Master of Computer Applications (MCA) from JECRC University (2023).

πŸ› οΈ Tech Stack

Category Technologies
Languages Python SQL R
Data Engineering & Analytics PySpark Databricks Delta Lake ETL
Data Science Pandas NumPy Scikit-Learn SciPy
Deep Learning TensorFlow Keras PyTorch
Visualization & BI Power BI Tableau Plotly
Tools & Platforms Git Excel VS Code MySQL

πŸš€ Featured Projects

Project Description Tech Stack
Global Development Trends Analyzed global trends using GNP, population density, and development tiers. Designed scalable aggregation using MySQL CTEs & Window Functions. SQL Pandas Seaborn
End-to-End Analytics Workflow Architected a scalable loading process using SQLAlchemy to migrate processed datasets into a centralized MySQL database. Python SQLAlchemy ETL
Weather Forecasting (ARIMA) Performed time-series forecasting and seasonal trend analysis to predict weather parameters using statistical modeling. ARIMA Time-Series Python
From Spark to Inferno: COVID-19 Analysis Traced the pandemic from initial sparks to global hotspots. Uncovered regional disparities and survival trends using time-series analysis. Pandas PySpark Seaborn
Databricks Lakehouse – Gold Layer Analytics Designed and documented an end-to-end Lakehouse architecture using the Medallion pattern (Bronze β†’ Silver β†’ Gold). Built business-ready dimension and fact tables with clear data lineage, star schema modeling, and Databricks-style documentation for BI and analytics consumption. Databricks PySpark SQL Delta Lake Data Modeling

πŸ“Š GitHub Stats

stats graph languages graph
streak graph

"Data is the new oil." β€” Clive Humby

Pinned Loading

  1. powerbi-dashboard-portfolio powerbi-dashboard-portfolio Public

  2. web-article-analysis web-article-analysis Public

    Jupyter Notebook

  3. Global-Development-Demographics-Analysis Global-Development-Demographics-Analysis Public

    Analyze global development indicators using SQL for data extraction and Pandas for analytical modeling and visualization.

    Jupyter Notebook

  4. End-to-End-Data-Analysis-Project End-to-End-Data-Analysis-Project Public

    A complete data analytics project using Python and SQL. The process involves downloading a dataset via the Kaggle API, cleaning the data with Pandas, and loading it into SQL Server. Finally, using …

    Jupyter Notebook

  5. Covid-Data-Analysis Covid-Data-Analysis Public

    The Velocity of Vulnerability: Global Pandemic Trends (2020-2021)

    Jupyter Notebook

  6. DataWareHousing_project DataWareHousing_project Public

    TSQL