👋 Hello there! I'm Kshitij, a Data Engineer in US.
🔧 I spend most of my time tinkering with data pipelines, ensuring smooth data flow and efficient processing.
🏢 Currently, I'm pursuing my Masters in Business Analytics and Information Management from @Purdue University. I have been a former member of the Data Engineering team at @AmericanExpress, where I contributed to building robust big data pipeline with financial risk data in an on-prem setup
Big data ecosystems always intrigue me. From designing ingestion frameworks for both batch and real-time workloads to crafting insightful visualizations for strategic decision-making, I've had the privilege of working across various levels of organizational data hierarchy.
-
Big Data: Hadoop HDFS, Tez, Hive, Spark, Microsoft BI Stack (SSAS, SSIS, SSMS), SQL, Teradata, Jethro, Sqoop, batch processing
-
Python: Data Structures, Pandas, Numpy, Algorithms
-
Data Visualization Tools: Microsoft Excel (pivot tables & charts), Tableau (Desktop & Server), Power BI
-
Cloud Computing/Analytics: Amazon Web Services (S3, Athena, RDS, DynamoDB), Google Cloud Platform (BigQuery)
-
Web Scraping: Beautiful Soup, Selenium, Scrapy, Splash
-
Other Tools/Skills: R, MATLAB, Shell, Git, JIRA, HTML, CSS, JavaScript, MySQL, JSON, Crontab, PostgreSQL