I'm a passionate data engineer from India! Here are some things about me:
I have expertise in the following:
- Programming Languages: Python, Java, R, SQL π
- Big Data Technologies: Spark, PySpark, Scala, Datalake, Teradata, Hive, DB, etc. π
- Python Libraries: pandas, NumPy, PySpark, BeautifulSoup, etc. π
- Web Scraping: BS4, codecs, glob, sqlite3 shutil, lxml, json, etc. πΈοΈ
- Web Frameworks: React and Flask, Jinja3 with Python π»
- Cloud Services: AWS, GCP, Azure βοΈ
- Cloud Computing: Databricks π
- BI Tools: Tableau, Power BI, Plotly π
- Operating Systems: Windows, Linux, and macOS π»
I have accomplished the following credentials:
- π Check out my DataCamp Portfolio for some of my data science and analytics projects
- π Check out my Codecademy for some of my Full stack web developer
- π Check out my credentials in learning journey:
Here are some of my notable projects:
With 1+ years of experience in data ingestion, curation, wrangling, cleaning, lineage, and analytics/discovery using tools like Python, SQL, Spark, Teradata, and Tableau. In this project, I have worked on data discovery methods for marketing campaigns using descriptive and predictive analytics to identify customer segments, preferences and behavior patterns. Also, creating datasets with traits for customer targeting using machine learning techniques such as clustering, classification, and recommendation systems.
I have scraped data from a job portal using Python libraries such as BeautifulSoup and requests and cleaned and preprocessed data to get insights and trends such as popular job titles, skills, and job locations. I also visualized the data using Python libraries such as Plotly and Pandas.
I have built a machine learning model to predict whether the breast cancer diagnosis is benign or malignant using the Breast Cancer Wisconsin (Diagnostic) Dataset. I preprocessed data using tools such as Pandas and NumPy, built and evaluated several models such as Logistic Regression, Random Forest, and Support Vector Machines using Scikit-learn, and fine-tuned the best model using GridSearchCV.
If you are interested in chat with me, you can find me on LinkedIn or email me at damodhar918@outlook.com.
Thanks for checking out my profile! π