Professional Aspiration:
- Passionate about leveraging computational methods to advance healthcare through biological data analysis
Educational Background:
- Master of Engineering in Computational Mathematics Science and Engineering(CMSE), providing a robust foundation in advanced computational techniques.
- Bachelor of Science in Biosystems Engineering with Concentration in Biomedical Engineering & Minor in CMSE
Technical Skills:
- Proficient in Python and R, utilized extensively in data science projects and software engineering tasks to analyze complex datasets and build predictive models.
- Git, Unix/Linux, FastQC, samtools, STAR, High Power Performing Computing.
- Machine Learning: Linear and Logitstic Regression, SVM, Decision Trees and Bayes Classifiers
- Deep Learning: TensorFlow, Neural Networks, Transformers -- BERT
- Dimensional Reduction: Principal Component Analysis, Multi-dimensional Scaling,
- Data analysis & Visualization: Pandas, Polars, tidyverse, ggplot2, matplotlib, and seaborn)
Mathematical Expertise:
- Solid understanding of Linear Algebra, Numerical Linear Algebra, Probability, and Statistics, essential for developing sophisticated algorithms and data analysis tools.
This project implements a method for scrapping youtube search result using selenium. Selenium is a perform document that allows for automation of website by the use of webdriver.
-
Selenium:
Automating website scrolling to get a large source html to parse. The html page is parse using
XPATH
. We first inspected the html source page and got the respective xpath for a video renderer. We then use the corresponding xpath to find the videoId for the youtube video. -
Youtube Data api:
To get the video statistics and video description, we used Youtube Data Api to access each video.
-
Hydra:
We define the project configuration using hydra. With hydra we can store file path with credentials and website headers, root url and country of search in yaml file.
- Rust language
- Exploring polars and alternative for pandas in python.
- Understand datafusion