- 👋 Hi, I’m @bayjan
- 👀 I’m interested in using machine learning algorithms to find patterns from data, especially, biomedical multi-omics data.
- I’m interested in comparative genomics analysis and do irregular DevOps tasks as well.
- 🌱 I’m currently learning a deep learning framework and its application to several data types.
- 📫 How to reach me: at GitLab https://gitlab.com/bayjan, at twitter @bayjan and at Google Scholar https://scholar.google.com/citations?user=UOg0jLgAAAAJ&hl=en
Bioinformatician turned Data Scientist
I am a data scientist with a background in bioinformatics and expertise in machine learning, statistics, and programming in Python and R.
Skills
Programming
Python (Biopython)
R
SQL (MySQL, PostgreSQL)
Bash scripting
Grid/Cloud computing
NoSQL (MongoDB)
Probabilistic programming (Stan/PyStan)
Essential bioinformatics skills
Comparative genomics tools
Orthology prediction
NGS data analysis (assembly, SNP/InDel analysis, annotation)
Phylodynamic analysis (e.g. BEAST tool)
Galaxy (workflow management and tool integration framework)
Analysis of large metagenomics data sets (Qiime)
MLST profiling (e.g.: SeqSphere)
Knowledge in statistics and machine learning
Linear models
Multivariate statistics
Machine learning algorithms (e.g. Random Forest, SVM) and libraries (e.g. scikit-learn, WEKA, H2GO)
Deep learning with PyTorch (fast.ai)
Other relevant skills
Linux
HPC
Snakemake
Docker
Git
Elasticsearch
Web programming (mostly using Python)
Apache Spark
Kubernetes