• Edgar Javier Hernandez, Ph.D.: edgarjavi@gmail.com
BMI Introduction to Programming (Python)
This course offers an introduction to an extensive array of methods for mathematical data analysis with emphasis on three major topics (probability analysis, statistical inference, and the basic concepts of statistical pattern recognition through machine learning), with a clear emphasis in the biomedical field. In here we will cover basic probability concepts such as recognizing the importance of the analysis of random events in real life applications, using probabilities axioms and rules. This course will present descriptive and inferential data methods for predictive analysis on samples and populations. This introductory course lays the foundation for more advance classes offered at the Biomedical Informatics Department. As an additional component of this class will be the extensive use of the statistical software R which is one of the most used statistical packages in most fields.
• Introduction to Probability and Statistics Using R (2010 - https://cran.r-project.org/web/packages/IPSUR/vignettes/IPSUR.pdf ), Kerns, Jay.
• Think Bayes (2012 - http://www.greenteapress.com/thinkbayes/thinkbayes.pdf ), Downey, Allen. (both freely available online).
• R for data Science (2017), Grolemund & Wickham - http://r4ds.had.co.nz/index.html
• Also a suggested text book is
o Think Stats: Exploratory Data Analysis in Python (2014 - http://greenteapress.com/thinkstats2/thinkstats2.pdf ) Downey, Allen.
o Introduction to Biostatistics 2nd ed. (2009): Sokal & Rohld
The University of Utah's subscription to the Safari Technical Books Online has a large selection of books that can be read online. For example. • Introduction to Probability (2015) Blizstein & Hwang • Probability and Statistics for Computer Scientists (2015) Baron • Probability and Statistics (2015) Rukmangadachari & Reddy
-
- By the end of this course, you will understand basic probability concepts, use probability rules, distinguish between discrete and continuous variables and the methods for their analysis, and solve problems related to random events.
-
- By the end of the course, you will be able to describe the main methods for data exploration and analysis. You will use descriptive statistics to understand the nature of your data, and you will use hypothesis testing to predict the behavior of variables from samples and populations.
-
- By the end of the course, you will understand the concepts of machine learning and pattern recognition, such as cluster recognition, data classification, dimensionality reduction, and temporal patterns.
-
- By the end of the course, you will have experience with of the most widely use statistical software in the sciences, R. You will be able to use many of the statistical and graphical packages available for this software.
- Introduction to the course
-
- Working with R
-
- Probability fundamentals
-
- Exploratory Data Analysis
-
- Bayes’ rule
-
- Probability functions
-
- Probability Distributions
-
- Hypothesis testing
-
- Visualization tools for hypothesis testing
-
- Linear regression and correlation
-
- Discrete estimation, contingency tables, exact test
-
- Clustering
-
- Boostrap
-
- Survival Analysis
-
- Bayesian inference
-
- Information Theory
-
- Introduction to Machine learning
-
- Case studies