Skip to content

erictleung/data-science-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Readings and Resources

Check Resources

Review and resource materials from around the internet for data science, with applications in bioinformatics and computational biology, that I've found useful.

Table of Contents

Learning to learn

Resources and tips on how to self-learn and learn with others

Statistics and Probability

Statistics is the study of the collection, analysis, interpretation, presentation, and organization of data.

Wikipedia

General Resources

R Friendly Resources

Specific Topics

Interactive Articles

Data-related

General mathematics

Resources generally related to learning and understand mathematical foundations

Linear Algebra

Linear algebra is the branch of mathematics concerning vector spaces and linear mappings between such spaces.

Wikipedia

Network Science

Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, cognitive and semantic networks, and social networks, considering distinct elements or actors represented by nodes (or vertices) and the connections between the elements or actors as links (or edges).

Wikipedia

Algorithms and Data Structures

In mathematics and computer science, an algorithm is a self-contained step-by-step set of operations to be performed.

Wikipedia

In computer science, a data structure is a particular way of organizing and storing data in a computer so that it can be accessed and modified efficiently.

Wikipedia

Programming

Computer programming (often shortened to programming) is a process that leads from an original formulation of a computing problem to executable computer programs.

Wikipedia

Statistical Methods and Machine Learning

Machine learning is the subfield of computer science that "gives computers the ability to learn without being explicitly programmed".

Wikipedia

Computational Biology

Computational biology involves the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavior, and social systems.

Wikipedia

Domain Knowledge

Domain knowledge is valid knowledge used to refer to an area of human endeavour, an autonomous computer activity, or other specialized discipline

Wikipedia

Data Visualization and Making Figures

Data visualization or data visualisation is viewed by many disciplines as a modern equivalent of visual communication. It involves the creation and study of the visual representation of data, meaning "information that has been abstracted in some schematic form, including attributes or variables for the units of information".

Wikipedia

Should-Read Bioinformatics Papers

Computational biology and bioinformatics papers to cover the breadth of the field.

General knowledge mapping and exploration tools

  • Inciteful - Tools to help you accelerate your research

Software Engineering

Software engineering is the application of engineering to the development of software in a systematic method.

Wikipedia

Reproducible Science

Reproducibility is the ability to get the same research results using the raw data and computer programs provided by the researchers.

Wikipedia

People Skills and Communication

People skills are patterns of behavior and behavioral interactions. Among people, it is an umbrella term for skills under three related set of abilities: personal effectiveness, interaction skills, and intercession skills.

Wikipedia

Other Lists

Useful lists on their own that may intersect other topics above.

License

CC0