-
Notifications
You must be signed in to change notification settings - Fork 45
/
index.jupyter
169 lines (133 loc) · 7.51 KB
/
index.jupyter
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
nbformat 4
nbformat_minor 2
markdown
# Statistics in Python
[back to main page](../index.ipynb)
markdown
pandas: http://pandas.pydata.org/
http://statpages.org/
TODO: explain "power" and "effect"
markdown
## Freely Available Raw Data
If you need some data to try out your statistics skills, maybe one of the
following helps ...
Andrew J. Tatem, Carlos A. Guerra, Peter M. Atkinson, Simon I. Hay
[Athletics: Momentous sprint at the 2156 Olympics?](http://dx.doi.org/10.1038/431525a)
Nature Volume 431, Issue 525, 2004.
[supplementary information](http://www.nature.com/nature/journal/v431/n7008/suppinfo/431525a.html)
Bicycle data from Seattle:
http://www.seattlebikeblog.com/2014/06/09/a-statistical-analysis-of-biking-on-the-fremont-bridge-part-1-overview/
https://jakevdp.github.io/blog/2014/06/10/is-seattle-really-seeing-an-uptick-in-cycling/
https://jakevdp.github.io/blog/2015/07/23/learning-seattles-work-habits-from-bicycle-counts/
markdown
## Tutorials etc.
Getting started with Pandas:
http://efavdb.com/pandas-tips-and-tricks/
Analysing Weed Pricing across US:
https://github.com/amitkaps/weed
Probability, Paradox, and the Reasonable Person Principle:
http://nbviewer.ipython.org/url/norvig.com/ipython/Probability.ipynb
Machine Learning for Hackers:
http://slendermeans.org/category/will-it-python.html
CS109 Data Science (video lecture):
http://cs109.org/
spurious correlations:
http://www.tylervigen.com/
NIST/SEMATECH e-Handbook of Statistical Methods:
http://www.itl.nist.gov/div898/handbook/
Implementing a Principal Component Analysis (PCA) in Python step by step:
http://sebastianraschka.com/Articles/2014_pca_step_by_step.html
What Educated Citizens Should Know About Statistics and Probability (Jessica Utts, The American Statistician Volume 57, Issue 2, 2003):
http://www.tandfonline.com/doi/abs/10.1198/0003130031630
Things in Pandas I Wish I'd Known Earlier:
http://nbviewer.ipython.org/github/rasbt/python_reference/blob/master/tutorials/things_in_pandas.ipynb
PCA:
http://nbviewer.ipython.org/github/rasbt/pattern_classification/blob/master/dimensionality_reduction/projection/principal_component_analysis.ipynb
Probabilistic Programming & Bayesian Methods for Hackers:
http://camdavidsonpilon.github.io/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/
A Concrete Introduction to Probability (using Python):
http://nbviewer.jupyter.org/url/norvig.com/ipython/Probability.ipynb
Probability, Paradox, and the Reasonable Person Principle:
http://nbviewer.jupyter.org/url/norvig.com/ipython/ProbabilityParadox.ipynb
Modern Pandas Series:
[Modern Pandas](http://tomaugspurger.github.io/modern-1.html),
[Method Chaining](http://tomaugspurger.github.io/method-chaining.html),
[Indexes](http://tomaugspurger.github.io/modern-3-indexes.html),
[Performance](http://tomaugspurger.github.io/modern-4-performance.html),
[Tidy Data](http://tomaugspurger.github.io/modern-5-tidy.html),
[Visualization](http://tomaugspurger.github.io/modern-6-visualization.html),
Larger Data (forthcoming)
[Distance Metrics for Fun and Profit](http://www.benfrederickson.com/distance-metrics/),
[Finding Similar Music using Matrix Factorization](http://www.benfrederickson.com/matrix-factorization/)
[Frequentism and Bayesianism: A Practical Introduction](http://jakevdp.github.io/blog/2014/03/11/frequentism-and-bayesianism-a-practical-intro/),
[When Results Differ](http://jakevdp.github.io/blog/2014/06/06/frequentism-and-bayesianism-2-when-results-differ/),
[Confidence, Credibility, and why Frequentism and Science do not Mix](http://jakevdp.github.io/blog/2014/06/12/frequentism-and-bayesianism-3-confidence-credibility/),
[How to be a Bayesian in Python](http://jakevdp.github.io/blog/2014/06/14/frequentism-and-bayesianism-4-bayesian-in-python/),
[Model Selection](http://jakevdp.github.io/blog/2015/08/07/frequentism-and-bayesianism-5-model-selection/)
Easier data analysis in Python with pandas (video series):
http://www.dataschool.io/easier-data-analysis-with-pandas/)
Think Stats, Think Bayes (free PDF books):
http://greenteapress.com/thinkstats2/index.html, http://greenteapress.com/wp/think-bayes/
Computational and Inferential Thinking: The Foundations of Data Science
http://www.inferentialthinking.com/
Points of Significance
http://www.nature.com/collections/qghhqm/pointsofsignificance
https://github.com/KIPAC/StatisticalMethods
Introduction to Bayesian Inference:
https://www.datascience.com/blog/introduction-to-bayesian-inference-learn-data-science-tutorials
EN 605.448 Data Science, at the Johns Hopkins University, Whiting School of Engineering:
http://nbviewer.jupyter.org/github/actsasgeek/en605448/blob/master/notebooks/010.1%20-%20Introduction.ipynb
https://github.com/actsasgeek/en605448
PROBABILITY THEORY: THE LOGIC OF SCIENCE by E. T. Jaynes (1994) http://omega.albany.edu:8008/JaynesBook.html
markdown
## Case Studies
TODO: Try to analyze (and probably reproduce) some existing studies.
Comparison of violins:
http://www.pnas.org/content/109/3/760
http://www.pnas.org/content/early/2014/04/03/1323367111
http://phenomena.nationalgeographic.com/2014/04/07/stradivarius-violins-arent-better-than-new-ones-round-two/
http://josephcurtinstudios.com/article/the-indianapolis-experiment/
http://www.violinist.com/blog/laurie/20121/13039/
http://www.npr.org/blogs/deceptivecadence/2012/01/02/144482863/double-blind-violin-test-can-you-pick-the-strad
http://www.artsjournal.com/slippeddisc/2012/01/exclusive-how-i-blind-tested-old-violins-against-new.html
http://www.thestrad.com/latest/editorschoice/from-the-archive-classic-and-modern-violins-compared
http://www.thestrad.com/video/new-vs-old-follow-up-to-the-indianapolis-blind-testing-experiment
http://diepresse.com/home/meinung/marginalien/1587780/Bei-Stradivari-hat-vor-allem-der-Mythos-Klang
http://abcnews.go.com/Technology/wireStory/blind-test-soloists-violins-23227307
http://en.wikipedia.org/wiki/Player_preferences_among_new_and_old_violins
markdown
## Other Stuff
https://github.com/grrrr/krippendorff-alpha
Matlab® routines for analyzing psychophysical data: http://www.palamedestoolbox.org/index.html
markdown
<p xmlns:dct="http://purl.org/dc/terms/">
<a rel="license"
href="http://creativecommons.org/publicdomain/zero/1.0/">
<img src="http://i.creativecommons.org/p/zero/1.0/88x31.png" style="border-style: none;" alt="CC0" />
</a>
<br />
To the extent possible under law,
<span rel="dct:publisher" resource="[_:publisher]">the person who associated CC0</span>
with this work has waived all copyright and related or neighboring
rights to this work.
</p>
metadata
{
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.7"
}
}