# Reading goals for Hughes and Hase

Below are the goals for each reading assignment in Hughes and Hase.

* [Errors in the physical sciences](#Errors-in-the-physical-sciences)
* [Random error in measurements](#Random-error-in-measurements)
* [Uncertainties as probabilities](#Uncertainties-as-probabilities)
* [Error propagation](#Error-propagation)
* [Data visualization and reduction](#Data-visualization-and-reduction)
* [Least-squares fitting of complex functions](#Least-squares-fitting-of-complex-functions)
* [Computer minimization and the error matrix](#Computer-minimization-and-the-error-matrix)
* [Hypothesis testing - how good are our models?](#Hypothesis-testing---how-good-are-our-models?)
* [Topics for further study](#Topics-for-further-study)
* [Exercises](#Exercises)


## Errors in the physical sciences 
1.  Be able to explain what the following terms mean and be able to
    provide representative examples:
    1.  An accurate measurement;
    2.  A precise measurement;
    3.  Random errors/uncertainty;
    4.  Systematic errors/uncertainty;
    5.  Mistakes.

## Random error in measurements 
1.  Be able to use the mean and standard deviation to characterize a
    statistical sample. Specifically, be able to:
    1.  Estimate the mean and standard deviation from a histogram;
    2.  Sketch a distribution with a given mean and standard deviation;
    3.  Calculate the mean for a set of measurements;
    4.  Calculate the standard deviation for a set of measurements.
2.  Know the functional form for a Gaussian distribution, be able to
    sketch it for a given mean and standard deviation, and be able to
    estimate the mean and standard deviation from the plot of a Gaussian
    distribution.
3.  Be able to discuss the relationships between a sample and its parent
    distribution.
4.  Be able to discuss the difference between the standard deviation and
    the standard deviation in the mean (aka the standard error), and be
    able to estimate both from a histogram of measurements.
5.  Be able to report uncertainties correctly for a given measurement.

## Uncertainties as probabilities 
1.  Be able to explain what a probability distribution function
    $P_{DF}(x)$ represents and why Eqs. (3.1)–(3.6) follow from its
    definition.
2.  Be able to recall and use Eqs. (3.1)-(3.3) to perform simple
    probability calculations for an arbitrary $P_{DF}(x)$, including:
    1.  check that $P_{DF}(x)$ is properly normalized, and identify the
        correct normalization factor if it is not;
    2.  evaluate the expectation value of a function $f(x)$; and
    3.  evaluate the expectation value of the mean and the variance.
3.  Be able to recall the definitions (3.7) and (3.8) of the Gaussian
    probability distribution function and the error function,
    respectively, and know how to use the error function in simple
    probability calculations like the one given in Sec. 3.2.2.
4.  Be aware of the rules described in Sec. 3.3.2 for rejecting
    outliers, be able to follow a well-defined procedure for doing so,
    and be able to suggest alternatives to throwing away data points.
5.  Be able to describe the basic properties of a Poisson distribution
    $P(N; \bar{N})$,
    including:
    1.  its functional form;
    2.  the kind of experimental data that will be described by it;
    3.  the expectation values of its mean and variance; and
    4.  the Gaussian probability distribution that approximates it for
        $N\rightarrow\infty$.
6.  Be able to sketch a Poisson distribution for a given mean and
    standard deviation, and be able to estimate the mean and standard
    deviation from the plot of a Poisson distribution.
7.  Be able to state the central limit theorem and recognize how it is
    used to justify the assumption of Gaussian errors in many
    experiments.

## Error propagation

1.  Be able to explain why Eq. (4.7) gives the approximate uncertainty
    in the single-variable function $Z(A)$ when there is uncertainty
    in the argument $A$, and be able to discuss the limitations of this
    approximation.
2.  Be able to derive all of the results in Table 4.1 and use Eq. (4.7)
    in concrete examples.
3.  Be able to explain why the component uncertainties $\alpha_Z^A$,
    $\alpha_Z^B$, $\alpha_Z^C$,..., add in quadrature to give $\alpha_Z$
    in Eq. (4.10), and recognize (for now) that this expression is
    restricted to independent, uncorrelated variables (Ch. 7 discusses
    the reason for this restriction in more depth).
4.  Be able to explain why Eq. (4.16) gives the approximate uncertainty
    in the multivariable function $Z\left(A, B, C,\ldots\right)$ when
    there is uncertainty in the arguments $A$, $B$, $C$,..., and be able
    to discuss the limitations of this approximation—here, as in (3)
    above, it is enough for you to recognize that the variables must be
    independent and uncorrelated.
5.  Be able to derive all of the results in Table 4.2 and use Eq. (4.16)
    in concrete examples.
6.  Be able to use error propagation methods to identify the dominant
    uncertainty in an experiment.
7.  Be able to find the weighted mean and its standard error
    $\alpha_{CE}$ for a set of numbers $\left\{x_i\right\}$ with
    uncertainties $\left\{\alpha_i\right\}$.

## Data visualization and reduction

1.  Be able to recall the "Guidelines for plotting data" in Sec. 5.1,
    and apply them to your own graphs.
2.  Be able to compute appropriate error bars for data in a graph.
3.  Be able to assess the quality of a fit from the fraction of data
    points that lie within one standard error bar from the fitted curve.
4.  Recognize that a least-squares fit to a line can be computed from
    the data using Eqs. (5.1) - (5.6).
5.  Be able to explain the meaning and significance of the following
    terms:
    1.  interpolate;
    2.  extrapolate;
    3.  aliasing;
    4.  residual;
    5.  method of least squares; and
    6.  goodness-of-fit parameter.
6.  Be able to explain why $P\left(m,c\right)$ in (5.8) is maximized
    when $\chi^2$ in (5.9) is minimized, and discuss how this provides a
    rationale for using $\chi^2$ to determine optimal fit parameters.
7.  Know how to graph data to help identify systematic errors.

## Least-squares fitting of complex functions

1.  Be able to recall and use the $\chi^2$ goodness-of-fit parameter (6.1)
    to fit a model to data, including:
    1.  determine the best fit parameters by constructing and minimizing
        $\chi^2$;
    2.  account for non-uniform uncertainties, for example for
        measurements drawn from a Poisson distribution; and
    3.  recognize the need for the weights in Equations (6.3)-(6.7) for
        a linear fit with non-uniform uncertainties.
2.  Be able to identify measurement strategies that will minimize the
    uncertainty of a particular fit parameter, for example the slope or
    the intercept in a linear fit.
3.  Recognize the need to use normalized residuals to evaluate fits with
    non-uniform uncertainties.
4.  Recognize the distinction between linear and nonlinear fits;
    specifically, that:
    1.  model functions that are linear in the parameters possess
        closed-form solutions like Equations (6.3)-(6.7) with a single
        set of optimal parameters;
    2.  model functions that are nonlinear in the parameters require
        numerical optimization techniques that are not guaranteed to
        have a single global solution;
    3.  nonlinear fit algorithms require an initial guess for the model
        parameters, while linear fits do not; and
    4.  the initial parameter guess may influence the solution found in
        a nonlinear fit.
5.  Be able to construct contours of the $\chi^2$ function in a
    two-parameter fit, and use them to estimate parameter uncertainties.
6.  Recognize the value and practical challenge of combining results
    from different measurements, each fit with different models, but
    using the same underlying fit parameters, e.g., fitting the complex
    function (6.15) to measurements of the real and imaginary part of a
    frequency response.
7.  Be able to test for correlations in fit residuals using lag plots
    and the Durbin-Watson statistic, Eq. (6.16); be able to explain why
    such correlations may indicate a poor fit of the model to the
    experimental data.

## Computer minimization and the error matrix

1.  Be able to explain qualitatively how data analysis computer programs
    fit a model to data by minimizing the $\chi^2$ goodness-of-fit
    parameter as a function of the model parameters. Specifically,
    1.  recognize that the terms *grid search*, *gradient-descent*,
        *Newton\'s method*, *Gauss-Newton*, and *Levenberg-Marquardt*
        refer to different algorithms for minimizing a function;
    2.  be able to use matrix notation to expand $\chi^2$ to second order
        around a particular point in space, as shown in (7.6); and
    3.  be able to write the gradient vector, Hessian matrix, and
        Jacobian matrix for a function of multiple variables, and
        explain how they appear in the context of computer minimization
        routines.
2.  Be able to describe how the curvature matrix relates to the error
    surface near $\chi^2_{\min}$, and how it can be used to estimate the
    parameter values at which $\chi^2=\chi_{\text{min}}^2+1$.
3.  Be able to describe the meaning and significance of the covariance
    matrix and the correlation matrix.
4.  Recognize that the covariance matrix can be estimated from a fit by
    inverting the curvature matrix at the minimum of the error surface.

## Hypothesis testing - how good are our models?

1.  Be able to explain how to use the $\chi^2$ statistic to test whether a
    set of measurements are consistent with a theoretical model, and why
    it is useful in this context; also, be able to apply the $\chi^2$ test
    to assess the agreement between an experiment and a theoretical
    description of it. Specifically,
    1.  be able to state the null hypothesis that is being tested with
        the $\chi^2$ test: that the model function $f(\theta; x)$ with the
        best-fit parameters
        $\mathbf{\theta} = \left\{\theta_1,\ldots,\theta_{\mathcal{N} }\right\}$
        and independent variables $\mathbf{x} = \{x_1,\ldots,x_N\}$ is
        consistent with measurements $\mathbf{y} = \{y_1,\ldots,y_N\}$
        that have independently known Gaussian uncertainties
        $\left\{\alpha_1,\ldots,\alpha_N\right\}$ for each measurement;
    2.  be able to explain what the term \"degrees of freedom\" means in
        the context of a statistical estimate;
    3.  be able to calculate the number of degrees of freedom in a fit;
    4.  given $N$ data points with known uncertainties and a model with
        $\mathcal{N}$ parameters that are optimized to yield a
        goodness-of-fit statistic $\chi_\mathrm{min}^2$, be able to
        assess the quality of the fit, both qualitatively (by comparing
        $\chi_\mathrm{min}^2$ with the number of degrees of freedom) and
        quantitatively (by computing $P(\chi_\mathrm{min}^2;\nu)$).
2.  Be able to describe a variety of methods, including the $\chi^2$ test,
    to assess the quality of a fit (see Sec. 8.5.3).
3.  Recognize that the confidence limits for a parameter estimate will
    be given by the Student's *t*-distribution when the uncertainty is
    not known independently, and that this is usually the implicit
    assumption of curve-fitting software; also, be able to explain why
    the Student *t*-distribution will approach the Gaussian distribution
    when the number of degrees of freedom is sufficiently large.
4.  Recognize that curve-fitting software often scales fit parameter
    uncertainties automatically, and how this can lead to erroneous
    assessments of both the fit quality and the parameter uncertainties
    (see the two **health warnings** in Sec. 8.9); also, recognize the
    limited context in which scaling uncertainties is appropriate.

## Topics for further study

1.  Be able to describe a few alternatives to the conventional
    least-squares method for curve fitting, and discuss situations for
    which they may be useful; for example,
    1.  Eq. (9.1) and (9.2) for a fit a straight line when there are
        uncertainties in both variables;
    2.  Eq. (9.3) for a fit to a more general function when there are
        uncertainties in both variables;
    3.  Eq. (9.6) for a fit that optimizes the orthogonal distances
        instead of the differences in the dependent variable; and
2.  Recognize that simulated annealing and genetic algorithms are
    techniques for finding global minima, and can be useful for complex
    minimization problems.
3.  Recognize the value of using computer simulations to evaluate
    statistical hypotheses; in particular,
    1.  Monte Carlo simulations for estimating statistical quantities
        from a known model;
    2.  Bootstrap methods for estimating statistical quantities directly
        from data, without the need for a model.
4.  Recognize that the terms *frequentist* and *Bayesian* refer to
    different approaches to statistical inference problems, and that
    both approaches involve the principle of maximum likelihood (see
    Sec. 5.3).
5. Be aware of the [Guide to the Expression of Uncertainty in
Measurement](https://www.bipm.org/en/publications/guides/gum.html), or
GUM. Note that a significant revision of the GUM is under way: the current version classifies uncertainties into two categories, "Type A" and "Type B", and uses a frequentist approach to Type A and a Bayesian approach for Type B. The revised version will use a Bayesian framework for everything.

## Exercises

[Selected solutions to exercises in Hughes and
Hase](https://wiki.its.sfu.ca/departments/phys-students/index.php/Selected_solutions_to_exercises_in_Hughes_and_Hase)