Skip to content

Latest commit

 

History

History
109 lines (104 loc) · 6.32 KB

list_of_questions_mathematics.md

File metadata and controls

109 lines (104 loc) · 6.32 KB

Mathematics - List of questions

Linear Algebra

  1. What is broadcasting in connection to Linear Algebra?
  2. What are scalars, vectors, matrices, and tensors?
  3. What is Hadamard product of two matrices?
  4. What is an inverse matrix?
  5. If inverse of a matrix exists, how to calculate it?
  6. What is the determinant of a square matrix? How is it calculated (Laplace expansion)? What is the connection of determinant to eigenvalues?
  7. Discuss span and linear dependence.
  8. What is Ax = b? When does Ax =b has a unique solution?
  9. In Ax = b, what happens when A is fat or tall?
  10. When does inverse of A exist?
  11. What is a norm? What is L1, L2 and L infinity norm?
  12. What are the conditions a norm has to satisfy?
  13. Why is squared of L2 norm preferred in ML than just L2 norm?
  14. When L1 norm is preferred over L2 norm?
  15. Can the number of nonzero elements in a vector be defined as L0 norm? If no, why?
  16. What is Frobenius norm?
  17. What is a diagonal matrix? (D_i,j = 0 for i != 0)
  18. Why is multiplication by diagonal matrix computationally cheap? How is the multiplication different for square vs. non-square diagonal matrix?
  19. At what conditions does the inverse of a diagonal matrix exist? (square and all diagonal elements non-zero)
  20. What is a symmetrix matrix? (same as its transpose)
  21. What is a unit vector?
  22. When are two vectors x and y orthogonal? (x.T * y = 0)
  23. At R^n what is the maximum possible number of orthogonal vectors with non-zero norm?
  24. When are two vectors x and y orthonormal? (x.T * y = 0 and both have unit norm)
  25. What is an orthogonal matrix? Why is computationally preferred? (a square matrix whose rows are mutually orthonormal and columns are mutually orthonormal.)
  26. What is eigendecomposition, eigenvectors and eigenvalues?
  27. How to find eigen values of a matrix?
  28. Write the eigendecomposition formula for a matrix. If the matrix is real symmetric, how will this change?
  29. Is the eigendecomposition guaranteed to be unique? If not, then how do we represent it?
  30. What are positive definite, negative definite, positive semi definite and negative semi definite matrices?
  31. What is SVD? Why do we use it? Why not just use ED?
  32. Given a matrix A, how will you calculate its SVD?
  33. What are singular values, left singulars and right singulars?
  34. What is the connection of SVD of A with functions of A?
  35. Why are singular values always non-negative?
  36. What is the Moore Penrose pseudo inverse and how to calculate it?
  37. If we do Moore Penrose pseudo inverse on Ax = b, what solution is provided is A is fat? Moreover, what solution is provided if A is tall?
  38. Which matrices can be decomposed by ED? (Any NxN square matrix with N linearly independent eigenvectors)
  39. Which matrices can be decomposed by SVD? (Any matrix; V is either conjugate transpose or normal transpose depending on whether A is complex or real)
  40. What is the trace of a matrix?
  41. How to write Frobenius norm of a matrix A in terms of trace?
  42. Why is trace of a multiplication of matrices invariant to cyclic permutations?
  43. What is the trace of a scalar?
  44. Write the frobenius norm of a matrix in terms of trace?

Numerical Optimization

  1. What is underflow and overflow?
  2. How to tackle the problem of underflow or overflow for softmax function or log softmax function?
  3. What is poor conditioning?
  4. What is the condition number?
  5. What are grad, div and curl?
  6. What are critical or stationary points in multi-dimensions?
  7. Why should you do gradient descent when you want to minimize a function?
  8. What is line search?
  9. What is hill climbing?
  10. What is a Jacobian matrix?
  11. What is curvature?
  12. What is a Hessian matrix?

Basics of Probability and Informaion Theory

  1. Compare "Frequentist probability" vs. "Bayesian probability"?
  2. What is a random variable?
  3. What is a probability distribution?
  4. What is a probability mass function?
  5. What is a probability density function?
  6. What is a joint probability distribution?
  7. What are the conditions for a function to be a probability mass function?
  8. What are the conditions for a function to be a probability density function?
  9. What is a marginal probability? Given the joint probability function, how will you calculate it?
  10. What is conditional probability? Given the joint probability function, how will you calculate it?
  11. State the Chain rule of conditional probabilities.
  12. What are the conditions for independence and conditional independence of two random variables?
  13. What are expectation, variance and covariance?
  14. Compare covariance and independence.
  15. What is the covariance for a vector of random variables?
  16. What is a Bernoulli distribution? Calculate the expectation and variance of a random variable that follows Bernoulli distribution?
  17. What is a multinoulli distribution?
  18. What is a normal distribution?
  19. Why is the normal distribution a default choice for a prior over a set of real numbers?
  20. What is the central limit theorem?
  21. What are exponential and Laplace distribution?
  22. What are Dirac distribution and Empirical distribution?
  23. What is mixture of distributions?
  24. Name two common examples of mixture of distributions? (Empirical and Gaussian Mixture)
  25. Is Gaussian mixture model a universal approximator of densities?
  26. Write the formulae for logistic and softplus function.
  27. Write the formulae for Bayes rule.
  28. What do you mean by measure zero and almost everywhere?
  29. If two random variables are related in a deterministic way, how are the PDFs related?
  30. Define self-information. What are its units?
  31. What are Shannon entropy and differential entropy?
  32. What is Kullback-Leibler (KL) divergence?
  33. Can KL divergence be used as a distance measure?
  34. Define cross-entropy.
  35. What are structured probabilistic models or graphical models?
  36. In the context of structured probabilistic models, what are directed and undirected models? How are they represented? What are cliques in undirected structured probabilistic models?

Confidence interval

  1. What is population mean and sample mean?
  2. What is population standard deviation and sample standard deviation?
  3. Why population s.d. has N degrees of freedom while sample s.d. has N-1 degrees of freedom? In other words, why 1/N inside root for pop. s.d. and 1/(N-1) inside root for sample s.d.? (Here)
  4. What is the formula for calculating the s.d. of the sample mean?
  5. What is confidence interval?
  6. What is standard error?