Modern Data Scientists

Mathematics is very important in the field of data science as concepts within mathematics aid in identifying patterns and assist in creating algorithms. The understanding of various notions of Statistics and Probability Theory are key for the implementation of such algorithms in data science.

Beyond the basics of calculus, linear algebra, and probability, there is a certain kind of mathematical thinking that comes up pretty often when you’re trying to understand data. It involves quantifying something you want to measure, then understanding how the quantification works in mathematical terms. The interesting part is not usually doing the math, but figuring out what math to do.

Most of the mathematics required for Data Science lie within the realms of statistics and algebra,

Statistics, in particular, is at the very foundation of Data Science, and is the collection of tools which helps us separate significance from randomness. Algebra is quite often at the heart of the analysis itself. The further quantitative skills facilitate intuition, which is essential in analytics.

Data-scientist should have a knowldge about one or more of this topics :

  • Linear algebra
  • Discrete math
  • Differential equations
  • Theory of statistics
  • Numerical analysis : numerical linear algebra and quadrature
  • Abstract algebra
  • Number theory
  • Real analysis
  • Complex analysis
  • Intermediate analysis
  • Probability and Statistics
  • Linear Algebra
  • Matrix Theory
  • Calculus
  • Set theory

? Here are some of the Useful resources to improve your Math skills & Data Science Expertise-


1) The Elements of Statistical Learning(Springer Series)
2) Introduction to Linear Algebra by Gilbert Strang.
3) Naked Statistics by Charles Wheelan.
4) An Introduction to Statistical Learning: with Applications in R.
5) Pattern Recognition and Machine Learning by Christopher M. Bishop.
6) Pattern Classification ((A Wiley-Interscience publication).
7) Introduction to Statistical Learning
8) Introduction to Bayesian Statistics

Must Know Algorithms for Data Scientist

Principal Component Analysis(PCA)/SVD
Least Squares and Polynomial Fitting
Constrained Linear Regression 
K means Clustering
Logistic Regression
SVM (Support Vector Machines)
Feedforward Neural Networks
Convolutional Neural Networks (Convnets)
Recurrent Neural Networks (RNNs)
Conditional Random Fields (CRFs)
Decision Trees
TD Algorithms

Similar Posts