Alert: This list is free of crash courses and shorcuts!
This is a curated list of books and MOOCs for anybody wants to have solid knowledge in AI and Data Science in 2024 seriously without shortcuts.
I assume that you have successfully passed at least undergraduate-level basic courses in computer science (e.g. have a Bachelor’s degree in CS) to go with the following materials smoothly.
Note: I spent some time working in data analysis recently so expect a lot of statistics content.
Scientific Computing
Stats/Math
Allen Downey’s books introduce statistics for programmers. I found them good in applying the theories of stats.
- Practical Stats for Data Scientists by Bruce & Bruce
- Think Stats by Allen Downey
- Think Bayes by Allen Downey
- Statistical Rethinking by Richard McElreath: Advanced course in Bayesian data analysis, causality, and statistical modelling. It is taught by a non-mathematician in an intuitive way, explaining advanced and interdependent topics with a lot of analogies. The author offers the course in two formats, a book and lectures.
- Causal inference in statistics By Judea Pearl
- All of Statistics: Comprehensive reference
- Mathematics for Machine Learning: Assuming you have studied college calculus and algebra, this book is a great compilation for the math you need for ML
- Bayesian Data Analysis course at Aalto (CS-E5710) by Aki Vehtari: Advanced material
Machine Learning
- Andrew Ng’s ML Specialization on Coursera: Your baby steps in ML
- An Introduction to Statistical Learning: Essential ML book. Full of examples and applications.
- Stanford CS229: Good lectures on the theoretical ML
- Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Practical approach to learn ML. Good after having the theory.
- The Elements of Statistical Learning: Advanced material
Deep Learning
- Understanding Deep Learning: I’m lucky to be in 2024 and read this book! It is a good shelter in the concurrent deep learning storms!
NLP/LLMs
- Introduction to Information Retrieval: Good starting point in NLP. This classic is authored by Christopher Manning et al.
- CMU Advanced NLP 2024 by Graham Neubig: Lectures are on YouTube and exercises on Github
- Andrej Karpathy Tutorials for building/hacking LLMs from scratch!
- Speech and Language Processing by Jurafsky: Comprehensive theoretical reference
Computer Vision
- Deep Learning for Vision Systems by Mohamed Elgendy
Reinforcement Learning
- Reinforcement Learning: An Introduction by Sutton and Barto: Essential textbook. Can be used as a theoretical reference
- Reinforcement learning by Phil Winder: supplemental material contains helpful code. Can be a good starting point, balanced between theory and practice.
- Deep Reinforcement Learning (CS 285) at UC Berkeley: Intensive course by a top voice in RL (lectures available on YouTube)
MLOps - SWE for Data Intensive Apps
Misc.
- Storytelling with Data: A Data Visualization Guide for Business Professionals
- Storytelling with Data: Let’s Practice
- MIT 14.310x Data Analysis for Social Scientists