Dive into Deep Learning¶
Dive into Deep Learning
An interactive deep learning book with code, math, and
discussions
Provides Deep Java Library(DJL) implementations
Adopted at 175 universities from 40 countries
Each section is an executable Jupyter notebook
You can modify the code and tune hyperparameters to get instant feedback to accumulate practical experiences in deep learning.
Mathematics + Figures + Code
We offer an interactive learning experience with mathematics, figures, code, text, and discussions, where concepts and techniques are illustrated and implemented with experiments on real data sets.
Active community support
Join DJL's slack channel to discuss the book!
D2L as a textbook or a reference book
Carnegie Mellon University
Duke University
Emory University
Fudan University
Gazi Üniversitesi
Georgia Institute of Technology
Habib University
Harbin Institute of Technology
Hasso-Plattner-Institut
Hangzhou Dianzi University
Hiroshima University
Hong Kong University of Science and Technology
Huazhong University of Science and Technology
Imperial College London
Indian Institute of Technology Bombay
Indian Institute of Technology Kanpur
Indian Institute of Technology Kharagpur
Indian Institute of Technology Mandi
Indian Institute of Technology Ropar
Indraprastha Institute of Information Technology, Delhi
Institut Supérieur De L'electronique Et Du Numérique
İstanbul Teknik Üniversitesi
King Abdullah University of Science and Technology
Kyungpook National University
Lancaster University
Massachusetts Institute of Technology
McGill University
Monash University
National Chung Hsing University
National Institute of Technical Teachers Training&Research
National Institute of Technology, Warangal
National Taiwan University
National United University
National University of Singapore
Nazarbayev University
Northeastern University
Ohio University
Peking University
Pontificia Universidad Católica de Chile
Portland State University
Rutgers, The State University of New Jersey
Sapienza Università di Roma
Shanghai Jiao Tong University
Shanghai University of Finance and Economics
Sogang University
Stanford University
Stevens Institute of Technology
Technische Universiteit Delft
Texas A&M University
The University of Texas at Austin
Tsinghua University
Universidad Carlos III de Madrid
Universidad Nacional de Colombia Sede Manizales
Universidade Federal de Minas Gerais
Università degli Studi di Brescia
Università degli Studi di Catania
Universität Heidelberg
Universitatea de Vest din Timișoara
University of Arkansas
University of California, Berkeley
University of California, Los Angeles
University of California, San Diego
University of California, Santa Barbara
University of Illinois at Urbana-Champaign
University of Maryland
University of Minnesota, Twin Cities
University of New Hampshire
University of North Carolina at Chapel Hill
University of Pennsylvania
University of Science and Technology of China
University of Technology Sydney
University of Washington
University of Waterloo
Univerzita Komenského v Bratislave
Vietnamese-German University
Yunnan University
Universitat Politècnica de Catalunya
Zhejiang University
Duke University
Emory University
Fudan University
Gazi Üniversitesi
Georgia Institute of Technology
Habib University
Harbin Institute of Technology
Hasso-Plattner-Institut
Hangzhou Dianzi University
Hiroshima University
Hong Kong University of Science and Technology
Huazhong University of Science and Technology
Imperial College London
Indian Institute of Technology Bombay
Indian Institute of Technology Kanpur
Indian Institute of Technology Kharagpur
Indian Institute of Technology Mandi
Indian Institute of Technology Ropar
Indraprastha Institute of Information Technology, Delhi
Institut Supérieur De L'electronique Et Du Numérique
İstanbul Teknik Üniversitesi
King Abdullah University of Science and Technology
Kyungpook National University
Lancaster University
Massachusetts Institute of Technology
McGill University
Monash University
National Chung Hsing University
National Institute of Technical Teachers Training&Research
National Institute of Technology, Warangal
National Taiwan University
National United University
National University of Singapore
Nazarbayev University
Northeastern University
Ohio University
Peking University
Pontificia Universidad Católica de Chile
Portland State University
Rutgers, The State University of New Jersey
Sapienza Università di Roma
Shanghai Jiao Tong University
Shanghai University of Finance and Economics
Sogang University
Stanford University
Stevens Institute of Technology
Technische Universiteit Delft
Texas A&M University
The University of Texas at Austin
Tsinghua University
Universidad Carlos III de Madrid
Universidad Nacional de Colombia Sede Manizales
Universidade Federal de Minas Gerais
Università degli Studi di Brescia
Università degli Studi di Catania
Universität Heidelberg
Universitatea de Vest din Timișoara
University of Arkansas
University of California, Berkeley
University of California, Los Angeles
University of California, San Diego
University of California, Santa Barbara
University of Illinois at Urbana-Champaign
University of Maryland
University of Minnesota, Twin Cities
University of New Hampshire
University of North Carolina at Chapel Hill
University of Pennsylvania
University of Science and Technology of China
University of Technology Sydney
University of Washington
University of Waterloo
Univerzita Komenského v Bratislave
Vietnamese-German University
Yunnan University
Universitat Politècnica de Catalunya
Zhejiang University
If you use D2L to teach (or plan to) and would like to receive a hardcopy, please contact us at d2lbook.en@gmail.com.
BibTeX entry for citing the book
@book{zhang2020dive,
title={Dive into Deep Learning},
author={Aston Zhang and Zachary C. Lipton and Mu Li and Alexander J. Smola},
note={\url{https://d2l.ai}},
year={2020}
}
Table of contents
- 1. Introduction
- 2. Preliminaries
- 3. Linear Neural Networks
- 4. Multilayer Perceptrons
- 4.1. Multilayer Perceptrons
- 4.2. Implementation of Multilayer Perceptron from Scratch
- 4.3. Concise Implementation of Multilayer Perceptron
- 4.4. Model Selection, Underfitting and Overfitting
- 4.5. Weight Decay
- 4.6. Dropout
- 4.7. Forward Propagation, Backward Propagation, and Computational Graphs
- 4.8. Numerical Stability and Initialization
- 4.9. Considering the Environment
- 5. Deep Learning Computation
- 6. Convolutional Neural Networks
- 7. Modern Convolutional Neural Networks
- 8. Recurrent Neural Networks
- 9. Modern Recurrent Neural Networks
- 10. Attention Mechanisms
- 11. Optimization Algorithms
- 12. Computational Performance
- 13. Computer Vision
- 14. Natural Language Processing: Pretraining