skip to content
The Hundred-Page Machine Learning Book
User Tools
Register
Log In
Site Tools
Search
Tools
Edit this page
Old revisions
Backlinks
Recent Changes
Media Manager
Sitemap
Register
Log In
>
Recent Changes
Media Manager
Sitemap
recurrent_neural_network
Recurrent Neural Network
Recommended Reading
An extended version of Chapter 6 with RNN unfolding and bidirectional RNN
The Unreasonable Effectiveness of Recurrent Neural Networks
by Andrej Karpathy (2015)
Recurrent Neural Networks and LSTM
by Niklas Donges (2018)
Understanding LSTM Networks
by Christopher Olah (2015)
Introduction to RNNs
by Denny Britz (2015)
Implementing a RNN with Python, Numpy and Theano
by Denny Britz (2015)
Backpropagation Through Time and Vanishing Gradients
by Denny Britz (2015)
Implementing a GRU/LSTM RNN with Python and Theano
by Denny Britz (2015)
Simplified Minimal Gated Unit Variations for Recurrent Neural Networks
by Joel Heck and Fathi Salem (2017)
Attention Is All You Need
by Vaswani et al. (2017), a state-of-the-art sequence-to-sequence model, plus an
illustrated guide
plus an
annotated paper with code
.
Training Compute-Optimal Large Language Models
by Hoffmann et al. (2022), (the Chinchilla paper).
Understanding Large Language Models
by Sebastian Raschka.
Edit
Page Tools
Edit this page
Old revisions
Backlinks
Back to top