Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
RNN_start.ipynb		RNN_start.ipynb

README.md

RNN training tutorial

by Wojtek Czarnecki

Designed for education purposes. Please do not distribute without permission.

Questions/Correspondence: [email protected]

This is a tutorial training various RNNs on simple datasets and doing some analysis.

Structure:

basic (vanilla RNN) implementation
observing exploding/vanishing gradients
intepretability by plotting and analysing activations of a network:
- identifying interpretable neurons
- identifying neurons-gates interactions
- identifying hidden state dynamics through time
training an LSTM on character level langugage modelling task
- comparing training of an LSTM and RNN, playing with architectures

First three sections are almost independent, one can go switch between them without any code dependencies (apart from being unable to use vanilla RNN in section 4, if it was not implemented in 1.).

Cells that include "starting point" in their title require filling in some code gaps; all remaining ones are complete (but feel free to play with them if you want!)

Please pay attention to questions after each section. Finding out answers to these is crucial to make sure one understands various modes of RNN operation.

Language model exercises are based on Sonnet LSTM example.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rnn

rnn

README.md

RNN training tutorial

This is a tutorial training various RNNs on simple datasets and doing some analysis.

Structure:

Files

rnn

Directory actions

More options

Directory actions

More options

Latest commit

History

rnn

Folders and files

parent directory

README.md

RNN training tutorial

This is a tutorial training various RNNs on simple datasets and doing some analysis.

Structure: