MiniRNN

Next character prediction generator based on Andrej Karpathys Mini char RNN. This implementation follows Andrej's numpy version but is completely implemeted in Pytorch. Implementation contains from-scratch RNN implementation using Pytorch tensors with goals of adding more advanced models already available in torch.nn

Update:

Pytorch LSTM architecture now available for parallel training. Increase the complexity of the model by adding more layers and increasing hidden layers size.

Differences

Biggest difference from Andrej's implementation is parallelism. Since RNN are slow in training because of their sequential nature, parallelism is implemented in the following way:

Text is divided in chunks, each containing sequence_length characters
RNN is trained in parallel on each chunk separatelly
This does make the training process multiple times faster but also considers these chunks to be completely independent, which can slightly hurt training performance.

Usage

Run train.py from command line or just import train.run and run it as a function. Play around with arguments to achieve best speed/performance results.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
models		models
util		util
README.md		README.md
loss.py		loss.py
train.py		train.py
whats_app_chat_util.py		whats_app_chat_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiniRNN

Update:

Differences

Usage

About

Releases

Packages

Languages

cagey-squirrel/MiniRNN

Folders and files

Latest commit

History

Repository files navigation

MiniRNN

Update:

Differences

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages