in this repository you will find different tools to process text data
crawls texts from the newsfeed of a specific site
a statistical spelling corrector which uses a dictionary and propability to correct misspellins
creats word embeddings using word2vec on wikipedia dump and allows you to further train the model on domain specific texts
uses stopwords and bigrams to reduce dimension extracts topics from text including visualisation
interactiv visualisation of categorial texts
uses a Recurrent neural network to learn and predict entiment from twitter data
takes german romantic short stories and generates new ones using a Recurrent neural network