NLP

in this repository you will find different tools to process text data

facebook crawler

crawls texts from the newsfeed of a specific site

spelling correction

a statistical spelling corrector which uses a dictionary and propability to correct misspellins

wiki_word2vec_domain

creats word embeddings using word2vec on wikipedia dump and allows you to further train the model on domain specific texts

Topic Model (LDA)

uses stopwords and bigrams to reduce dimension extracts topics from text including visualisation

bokeh_cloud

interactiv visualisation of categorial texts

Sentiment RNN

uses a Recurrent neural network to learn and predict entiment from twitter data

Groschenromangenerator RNN

takes german romantic short stories and generates new ones using a Recurrent neural network

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
GPT_2_Text_Generation_AI_Ethics.ipynb		GPT_2_Text_Generation_AI_Ethics.ipynb
Groschenromangenerator.ipynb		Groschenromangenerator.ipynb
README.md		README.md
SDG_SetFit.ipynb		SDG_SetFit.ipynb
Sentiment Twitter RNN.ipynb		Sentiment Twitter RNN.ipynb
Topic Model (LDA).ipynb		Topic Model (LDA).ipynb
bokeh_cloud.py		bokeh_cloud.py
facebook_crawler.ipynb		facebook_crawler.ipynb
policy_qa_haystack.ipynb		policy_qa_haystack.ipynb
sdg_bert_classification_domain_adaptation.ipynb		sdg_bert_classification_domain_adaptation.ipynb
spelling correction.ipynb		spelling correction.ipynb
spelling.py		spelling.py
w2v_ft_train.ipynb		w2v_ft_train.ipynb
wiki_word2vec_domain.ipynb		wiki_word2vec_domain.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP

facebook crawler

spelling correction

wiki_word2vec_domain

Topic Model (LDA)

bokeh_cloud

Sentiment RNN

Groschenromangenerator RNN

About

Releases

Packages

Languages

ELehmann91/NLP

Folders and files

Latest commit

History

Repository files navigation

NLP

facebook crawler

spelling correction

wiki_word2vec_domain

Topic Model (LDA)

bokeh_cloud

Sentiment RNN

Groschenromangenerator RNN

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages