nlp document classfication datasets
three small datasets: places vs orgs
people vs ogrs
places vs people
letter_dict is a a doctionary for the selected words in all documents.
dict_int is the map between key storeds in text data files and each word in letter_dict.