Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a synonym word list for internet slangs #12

Open
Aniq55 opened this issue Feb 1, 2018 · 3 comments
Open

Create a synonym word list for internet slangs #12

Aniq55 opened this issue Feb 1, 2018 · 3 comments

Comments

@Aniq55
Copy link
Member

Aniq55 commented Feb 1, 2018

Slangs like

  • fam
  • lit
  • lol
  • lmao
    etc do not exist in the nltk library.
    We need a list of such words and words similar to it, for example, if someone searches for laugh, he/she should get the meme which has lol written on it
@soumam007
Copy link
Contributor

http://onlineslangdictionary.com/meaning-definition-of/<internet_slang_you_want_to_search>

I wish i knew of any json data ,
but the special thing about this site is what i found that they full form of them are coming in the same fashion so they can be scraped
we have to scrape the word between ---------------------> " " (the quotes)

check for demonstration:--

http://onlineslangdictionary.com/meaning-definition-of/rofl
http://onlineslangdictionary.com/meaning-definition-of/lmao
http://onlineslangdictionary.com/meaning-definition-of/lol

now if length of 'keywords'(list) is 0 we can take the above method and try getting keywords,since wordnet may not have lmao but it must be having 'laughing' 'my' 'ass' 'off' as individual words.

This is my school of thought.

@soumam007
Copy link
Contributor

screenshot from 2018-02-06 14-09-26
screenshot from 2018-02-06 14-09-57
screenshot from 2018-02-06 14-10-30
screenshot from 2018-02-06 14-13-04

So i did my part of work, ie fetching the full form and the list of words that you cal see in the console is basically the list of keywords on the basis of which the meme is getting searched.

Since the searching algo is not upto that level so may be only one or two words are matching
like in rofl---> "on the floor" words are there in the image
like in lmao--> "my ass" words are in the image

@Aniq55
Copy link
Member Author

Aniq55 commented Feb 6, 2018

Looks good, send the PR

@Aniq55 Aniq55 removed the CMoC label Feb 6, 2018
@Aniq55 Aniq55 changed the title Create a synonym word list for internet slags Create a synonym word list for internet slangs Feb 23, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants