RAKE

A Python implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm as described in: Rose, S., Engel, D., Cramer, N., & Cowley, W. (2010). Automatic Keyword Extraction from Individual Documents. In M. W. Berry & J. Kogan (Eds.), Text Mining: Theory and Applications: John Wiley & Sons.

The source code is released under the MIT License.

Usage

Import rake and operator.

import rake
import operator

Initialize RAKE with a stopword list and keyword parameters. rake_object = rake.Rake("SmartStoplist.txt", 5, 3, 4)

This creates a RAKE object that extracts keywords where:

Each word has at least 5 characters
Each phrase has at most 3 words
Each keyword appears in the text at least 4 times

Store the text you want to process in a variable, process with RAKE, and print to the screen.

sample_file = open("data/docs/fao_test/w2167e.txt", 'r')
text = sample_file.read()
keywords = rake_object.run(text)
print "Keywords:", keywords

You should get an output like this: Keywords: Keywords: [('household food security', 7.711414565826329), ('indigenous groups living', 7.4), ('national forest programmes', 7.249539170506913), ('wood forest products', 6.844777265745007)...

This fork uses MySQLdb to read messages from a MySQL database, do some trash pickup/cleaning of HTML tags, and writes the cleaned messages to a file for processing.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
posts		posts
MIT-License.txt		MIT-License.txt
README.md		README.md
SmartStoplist.txt		SmartStoplist.txt
blogposts.txt		blogposts.txt
evaluate_rake.py		evaluate_rake.py
optimize_rake.py		optimize_rake.py
rake.py		rake.py
rake_example.py		rake_example.py
requirements.txt		requirements.txt
test_data.py		test_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAKE

Usage

About

Releases

Packages

Languages

License

chrisfromthelc/python-rake

Folders and files

Latest commit

History

Repository files navigation

RAKE

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages