Skip to content
This repository has been archived by the owner on Apr 27, 2018. It is now read-only.

Automatic topic issues classifier #151

Open
lintool opened this issue Oct 30, 2015 · 1 comment
Open

Automatic topic issues classifier #151

lintool opened this issue Oct 30, 2015 · 1 comment
Labels

Comments

@lintool
Copy link
Owner

lintool commented Oct 30, 2015

@ianmilligan1 Do you know about this? http://www.policyagendas.org/page/topic-codebook

Do you think it'd be useful to build a topic classifier and integrate it in Warcbase?

The classifier would take in a blob or text and spit out a label (issue area in the codebook above), which you could then filter and group by. So, for example, you might compare (via a word cloud?), the platforms of different parties on defense, over time.

The codebook is a bit US-centric though, but dunno if it matters...

@ianmilligan1
Copy link
Collaborator

This looks great - I think this'd be useful, and these topics look like they'd complement some of our collections really well: the CPP collection we've got at http://webarchives.ca, and then the Canadian/American collections we'll have available for the hackathon. I don't think the US-centric nature is that big a deal, tho we can take a closer look.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants