This is a framework for the Multir Pipeline that handles corpus representation, distant supervision, feature generation, model training, and extraction
Some of the source code comes from Raphael Hoffman's Multir project http://www.cs.washington.edu/ai/raphaelh/mr/index.html