We use usenet.txt to generate initial categories. The output is tumblrssUsenetMap.json. Later it's copied to resources (atm also renamed to usenetMap.json) This is where we can modify the categories.
Input usenetMap.json output usenetTumblrTree.json.
Modifies the usenetTumblrTree.json to a fixed depth tree. The output is threeLevelTree.json.
UsenetAddRreddit: threeLevelTree.json - > usenetTumblrReddit.json Consumes the original and the fixed depth json and adds reddit newssources. The output is usenetTumblrReddit.json.