Partitions

The goal is to divide N projects up into K clusters (based on their shared typical words) so that the the minimum number of shared words between projects in different clusters is maximized. K is calculated using the following formula: K = Math.floor(Math.sqrt(N)).

Notes:

Partitions will relocate projects (already placed in a cluster) into other clusters as needed.
Unclustered projects will remain singleton clusters.

Running Partitions

To run it (logging enabled -- printed to screen)


$ ./vip p -f path/to/corpus.json -t path/to/out-folder -v

To run it (logging enabled -- printed to file named projects.json)


$ ./vip p -f path/to/corpus.json -t path/to/out-folder -v -o projects.json

Output

Please take a look at the projects.json file. This file is an example of a json file produced by the 'Partitions' project.

Additional Resources

Partitions also supports the following algorithms:

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
lib		lib
project		project
src/main		src/main
.gitignore		.gitignore
README.md		README.md
big.txt		big.txt
build.sbt		build.sbt
output.txt		output.txt
partitions.jar		partitions.jar
projects.json		projects.json
vip		vip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Partitions

Running Partitions

Output

Additional Resources

About

Releases

Packages

Languages

aas-integration/partitions

Folders and files

Latest commit

History

Repository files navigation

Partitions

Running Partitions

Output

Additional Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages