nim_sequencer

My first undergraduate research project in which I implemented the KMP string matching algorithm to create a large data set for a combinatorial game theory problem. This was my first experience using HPC resources. The problem was trivially parallel, but not all of the threads were guarenteed to find a solution given and would have to be run again. Manually taking care of this on every run became cumbersome, and the bulk of the project quickly became about writing a command line interface that made file management and cluster submission much more maintainable.

More

My advisor had me study and implement a string matching algorithm in C so I could detect the length of repeating sequences in infinitely long strings. To detect the repeating sequence, I had to use an array longer that was at least as large as the sequence itself. The challenge was that the length of the sequence was indeterminate, so I had no way to know how long of an array to use. Execution of the algorithm was proportional to the array size, so it was to my benefit to use the smallest array possible. In response, I had to get creative about implementing a “guess and check” batching strategy to maintain a reasonable walltime; I would run the sequence detection with a small array and then re-run those that weren’t detected with a larger array. After struggling to manually queue runs with the right inputs several times a day, I took matters into my own hands and implemented a command line tool that managed data files and job submissions to the cluster. This was my first experience in HPC and research, but I was successfully able to use my programming and problem-solving skills to create a reproducible and scalable scientific dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
nim_sequencer		nim_sequencer
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
nim		nim
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nim_sequencer

More

About

Languages

License

nicole-brewer/nim_sequencer

Folders and files

Latest commit

History

Repository files navigation

nim_sequencer

More

About

Topics

Resources

License

Stars

Watchers

Forks

Languages