Extraneous Information in repolist.yml #19

frobnitzem · 2021-07-09T16:02:36Z

Reposcanner works repository-at-a-time, so its "currently scanned repository" and output files should be named after the repository url. Right now, project names and ID-s are stored inside ManagerRoutineTask - which must do a lot of work getting nameOrID and repoNameOrURL.

These are bad for reproducibility, since they depend on how we categorize the repository, rather than the repository itself.

To fix this, repolist.yml should only contain a list of urls. Then the function, prepareTasks in manager.py should just collect all the urls to scan.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extraneous Information in repolist.yml #19

Extraneous Information in repolist.yml #19

frobnitzem commented Jul 9, 2021

Extraneous Information in repolist.yml #19

Extraneous Information in repolist.yml #19

Comments

frobnitzem commented Jul 9, 2021