Skip to content

Commit

Permalink
Add empty directories with README files, update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
bfhealy committed Apr 10, 2024
1 parent 0f88432 commit 9bc50c0
Show file tree
Hide file tree
Showing 15 changed files with 17 additions and 3 deletions.
6 changes: 3 additions & 3 deletions doc/scripts.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,9 +4,9 @@ The `hpc_files` directory in the `scope-ml` repository contains scripts, files a

Note that data files are not included in the `hpc_files` directory. The main files necessary to run the scripts detailed below are listed here and available on [Zenodo](https://zenodo.org/doi/10.5281/zenodo.8410825):
- `trained_models_dnn` and `trained_models_xgb`: download on Zenodo, unzip, and place directories into `models_dnn` and `models_xgb` directories, respectively
- `training_set.parquet`: download on Zenodo and place into a directory called `fritzDownload`
- `training_set.parquet`: download on Zenodo and place into the directory called `fritzDownload`

Note also that most included scripts and directories can also be generated from scratch using the following SCoPe scripts: `train-algorithm-slurm`, `generate-features-slurm`, `run-inference-slurm`, and `combine-preds-slurm`. The directories generated by these scripts generally are populated with two subdirectories: `logs` to contain slurm logs, and `slurm` to contain slurm scripts. **Since GitHub does not track empty directories, `logs` will have to be created (on the same level as `slurm`) in each of the example HPC directories.**
Note also that most included scripts and directories can also be generated from scratch using the following SCoPe scripts: `train-algorithm-slurm`, `generate-features-slurm`, `run-inference-slurm`, and `combine-preds-slurm`. The directories generated by these scripts generally are populated with two subdirectories: `logs` to contain slurm logs, and `slurm` to contain slurm scripts.

## Configuration

Expand All @@ -30,7 +30,7 @@ These two directories are generated when running `train-algorithm-slurm`. The `s
### Output: trained models in `models_dnn` and `models_xgb`
Trained models are saved in these two directories. The `--group` name passed to the training code will determine the subdirectory where the models are saved. Within this, each classifier gets its own subdirectory that includes the model files, diagnostic plots, and feature importance data (XGB only).

**To run inference with the latest trained models, download `trained_dnn_models.zip` and `trained_xgb_models.zip` from Zenodo and unzip them within a corresponding `models_dnn` or `models_xgb` directory.**
**To run inference with the latest trained models, download `trained_dnn_models.zip` and `trained_xgb_models.zip` from Zenodo and unzip them within the corresponding `models_dnn` or `models_xgb` directory.**

## Generating Features

Expand Down
1 change: 1 addition & 0 deletions hpc_files/combine_preds/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/dnn_inference/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/dnn_training/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/fritzDownload/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Place training_set.parquet (from Zenodo) here
1 change: 1 addition & 0 deletions hpc_files/generated_features_GCN_sources/fg_sources/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ZTF ID lists from gcn_cronjob.py go here
1 change: 1 addition & 0 deletions hpc_files/generated_features_GCN_sources/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/generated_features_delta/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/generated_features_new/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/generated_features_underMS/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Batched files containing ZTF IDs (from data wrangling notebook) go here
1 change: 1 addition & 0 deletions hpc_files/models_dnn/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Place unzipped trained_dnn_models directory (from Zenodo) here
1 change: 1 addition & 0 deletions hpc_files/models_xgb/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Place unzipped trained_xgb_models directory (from Zenodo) here
1 change: 1 addition & 0 deletions hpc_files/xgb_inference/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here
1 change: 1 addition & 0 deletions hpc_files/xgb_training/logs/README
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Slurm logs go here

0 comments on commit 9bc50c0

Please sign in to comment.