Fix structuring of CONTRIBUTING docs

automl · Nov 14, 2024 · 4e9efee · 4e9efee
1 parent 43617c7
commit 4e9efee
Showing 1 changed file with 22 additions and 70 deletions.
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -23,7 +23,7 @@ There are three required steps and one optional:
 
 For instructions see below.
 
-## 1. Optional: Install miniconda and create a virtual environment
+### 1. Optional: Install miniconda and create a virtual environment
 
 To manage python versions install e.g., miniconda with
 
@@ -42,7 +42,7 @@ conda create -n neps python=3.10
 conda activate neps
 ```
 
-## 2. Install poetry
+### 2. Install poetry
 
 First, install poetry, e.g., via
 
@@ -59,7 +59,7 @@ export PATH="$HOME/.local/bin:$PATH"
 
 to your `.zshrc` / `.bashrc` or alternatively simply running the export manually.
 
-## 3. Install the neps Package Using poetry
+### 3. Install the neps Package Using poetry
 
 Clone the repository, e.g.,
 
@@ -76,7 +76,7 @@ poetry install
 
 This will installthe neps package but also additional dev dependencies.
 
-## 4. Activate pre-commit for the repository
+### 4. Activate pre-commit for the repository
 
 With the python environment used to install the neps package run in the main directory of neps
 
@@ -93,9 +93,6 @@ your choice, e.g.
 [VSCode](https://marketplace.visualstudio.com/items?itemName=charliermarsh.ruff),
 [PyCharm](https://plugins.jetbrains.com/plugin/20574-ruff).
 
-
-# Checks and Tests
-
 We have setup checks and tests at several points in the development flow:
 
 - At every commit we automatically run a suite of [pre-commit](https://pre-commit.com/) hooks that perform static code analysis, autoformating, and sanity checks.
@@ -104,7 +101,9 @@ This is setup during our [installation process](https://automl.github.io/neps/co
 The tests correspond directly to examples in [neps_examples](https://github.com/automl/neps/tree/master/neps_examples) and only check for crash-causing errors.
 - At every push all integration tests and regression tests are run automatically using [github actions](https://github.com/automl/neps/actions).
 
-## Linting (Ruff)
+## Checks and tests
+
+### Linting (Ruff)
 For linting we use `ruff` for checking code quality. You can install it locally and use it as so:
 
 ```bash
@@ -127,7 +126,7 @@ The configuration of `ruff` is in the `pyproject.toml` file and we refer you to
 
 There you can find the documentation for all of the rules employed.
 
-## Type Checking (Mypy)
+### Type Checking (Mypy)
 For type checking we use `mypy`. You can install it locally and use it as so:
 
 ```bash
@@ -159,64 +158,17 @@ or types defined from NePS, there is probably a good reason for a mypy error.
 If you have issues regarding typing, please feel free to reach out for help `@eddiebergman`.
 
 
-## Examples and Integration Tests
+### Examples and Integration Tests
 
-We use examples in [neps_examples](https://github.com/automl/neps/tree/master/neps_examples) as integration tests, which we run from the main directory via
+We use some examples in [neps_examples](https://github.com/automl/neps/tree/master/neps_examples) as integration tests, which we run from the main directory via
 
 ```bash
 pytest
 ```
 
-If tests fail for you on the master, please raise an issue on github, preferabbly with some informationon the error,
+If tests fail for you on the master, please raise an issue on github, preferably with some information on the error,
 traceback and the environment in which you are running, i.e. python version, OS, etc.
 
-## Regression Tests
-
-Regression tests are run on each push to the repository to assure the performance of the optimizers don't degrade.
-
-Currently, regression runs are recorded on JAHS-Bench-201 data for 2 tasks: `cifar10` and `fashion_mnist` and only for optimizers: `random_search`, `bayesian_optimization`, `mf_bayesian_optimization`.
-This information is stored in the `tests/regression_runner.py` as two lists: `TASKS`, `OPTIMIZERS`.
-The recorded results are stored as a json dictionary in the `tests/losses.json` file.
-
-### Adding new optimizer algorithms
-
-Once a new algorithm is added to NEPS library, we need to first record the performance of the algorithm for 100 optimization runs.
-
-- If the algorithm expects standard loss function (pipeline) and accepts fidelity hyperparameters in pipeline space, then recording results only requires adding the optimizer name into `OPTIMIZERS` list in `tests/regression_runner.py` and running `tests/regression_runner.py`
-
-- In case your algorithm requires custom pipeline and/or pipeline space you can modify the `runner.run_pipeline` and `runner.pipeline_space` attributes of the `RegressionRunner` after initialization (around line `#322` in `tests/regression_runner.py`)
-
-You can verify the optimizer is recorded by rerunning the `regression_runner.py`.
-Now regression test will be run on your new optimizer as well on every push.
-
-### Regression test metrics
-
-For each regression test the algorithm is run 10 times to sample its performance, then they are statistically compared to the 100 recorded runs. We use these 3 boolean metrics to define the performance of the algorithm on any task:
-
-1. [Kolmogorov-Smirnov test for goodness of fit](https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.kstest.html) - `pvalue` >= 10%
-1. Absolute median distance - bounded within 92.5% confidence range of the expected median distance
-1. Median improvement - Median improvement over the recorded median
-
-Test metrics are run for each `(optimizer, task)` combination separately and then collected.
-The collected metrics are then further combined into 2 metrics
-
-1. Task pass - either both `Kolmogorov-Smirnov test` and `Absolute median distance` test passes or just `Median improvement`
-1. Test aggregate - Sum_over_tasks(`Kolmogorov-Smirnov test` + `Absolute median distance` + 2 * `Median improvement`)
-
-Finally, a test for an optimizer only passes when at least for one of the tasks `Task pass` is true, and `Test aggregate` is higher than 1 + `number of tasks`
-
-### On regression test failures
-
-Regression tests are stochastic by nature, so they might fail occasionally even the algorithm performance didn't degrade.
-In the case of regression test failure, try running it again first, if the problem still persists, then you can contact [Danny Stoll](mailto:[email protected]) or [Samir](mailto:[email protected]).
-You can also run tests locally by running:
-
-```
-poetry run pytest -m regression_all
-```
-
-## Disabling and Skipping Checks etc.
-
 ### Pre-commit: How to not run hooks?
 
 To commit without running `pre-commit` use `git commit --no-verify -m <COMMIT MESSAGE>`.
@@ -231,11 +183,11 @@ There are two options:
   code = "foo"  # type: ignore
   ```
 
-## Managing Dependencies
+### Managing Dependencies
 
 To manage dependencies and for package distribution we use [poetry](https://python-poetry.org/docs/) (replaces pip).
 
-## Add dependencies
+#### Add dependencies
 
 To install a dependency use
 
@@ -247,7 +199,7 @@ and commit the updated `pyproject.toml` to git.
 
 For more advanced dependency management see examples in `pyproject.toml` or have a look at the [poetry documentation](https://python-poetry.org/).
 
-## Install dependencies added by others
+#### Install dependencies added by others
 
 When other contributors added dependencies to `pyproject.toml`, you can install them via
 
@@ -256,7 +208,7 @@ poetry lock
 poetry install
 ```
 
-# Documentation
+## Documentation
 
 We use [MkDocs](https://www.mkdocs.org/getting-started/), more specifically [Material for MkDocs](https://squidfunk.github.io/mkdocs-material/) for documentation.
 To support documentation for multiple versions, we use the plugin [mike](https://github.com/jimporter/mike).
@@ -278,7 +230,7 @@ To publish the documentation run
 mike deploy 0.5.1 latest -p
 ```
 
-# Releasing a New Version
+## Releasing a New Version
 
 There are four steps to releasing a new version of neps:
 
@@ -288,19 +240,19 @@ There are four steps to releasing a new version of neps:
 3. Update Documentation
 4. Publish on PyPI
 
-## 0. Understand Semantic Versioning
+### 0. Understand Semantic Versioning
 
 We follow the [semantic versioning](https://semver.org) scheme.
 
-## 1. Update the Package Version and CITATION.cff
+### 1. Update the Package Version and CITATION.cff
 
 ```bash
 poetry version v0.9.0
 ```
 
 and manually change the version specified in `CITATION.cff`.
 
-## 2. Commit with a Version Tag
+### 2. Commit with a Version Tag
 
 First commit and test
 
@@ -318,7 +270,7 @@ git push --tags
 git push
 ```
 
-## 3. Update Documentation
+### 3. Update Documentation
 
 First check if the documentation has any issues via
 
@@ -335,11 +287,11 @@ Afterwards, publish it via
 mike deploy 0.9.0 latest -up
 ```
 
-## 4. Publish on PyPI
+### 4. Publish on PyPI
 
 To publish to PyPI:
 
-1. Get publishing rights, e.g., asking Danny or Maciej or Neeratyoy.
+1. Get publishing rights, e.g., asking Danny or Neeratyoy.
 2. Be careful, once on PyPI we can not change things.
 3. Run