make evaluator script executable with command line arguments PDDL/SAS files for testing #82

ClemensBuechner · 2022-06-29T15:42:18Z

No description provided.

FlorianPommerening · 2024-08-13T10:13:07Z

We discussed this and would like evaluators to take the path to pickled state as input for the normal operation of the search rather than importing a function from the evaluator script and calling it. This will make it possible for the search to detect cases where the evaluator runs out of resources and will unify the grid and local environments more. The original goal of the issue was to also allow PDDL/SAS files as input for testing. We will handle this as optional inputs in convenience functions provided by the modules.

We discussed a few options on Discord and ended up with the following design. An evaluator without convenience functions, could look like this:

#!/usr/bin/env python3

from machetli import pddl, tools, EXIT_CODE_IMPROVING, EXIT_CODE_NONIMPROVING
import sys

def load_state(argv):
    try:
        return machetli.unpickle(argv[1])
    except:
        return pddl.create_state(argv[1], argv[2])

def evaluate(state):
    with pddl.temporary_files(state) as (domain_filename, problem_filename):
        return evaluate(domain_filename, problem_filename)

def evaluate(domain_filename, problem_filename):
    command = ["./bugged-planner/plan", f"{domain_filename}", f"{problem_filename}"]
    run = tools.Run(command, time_limit=20, memory_limit=3000)
    stdout, stderr, returncode = run.start()
    return "Wrong task encoding" in stdout

if __name__ == "__main__":
    state = load_state(sys.argv)
    if evaluate(state):
        return EXIT_CODE_IMPROVING
    else:
        return EXIT_CODE_NONIMPROVING

With convenience functions, we could boil it down to this:

from machetli import pddl, tools

def evaluate(domain_filename, problem_filename):
    command = ["./bugged-planner/plan", f"{domain_filename}", f"{problem_filename}"]
    run = tools.Run(command, time_limit=20, memory_limit=3000)
    stdout, stderr, returncode = run.start()
    return "Wrong task encoding" in stdout

if __name__ == "__main__":
    pddl.run_evaluator(evaluate)

FlorianPommerening · 2024-08-13T13:23:16Z

As discussed in #90, we want to work on #53 first. We might do this issue together with #53 if that makes things easier.

FlorianPommerening · 2024-08-16T11:54:39Z

Pull request #94 deals with this. There are still a couple of TODOs and we have to update the documentation.

ClemensBuechner added the enhancement New feature or request label Jun 29, 2022

FlorianPommerening mentioned this issue Oct 27, 2023

Create eval directory and batch directories with executable evaluator script in all Environments #90

Open

FlorianPommerening mentioned this issue Aug 14, 2024

write experiment data to designated data directory and/or eval directories similar to Lab #53

Closed

ClemensBuechner linked a pull request Aug 21, 2024 that will close this issue

switch to using evaluator as script. #94

Merged

ClemensBuechner closed this as completed Aug 21, 2024

ClemensBuechner mentioned this issue Aug 21, 2024

when running python -m machetli.evaluator on the grid we get a warning about undefined behavior. Use a __main__.py file instead #79

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make evaluator script executable with command line arguments PDDL/SAS files for testing #82

make evaluator script executable with command line arguments PDDL/SAS files for testing #82

ClemensBuechner commented Jun 29, 2022

FlorianPommerening commented Aug 13, 2024

FlorianPommerening commented Aug 13, 2024

FlorianPommerening commented Aug 16, 2024

make evaluator script executable with command line arguments PDDL/SAS files for testing #82

make evaluator script executable with command line arguments PDDL/SAS files for testing #82

Comments

ClemensBuechner commented Jun 29, 2022

FlorianPommerening commented Aug 13, 2024

FlorianPommerening commented Aug 13, 2024

FlorianPommerening commented Aug 16, 2024