CrossBeam: Learning to Search in Bottom-Up Program Synthesis

This repository contains the source code associated with the paper published at ICLR'22 (OpenReview):

Kensen Shi, Hanjun Dai, Kevin Ellis, and Charles Sutton. CrossBeam: Learning to Search in Bottom-Up Program Synthesis. International Conference on Learning Representations (ICLR), 2022.

In this research project, we train a neural model to learn a hands-on search policy for bottom-up program synthesis, in an effort to tame the search space blowup.

To cite this work, you can use the following BibTeX entry:

@inproceedings{shi2022crossbeam,
    title={{CrossBeam}: Learning to Search in Bottom-Up Program Synthesis},
    author={Kensen Shi and Hanjun Dai and Kevin Ellis and Charles Sutton},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=qhC8mr2LEKq},
}

Setup

For dependencies, first install PyTorch with CUDA and torch-scatter, for example with the following commands:

pip3 install torch==1.10.2+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html
pip3 install torch-scatter -f https://data.pyg.org/whl/torch-1.10.0+cu113.html

Then run the following from the root crossbeam/ directory:

pip3 install -e .

You can also append the --user option if you don't want to install globally.

Running tests

From the root crossbeam directory, run:

pytest

Generate training and test datasets

The synthetic training data is only needed for re-training models and is not included in this repo. The following steps will generate the synthetic training data:

./crossbeam/datasets/run_bustle_gen.sh
./crossbeam/datasets/run_logic_gen.sh
./crossbeam/datasets/run_logic_clean_and_shard.sh

Test datasets are in crossbeam/data/.

Train the model

Trained models are included in this repo in trained_models/. If you wish to re-train the models, first generate training data with the commands above, and then follow the steps below.

BUSTLE domain:

Navigate to crossbeam/experiment/bustle directory, make any necessary edits to bustle_main_dist.sh including the data folder and number of GPUs to use, and run the script to train a model:

cd crossbeam/experiment/bustle
./bustle_main_dist.sh

The default hyperparameters should mirror the settings in the paper.

Logic domain:

Similar to the case above, run the following script:

cd crossbeam/experiment/logic_synthesis
./dist_run_logic.sh

You can set the usegreattransformer argument to either 1 or 0, to use the Transformer encoder or a simple MLP.

Running the trained model on test datasets

From the root crossbeam/ directory, run:

./iclr2022/bustle_experiments.sh
./iclr2022/logic_experiments.sh

These commands will save results as JSON files in iclr2022/bustle_results and iclr2022/logic_results.

Disclaimer

This is not an official Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
crossbeam		crossbeam
iclr2022		iclr2022
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
code-of-conduct.md		code-of-conduct.md
contributing.md		contributing.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrossBeam: Learning to Search in Bottom-Up Program Synthesis

Setup

Running tests

Generate training and test datasets

Train the model

BUSTLE domain:

Logic domain:

Running the trained model on test datasets

Disclaimer

About

Releases

Packages

Contributors 3

Languages

License

google-research/crossbeam

Folders and files

Latest commit

History

Repository files navigation

CrossBeam: Learning to Search in Bottom-Up Program Synthesis

Setup

Running tests

Generate training and test datasets

Train the model

BUSTLE domain:

Logic domain:

Running the trained model on test datasets

Disclaimer

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages