LSTM-Based Spatial Encoding: Explainable Path Planning for Time-Variant Multi-Agent Systems

This repository contains the source code used for the LSTM-based multi-agent path planning method described in Long Short-Term Memory for Spatial Encoding in Multi-Agent Path Planning (Schlichting et al., 2022). The code can be used as starting point for other multi-agent environments (exchanging the gym environment), following ideas outlined in the paper regarding saftey guarantees, or using the code for own experiments. The implementation is based on Open AI's Gym and PyTorch.

The paper can be found using this link. In case you use this work for your own research, please cite as:

@article{schlichting2022long,
  title={Long Short-Term Memory for Spatial Encoding in Multi-Agent Path Planning},
  author={Schlichting, Marc R and Notter, Stefan and Fichter, Walter},
  journal={Journal of Guidance, Control, and Dynamics},
  volume={45},
  number={5},
  pages={952--961},
  year={2022},
  publisher={American Institute of Aeronautics and Astronautics}
}

Installation

The setup described here has mainly been tested on Linux-based systems. A Python installation (as of November 2022, we recommend Python 3.9.13) is required. The following Python packages are required (pip is recommended for installation): numpy (==1.23.4), scipy (==1.9.3), gym (only basic installation, ==0.23.1), PyTorch (see here for installation notes, CUDA is not used in the implementation so far, ==1.13.0). Depending on the platform, other packages such as Visual Studio Build Tools (maybe required for gym) need to be installed. Test that all packages are are properly installed before proceding with the next step. To simplify the installation, the following command will install all necessary packages.

pip3 install -r requirements.txt

Before beginning with the training of the policy, the custom environment (located in the drone-sim2d folder) must be installed. For this purpose navigate into the previously mentioned folder:

cd drone-sim2d

Now use pip to install the custom environment:

pip3 install -e .

If no error occurs, the installation is successfully completed.

Training

All parameters can be changed within the main function of the training.py file. The overall structure of a typical PPO implementation has been adapted to work with mutli-agent environments. The training can be started using:

python3 training.py

The logs folders will contain the log files for each run. For each run, a unique timestamp is created which will be used for all log files and model names. Within the logs folder, two files are created: One parameter file that contains relevant parameters for each run and a second file that contains the average episode length as well as the average reward per log interval. The models are saved to the models folder after a specified number of training episodes (as defined in the beginning of the main function).

Evaluation

For inference, we need to install the evaluation version of the simulator. Navigate into the drone-sim2d-eval directory and exectute

pip3 install -e .

To specify the points of origin of the vehicles, please modify the positions_destinations.txt file. This file follows the following format

vehicle_1_x_origin vehicle_1_y_origin vehicle_1_x_target vehicle_1_y_target 
vehicle_2_x_origin vehicle_2_y_origin vehicle_2_x_target vehicle_2_y_target 
...
vehicle_n_x_origin vehicle_n_y_origin vehicle_n_x_target vehicle_n_y_target

Once the scenario has been specified in the positions_destinations.txt, the actual evaluation script can be run with

python3 evaluation.py

The resulting trajectories are saved by default in the trajectories directory. We also provide a small helper script plot_trajectories.py which plots a simple ground track of the simulated trajectories. For the evalulation case, all vehicles follow the same policy that has been trained before (the policy to be used is specified in evaluation.py. For an easier start, we provide one completely trained policy in the models directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSTM-Based Spatial Encoding: Explainable Path Planning for Time-Variant Multi-Agent Systems

Installation

Training

Evaluation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
drone-sim2d-eval		drone-sim2d-eval
drone-sim2d		drone-sim2d
logs		logs
models		models
trajectories		trajectories
README.md		README.md
evaluation.py		evaluation.py
plot_trajectories.py		plot_trajectories.py
positions_destinations.txt		positions_destinations.txt
requirements.txt		requirements.txt
training.py		training.py

MarcSchlichting/LSTMSpatialEncoding

Folders and files

Latest commit

History

Repository files navigation

LSTM-Based Spatial Encoding: Explainable Path Planning for Time-Variant Multi-Agent Systems

Installation

Training

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages