Graph-Bandit

This repository contains the implementation of graph bandit algorithms and the corresponding numerical experiments. The code are written in Python.

Python packages required for running the experiments:

For running the Python notebooks: jupyter notebook or jupyter lab.
Graph-related utilities: networkx.
Plotting utilities: matplotlib, seaborn.
For showing the progress bar during the experiments: tqdm.
For saving and loading experiment data: pickle.

Quick Start: Directly run the 'Robotic Application.ipynb' notebook to see the network used in our robotic application and the regret for our proposed algorithm.

Contents of the Python files

graph_bandit.py: the class definition of graph bandit environment, which includes a class method that trains a Q-learning agent.

agents.py: contains the agent implementing our propose algorithm(under the name doubling_agent), as well as the local Thompson Sampling and UCB agents.

core.py: contains a function that visits all nodes at least once(used in initialization), and the train_agent() function.

known_rewards_helper_functions.py: the shortest path algorithm for off-line planning.

graph_bandit_helper_tools.py: contains a graph generator, a graph drawing utility, and a wrapper for training a Q-learning agent.

Contents of the Python notebooks

Main.ipynb: contains the experiments comparing our proposed algorithm with various benchmarks on various graphs.

Main Plotting.ipynb: plotting utilities for the results obtained from Main.ipynb

Sensitivity Analysis.ipynb: experiments showing how the performance of our algorithm depends on graph parameters $|S|,D,$ and $\Delta$.

Robotic Application.ipynb: contains the synthetic robotic application of providing Internet access to rural/suburban areas using an UAV.

Direct SP.ipynb: additional experiments comparing two ways of reaching the destination node in one iteration, a section described in the technical appendix.

Direct SP Plotting.ipynb: the plotting notebook for Direct SP.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Data		Data
Figures		Figures
__pycache__		__pycache__
.gitignore		.gitignore
Code and Data.zip		Code and Data.zip
Direct SP Plotting.ipynb		Direct SP Plotting.ipynb
Direct SP.ipynb		Direct SP.ipynb
Main Plotting.ipynb		Main Plotting.ipynb
Main.ipynb		Main.ipynb
Map.png		Map.png
New Benchmark Development.ipynb		New Benchmark Development.ipynb
README.md		README.md
Robotic Application.ipynb		Robotic Application.ipynb
Sensitivity analysis.ipynb		Sensitivity analysis.ipynb
UCRL2.ipynb		UCRL2.ipynb
agents.py		agents.py
core.py		core.py
graph_bandit.py		graph_bandit.py
graph_bandit_helper_tools.py		graph_bandit_helper_tools.py
known_rewards_helper_functions.py		known_rewards_helper_functions.py
network.png		network.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph-Bandit

Contents of the Python files

Contents of the Python notebooks

About

Releases

Packages

Contributors 3

Languages

kasperjo/Graph-Bandit

Folders and files

Latest commit

History

Repository files navigation

Graph-Bandit

Contents of the Python files

Contents of the Python notebooks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages