Omok (Gomoku) RL agent

Warning

This repository is still under active development.

Description

Training Omok AI with OpenAI Gym and Stable-Baselines3

This project is built on OpenAI Gym and Stable-Baselines3, providing a framework for two agents to compete and learn from each other, enhancing their strategies over time.

Overview

Omok, also known as Gomoku, is a strategy board game traditionally played with Go pieces (black and white stones) on a Go board. The objective is to be the first to place five of one's own pieces in a row, either horizontally, vertically, or diagonally.

In this project, I have developed a training environment where two AI agents compete against each other in the game of Omok. The agents are trained using reinforcement learning techniques provided by Stable-Baselines3, integrated within an OpenAI Gym environment. To accelerate learning, one agent is periodically cloned from the other, ensuring that both agents evolve competitive strategies against increasingly skilled opponents.

Features

Custom Omok Environment

An OpenAI Gym-based environment for the Omok game.

Reinforcement Learning Agents

Utilization of Stable-Baselines3 for training agents with advanced reinforcement learning algorithms.

Competition-Driven Learning

Agents learn by competing against a clone of themselves.

Periodic Cloning: One agent is periodically cloned from the other to maintain a challenging learning environment and prevent stagnation.

Demo

omok_demo.mov

Milestones

Implement the Omok Game
Wrap the Omok Game in a Gym Environment
Implement Learning Agents that compete
Masking unauthorized actions
Enhance performance with algorithm adjustments
Implement various Omok rules
Develop an API for the game
Create a website for playing against the Omok AI

Known Problems

As the number of training episodes increases, both agents tend to place their pieces in the same positions repeatedly, leading to repetitive game outcomes in subsequent episodes.
Agent sometimes tries to place the pieces on a wrong place (e.g. on the non-empty places).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Omok (Gomoku) RL agent

Description

Overview

Features

Custom Omok Environment

Reinforcement Learning Agents

Competition-Driven Learning

Demo

Milestones

Known Problems

Files

README.md

Latest commit

History

README.md

File metadata and controls

Omok (Gomoku) RL agent

Description

Overview

Features

Custom Omok Environment

Reinforcement Learning Agents

Competition-Driven Learning

Demo

Milestones

Known Problems