MARL PPO Suite 🚀

Welcome to the MARL PPO Suite! This repository contains clean and documented implementations of Proximal Policy Optimization (PPO)-based algorithms designed for cooperative multi-agent reinforcement learning, particularly in StarCraft II Multi-Agent Challenge (SMAC) environments.

Introduction

In recent years, multi-agent reinforcement learning has gained significant attention. The MARL PPO Suite aims to provide a comprehensive toolkit for researchers and practitioners in this field. Our focus is on implementing efficient algorithms that can tackle complex tasks in cooperative environments.

You can find the latest releases of this project here.

Features

Clean Code: Each implementation follows best practices for clarity and maintainability.
Documentation: Thorough documentation helps users understand the algorithms and their applications.
Multiple Architectures: Supports both MLP (Multi-Layer Perceptron) and RNN (Recurrent Neural Network) architectures, including GRU (Gated Recurrent Unit).
Normalization Techniques: Various normalization strategies are implemented to improve training stability and performance.
Focus on SMAC: Tailored for environments like SMAC, allowing easy experimentation and evaluation.

Installation

To get started with the MARL PPO Suite, clone the repository and install the required dependencies.

git clone https://github.com/xujiuqing2023/marl-ppo-suite.git
cd marl-ppo-suite
pip install -r requirements.txt

Make sure you have Python 3.6 or higher installed on your system.

Usage

To use the MARL PPO Suite, you can run the provided training scripts. Here’s a simple example:

python train.py --config configs/mappo_config.yaml

Adjust the configuration file as needed for your specific use case. For more details, check the documentation in the docs folder.

Algorithms

The MARL PPO Suite includes several algorithms based on PPO:

MAPPO: Multi-Agent Proximal Policy Optimization, which allows agents to learn in a shared environment.
MLP-based MAPPO: Uses a simple feedforward neural network for agent policy representation.
RNN-based MAPPO: Utilizes recurrent networks to handle partial observability in environments.

Each algorithm is designed to work seamlessly with SMAC environments.

Normalization Techniques

Normalization can significantly impact the training process. The MARL PPO Suite offers several techniques, including:

Standardization: Adjusts the input features to have a mean of zero and a standard deviation of one.
Min-Max Scaling: Scales the features to a specific range, typically [0, 1].
Batch Normalization: Normalizes activations in a mini-batch, stabilizing the learning process.

You can choose the normalization technique that best fits your problem.

Examples

To illustrate the capabilities of the MARL PPO Suite, we provide several examples in the examples directory. These include:

Training agents in a basic SMAC scenario.
Evaluating performance metrics.
Visualizing training progress.

Feel free to modify these examples to suit your needs.

Contributing

We welcome contributions to the MARL PPO Suite! If you would like to contribute, please follow these steps:

Fork the repository.
Create a new branch for your feature or bug fix.
Make your changes and commit them with clear messages.
Push your branch to your forked repository.
Create a pull request detailing your changes.

We appreciate your interest in improving the MARL PPO Suite!

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Contact

For questions or feedback, feel free to reach out via GitHub issues or contact the repository maintainer:

Maintainer: xujiuqing2023

Stay updated with the latest releases by visiting our Releases section.

Acknowledgments

We thank the contributors to the open-source community for their invaluable resources and tools that made this project possible. Special thanks to the developers of the SMAC environments for providing a challenging platform for multi-agent reinforcement learning.

Explore the MARL PPO Suite and dive into the world of multi-agent reinforcement learning!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
algos		algos
buffers		buffers
networks		networks
runners		runners
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MARL PPO Suite 🚀

Table of Contents

Introduction

Features

Installation

Usage

Algorithms

Normalization Techniques

Examples

Contributing

License

Contact

Acknowledgments

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

xujiuqing2023/marl-ppo-suite

Folders and files

Latest commit

History

Repository files navigation

MARL PPO Suite 🚀

Table of Contents

Introduction

Features

Installation

Usage

Algorithms

Normalization Techniques

Examples

Contributing

License

Contact

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages