Skip to content

Latest commit

 

History

History
26 lines (22 loc) · 512 Bytes

README.md

File metadata and controls

26 lines (22 loc) · 512 Bytes

Mahjong

The objective of this project is to build a reasonable AI agent to play Mahjong with human players.

The enviroment of Mahjong is built by RLCard.

Requirements

  • PyTorch
  • RLCard

Usage

Train PPO Agent

python main.py --log_dir 'experiments/mahjong_ppo_result/'

Play Demo with trained agent

python demo.py

Todo

  • PPO Agent
  • Demo
  • Distributed PPO Agent
  • MAPPO Agent
  • SAC Agent
  • Add other types of Mahjong games