Block Blast Game + Reinforcement Learning AI Agent

Block Blast is a Tetris-inspired puzzle game played on an 8×8 grid. At each turn, the player (or agent) is presented with three randomly generated block shapes and must choose one to place anywhere on the board. Whenever an entire row or column is filled, it clears and awards points; clearing multiple lines in succession activates a combo multiplier for even higher scores.

Under the hood, the game engine guarantees that at least one of the three available shapes can always be placed, ensuring no unwinnable states arise prematurely. A Pygame-based interface lets humans play and test strategies interactively, while a custom OpenAI Gym environment exposes the game’s state, action space, and reward function for Reinforcement Learning.

This repository implements several RL agents—including a random baseline, Deep Q-Network (DQN), Proximal Policy Optimization (PPO), and a masked-action PPO variant—to train and evaluate performance. Comprehensive scripts are provided to train agents, log metrics, visualize results, and compare different approaches under consistent conditions.

🚀 Features

Custom Gym Env: Fine-tuned observation, action space, and rewards.
Action Masking: Prevents invalid moves for PPO & DQN.
Multiple Agents:
- Random (baseline)
- PPO (with/without masking)
- DQN
Visualization: TensorBoard logs, matplotlib plots of performance.
Human Interface: Pygame-based play via keyboard & mouse.

📂 Project Structure

BlockBlast-Game-AI-Agent/
├── agent_comparison/      # Simulation & comparison scripts
│   ├── simulate_playing.py
│   └── results/           # CSVs & plots
├── agents/                # RL agents & models
│   ├── models/            # Saved checkpoints
│   ├── dqn_agent.py
│   ├── dqn_masked_agent.py
│   ├── ppo_agent.py
│   ├── ppo_masked_agent.py
│   └── random_agent.py
├── blockblast_game/       # Game environment & assets
│   ├── game_env.py
│   ├── game_renderer.py
│   ├── game_state.py
│   └── Assets/            # Sprites & images
├── human_play/            # Human‐play wrapper
│   └── human_play.py
├── requirements.txt
└── README.md

⚙️ Requirements

Python 3.8+
Pygame
Gym
Stable Baselines3
pip install -r requirements.txt

🔧 Installation

Clone the repo

git clone https://github.com/RisticDjordje/BlockBlast-Game-AI-Agent.git
cd BlockBlast-Game-AI-Agent

Create & activate a virtual environment

python3 -m venv venv
source venv/bin/activate    # macOS/Linux
venv\Scripts�ctivate       # Windows

Install dependencies
```
pip install -r requirements.txt
```

💻 Usage

Ensure you are running all commands from the root directory of the project to avoid import errors due to sibling folder structure.

Human Play

python -m human_play.human_play

E, R, T: Select piece
Mouse hover: Preview placement
Left-click / SPACE: Place piece
ESC: Restart

Training Agents

Make sure you’re in the project root (so blockblast_game/ and agents/ are on your PYTHONPATH) and your virtualenv is activated.

DQN

Standard DQN
```
python -m agents.dqn_agent
```
Masked DQN
```
python -m agents.dqn_masked_agent
```

PPO

Standard PPO
```
python -m agents.ppo_agent
```
Masked PPO
```
python -m agents.ppo_masked_agent
```

Each script supports flags at the top of the file (e.g. do_train, do_visualize, total_timesteps, continue_training)—just edit those or wire in an argparse wrapper if you’d like CLI control.

Comparing Agents

Simulate games
```
python -m agent_comparison.simulate_playing
```
→ CSV in agent_comparison/results/
Visualize results
```
python -m agent_comparison.visualize_results
```
→ Performance plots in agent_comparison/results/

🏗 Reinforcement Learning Environment

Observation:
- 8×8 grid (0/1)
- 3× 5×5 piece matrices
- Current score & combo count
Action Space: 3 shapes × 64 positions → 192 discrete actions
Reward Design (example although I have tried many different combinations):
- Valid placement: +0.2
- Invalid: –2.5 (escalating penalties)
- Line clear: +1.5 / line
- Game over: –20

🤖 Algorithms

Random Agent
PPO (Clipped Policy Gradient)
DQN (Value-based with Replay & Target Network)
Masked PPO (invalid‐action masking)

See inline docstrings for hyperparameters and architecture details.

🤝 Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. I welcome improvements, bug fixes, new features, and documentation enhancements! Feel free to start a discussion by opening an issue if you have questions or want to propose larger changes.

How to Contribute

Fork the repository to your own GitHub account.

Clone your fork locally:

git clone https://github.com/YourUserName/BlockBlast-Game-AI-Agent.git
cd BlockBlast-Game-AI-Agent

Create a new branch for your work with a descriptive name (feature, bugfix, etc.):
```
git checkout -b feature/YourFeatureName
```
Make your changes in your branch.

Commit with clear, descriptive messages:

Use imperative mood (e.g., “Add”, “Fix”, “Update”).

Commit message style examples:

feat(game): add action masking for PPO agent
fix(renderer): correct sprite alignment in preview mode
docs(readme): update contributing section with commit guidelines

Push your branch to GitHub:
```
git push origin feature/YourFeatureName
```
Open a Pull Request against the main branch of this repository. In your PR description, include:
- A summary of your changes.
- Any related issue numbers.
- Screenshots or GIFs when relevant (before-and-after visuals help reviewers).

We appreciate every contribution—big or small. Thank you for helping make Block Blast even better!

📊 Results & Analysis

DQN and PPO: both struggle with the large action space of the game and I have not been able to get the agents to learn not to place pieces in invalid positions.
I have solved this by introducing action masking. Masked PPO avoids invalid moves and gains higher average rewards compared to an agent that plays randomly among the valid moves.

🔭 Future Work

Action‐masking for DQN
Hyperparameter sweeps & longer training
Deploy a public “BlockBlast Solver” website with a MLOps pipeline

🙏 Credits

Game Assets: Kefrov’s Blast
Everything else, including game logic, agent training, and analysis, is original work.

📄 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Block Blast Game + Reinforcement Learning AI Agent

📋 Table of Contents

🚀 Features

📂 Project Structure

⚙️ Requirements

🔧 Installation

💻 Usage

Human Play

Training Agents

DQN

PPO

Comparing Agents

🏗 Reinforcement Learning Environment

🤖 Algorithms

🤝 Contributing

How to Contribute

📊 Results & Analysis

🔭 Future Work

🙏 Credits

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
agent_comparison		agent_comparison
agents		agents
blockblast_game		blockblast_game
fast_api_integration		fast_api_integration
human_play		human_play
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

RisticDjordje/BlockBlast-Game-AI-Agent

Folders and files

Latest commit

History

Repository files navigation

Block Blast Game + Reinforcement Learning AI Agent

📋 Table of Contents

🚀 Features

📂 Project Structure

⚙️ Requirements

🔧 Installation

💻 Usage

Human Play

Training Agents

DQN

PPO

Comparing Agents

🏗 Reinforcement Learning Environment

🤖 Algorithms

🤝 Contributing

How to Contribute

📊 Results & Analysis

🔭 Future Work

🙏 Credits

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages