Skip to content
#

multi-agent-rl

Here are 21 public repositories matching this topic...

A deep reinforcement learning system for optimizing bridge maintenance decisions across municipal infrastructure fleets, implementing cross-subsidy budget sharing and cooperative multi-agent learning.

  • Updated Dec 5, 2025
  • Python

Dark Zero Point Genesis: PPO Latent World Models Under Thermodynamic Scarcity 256 agents. 128D Latent Manifolds. Zero supervision. Agents utilize PPO-clipped surrogate objectives. Survival = Predictive Error Coding (PEC) × Energy Efficiency across a 50/15 Seasonal Cycle.

  • Updated Feb 26, 2026
  • Python

Deterministic hex-grid soccer environment with two adversarial agents. Implements Q-Learning, Minimax-Q (via LP), and Belief-Q with online belief updates; trains in SE2G/SE6G to reduce state space and evaluates behaviors in the full environment with comprehensive visualizations.

  • Updated Sep 28, 2024
  • Python

🎯 Key Features 1. Flexible Game Configuration Adjustable grid size (3×3 up to 10×10) Customizable win condition (e.g., 5-in-a-row on a 7×7 board) 2. Two Competing RL Agents Agent 1 (Blue X) vs Agent 2 (Red O) Each has independent Q-Learning parameters Watch them evolve different strategies over time

  • Updated Dec 13, 2025
  • Python

Hexapawn Game Engine Proper 3×3 board with pawn movement Strategic RL Agents Minimax with Alpha-Beta Pruning (depth configurable 1-7) Q-Learning with temporal difference updates Experience replay for efficient learning Epsilon-greedy exploration with decay Multi-level decision hierarchy (immediate threats → strategic planning)

  • Updated Dec 14, 2025
  • Python

Research-grade Reinforcement Learning framework for single-agent and multi-agent warehouse navigation using Deep Q-Networks (DQN), PyTorch, replay buffer, target networks, logging, and full test suite. Built for PhD-level RL and autonomous systems research.

  • Updated Dec 11, 2025
  • Python

Multi-Equipment CBM system using QR-DQN with advanced probability distribution analysis. Coordinated maintenance decision-making for 4 industrial equipment units with realistic anomaly rates (1.9-2.2%), comprehensive risk analysis (VaR/CVaR), and 51-quantile distribution visualization.

  • Updated Dec 21, 2025
  • Python

Pure RL Agents: I implemented Q-Learning agents that learn through Self-Play. They play against each other to get smarter without human help! Symmetry Optimization: To make them "genius" faster, I added logic so they understand that a board mirrored left-to-right is the same situation. This cuts the learning time in half!

  • Updated Dec 13, 2025
  • Python

A specialized Reinforcement Learning (RL) project focused on multi-task mastery across 10 distinct gaming environments. General-Gamer-AI-Lite implements a lightweight multi-task agent designed to learn shared representations and transfer knowledge between varied game mechanics, from classic arcade challenges to strategic grid worlds.

  • Updated Jan 26, 2026
  • Python

Multi-Equipment CBM (Condition-Based Maintenance) optimization using Deep Q-Learning with cost leveling and scenario comparison. Advanced RL system with QR-DQN, N-step learning, and parallel environments for HVAC equipment predictive maintenance.

  • Updated Dec 25, 2025
  • Python

Improve this page

Add a description, image, and links to the multi-agent-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-agent-rl topic, visit your repo's landing page and select "manage topics."

Learn more