Block or Report
Block or report Zealoter
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.
An elegant PyTorch deep reinforcement learning library.
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
Code for "Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent", IJCAI 2024 (Oral)
source code for AAMAS 2023 Imperfect-information Card Game Competition
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
🖥 Control your display's brightness & volume on your Mac as if it was a native Apple Display. Use Apple Keyboard keys or custom shortcuts. Shows the native macOS OSDs.
A python3 port of https://github.com/worldveil/deuces , a pure python poker hand evaluator
real Transformer TeraFLOPS on various GPUs
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Torch modules that wrap blackbox combinatorial solvers according to the method presented in "Differentiating Blackbox Combinatorial Solvers"
codes for the paper "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning"
Code for the paper 'An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem' (INFORMS Annual Meeting Session 2019)
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
2048 environment for Reinforcement Learning and DQN algorithm
Simple A3C implementation with pytorch + multiprocessing
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning