research
Slay the Spire Watcher Solver
Jan 2026 - Apr 2026WIP
Reinforcement learning agent targeting 100% win rate on Slay the Spire Watcher at Ascension 20 (top players achieve ~90%).
An RL agent for Slay the Spire with an ambitious goal: beat every single seed at the highest difficulty (Ascension 20). Top human players achieve ~90% win rates, but no unlosable seeds have been proven - the question is whether ML can find a path to 100%.
Thesis
The game has no proven unlosable seeds. If a model can learn to always win, it either proves perfect play exists or the model found strategies humans haven't discovered.
Technical Approach
Strategic Features
Engineering high-level strategic features rather than raw card data:
- Kill probability calculations per enemy
- Stance-specific mechanics (Wrath doubles damage)
- Block efficiency and damage predictions
Learning Architecture
- Behavioral cloning on official game run data
- Neural network for decision evaluation
- Modular system for training and inference
Status
Currently paused - will resume when time permits.
PythonPyTorchRLBehavioral Cloning