Highlights
- Pro
Stars
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
⚡ Flashbax: Accelerated Replay Buffers in JAX
Play, learn, solve, and analyze No-Limit Texas Hold Em. Implementation follows from Monte Carlo counter-factual regret minimization over with hierarchical K-means imperfect recall abstractions.
JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.
Repository with QPSICE models dedicated to Power Electronics
Fast and modular sklearn replacement for generalized linear models
Egiob / jumanji
Forked from instadeepai/jumanji🕹️ A suite of diverse and challenging RL environments in JAX
Egiob / mctx
Forked from google-deepmind/mctxMonte Carlo tree search in JAX
Egiob / open_spiel
Forked from google-deepmind/open_spielOpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Egiob / pgx
Forked from sotetsuk/pgx🎲 Vectorized RL game environments in JAX
A Jax-based library for designing and training small transformers.
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Bringing Old Photo Back to Life (CVPR 2020 oral)
ALIEN is a CUDA-powered artificial life simulation program.
🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Vectorization techniques for fast population-based training.
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Book Paris tennis court (include CAPTCHA bypass)
Accelerated Quality-Diversity
instadeepai / QDax
Forked from adaptive-intelligent-robotics/QDaxAccelerated Quality-Diversity
victorcampos7 / edl
Forked from salesforce/sibling-rivalryCode for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
pytorch implement for the paper Few-Shot Adversarial Domain Adaptation