API Endpoint

Leaderboard

Loading leaderboard...

README

Poker

Description

Poker is an environment for evaluating agents on strategic decision-making in Texas Hold'em Poker, testing betting strategies, bluffing, and probabilistic reasoning. This environment wraps the Poker implementation from TextArena, a framework for text-based game environments.

Capabilities

Testing probabilistic reasoning and hand strength evaluation
Evaluating strategic betting, bluffing, and risk management
Assessing opponent modeling and behavioral adaptation
Testing multi-player game dynamics (2, 3, or 4 players)

Compute Requirements

Poker does not require a sandbox. It has minimal compute requirements.

License

MIT.

Tasks

There are two splits: train (240 tasks) and test (240 tasks). Each split contains 20 tasks across each of 4 variants, with each variant tested at 3 player counts (2, 3, 4 players):

Poker-v0-small
Poker-v0
Poker-v0-long
Poker-v0-extreme

Each task is seeded for reproducibility.

Reward Structure

This is a sparse reward environment. Rewards are mapped from TextArena's native range of {-1, 0, 1} to {0.0, 0.5, 1.0} via (raw + 1) / 2.

We do not use LLM graders for this environment; reward is determined programmatically.

Data

Game state is generated procedurally by the TextArena engine using seeded randomness. No external data files are required.

Tools

Agents are given five tools:

fold(): Fold your hand and give up the current pot.
call(): Call (match) the current bet.
check(): Check (pass without betting). Only valid when no bet is active.
bet(amount): Open betting with the specified chip amount.
raise_bet(amount): Raise the current bet by the specified chip amount.

Time Horizon

Poker is a multi-turn environment.

Environment Difficulty

Hard. Texas Hold'em requires sophisticated probabilistic reasoning, opponent modeling, strategic betting, and bluffing. Success depends on balancing aggression with risk management across multiple betting rounds.

Other Environment Requirements

This environment requires an OpenAI API key (passed via secrets) to power the LLM opponents.

Safety

Agents in Poker interact only with a card game simulation and have no access to external systems, the internet, or sensitive data. The environment does not present safety risks.

Citations

@software{textarena2024,
  author    = {Guertler, Leon and Banting, Wilfried and Pignatelli, Eduardo},
  title     = {TextArena},
  year      = {2024},
  publisher = {GitHub},
  url       = {https://github.com/LeonGuertler/TextArena}
}

Repository

Source repository

EnvCommons/poker

Clone Repository

Tools

Tools available in the environment

No tools available for this environment, it probably hasn't been indexed yet.

Compute Configuration

Resource allocation for this environment.

Component	Configuration
Environment Server	1 vCPU / 4 GB RAM
Sandbox Machine	Not configured

Estimated Cost

Pay per second of active session usage. Billing starts when your session begins and stops when it ends.

Component	Cost / second
Environment	$0.0000320
Sandbox	Not configured
Total	$0.0000320

Examples

5-minute session$0.0096

1-hour session$0.1152

Poker

GeneralReasoning/Poker

Poker

Description

Capabilities

Compute Requirements

License

Tasks

Reward Structure

Data

Tools

Time Horizon

Environment Difficulty

Other Environment Requirements

Safety

Citations

Repository

Clone Repository

Tools

Compute Configuration

Estimated Cost

Examples