A real-world reinforcement learning environment where AI agents learn to manage email inboxes.
| Rank | Name | Task | Score | Time |
|---|
Label 10 emails as urgent, normal, or spam.
Label 20 emails AND decide: reply, archive, or delete.
30 emails including tricky phishing attempts. Label, act, summarize.
| Method | Endpoint | Description |
|---|---|---|
| POST | /reset | Start new episode |
| POST | /step | Submit answers โ get reward + feedback |
| GET | /state | Current environment state |
| GET | /tasks | List all tasks |
| GET | /health | Health check |
| GET | /stats | Environment statistics |
| GET | /leaderboard | Top scores |