Gaming · harness

GamingAgent (lmgame-Bench)

Framework of LLM/VLM gaming agents plus lmgame-Bench that evaluates models by having them actually play games like Sokoban and Mario.

Connects to: OpenAI, Anthropic, Gemini · Python · MIT 940★

Use it with an AI agent

Loadbay is an MCP server, so an agent can search the catalog and find this harness:

claude mcp add --transport http loadbay https://loadbay.xyz/api/mcp