-
Notifications
You must be signed in to change notification settings - Fork 25
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
- Status: Open.#236 In TIGER-AI-Lab/ClawBench;
- Status: Open.#222 In TIGER-AI-Lab/ClawBench;
- Status: Open.#219 In TIGER-AI-Lab/ClawBench;
- Status: Open.#217 In TIGER-AI-Lab/ClawBench;
publicity: ClawBench launch story / data points pitched to LMSYS, Anthropic, OpenAI, Google evaluator teams
enhancementNew feature or requestNew feature or requestStatus: Open.#191 In TIGER-AI-Lab/ClawBench;publicity: submit ClawBench results to Papers With Code (web-agent / browser-agent category)
enhancementNew feature or requestNew feature or requestStatus: Open.#192 In TIGER-AI-Lab/ClawBench;publicity: @clawbench_live X/Twitter bot — auto-posts when a new model lands on the leaderboard
enhancementNew feature or requestNew feature or requestStatus: Open.#193 In TIGER-AI-Lab/ClawBench;feat: adapter — run OSWorld (OS-level agent eval, macOS/Linux/Windows) under the ClawBench harness
enhancementNew feature or requestNew feature or requestStatus: Open.#187 In TIGER-AI-Lab/ClawBench;feat: adapter — run AssistantBench (realistic time-consuming live-web tasks) under the ClawBench harness
enhancementNew feature or requestNew feature or requestStatus: Open.#188 In TIGER-AI-Lab/ClawBench;feat: adapter — run Online-Mind2Web (live-web evolution of Mind2Web) under the ClawBench harness
enhancementNew feature or requestNew feature or requestStatus: Open.#189 In TIGER-AI-Lab/ClawBench;feat: adapter — run WebVoyager (live-website, screenshot+LLM-judge browser agent) under the ClawBench harness
enhancementNew feature or requestNew feature or requestStatus: Open.#190 In TIGER-AI-Lab/ClawBench;publicity: embeddable leaderboard widget — iframe at /embed/leaderboard for blogs + benchmarks aggregators
enhancementNew feature or requestNew feature or requestStatus: Open.#186 In TIGER-AI-Lab/ClawBench;