GitHub / polymathbenchmark / polymathbenchmark.github.io
A Challenging Multi-Modal Mathematical Reasoning Benchmark
Stars: 0
Forks: 0
Open issues: 0
License: mit
Language: JavaScript
Size: 2.01 MB
Dependencies parsed at: Pending
Created at: 8 months ago
Updated at: about 1 month ago
Pushed at: about 1 month ago
Last synced at: about 1 month ago
Topics: benchmark, claude-3-5-sonnet, gemini-vision-pro, gpt-4, llama3, llm, multimodal, openai-o1, qwen2-vl, vision
Loading...