⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

PythonMMMU-Benchmark/MMMU

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

68.5/100

★ 548Forks: 49

View on GitHub →Homepage →

Loading report...

Similar Projects

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python★ 13.2K

chronos-forecasting

Chronos: Pretrained Models for Time Series Forecasting

Python★ 4.9K

prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Python★ 3.0K

mlx-tune

Bringing the Unsloth experience to Mac users via Apple's MLX framework

Python★ 720

← Back to List