Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonMMMU-Benchmark/MMMU

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

68.5/100
548Forks: 49
View on GitHubHomepage →
Loading report...

Similar Projects

litgpt

88

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python13.2K

chronos-forecasting

84

Chronos: Pretrained Models for Time Series Forecasting

Python4.9K

prompttools

79

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Python3.0K

mlx-tune

75

Bringing the Unsloth experience to Mac users via Apple's MLX framework

Python720
Back to List