Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonbytedance/Sa2VA

Sa2VA

Official Repo For Pixel-LLM Codebase

70.0/100
1.6KForks: 112
View on GitHub
Loading report...

Similar Projects

cambrian

51

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python2.0K

JarvisArt

64

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python772

4KAgent

56

[NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any image to perfect-4K!

Python753

vllm-mlx

60

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

Python531
Back to List