Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonAIDC-AI/Ovis

Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

52.4/100
1.5KForks: 84
View on GitHubHomepage →
Loading report...

Similar Projects

MaxKB

90

🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。

Python21.2K

MobileAgent

64

Mobile-Agent: The Powerful GUI Agent Family

Python8.8K

VLM-R1

59

Solve Visual Understanding with Reinforced VLMs

Python6.0K

pixeltable

87

Declarative and Incremental Backend for Multimodal AI Applications

Python1.6K
Back to List