⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

PythonFunAudioLLM/SenseVoice

SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

81.8/100

★ 8.5KForks: 778

View on GitHub →Homepage →

Loading report...

Similar Projects

FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Python★ 17.6K

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python★ 161.5K

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python★ 21.3K

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.

Python★ 5.8K

← Back to List