Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonFunAudioLLM/SenseVoice

SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

81.8/100
8.5KForks: 778
View on GitHubHomepage →
Loading report...

Similar Projects

FunASR

91

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Python17.6K

transformers

98

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python161.5K

peft

91

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python21.3K

FunClip

80

Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.

Python5.8K
Back to List