Fun-ASR-Nano Web Demo。原始代码来自 https://www.modelscope.cn/studios/FunAudioLLM/Fun-ASR-Nano.git
This Project includes Install and Running Scripts for open source TTS projects.
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters. 对原项目添加了mps(mac电脑M芯片)的支持,使用Gradio搭建可视化界面。
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. 仅对原项目添加了mps(mac电脑M芯片)的支持,修改torch版本,其他功能未变。
IndexTTS2 is an Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System. We don't change the code. we only remove the .gitattributes file. 我们不修改代码,仅删除.gitattributes file,方便大家快速克隆。
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text. 仅对原项目添加了cpu和mps(mac电脑M芯片)的支持,添加生成过程的进度条。其他功能未变。
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text. 仅对原项目添加了cpu和mps(mac电脑M芯片)的支持,添加生成过程的进度条。其他功能未变。