Text to Speech Item

By crabboss 2024 年 6 月 24 日

此处我们总结了市面上常见的文本转语音的项目，一起来探索一下吧。

1 ChatTTS

Github: https://github.com/2noise/ChatTTS

2 MandarinTTS

Github: https://github.com/ranchlai/mandarin-tts

This is a modularized Text-to-speech framework aiming to support fast research and product developments. Main features include

all modules are configurable via yaml,
speaker embedding / prosody embeding/ multi-stream text embedding are supported and configurable,
various vocoders (VocGAN, hifi-GAN, waveglow, melGAN) are supported by adapter so that comparison across different vocoders can be done easily,
durations/pitch/energy variance predictor are supported, and other variances can be added easily,
and more on the road-map.

3 Chinese-FastSpeech2

Github: https://github.com/Executedone/Chinese-FastSpeech2

基于标贝中文标准女声数据继续训练，同时对原论文的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏.

By crabboss

AI排行榜

crabboss 2024 年 6 月 24 日

目前能用的国内docker加速源

crabboss 2024 年 6 月 15 日

ACM模式处理输入输出

crabboss 2024 年 6 月 1 日

大模型大模型基础

大模型分布式入门

大模型大模型量化

大模型量化入门

大模型大模型基础

优化器的进化之旅

大模型大模型基础

FlashAttention – 原理解析