parallel-waveganParallel WaveGAN implementationhifiganmelganneural-vocoderparallel-wavenetpytorchrealtimespeech-synthesisstyle-melgantext-to-speechttsvocoderwavenet
piper-ttsA fast, local neural text to speech system that sounds great and is optimized for the Raspberry Pi 4.rhasspypiperttsspeech-synthesistext-to-speech
paddlespeechSpeech tools and models based on PaddlepaddleSSLspeechasrttsspeakerverficationspeechclassficationtextfrontendMFApaddlepaddlepaddleaudiostreamingbeamsearchctcdecoderdeepspeech2wav2vec2hubertwavlmtransformerconformerfastspeech2hifiganganvocoderscode-switchkwspunctuation-restorationself-supervised-learningsound-classificationspeech-alignmentspeech-recognitionspeech-synthesisspeech-translationstreaming-asrstreaming-ttsvocodervoice-cloningvoice-recognitionwhisper
chatttsA generative speech model for daily dialogueagentchatchatgptchatttschinesechinese-languageenglishenglish-languagegptllmllm-agentnatural-language-inferencepythontext-to-speechtorchtorchaudiotts
paddleaudioSpeech audio tools based on Paddlepaddleaudioprocesspaddlepaddleasrcode-switchconformerkwspunctuation-restorationself-supervised-learningsound-classificationspeech-alignmentspeech-recognitionspeech-synthesisspeech-translationstreaming-asrstreaming-ttstransformerttsvocodervoice-cloningvoice-recognitionwav2vec2whisper
nemo-toolkitNeMo - a toolkit for Conversational AINLPNeModeepgpulanguagelearningmachinenvidiapytorchspeechtorchttsasrdeeplearninggenerative-ailarge-language-modelsmachine-translationmultimodalneural-networksspeaker-diariazationspeaker-recognitionspeech-synthesisspeech-translation
TTSDeep learning for Text to Speech by Coqui.deep-learningglow-ttshifiganmelganmulti-speaker-ttspythonpytorchspeaker-encoderspeaker-encodingsspeechspeech-synthesistacotrontext-to-speechttstts-modelvocodervoice-cloningvoice-conversionvoice-synthesis
pygtransGoogle Translate, support APIKEYpygtransgoogletranslateapikeytexthtmlcncomcloudpythonpython-librarytranslationtts