jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

PythonJupyter Notebooktext-to-speechdeep-learningpytorchttsspeech-synthesis
This is stars and forks stats for /jaywalnut310/vits repository. As of 02 May, 2024 this repository has 5249 stars and 1053 forks.

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Jaehyeon Kim, Jungil Kong, and Juhee Son In our recent paper, we propose VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech. Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems. In this work, we present a parallel...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
rebuild-123/Python-Head-First-Design-PatternsPython2870350
Zulko/moviepyPythonDockerfile11k01.4k0
Rdmo1/swift-multi-ToolBatchfilePython22701260
martriay/cairo-workshopPythonCairo23030
Poeschl/Hassio-AddonsDockerfileShellPython2520870
fortran-lang/fpm-docsFortranCSSPython310180
tdpetrou/Machine-Learning-Books-With-PythonJupyter Notebook92006110
jleen/lexiconPerlPythonShell1000
erg-lang/ergRustPythonOther2.3k0560
rhasspy/rhasspyShellM4Makefile2.1k01680