The Fine-Tuning Index / PEFT & LoRA / #72
zhao-kun/VibeVoiceFusion
by zhao-kun · PEFT & LoRA · updated 3mo ago
VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA fine-tuning, batch generation, and VRAM optimization. Based on Microsoft's VibeVoice (AR + diffusion architecture)
43
momentum
480
stars
61
forks
#72
rank
aigcautoregressive-modelsfine-tuninglanguage-modelloraspeech-synthesisttstts-enginesttsuitevibevoicevramsavingweb
View on GitHub →