Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
2026.3.31 claude code 意外把包含源码的文件上传到 npm 仓库,版本号是 2.1.88,其中 cli.js.map 文件有 57MB 的体积,claude code 的源码在该文件的 sourcesContent 字段里面,解压还原后有 70w 行代码
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package. ComfyUI-Yolain-Workflows 一份非常全面的 ComfyUI 工作流合集,由 @yolain 整理并开源分享,包含文生图、图生图、背景去除、重绘/扩图、换脸、透明图层生成、重绘光