Fish Speech Howto
Fish Speech is an open-source, multilingual text-to-speech model capable of generating high-quality, natural-sounding speech in Chinese, Japanese, and English with customizable voices and emotions.
View MoreHow to Use Fish Speech
Install dependencies: Install required packages by running: pip3 install torch torchvision torchaudio
Create virtual environment: Create a Python 3.10 virtual environment using conda: conda create -n fish-speech python=3.10
Activate environment: Activate the virtual environment: conda activate fish-speech
Install Fish Speech: Install Fish Speech by running: pip3 install -e .
Download models: Download required models from Hugging Face: huggingface-cli download fishaudio/fish-speech-1.2-sft --local-dir checkpoints/fish-speech-1.2-sft
Run inference: Generate speech by running: python tools/llama/generate.py --text "Your text here" --checkpoint-path "checkpoints/fish-speech-1.2-sft"
Decode audio: Decode the generated tokens to audio using VQGAN: python tools/vqgan/inference.py -i "codes_0.npy" --checkpoint-path "checkpoints/fish-speech-1.2-sft/firefly-gan-vq-fsq-4x1024-42hz-generator.pth"
Start web UI (optional): Launch the web interface by running: python -m tools.webui --llama-checkpoint-path "checkpoints/fish-speech-1.2-sft" --decoder-checkpoint-path "checkpoints/fish-speech-1.2-sft/firefly-gan-vq-fsq-4x1024-42hz-generator.pth"
Fish Speech FAQs
Fish Speech is an open-source text-to-speech (TTS) model developed by Fish Audio. It is trained on 150,000 hours of multilingual audio data and can generate high-quality speech in Chinese, Japanese, and English.
Popular Articles
X Plans to Launch Free Version of AI Chatbot Grok to Compete with Industry Giants
Nov 12, 2024
Top AI Image Generators: Is Flux 1.1 Pro Ultra the Best Compared to Midjourney, Recraft V3, and Ideogram
Nov 12, 2024
HiWaifu AI Referral Codes in November 2024 and How to Redeem
Nov 12, 2024
Midjourney Promo Codes Free in November 2024 and How to redeem
Nov 12, 2024
View More