Installation
Requirements
- GPU Memory: 12GB (Inference)
- System: Linux, WSL
Setup
First you need install pyaudio and sox, which is used for audio processing.
Conda
UV
Intel Arc XPU support
conda create -n fish-speech python=3.12
conda activate fish-speech
conda install libstdcxx -c conda-forge
pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/xpu
pip install -e .
Warning
The compile
option is not supported on windows and macOS, if you want to run with compile, you need to install trition by yourself.