789 B
789 B
Requirements
- GPU Memory: 12GB (Inference)
- System: Linux, WSL
Setup
First you need install pyaudio and sox, which is used for audio processing.
apt install portaudio19-dev libsox-dev ffmpeg
Conda
conda create -n fish-speech python=3.12
conda activate fish-speech
pip install -e .
UV
uv sync --python 3.12
Intel Arc XPU support
conda create -n fish-speech python=3.12
conda activate fish-speech
conda install libstdcxx -c conda-forge
pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/xpu
pip install -e .
!!! warning
The compile
option is not supported on windows and macOS, if you want to run with compile, you need to install trition by yourself.