🔊 UniFlow-Audio Inference Demo

Multi-task Audio Generation System based on UniFlow-Audio

Note: For TTS, due to the restriction of HuggingFace Space, the g2p phonemizer used here is inconsistant with the one used during training, so there may be problems. Please refer to INFERENCE_CLI.md for CLI calling guidance.
Model Name
1 10
1 100
Examples
Audio Caption Guidance Scale Sampling Steps

📝 Notes

  • Model Name: Choose from UniFlow-Audio-large, UniFlow-Audio-medium, or UniFlow-Audio-small
  • Guidance Scale: Controls the guidance strength of the input condition on the output
  • Sampling Steps: Number of flow matching sampling steps

💡 Tip: Models will be automatically downloaded on first run, please be patient