Agent-readable docs index: /llms.txt. Full docs in one file: /llms-full.txt. Download /docs.zip to grep all markdown files locally.

Speech & Audio

egaki handles text-to-speech, voice cloning, and audio stem separation.

Text-to-speech

egaki speech "Hello, this is a test." -o hello.mp3
Specify a model and voice:
egaki speech "Welcome to the future of video." \ --model sonic-3.5 \ --voice <voice-id> \ -o narration.mp3

Read from stdin

Pipe text from a file:
cat script.txt | egaki speech --stdin -o narration.mp3

Speed control

Cartesia models support --speed from 0.6 to 1.5:
egaki speech "Speak faster." --speed 1.3 -o fast.mp3

Available providers

ProviderModelsNotes
OpenAItts-1, tts-1-hdStandard quality
Cartesiasonic-3.5, sonic-3Best quality, speed control
ElevenLabseleven_v3, eleven_multilingual_v2, eleven_flash_v2_5Multilingual

Voice cloning

Clone a voice from an audio clip and get a reusable voice ID:
egaki voice clone recording.mp3 --name "my-voice" --json

Workflow: clone from a song

Separate vocals first, then clone:
# 1. Separate vocals egaki demucs song.mp3 --stems vocals -o stems/ # 2. Clone the isolated voice egaki voice clone stems/song-vocals.mp3 --name "singer" --json # 3. Generate speech with the cloned voice egaki speech "Your text here." --voice <voice-id> -m sonic-3.5 -o output.mp3

Providers

  • Cartesia (default): instant cloning, up to 10s of audio, free
  • ElevenLabs: longer clips, --remove-background-noise option
egaki voice clone noisy-audio.mp3 \ --provider elevenlabs \ --name "clean-voice" \ --remove-background-noise \ --json

Audio stem separation (demucs)

Separate a song into individual stems using fal.ai's Demucs model:
egaki demucs song.mp3 --stems vocals,other -o stems/

Available stems

vocals, drums, bass, other, guitar, piano

Models

ModelStemsBest for
htdemucsvocals, drums, bass, otherGeneral purpose
htdemucs_ftvocals, drums, bass, otherFine-tuned, higher quality
htdemucs_6s (default)vocals, drums, bass, other, guitar, piano6-stem separation
egaki demucs song.mp3 --model htdemucs_6s --stems vocals,guitar -o stems/