mirror of
https://github.com/ollama/ollama.git
synced 2026-04-26 02:36:09 +02:00
2.1 KiB
2.1 KiB
Speech to Text Prototype
To run
make {/path/to/whisper.cpp/server}
Update routes.go
- replace
whisperServerwith path to server
api/generate
Request fields
speech(required):audio(required): path to audio filemodel(required): path to whisper modeltranscribe(optional): if true, will transcribe and return the audio filekeep_alive: (optional): sets how long the model is stored in memory (default:5m)
prompt(optional): if not null, passed in with the transcribed audio
Transcription
curl http://localhost:11434/api/generate -d '{
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
"transcribe": true,
"keep_alive": "1m"
},
"stream": false
}' | jq
Response Generation
curl http://localhost:11434/api/generate -d '{
"model": "llama3",
"prompt": "What do you think about this quote?",
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
"keep_alive": "1m"
},
"stream": false
}' | jq
api/chat
Request fields
model(required): language model to chat withspeech(required):model(required): path to whisper modelkeep_alive: (optional): sets how long the model is stored in memory (default:5m)
messages/message/audio(required): path to audio file
curl http://localhost:11434/api/chat -d '{
"model": "llama3",
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"keep_alive": "10m"
},
"messages": [
{
"role": "system",
"content": "You are a Canadian Nationalist"
},
{
"role": "user",
"content": "What do you think about this quote?",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav"
}
],
"stream": false
}' | jq