mirror of
https://github.com/ollama/ollama.git
synced 2026-04-21 08:15:42 +02:00
new runner
This commit is contained in:
21
runner/README.md
Normal file
21
runner/README.md
Normal file
@@ -0,0 +1,21 @@
|
||||
# `runner`
|
||||
|
||||
> Note: this is a work in progress
|
||||
|
||||
A minimial runner for loading a model and running inference via a http web server.
|
||||
|
||||
```
|
||||
./runner -model <model binary>
|
||||
```
|
||||
|
||||
### Completion
|
||||
|
||||
```
|
||||
curl -X POST -H "Content-Type: application/json" -d '{"prompt": "hi"}' http://localhost:8080/completion
|
||||
```
|
||||
|
||||
### Embeddings
|
||||
|
||||
```
|
||||
curl -X POST -H "Content-Type: application/json" -d '{"prompt": "turn me into an embedding"}' http://localhost:8080/embedding
|
||||
```
|
||||
Reference in New Issue
Block a user