mirror of
https://github.com/ollama/ollama.git
synced 2026-04-18 06:54:09 +02:00
Compare commits
base: starred:pdevine/sampling-cache-error
starred:main
starred:jessegross/tokenize
starred:jessegross/compile
starred:jessegross/sampler
starred:hoyyeva/fix-launch-app-process-reap
starred:launch-copilot-cli
starred:hoyyeva/opencode-thinking
starred:origin/jessegross/closures
starred:brucemacd/launch-fetch-reccomended
starred:release_v0.20.7
starred:parth-auto-save-backup
starred:parth-test
starred:jessegross/batching
starred:jmorganca/gemma4-audio-replacements
starred:fix-manifest-digest-on-pull
starred:hoyyeva/vscode-improve
starred:brucemacd/install-server-wait
starred:brucemacd/download-before-remove
starred:parth/update-claude-docs
starred:parth-anthropic-reference-images-path
starred:brucemac/start-ap-install
starred:pdevine/mlx-update
starred:pdevine/qwen35_vision
starred:drifkin/api-show-fallback
starred:mintlify/image-generation-1773352582
starred:hoyyeva/server-context-length-local-config
starred:jmorganca/faster-reptition-penalties
starred:jmorganca/convert-nemotron
starred:parth-pi-thinking
starred:pdevine/sampling-penalties
starred:jmorganca/fix-create-quantization-memory
starred:dongchen/resumable_transfer_fix
starred:pdevine/sampling-cache-error
starred:jessegross/mlx-usage
starred:hoyyeva/openclaw-config
starred:hoyyeva/app-html
starred:pdevine/qwen3next
starred:brucemacd/sign-sh-install
starred:brucemacd/tui-update
starred:brucemacd/usage-api
starred:jmorganca/launch-empty
starred:fix-app-dist-embed
starred:mxyng/mlx-compile
starred:mxyng/mlx-quant
starred:mxyng/mlx-glm4.7
starred:mxyng/mlx
starred:brucemacd/simplify-model-picker
starred:jmorganca/qwen3-concurrent
starred:fix-glm-4.7-flash-mla-config
starred:drifkin/qwen3-coder-opening-tag
starred:brucemacd/usage-cli
starred:fix-cuda12-fattn-shmem
starred:ollama-imagegen-docs
starred:parth/fix-multiline-inputs
starred:brucemacd/config-docs
starred:mxyng/model-files
starred:mxyng/simple-execute
starred:fix-imagegen-ollama-models
starred:mxyng/async-upload
starred:jmorganca/lazy-no-dtype-changes
starred:imagegen-auto-detect-create
starred:parth/decrease-concurrent-download-hf
starred:fix-mlx-quantize-init
starred:jmorganca/x-cleanup
starred:usage
starred:imagegen-readme
starred:jmorganca/glm-image
starred:mlx-gpu-cd
starred:jmorganca/imagegen-modelfile
starred:parth/agent-skills
starred:parth/agent-allowlist
starred:parth/signed-in-offline
starred:parth/agents
starred:parth/fix-context-chopping
starred:improve-cloud-flow
starred:parth/add-models-websearch
starred:parth/prompt-renderer-mcp
starred:jmorganca/native-settings
starred:jmorganca/download-stream-hash
starred:jmorganca/client2-rebased
starred:brucemacd/oai-chat-req-multipart
starred:jessegross/multi_chunk_reserve
starred:grace/additional-omit-empty
starred:grace/mistral-3-large
starred:mxyng/tokenizer2
starred:mxyng/tokenizer
starred:jessegross/flash
starred:hoyyeva/windows-nacked-app
starred:mxyng/cleanup-attention
starred:grace/deepseek-parser
starred:hoyyeva/remember-unsent-prompt
starred:parth/add-lfs-pointer-error-conversion
starred:parth/olmo2-test2
starred:hoyyeva/ollama-launchagent-plist
starred:nicole/olmo-model
starred:parth/olmo-test
starred:mxyng/remove-embedded
starred:parth/render-template
starred:jmorganca/intellect-3
starred:parth/remove-prealloc-linter
starred:jmorganca/cmd-eval
starred:nicole/nomic-embed-text-fix
starred:mxyng/lint-2
starred:hoyyeva/add-gemini-3-pro-preview
starred:hoyyeva/load-model-list
starred:mxyng/expand-path
starred:mxyng/environ-2
starred:hoyyeva/deeplink-json-encoding
starred:parth/improve-tool-calling-tests
starred:hoyyeva/conversation
starred:hoyyeva/assistant-edit-response
starred:hoyyeva/thinking
starred:origin/brucemacd/invalid-char-i-err
starred:parth/improve-tool-calling
starred:jmorganca/required-omitempty
starred:grace/qwen3-vl-tests
starred:mxyng/iter-client
starred:parth/docs-readme
starred:nicole/embed-test
starred:pdevine/integration-benchstat
starred:parth/remove-generate-cmd
starred:parth/add-toolcall-id
starred:mxyng/server-tests
starred:jmorganca/glm-4.6
starred:jmorganca/gin-h-compat
starred:drifkin/stable-tool-args
starred:pdevine/qwen3-more-thinking
starred:parth/add-websearch-client
starred:nicole/websearch_local
starred:jmorganca/qwen3-coder-updates
starred:grace/deepseek-v3-migration-tests
starred:mxyng/fix-create
starred:jmorganca/cloud-errors
starred:pdevine/parser-tidy
starred:revert-12233-parth/simplify-entrypoints-runner
starred:parth/enable-so-gpt-oss
starred:brucemacd/qwen3vl
starred:jmorganca/readme-simplify
starred:parth/gpt-oss-structured-outputs
starred:revert-12039-jmorganca/tools-braces
starred:mxyng/embeddings
starred:mxyng/gguf
starred:mxyng/benchmark
starred:mxyng/types-null
starred:parth/move-parsing
starred:mxyng/gemma2
starred:jmorganca/docs
starred:mxyng/16-bit
starred:mxyng/create-stdin
starred:pdevine/authorizedkeys
starred:mxyng/quant
starred:parth/opt-in-error-context-window
starred:brucemacd/cache-models
starred:brucemacd/runner-completion
starred:jmorganca/llama-update-6
starred:brucemacd/benchmark-list
starred:brucemacd/partial-read-caps
starred:parth/deepseek-r1-tools
starred:mxyng/omit-array
starred:parth/tool-prefix-temp
starred:brucemacd/runner-test
starred:jmorganca/qwen25vl
starred:brucemacd/model-forward-test-ext
starred:parth/python-function-parsing
starred:jmorganca/cuda-compression-none
starred:drifkin/num-parallel
starred:drifkin/chat-truncation-fix
starred:jmorganca/sync
starred:parth/python-tools-calling
starred:drifkin/array-head-count
starred:brucemacd/create-no-loop
starred:parth/server-enable-content-stream-with-tools
starred:qwen25omni
starred:mxyng/v3
starred:brucemacd/ropeconfig
starred:jmorganca/silence-tokenizer
starred:parth/sample-so-test
starred:parth/sampling-structured-outputs
starred:brucemacd/doc-go-engine
starred:parth/constrained-sampling-json
starred:jmorganca/mistral-wip
starred:brucemacd/mistral-small-convert
starred:parth/sample-unmarshal-json-for-params
starred:brucemacd/jomorganca/mistral
starred:pdevine/bfloat16
starred:jmorganca/mistral
starred:brucemacd/mistral
starred:pdevine/logging
starred:parth/sample-correctness-fix
starred:parth/sample-fix-sorting
starred:jmorgan/sample-fix-sorting-extras
starred:jmorganca/temp-0-images
starred:brucemacd/parallel-embed-models
starred:brucemacd/shim-grammar
starred:jmorganca/fix-gguf-error
starred:bmizerany/nameswork
starred:jmorganca/faster-releases
starred:bmizerany/validatenames
starred:brucemacd/err-no-vocab
starred:brucemacd/rope-config
starred:brucemacd/err-hint
starred:brucemacd/qwen2_5
starred:brucemacd/logprobs
starred:brucemacd/new_runner_graph_bench
starred:progress-flicker
starred:brucemacd/forward-test
starred:brucemacd/go_qwen2
starred:pdevine/gemma2
starred:jmorganca/add-missing-symlink-eval
starred:mxyng/next-debug
starred:parth/set-context-size-openai
starred:brucemacd/next-bpe-bench
starred:brucemacd/next-bpe-test
starred:brucemacd/new_runner_e2e
starred:brucemacd/new_runner_qwen2
starred:pdevine/convert-cohere2
starred:brucemacd/convert-cli
starred:parth/log-probs
starred:mxyng/next-mlx
starred:mxyng/cmd-history
starred:parth/templating
starred:parth/tokenize-detokenize
starred:brucemacd/check-key-register
starred:bmizerany/grammar
starred:jmorganca/vendor-081b29bd
starred:mxyng/func-checks
starred:jmorganca/fix-null-format
starred:parth/fix-default-to-warn-json
starred:jmorganca/qwen2vl
starred:jmorganca/no-concat
starred:parth/cmd-cleanup-SO
starred:brucemacd/check-key-register-structured-err
starred:parth/openai-stream-usage
starred:parth/fix-referencing-so
starred:stream-tools-stop
starred:jmorganca/degin-1
starred:brucemacd/install-path-clean
starred:brucemacd/push-name-validation
starred:brucemacd/browser-key-register
starred:jmorganca/openai-fix-first-message
starred:jmorganca/fix-proxy
starred:jessegross/sample
starred:parth/disallow-streaming-tools
starred:dhiltgen/remove_submodule
starred:jmorganca/ga
starred:jmorganca/mllama
starred:pdevine/newlines
starred:pdevine/geems-2b
starred:jmorganca/llama-bump
starred:mxyng/modelname-7
starred:mxyng/gin-slog
starred:mxyng/modelname-6
starred:jyan/convert-prog
starred:jyan/quant5
starred:paligemma-support
starred:pdevine/import-docs
starred:jmorganca/openai-context
starred:jyan/paligemma
starred:jyan/p2
starred:jyan/palitest
starred:bmizerany/embedspeedup
starred:jmorganca/llama-vit
starred:brucemacd/allow-ollama
starred:royh/ep-methods
starred:royh/whisper
starred:mxyng/api-models
starred:mxyng/fix-memory
starred:jyan/q4_4/8
starred:jyan/ollama-v
starred:royh/stream-tools
starred:roy-embed-parallel
starred:bmizerany/hrm
starred:revert-5963-revert-5924-mxyng/llama3.1-rope
starred:royh/embed-viz
starred:jyan/local2
starred:jyan/auth
starred:jyan/local
starred:jyan/parse-temp
starred:jmorganca/template-mistral
starred:jyan/reord-g
starred:royh-openai-suffixdocs
starred:royh-imgembed
starred:royh-embed-parallel
starred:jyan/quant4
starred:royh-precision
starred:jyan/progress
starred:pdevine/fix-template
starred:jyan/quant3
starred:pdevine/ggla
starred:mxyng/update-registry-domain
starred:jmorganca/ggml-static
starred:mxyng/create-context
starred:jyan/v0.146
starred:mxyng/layers-from-files
starred:build_dist
starred:bmizerany/noseek
starred:royh-ls
starred:royh-name
starred:timeout
starred:mxyng/server-timestamp
starred:bmizerany/nosillyggufslurps
starred:royh-params
starred:jmorganca/llama-cpp-7c26775
starred:royh-openai-delete
starred:royh-show-rigid
starred:jmorganca/enable-fa
starred:jmorganca/no-error-template
starred:jyan/format
starred:royh-testdelete
starred:bmizerany/fastverify
starred:language_support
starred:pdevine/ps-glitches
starred:brucemacd/tokenize
starred:bruce/iq-quants
starred:bmizerany/filepathwithcoloninhost
starred:mxyng/split-bin
starred:bmizerany/client-registry
starred:jmorganca/if-none-match
starred:native
starred:jmorganca/native
starred:jmorganca/batch-embeddings
starred:jmorganca/initcmake
starred:jmorganca/mm
starred:pdevine/showggmlinfo
starred:modenameenforcealphanum
starred:bmizerany/modenameenforcealphanum
starred:jmorganca/done-reason
starred:jmorganca/llama-cpp-8960fe8
starred:ollama.com
starred:bmizerany/filepathnobuild
starred:bmizerany/types/model/defaultfix
starred:rmdisplaylong
starred:nogogen
starred:bmizerany/x
starred:modelfile-readme
starred:bmizerany/replacecolon
starred:jmorganca/limit
starred:jmorganca/execstack
starred:jmorganca/replace-assets
starred:mxyng/tune-concurrency
starred:jmorganca/testing
starred:whitespace-detection
starred:jmorganca/options
starred:upgrade-all
starred:scratch
starred:cuda-search
starred:mattw/airenamer
starred:mattw/allmodelsonhuggingface
starred:mattw/quantcontext
starred:mattw/whatneedstorun
starred:brucemacd/llama-mem-calc
starred:mattw/faq-context
starred:mattw/communitylinks
starred:mattw/noprune
starred:mattw/python-functioncalling
starred:rename
starred:mxyng/install
starred:pulse
starred:remove-first
starred:editor
starred:mattw/selfqueryingretrieval
starred:cgo
starred:mattw/howtoquant
starred:api
starred:matt/streamingapi
starred:format-config
starred:mxyng/extra-args
starred:shell
starred:update-nous-hermes
starred:cp-model
starred:upload-progress
starred:fix-unknown-model
starred:fix-model-names
starred:delete-fix
starred:insecure-registry
starred:ls
starred:deletemodels
starred:progressbar
starred:readme-updates
starred:license-layers
starred:skip-list
starred:list-models
starred:modelpath
starred:matt/examplemodelfiles
starred:distribution
starred:go-opts
starred:v0.21.0
starred:v0.21.0-rc1
starred:v0.21.0-rc0
starred:v0.20.8-rc0
starred:v0.20.7
starred:v0.20.7-rc1
starred:v0.20.7-rc0
starred:v0.20.6
starred:v0.20.6-rc1
starred:v0.20.6-rc0
starred:v0.20.5
starred:v0.20.5-rc2
starred:v0.20.5-rc1
starred:v0.20.5-rc0
starred:v0.20.4
starred:v0.20.4-rc2
starred:v0.20.4-rc1
starred:v0.20.4-rc0
starred:v0.20.3
starred:v0.20.3-rc0
starred:v0.20.2
starred:v0.20.1
starred:v0.20.1-rc2
starred:v0.20.1-rc1
starred:v0.20.1-rc0
starred:v0.20.0
starred:v0.20.0-rc1
starred:v0.20.0-rc0
starred:v0.19.0
starred:v0.19.0-rc2
starred:v0.19.0-rc1
starred:v0.19.0-rc0
starred:v0.18.4-rc1
starred:v0.18.4-rc0
starred:v0.18.3
starred:v0.18.3-rc2
starred:v0.18.3-rc1
starred:v0.18.3-rc0
starred:v0.18.2
starred:v0.18.2-rc1
starred:v0.18.2-rc0
starred:v0.18.1
starred:v0.18.1-rc1
starred:v0.18.1-rc0
starred:v0.18.0
starred:v0.18.0-rc2
starred:v0.18.0-rc1
starred:v0.18.0-rc0
starred:v0.17.8-rc4
starred:v0.17.8-rc3
starred:v0.17.8-rc2
starred:v0.17.8-rc1
starred:v0.17.8-rc0
starred:v0.17.7-rc2
starred:v0.17.7
starred:v0.17.7-rc1
starred:v0.17.7-rc0
starred:v0.17.6
starred:v0.17.5
starred:v0.17.4
starred:v0.17.3
starred:v0.17.2
starred:v0.17.1
starred:v0.17.1-rc2
starred:v0.17.1-rc1
starred:v0.17.1-rc0
starred:v0.17.0
starred:v0.17.0-rc2
starred:v0.17.0-rc1
starred:v0.17.0-rc0
starred:v0.16.3
starred:v0.16.3-rc2
starred:v0.16.3-rc1
starred:v0.16.3-rc0
starred:v0.16.2-rc0
starred:v0.16.2
starred:v0.16.1
starred:v0.16.0
starred:v0.16.0-rc2
starred:v0.16.0-rc0
starred:v0.16.0-rc1
starred:v0.15.6
starred:v0.15.5
starred:v0.15.5-rc5
starred:v0.15.5-rc4
starred:v0.15.5-rc3
starred:v0.15.5-rc2
starred:v0.15.5-rc1
starred:v0.15.5-rc0
starred:v0.15.4
starred:v0.15.3
starred:v0.15.2
starred:v0.15.1
starred:v0.15.1-rc1
starred:v0.15.1-rc0
starred:v0.15.0-rc6
starred:v0.15.0
starred:v0.15.0-rc5
starred:v0.15.0-rc4
starred:v0.15.0-rc3
starred:v0.15.0-rc2
starred:v0.15.0-rc1
starred:v0.15.0-rc0
starred:v0.14.3
starred:v0.14.3-rc3
starred:v0.14.3-rc2
starred:v0.14.3-rc1
starred:v0.14.3-rc0
starred:v0.14.2
starred:v0.14.2-rc1
starred:v0.14.2-rc0
starred:v0.14.1
starred:v0.14.0
starred:v0.14.0-rc11
starred:v0.14.0-rc10
starred:v0.14.0-rc9
starred:v0.14.0-rc8
starred:v0.14.0-rc7
starred:v0.14.0-rc6
starred:v0.14.0-rc5
starred:v0.14.0-rc4
starred:v0.14.0-rc3
starred:v0.14.0-rc2
starred:v0.14.0-rc1
starred:v0.14.0-rc0
starred:v0.13.5
starred:v0.13.5-rc1
starred:v0.13.5-rc0
starred:v0.13.4-rc2
starred:v0.13.4
starred:v0.13.4-rc1
starred:v0.13.4-rc0
starred:v0.13.3
starred:v0.13.3-rc1
starred:v0.13.3-rc0
starred:v0.13.2
starred:v0.13.2-rc2
starred:v0.13.2-rc1
starred:v0.13.2-rc0
starred:v0.13.1
starred:v0.13.1-rc2
starred:v0.13.1-rc1
starred:v0.13.1-rc0
starred:v0.13.0
starred:v0.13.0-rc0
starred:v0.12.11
starred:v0.12.11-rc1
starred:v0.12.11-rc0
starred:v0.12.10-rc1
starred:v0.12.10
starred:v0.12.10-rc0
starred:v0.12.9-rc0
starred:v0.12.9
starred:v0.12.8
starred:v0.12.8-rc0
starred:v0.12.7
starred:v0.12.7-rc1
starred:v0.12.7-rc0
starred:v0.12.7-citest0
starred:v0.12.6
starred:v0.12.6-rc1
starred:v0.12.6-rc0
starred:v0.12.5
starred:v0.12.5-rc0
starred:v0.12.4
starred:v0.12.4-rc7
starred:v0.12.4-rc6
starred:v0.12.4-rc5
starred:v0.12.4-rc4
starred:v0.12.4-rc3
starred:v0.12.4-rc2
starred:v0.12.4-rc1
starred:v0.12.4-rc0
starred:v0.12.3
starred:v0.12.2-rc0
starred:v0.12.2
starred:v0.12.1
starred:v0.12.1-rc2
starred:v0.12.1-rc1
starred:v0.12.1-rc0
starred:v0.12.0
starred:v0.12.0-rc1
starred:v0.12.0-rc0
starred:v0.11.11
starred:v0.11.11-rc2
starred:v0.11.11-rc3
starred:v0.11.11-rc1
starred:v0.11.11-rc0
starred:v0.11.10
starred:v0.11.9
starred:v0.11.9-rc0
starred:v0.11.8
starred:v0.11.8-rc0
starred:v0.11.7
starred:v0.11.7-rc0
starred:v0.11.7-rc1
starred:v0.11.6
starred:v0.11.6-rc0
starred:v0.11.5-rc5
starred:v0.11.5
starred:v0.11.5-rc4
starred:v0.11.5-rc3
starred:v0.11.5-rc2
starred:v0.11.5-rc1
starred:v0.11.5-rc0
starred:v0.11.4
starred:v0.11.4-rc0
starred:v0.11.3
starred:v0.11.3-rc0
starred:v0.11.2
starred:v0.11.1
starred:v0.11.0-rc0
starred:v0.11.0-rc1
starred:v0.11.0-rc2
starred:v0.11.0
starred:v0.10.2-int1
starred:v0.10.1
starred:v0.10.0
starred:v0.10.0-rc4
starred:v0.10.0-rc3
starred:v0.10.0-rc2
starred:v0.10.0-rc1
starred:v0.10.0-rc0
starred:v0.9.7-rc1
starred:v0.9.7-rc0
starred:v0.9.6
starred:v0.9.6-rc0
starred:v0.9.6-ci0
starred:v0.9.5
starred:v0.9.4-rc4
starred:v0.9.4-rc6
starred:v0.9.4
starred:v0.9.4-rc3
starred:v0.9.4-rc5
starred:v0.9.4-rc1
starred:v0.9.4-rc2
starred:v0.9.4-rc0
starred:v0.9.3
starred:v0.9.3-rc5
starred:v0.9.4-citest0
starred:v0.9.3-rc4
starred:v0.9.3-rc3
starred:v0.9.3-rc2
starred:v0.9.3-rc1
starred:v0.9.3-rc0
starred:v0.9.2
starred:v0.9.1
starred:v0.9.1-rc1
starred:v0.9.1-rc0
starred:v0.9.1-ci1
starred:v0.9.1-ci0
starred:v0.9.0
starred:v0.9.0-rc0
starred:v0.8.0
starred:v0.8.0-rc0
starred:v0.7.1-rc2
starred:v0.7.1
starred:v0.7.1-rc1
starred:v0.7.1-rc0
starred:v0.7.0
starred:v0.7.0-rc1
starred:v0.7.0-rc0
starred:v0.6.9-rc0
starred:v0.6.8-rc0
starred:v0.6.8
starred:v0.6.7
starred:v0.6.7-rc2
starred:v0.6.7-rc1
starred:v0.6.7-rc0
starred:v0.6.6
starred:v0.6.6-rc2
starred:v0.6.6-rc1
starred:v0.6.6-rc0
starred:v0.6.5-rc1
starred:v0.6.5
starred:v0.6.5-rc0
starred:v0.6.4
starred:v0.6.4-rc0
starred:v0.6.3
starred:v0.6.3-rc1
starred:v0.6.3-rc0
starred:v0.6.2
starred:v0.6.2-rc0
starred:v0.6.1
starred:v0.6.1-rc0
starred:v0.6.0-rc0
starred:v0.6.0
starred:v0.5.14-rc0
starred:v0.5.13
starred:v0.5.13-rc6
starred:v0.5.13-rc5
starred:v0.5.13-rc4
starred:v0.5.13-rc3
starred:v0.5.13-rc2
starred:v0.5.13-rc1
starred:v0.5.13-rc0
starred:v0.5.12
starred:v0.5.12-rc1
starred:v0.5.12-rc0
starred:v0.5.11
starred:v0.5.10
starred:v0.5.9
starred:v0.5.9-rc0
starred:v0.5.8
starred:v0.5.8-rc13
starred:v0.5.8-rc12
starred:v0.5.8-rc11
starred:v0.5.8-rc10
starred:v0.5.8-rc9
starred:v0.5.8-rc8
starred:v0.5.8-rc7
starred:v0.5.8-rc6
starred:v0.5.8-rc5
starred:v0.5.8-rc4
starred:v0.5.8-rc3
starred:v0.5.8-rc2
starred:v0.5.8-rc1
starred:v0.5.8-rc0
starred:v0.5.7
starred:v0.5.6
starred:v0.5.5
starred:v0.5.5-rc0
starred:v0.5.4
starred:v0.5.3
starred:v0.5.3-rc0
starred:v0.5.2
starred:v0.5.2-rc3
starred:v0.5.2-rc2
starred:v0.5.2-rc1
starred:v0.5.2-rc0
starred:v0.5.1
starred:v0.5.0
starred:v0.5.0-rc1
starred:v0.4.8-rc0
starred:v0.4.7
starred:v0.4.6
starred:v0.4.5
starred:v0.4.4
starred:v0.4.3
starred:v0.4.3-rc0
starred:v0.4.2
starred:v0.4.2-rc1
starred:v0.4.2-rc0
starred:v0.4.1
starred:v0.4.1-rc0
starred:v0.4.0
starred:v0.4.0-rc8
starred:v0.4.0-rc7
starred:v0.4.0-rc6
starred:v0.4.0-rc5
starred:v0.4.0-rc4
starred:v0.4.0-rc3
starred:v0.4.0-rc2
starred:v0.4.0-rc1
starred:v0.4.0-rc0
starred:v0.4.0-ci3
starred:v0.3.14
starred:v0.3.14-rc0
starred:v0.3.13
starred:v0.3.12
starred:v0.3.12-rc5
starred:v0.3.12-rc4
starred:v0.3.12-rc3
starred:v0.3.12-rc2
starred:v0.3.12-rc1
starred:v0.3.11
starred:v0.3.11-rc4
starred:v0.3.11-rc3
starred:v0.3.11-rc2
starred:v0.3.11-rc1
starred:v0.3.10
starred:v0.3.10-rc1
starred:v0.3.9
starred:v0.3.8
starred:v0.3.7
starred:v0.3.7-rc6
starred:v0.3.7-rc5
starred:v0.3.7-rc4
starred:v0.3.7-rc3
starred:v0.3.7-rc2
starred:v0.3.7-rc1
starred:v0.3.6
starred:v0.3.5
starred:v0.3.4
starred:v0.3.3
starred:v0.3.2
starred:v0.3.1
starred:v0.3.0
starred:v0.2.8
starred:v0.2.8-rc2
starred:v0.2.8-rc1
starred:v0.2.7
starred:v0.2.6
starred:v0.2.5
starred:v0.2.4
starred:v0.2.3
starred:v0.2.2
starred:v0.2.2-rc2
starred:v0.2.2-rc1
starred:v0.2.1
starred:v0.2.0
starred:v0.1.49-rc14
starred:v0.1.49-rc13
starred:v0.1.49-rc12
starred:v0.1.49-rc11
starred:v0.1.49-rc10
starred:v0.1.49-rc9
starred:v0.1.49-rc8
starred:v0.1.49-rc7
starred:v0.1.49-rc6
starred:v0.1.49-rc4
starred:v0.1.49-rc5
starred:v0.1.49-rc3
starred:v0.1.49-rc2
starred:v0.1.49-rc1
starred:v0.1.48
starred:v0.1.47
starred:v0.1.46
starred:v0.1.45
starred:v0.1.45-rc5
starred:v0.1.45-rc4
starred:v0.1.45-rc3
starred:v0.1.45-rc2
starred:v0.1.45-rc1
starred:v0.1.44
starred:v0.1.43
starred:v0.1.42
starred:v0.1.41
starred:v0.1.40
starred:v0.1.40-rc1
starred:v0.1.39
starred:v0.1.39-rc2
starred:v0.1.39-rc1
starred:v0.1.38
starred:v0.1.37
starred:v0.1.36
starred:v0.1.35
starred:v0.1.35-rc1
starred:v0.1.34
starred:v0.1.34-rc1
starred:v0.1.33
starred:v0.1.33-rc7
starred:v0.1.33-rc6
starred:v0.1.33-rc5
starred:v0.1.33-rc4
starred:v0.1.33-rc3
starred:v0.1.33-rc2
starred:v0.1.33-rc1
starred:v0.1.32
starred:v0.1.32-rc2
starred:v0.1.32-rc1
starred:v0.1.31
starred:v0.1.30
starred:v0.1.29
starred:v0.1.28
starred:v0.1.27
starred:v0.1.26
starred:v0.1.25
starred:v0.1.24
starred:v0.1.23
starred:v0.1.22
starred:v0.1.21
starred:v0.1.20
starred:v0.1.19
starred:v0.1.18
starred:v0.1.17
starred:v0.1.16
starred:v0.1.15
starred:v0.1.14
starred:v0.1.13
starred:v0.1.12
starred:v0.1.11
starred:v0.1.10
starred:v0.1.9
starred:v0.1.8
starred:v0.1.7
starred:v0.1.6
starred:v0.1.5
starred:v0.1.4
starred:v0.1.3
starred:v0.1.2
starred:v0.1.1
starred:v0.1.0
starred:v0.0.21
starred:v0.0.20
starred:v0.0.19
starred:v0.0.18
starred:v0.0.17
starred:v0.0.16
starred:v0.0.15
starred:v0.0.14
starred:v0.0.13
starred:v0.0.12
starred:v0.0.11
starred:v0.0.10
starred:v0.0.9
starred:v0.0.8
starred:v0.0.7
starred:v0.0.6
starred:v0.0.5
starred:v0.0.4
starred:v0.0.3
starred:v0.0.2
starred:v0.0.1
...
compare: starred:mattw/quantcontext
starred:jessegross/tokenize
starred:jessegross/compile
starred:jessegross/sampler
starred:main
starred:hoyyeva/fix-launch-app-process-reap
starred:launch-copilot-cli
starred:hoyyeva/opencode-thinking
starred:origin/jessegross/closures
starred:brucemacd/launch-fetch-reccomended
starred:release_v0.20.7
starred:parth-auto-save-backup
starred:parth-test
starred:jessegross/batching
starred:jmorganca/gemma4-audio-replacements
starred:fix-manifest-digest-on-pull
starred:hoyyeva/vscode-improve
starred:brucemacd/install-server-wait
starred:brucemacd/download-before-remove
starred:parth/update-claude-docs
starred:parth-anthropic-reference-images-path
starred:brucemac/start-ap-install
starred:pdevine/mlx-update
starred:pdevine/qwen35_vision
starred:drifkin/api-show-fallback
starred:mintlify/image-generation-1773352582
starred:hoyyeva/server-context-length-local-config
starred:jmorganca/faster-reptition-penalties
starred:jmorganca/convert-nemotron
starred:parth-pi-thinking
starred:pdevine/sampling-penalties
starred:jmorganca/fix-create-quantization-memory
starred:dongchen/resumable_transfer_fix
starred:pdevine/sampling-cache-error
starred:jessegross/mlx-usage
starred:hoyyeva/openclaw-config
starred:hoyyeva/app-html
starred:pdevine/qwen3next
starred:brucemacd/sign-sh-install
starred:brucemacd/tui-update
starred:brucemacd/usage-api
starred:jmorganca/launch-empty
starred:fix-app-dist-embed
starred:mxyng/mlx-compile
starred:mxyng/mlx-quant
starred:mxyng/mlx-glm4.7
starred:mxyng/mlx
starred:brucemacd/simplify-model-picker
starred:jmorganca/qwen3-concurrent
starred:fix-glm-4.7-flash-mla-config
starred:drifkin/qwen3-coder-opening-tag
starred:brucemacd/usage-cli
starred:fix-cuda12-fattn-shmem
starred:ollama-imagegen-docs
starred:parth/fix-multiline-inputs
starred:brucemacd/config-docs
starred:mxyng/model-files
starred:mxyng/simple-execute
starred:fix-imagegen-ollama-models
starred:mxyng/async-upload
starred:jmorganca/lazy-no-dtype-changes
starred:imagegen-auto-detect-create
starred:parth/decrease-concurrent-download-hf
starred:fix-mlx-quantize-init
starred:jmorganca/x-cleanup
starred:usage
starred:imagegen-readme
starred:jmorganca/glm-image
starred:mlx-gpu-cd
starred:jmorganca/imagegen-modelfile
starred:parth/agent-skills
starred:parth/agent-allowlist
starred:parth/signed-in-offline
starred:parth/agents
starred:parth/fix-context-chopping
starred:improve-cloud-flow
starred:parth/add-models-websearch
starred:parth/prompt-renderer-mcp
starred:jmorganca/native-settings
starred:jmorganca/download-stream-hash
starred:jmorganca/client2-rebased
starred:brucemacd/oai-chat-req-multipart
starred:jessegross/multi_chunk_reserve
starred:grace/additional-omit-empty
starred:grace/mistral-3-large
starred:mxyng/tokenizer2
starred:mxyng/tokenizer
starred:jessegross/flash
starred:hoyyeva/windows-nacked-app
starred:mxyng/cleanup-attention
starred:grace/deepseek-parser
starred:hoyyeva/remember-unsent-prompt
starred:parth/add-lfs-pointer-error-conversion
starred:parth/olmo2-test2
starred:hoyyeva/ollama-launchagent-plist
starred:nicole/olmo-model
starred:parth/olmo-test
starred:mxyng/remove-embedded
starred:parth/render-template
starred:jmorganca/intellect-3
starred:parth/remove-prealloc-linter
starred:jmorganca/cmd-eval
starred:nicole/nomic-embed-text-fix
starred:mxyng/lint-2
starred:hoyyeva/add-gemini-3-pro-preview
starred:hoyyeva/load-model-list
starred:mxyng/expand-path
starred:mxyng/environ-2
starred:hoyyeva/deeplink-json-encoding
starred:parth/improve-tool-calling-tests
starred:hoyyeva/conversation
starred:hoyyeva/assistant-edit-response
starred:hoyyeva/thinking
starred:origin/brucemacd/invalid-char-i-err
starred:parth/improve-tool-calling
starred:jmorganca/required-omitempty
starred:grace/qwen3-vl-tests
starred:mxyng/iter-client
starred:parth/docs-readme
starred:nicole/embed-test
starred:pdevine/integration-benchstat
starred:parth/remove-generate-cmd
starred:parth/add-toolcall-id
starred:mxyng/server-tests
starred:jmorganca/glm-4.6
starred:jmorganca/gin-h-compat
starred:drifkin/stable-tool-args
starred:pdevine/qwen3-more-thinking
starred:parth/add-websearch-client
starred:nicole/websearch_local
starred:jmorganca/qwen3-coder-updates
starred:grace/deepseek-v3-migration-tests
starred:mxyng/fix-create
starred:jmorganca/cloud-errors
starred:pdevine/parser-tidy
starred:revert-12233-parth/simplify-entrypoints-runner
starred:parth/enable-so-gpt-oss
starred:brucemacd/qwen3vl
starred:jmorganca/readme-simplify
starred:parth/gpt-oss-structured-outputs
starred:revert-12039-jmorganca/tools-braces
starred:mxyng/embeddings
starred:mxyng/gguf
starred:mxyng/benchmark
starred:mxyng/types-null
starred:parth/move-parsing
starred:mxyng/gemma2
starred:jmorganca/docs
starred:mxyng/16-bit
starred:mxyng/create-stdin
starred:pdevine/authorizedkeys
starred:mxyng/quant
starred:parth/opt-in-error-context-window
starred:brucemacd/cache-models
starred:brucemacd/runner-completion
starred:jmorganca/llama-update-6
starred:brucemacd/benchmark-list
starred:brucemacd/partial-read-caps
starred:parth/deepseek-r1-tools
starred:mxyng/omit-array
starred:parth/tool-prefix-temp
starred:brucemacd/runner-test
starred:jmorganca/qwen25vl
starred:brucemacd/model-forward-test-ext
starred:parth/python-function-parsing
starred:jmorganca/cuda-compression-none
starred:drifkin/num-parallel
starred:drifkin/chat-truncation-fix
starred:jmorganca/sync
starred:parth/python-tools-calling
starred:drifkin/array-head-count
starred:brucemacd/create-no-loop
starred:parth/server-enable-content-stream-with-tools
starred:qwen25omni
starred:mxyng/v3
starred:brucemacd/ropeconfig
starred:jmorganca/silence-tokenizer
starred:parth/sample-so-test
starred:parth/sampling-structured-outputs
starred:brucemacd/doc-go-engine
starred:parth/constrained-sampling-json
starred:jmorganca/mistral-wip
starred:brucemacd/mistral-small-convert
starred:parth/sample-unmarshal-json-for-params
starred:brucemacd/jomorganca/mistral
starred:pdevine/bfloat16
starred:jmorganca/mistral
starred:brucemacd/mistral
starred:pdevine/logging
starred:parth/sample-correctness-fix
starred:parth/sample-fix-sorting
starred:jmorgan/sample-fix-sorting-extras
starred:jmorganca/temp-0-images
starred:brucemacd/parallel-embed-models
starred:brucemacd/shim-grammar
starred:jmorganca/fix-gguf-error
starred:bmizerany/nameswork
starred:jmorganca/faster-releases
starred:bmizerany/validatenames
starred:brucemacd/err-no-vocab
starred:brucemacd/rope-config
starred:brucemacd/err-hint
starred:brucemacd/qwen2_5
starred:brucemacd/logprobs
starred:brucemacd/new_runner_graph_bench
starred:progress-flicker
starred:brucemacd/forward-test
starred:brucemacd/go_qwen2
starred:pdevine/gemma2
starred:jmorganca/add-missing-symlink-eval
starred:mxyng/next-debug
starred:parth/set-context-size-openai
starred:brucemacd/next-bpe-bench
starred:brucemacd/next-bpe-test
starred:brucemacd/new_runner_e2e
starred:brucemacd/new_runner_qwen2
starred:pdevine/convert-cohere2
starred:brucemacd/convert-cli
starred:parth/log-probs
starred:mxyng/next-mlx
starred:mxyng/cmd-history
starred:parth/templating
starred:parth/tokenize-detokenize
starred:brucemacd/check-key-register
starred:bmizerany/grammar
starred:jmorganca/vendor-081b29bd
starred:mxyng/func-checks
starred:jmorganca/fix-null-format
starred:parth/fix-default-to-warn-json
starred:jmorganca/qwen2vl
starred:jmorganca/no-concat
starred:parth/cmd-cleanup-SO
starred:brucemacd/check-key-register-structured-err
starred:parth/openai-stream-usage
starred:parth/fix-referencing-so
starred:stream-tools-stop
starred:jmorganca/degin-1
starred:brucemacd/install-path-clean
starred:brucemacd/push-name-validation
starred:brucemacd/browser-key-register
starred:jmorganca/openai-fix-first-message
starred:jmorganca/fix-proxy
starred:jessegross/sample
starred:parth/disallow-streaming-tools
starred:dhiltgen/remove_submodule
starred:jmorganca/ga
starred:jmorganca/mllama
starred:pdevine/newlines
starred:pdevine/geems-2b
starred:jmorganca/llama-bump
starred:mxyng/modelname-7
starred:mxyng/gin-slog
starred:mxyng/modelname-6
starred:jyan/convert-prog
starred:jyan/quant5
starred:paligemma-support
starred:pdevine/import-docs
starred:jmorganca/openai-context
starred:jyan/paligemma
starred:jyan/p2
starred:jyan/palitest
starred:bmizerany/embedspeedup
starred:jmorganca/llama-vit
starred:brucemacd/allow-ollama
starred:royh/ep-methods
starred:royh/whisper
starred:mxyng/api-models
starred:mxyng/fix-memory
starred:jyan/q4_4/8
starred:jyan/ollama-v
starred:royh/stream-tools
starred:roy-embed-parallel
starred:bmizerany/hrm
starred:revert-5963-revert-5924-mxyng/llama3.1-rope
starred:royh/embed-viz
starred:jyan/local2
starred:jyan/auth
starred:jyan/local
starred:jyan/parse-temp
starred:jmorganca/template-mistral
starred:jyan/reord-g
starred:royh-openai-suffixdocs
starred:royh-imgembed
starred:royh-embed-parallel
starred:jyan/quant4
starred:royh-precision
starred:jyan/progress
starred:pdevine/fix-template
starred:jyan/quant3
starred:pdevine/ggla
starred:mxyng/update-registry-domain
starred:jmorganca/ggml-static
starred:mxyng/create-context
starred:jyan/v0.146
starred:mxyng/layers-from-files
starred:build_dist
starred:bmizerany/noseek
starred:royh-ls
starred:royh-name
starred:timeout
starred:mxyng/server-timestamp
starred:bmizerany/nosillyggufslurps
starred:royh-params
starred:jmorganca/llama-cpp-7c26775
starred:royh-openai-delete
starred:royh-show-rigid
starred:jmorganca/enable-fa
starred:jmorganca/no-error-template
starred:jyan/format
starred:royh-testdelete
starred:bmizerany/fastverify
starred:language_support
starred:pdevine/ps-glitches
starred:brucemacd/tokenize
starred:bruce/iq-quants
starred:bmizerany/filepathwithcoloninhost
starred:mxyng/split-bin
starred:bmizerany/client-registry
starred:jmorganca/if-none-match
starred:native
starred:jmorganca/native
starred:jmorganca/batch-embeddings
starred:jmorganca/initcmake
starred:jmorganca/mm
starred:pdevine/showggmlinfo
starred:modenameenforcealphanum
starred:bmizerany/modenameenforcealphanum
starred:jmorganca/done-reason
starred:jmorganca/llama-cpp-8960fe8
starred:ollama.com
starred:bmizerany/filepathnobuild
starred:bmizerany/types/model/defaultfix
starred:rmdisplaylong
starred:nogogen
starred:bmizerany/x
starred:modelfile-readme
starred:bmizerany/replacecolon
starred:jmorganca/limit
starred:jmorganca/execstack
starred:jmorganca/replace-assets
starred:mxyng/tune-concurrency
starred:jmorganca/testing
starred:whitespace-detection
starred:jmorganca/options
starred:upgrade-all
starred:scratch
starred:cuda-search
starred:mattw/airenamer
starred:mattw/allmodelsonhuggingface
starred:mattw/quantcontext
starred:mattw/whatneedstorun
starred:brucemacd/llama-mem-calc
starred:mattw/faq-context
starred:mattw/communitylinks
starred:mattw/noprune
starred:mattw/python-functioncalling
starred:rename
starred:mxyng/install
starred:pulse
starred:remove-first
starred:editor
starred:mattw/selfqueryingretrieval
starred:cgo
starred:mattw/howtoquant
starred:api
starred:matt/streamingapi
starred:format-config
starred:mxyng/extra-args
starred:shell
starred:update-nous-hermes
starred:cp-model
starred:upload-progress
starred:fix-unknown-model
starred:fix-model-names
starred:delete-fix
starred:insecure-registry
starred:ls
starred:deletemodels
starred:progressbar
starred:readme-updates
starred:license-layers
starred:skip-list
starred:list-models
starred:modelpath
starred:matt/examplemodelfiles
starred:distribution
starred:go-opts
starred:v0.21.0
starred:v0.21.0-rc1
starred:v0.21.0-rc0
starred:v0.20.8-rc0
starred:v0.20.7
starred:v0.20.7-rc1
starred:v0.20.7-rc0
starred:v0.20.6
starred:v0.20.6-rc1
starred:v0.20.6-rc0
starred:v0.20.5
starred:v0.20.5-rc2
starred:v0.20.5-rc1
starred:v0.20.5-rc0
starred:v0.20.4
starred:v0.20.4-rc2
starred:v0.20.4-rc1
starred:v0.20.4-rc0
starred:v0.20.3
starred:v0.20.3-rc0
starred:v0.20.2
starred:v0.20.1
starred:v0.20.1-rc2
starred:v0.20.1-rc1
starred:v0.20.1-rc0
starred:v0.20.0
starred:v0.20.0-rc1
starred:v0.20.0-rc0
starred:v0.19.0
starred:v0.19.0-rc2
starred:v0.19.0-rc1
starred:v0.19.0-rc0
starred:v0.18.4-rc1
starred:v0.18.4-rc0
starred:v0.18.3
starred:v0.18.3-rc2
starred:v0.18.3-rc1
starred:v0.18.3-rc0
starred:v0.18.2
starred:v0.18.2-rc1
starred:v0.18.2-rc0
starred:v0.18.1
starred:v0.18.1-rc1
starred:v0.18.1-rc0
starred:v0.18.0
starred:v0.18.0-rc2
starred:v0.18.0-rc1
starred:v0.18.0-rc0
starred:v0.17.8-rc4
starred:v0.17.8-rc3
starred:v0.17.8-rc2
starred:v0.17.8-rc1
starred:v0.17.8-rc0
starred:v0.17.7-rc2
starred:v0.17.7
starred:v0.17.7-rc1
starred:v0.17.7-rc0
starred:v0.17.6
starred:v0.17.5
starred:v0.17.4
starred:v0.17.3
starred:v0.17.2
starred:v0.17.1
starred:v0.17.1-rc2
starred:v0.17.1-rc1
starred:v0.17.1-rc0
starred:v0.17.0
starred:v0.17.0-rc2
starred:v0.17.0-rc1
starred:v0.17.0-rc0
starred:v0.16.3
starred:v0.16.3-rc2
starred:v0.16.3-rc1
starred:v0.16.3-rc0
starred:v0.16.2-rc0
starred:v0.16.2
starred:v0.16.1
starred:v0.16.0
starred:v0.16.0-rc2
starred:v0.16.0-rc0
starred:v0.16.0-rc1
starred:v0.15.6
starred:v0.15.5
starred:v0.15.5-rc5
starred:v0.15.5-rc4
starred:v0.15.5-rc3
starred:v0.15.5-rc2
starred:v0.15.5-rc1
starred:v0.15.5-rc0
starred:v0.15.4
starred:v0.15.3
starred:v0.15.2
starred:v0.15.1
starred:v0.15.1-rc1
starred:v0.15.1-rc0
starred:v0.15.0-rc6
starred:v0.15.0
starred:v0.15.0-rc5
starred:v0.15.0-rc4
starred:v0.15.0-rc3
starred:v0.15.0-rc2
starred:v0.15.0-rc1
starred:v0.15.0-rc0
starred:v0.14.3
starred:v0.14.3-rc3
starred:v0.14.3-rc2
starred:v0.14.3-rc1
starred:v0.14.3-rc0
starred:v0.14.2
starred:v0.14.2-rc1
starred:v0.14.2-rc0
starred:v0.14.1
starred:v0.14.0
starred:v0.14.0-rc11
starred:v0.14.0-rc10
starred:v0.14.0-rc9
starred:v0.14.0-rc8
starred:v0.14.0-rc7
starred:v0.14.0-rc6
starred:v0.14.0-rc5
starred:v0.14.0-rc4
starred:v0.14.0-rc3
starred:v0.14.0-rc2
starred:v0.14.0-rc1
starred:v0.14.0-rc0
starred:v0.13.5
starred:v0.13.5-rc1
starred:v0.13.5-rc0
starred:v0.13.4-rc2
starred:v0.13.4
starred:v0.13.4-rc1
starred:v0.13.4-rc0
starred:v0.13.3
starred:v0.13.3-rc1
starred:v0.13.3-rc0
starred:v0.13.2
starred:v0.13.2-rc2
starred:v0.13.2-rc1
starred:v0.13.2-rc0
starred:v0.13.1
starred:v0.13.1-rc2
starred:v0.13.1-rc1
starred:v0.13.1-rc0
starred:v0.13.0
starred:v0.13.0-rc0
starred:v0.12.11
starred:v0.12.11-rc1
starred:v0.12.11-rc0
starred:v0.12.10-rc1
starred:v0.12.10
starred:v0.12.10-rc0
starred:v0.12.9-rc0
starred:v0.12.9
starred:v0.12.8
starred:v0.12.8-rc0
starred:v0.12.7
starred:v0.12.7-rc1
starred:v0.12.7-rc0
starred:v0.12.7-citest0
starred:v0.12.6
starred:v0.12.6-rc1
starred:v0.12.6-rc0
starred:v0.12.5
starred:v0.12.5-rc0
starred:v0.12.4
starred:v0.12.4-rc7
starred:v0.12.4-rc6
starred:v0.12.4-rc5
starred:v0.12.4-rc4
starred:v0.12.4-rc3
starred:v0.12.4-rc2
starred:v0.12.4-rc1
starred:v0.12.4-rc0
starred:v0.12.3
starred:v0.12.2-rc0
starred:v0.12.2
starred:v0.12.1
starred:v0.12.1-rc2
starred:v0.12.1-rc1
starred:v0.12.1-rc0
starred:v0.12.0
starred:v0.12.0-rc1
starred:v0.12.0-rc0
starred:v0.11.11
starred:v0.11.11-rc2
starred:v0.11.11-rc3
starred:v0.11.11-rc1
starred:v0.11.11-rc0
starred:v0.11.10
starred:v0.11.9
starred:v0.11.9-rc0
starred:v0.11.8
starred:v0.11.8-rc0
starred:v0.11.7
starred:v0.11.7-rc0
starred:v0.11.7-rc1
starred:v0.11.6
starred:v0.11.6-rc0
starred:v0.11.5-rc5
starred:v0.11.5
starred:v0.11.5-rc4
starred:v0.11.5-rc3
starred:v0.11.5-rc2
starred:v0.11.5-rc1
starred:v0.11.5-rc0
starred:v0.11.4
starred:v0.11.4-rc0
starred:v0.11.3
starred:v0.11.3-rc0
starred:v0.11.2
starred:v0.11.1
starred:v0.11.0-rc0
starred:v0.11.0-rc1
starred:v0.11.0-rc2
starred:v0.11.0
starred:v0.10.2-int1
starred:v0.10.1
starred:v0.10.0
starred:v0.10.0-rc4
starred:v0.10.0-rc3
starred:v0.10.0-rc2
starred:v0.10.0-rc1
starred:v0.10.0-rc0
starred:v0.9.7-rc1
starred:v0.9.7-rc0
starred:v0.9.6
starred:v0.9.6-rc0
starred:v0.9.6-ci0
starred:v0.9.5
starred:v0.9.4-rc4
starred:v0.9.4-rc6
starred:v0.9.4
starred:v0.9.4-rc3
starred:v0.9.4-rc5
starred:v0.9.4-rc1
starred:v0.9.4-rc2
starred:v0.9.4-rc0
starred:v0.9.3
starred:v0.9.3-rc5
starred:v0.9.4-citest0
starred:v0.9.3-rc4
starred:v0.9.3-rc3
starred:v0.9.3-rc2
starred:v0.9.3-rc1
starred:v0.9.3-rc0
starred:v0.9.2
starred:v0.9.1
starred:v0.9.1-rc1
starred:v0.9.1-rc0
starred:v0.9.1-ci1
starred:v0.9.1-ci0
starred:v0.9.0
starred:v0.9.0-rc0
starred:v0.8.0
starred:v0.8.0-rc0
starred:v0.7.1-rc2
starred:v0.7.1
starred:v0.7.1-rc1
starred:v0.7.1-rc0
starred:v0.7.0
starred:v0.7.0-rc1
starred:v0.7.0-rc0
starred:v0.6.9-rc0
starred:v0.6.8-rc0
starred:v0.6.8
starred:v0.6.7
starred:v0.6.7-rc2
starred:v0.6.7-rc1
starred:v0.6.7-rc0
starred:v0.6.6
starred:v0.6.6-rc2
starred:v0.6.6-rc1
starred:v0.6.6-rc0
starred:v0.6.5-rc1
starred:v0.6.5
starred:v0.6.5-rc0
starred:v0.6.4
starred:v0.6.4-rc0
starred:v0.6.3
starred:v0.6.3-rc1
starred:v0.6.3-rc0
starred:v0.6.2
starred:v0.6.2-rc0
starred:v0.6.1
starred:v0.6.1-rc0
starred:v0.6.0-rc0
starred:v0.6.0
starred:v0.5.14-rc0
starred:v0.5.13
starred:v0.5.13-rc6
starred:v0.5.13-rc5
starred:v0.5.13-rc4
starred:v0.5.13-rc3
starred:v0.5.13-rc2
starred:v0.5.13-rc1
starred:v0.5.13-rc0
starred:v0.5.12
starred:v0.5.12-rc1
starred:v0.5.12-rc0
starred:v0.5.11
starred:v0.5.10
starred:v0.5.9
starred:v0.5.9-rc0
starred:v0.5.8
starred:v0.5.8-rc13
starred:v0.5.8-rc12
starred:v0.5.8-rc11
starred:v0.5.8-rc10
starred:v0.5.8-rc9
starred:v0.5.8-rc8
starred:v0.5.8-rc7
starred:v0.5.8-rc6
starred:v0.5.8-rc5
starred:v0.5.8-rc4
starred:v0.5.8-rc3
starred:v0.5.8-rc2
starred:v0.5.8-rc1
starred:v0.5.8-rc0
starred:v0.5.7
starred:v0.5.6
starred:v0.5.5
starred:v0.5.5-rc0
starred:v0.5.4
starred:v0.5.3
starred:v0.5.3-rc0
starred:v0.5.2
starred:v0.5.2-rc3
starred:v0.5.2-rc2
starred:v0.5.2-rc1
starred:v0.5.2-rc0
starred:v0.5.1
starred:v0.5.0
starred:v0.5.0-rc1
starred:v0.4.8-rc0
starred:v0.4.7
starred:v0.4.6
starred:v0.4.5
starred:v0.4.4
starred:v0.4.3
starred:v0.4.3-rc0
starred:v0.4.2
starred:v0.4.2-rc1
starred:v0.4.2-rc0
starred:v0.4.1
starred:v0.4.1-rc0
starred:v0.4.0
starred:v0.4.0-rc8
starred:v0.4.0-rc7
starred:v0.4.0-rc6
starred:v0.4.0-rc5
starred:v0.4.0-rc4
starred:v0.4.0-rc3
starred:v0.4.0-rc2
starred:v0.4.0-rc1
starred:v0.4.0-rc0
starred:v0.4.0-ci3
starred:v0.3.14
starred:v0.3.14-rc0
starred:v0.3.13
starred:v0.3.12
starred:v0.3.12-rc5
starred:v0.3.12-rc4
starred:v0.3.12-rc3
starred:v0.3.12-rc2
starred:v0.3.12-rc1
starred:v0.3.11
starred:v0.3.11-rc4
starred:v0.3.11-rc3
starred:v0.3.11-rc2
starred:v0.3.11-rc1
starred:v0.3.10
starred:v0.3.10-rc1
starred:v0.3.9
starred:v0.3.8
starred:v0.3.7
starred:v0.3.7-rc6
starred:v0.3.7-rc5
starred:v0.3.7-rc4
starred:v0.3.7-rc3
starred:v0.3.7-rc2
starred:v0.3.7-rc1
starred:v0.3.6
starred:v0.3.5
starred:v0.3.4
starred:v0.3.3
starred:v0.3.2
starred:v0.3.1
starred:v0.3.0
starred:v0.2.8
starred:v0.2.8-rc2
starred:v0.2.8-rc1
starred:v0.2.7
starred:v0.2.6
starred:v0.2.5
starred:v0.2.4
starred:v0.2.3
starred:v0.2.2
starred:v0.2.2-rc2
starred:v0.2.2-rc1
starred:v0.2.1
starred:v0.2.0
starred:v0.1.49-rc14
starred:v0.1.49-rc13
starred:v0.1.49-rc12
starred:v0.1.49-rc11
starred:v0.1.49-rc10
starred:v0.1.49-rc9
starred:v0.1.49-rc8
starred:v0.1.49-rc7
starred:v0.1.49-rc6
starred:v0.1.49-rc4
starred:v0.1.49-rc5
starred:v0.1.49-rc3
starred:v0.1.49-rc2
starred:v0.1.49-rc1
starred:v0.1.48
starred:v0.1.47
starred:v0.1.46
starred:v0.1.45
starred:v0.1.45-rc5
starred:v0.1.45-rc4
starred:v0.1.45-rc3
starred:v0.1.45-rc2
starred:v0.1.45-rc1
starred:v0.1.44
starred:v0.1.43
starred:v0.1.42
starred:v0.1.41
starred:v0.1.40
starred:v0.1.40-rc1
starred:v0.1.39
starred:v0.1.39-rc2
starred:v0.1.39-rc1
starred:v0.1.38
starred:v0.1.37
starred:v0.1.36
starred:v0.1.35
starred:v0.1.35-rc1
starred:v0.1.34
starred:v0.1.34-rc1
starred:v0.1.33
starred:v0.1.33-rc7
starred:v0.1.33-rc6
starred:v0.1.33-rc5
starred:v0.1.33-rc4
starred:v0.1.33-rc3
starred:v0.1.33-rc2
starred:v0.1.33-rc1
starred:v0.1.32
starred:v0.1.32-rc2
starred:v0.1.32-rc1
starred:v0.1.31
starred:v0.1.30
starred:v0.1.29
starred:v0.1.28
starred:v0.1.27
starred:v0.1.26
starred:v0.1.25
starred:v0.1.24
starred:v0.1.23
starred:v0.1.22
starred:v0.1.21
starred:v0.1.20
starred:v0.1.19
starred:v0.1.18
starred:v0.1.17
starred:v0.1.16
starred:v0.1.15
starred:v0.1.14
starred:v0.1.13
starred:v0.1.12
starred:v0.1.11
starred:v0.1.10
starred:v0.1.9
starred:v0.1.8
starred:v0.1.7
starred:v0.1.6
starred:v0.1.5
starred:v0.1.4
starred:v0.1.3
starred:v0.1.2
starred:v0.1.1
starred:v0.1.0
starred:v0.0.21
starred:v0.0.20
starred:v0.0.19
starred:v0.0.18
starred:v0.0.17
starred:v0.0.16
starred:v0.0.15
starred:v0.0.14
starred:v0.0.13
starred:v0.0.12
starred:v0.0.11
starred:v0.0.10
starred:v0.0.9
starred:v0.0.8
starred:v0.0.7
starred:v0.0.6
starred:v0.0.5
starred:v0.0.4
starred:v0.0.3
starred:v0.0.2
starred:v0.0.1
2 Commits
pdevine/sa
...
mattw/quan
| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|
|
fed3843be2 |
update to resolve jmorganca comments
Signed-off-by: Matt Williams <m@technovangelist.com> |
||
|
|
01d4047ed3 |
add faq about quant and context
Signed-off-by: Matt Williams <m@technovangelist.com> |
1 changed files with 23 additions and 0 deletions
23
docs/faq.md
23
docs/faq.md
|
|
@@ -112,3 +112,26 @@ This can impact both installing Ollama, as well as downloading models.
|
||||||
Open `Control Panel > Networking and Internet > View network status and tasks` and click on `Change adapter settings` on the left panel. Find the `vEthernel (WSL)` adapter, right click and select `Properties`.
|
Open `Control Panel > Networking and Internet > View network status and tasks` and click on `Change adapter settings` on the left panel. Find the `vEthernel (WSL)` adapter, right click and select `Properties`.
|
||||||
Click on `Configure` and open the `Advanced` tab. Search through each of the properties until you find `Large Send Offload Version 2 (IPv4)` and `Large Send Offload Version 2 (IPv6)`. *Disable* both of these
|
Click on `Configure` and open the `Advanced` tab. Search through each of the properties until you find `Large Send Offload Version 2 (IPv4)` and `Large Send Offload Version 2 (IPv6)`. *Disable* both of these
|
||||||
properties.
|
properties.
|
||||||
|
|
||||||
|
## What does the q in the model tag mean? What is quantization?
|
||||||
|
|
||||||
|
Whenever you pull a model without a tag, Ollama will actually pull the q4_0 quantization of the model. You can verify this on the tags page. On https://ollama.ai/library/llama2/tags you can see that the hash for the latest tag matches the hash for the 7b model. 
|
||||||
|
|
||||||
|
Looking at the that page for any model, you can see several quantization options available. Quantization is a method of compression that allows the model to fit in less space and thus use less RAM and VRAM on your machine.
|
||||||
|
|
||||||
|
At a high level, a model is made of an enormous collection of nodes that determine how to generate text. These nodes are connected at different levels with weights. The training process adjusts these weights to be able to output the right text every time.
|
||||||
|
|
||||||
|
Most of the source models that we use start with weights that are 32bit floating-point numbers. Those weights, and another concept called biases, add up to be the parameters. So a source model with 7 billion parameters has 7 billion 32bit floating-point numbers, plus a description of all the nodes and more. That adds up to needing at least 28 Gigabytes of memory to load, if you choose to load one of those source models.
|
||||||
|
|
||||||
|
Quantization turns those 32bit floating point weights into much smaller integers. The number next to the q indicates the bit size of the weights. So a q4 model converted those 32bit floats into 4bit integers. A 4bit quantization takes up the space for 7billion 4bit integers, plus a little overhead. That comes out to almost 4 Gigabytes. Obviously, there is some loss of information in this process of going from 30GB to 4GB, but it turns out in most cases it isn't really noticeable. In fact, even the 2bit quantization which fits in less than 3GB can be very useful.
|
||||||
|
|
||||||
|
There are three major sets of quantizations you will see in the Ollama Library of models: **fp16**, models with just a q and a number, like **q4_0**, and then models with a **K** in the tag. The **fp16** model is one that has been converted and quantized from the source 32bit to 16bit. This will be about half the size of the 32bit source model and is the largest quantization we deliver in the library. The **q4_0**, **q4_1**, **q5_0**, etc. models use two different quantization methods that were the original methods.
|
||||||
|
|
||||||
|
The models with a **K** are often referred to as K Quants. This is a method that allows for models of a similar quality but smaller than the original method used. Essentially, it finds clusters of weights and quantizes those together, allowing for higher precision while using the same bit sizes as the regular quantization options. But this requires a set of maps for the model to figure out the original values which have a computational cost. You may see some impact on the speed of models with K quants compared to the regular quantizations.
|
||||||
|
|
||||||
|
## What is context, can I increase it, and why doesn't every model support a huge context?
|
||||||
|
|
||||||
|
Context refers to the size of the input you can send to a model and get sensible output back. Many models have a context size of 2048 tokens. It's sometimes possible to give it more using the **num_ctx** parameter, but the answers start to degrade. This is because half of the context is "freed" up to allow for more memory. Newer models have been able to increase that context size using different methods. This increase in context size results in a corresponding increase in memory required, sometimes by orders of magnitude.
|
||||||
|
|
||||||
|
> !WARNING]
|
||||||
|
> Currently, over-allocating context size may result in model quality or stability issues.
|
||||||
|
|
|
||||||
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.