ollama/scripts
Daniel Hiltgen 630882621b
llama-server followups (#16353)
* llama-server followups

Misc fixes for #16031
- Add back dropped ROCm build flag for multi-GPU support on windows
- Fix amdhip64_*.dll version detection for "latest" selection
- Fix embeddings API for consistent normalize behavior with prior versions

* ci: set up for automated llama.cpp update testing

* reduce batch for fa-disabled, and constrained vram

* mlx: fix v3 load bug on m5

Imagegen was incorrectly loading v3 first.  This DRYs out the loading code so imagegen gets the same new v4/v3 selection logic.

* fix reload bug on embedding models

* bump version

* steer user how to enable iGPU when disabled
2026-06-01 10:44:21 -07:00
..
support app: add code for macOS and Windows apps under 'app' (#12933) 2025-11-04 11:40:17 -08:00
.this-is-the-create-dmg-repo app: add code for macOS and Windows apps under 'app' (#12933) 2025-11-04 11:40:17 -08:00
build_darwin.sh llama-server followups (#16353) 2026-06-01 10:44:21 -07:00
build_docker.sh runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
build_linux.sh runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
build_windows.ps1 runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
buildkitd.toml.example runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
create-dmg.sh app: add code for macOS and Windows apps under 'app' (#12933) 2025-11-04 11:40:17 -08:00
deduplicate_cuda_libs.sh CI: dedup cuda libraries to reduce payload size (#13704) 2026-01-13 11:25:31 -08:00
env.sh runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
install.ps1 win: progress reporting on install download (#14219) 2026-02-12 12:06:56 -08:00
install.sh install: prevent partial download script execution (#14311) 2026-02-18 18:32:45 -08:00
push_docker.sh runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
tag_latest.sh CI: clean up naming, fix tagging latest (#6832) 2024-09-16 16:18:41 -07:00