Daniel Hiltgen
356c0b8e34
gemma4: add audio support with USM conformer encoder
Add audio encoding for Gemma 4 using the USM conformer architecture:
- Converter: audio tensor mapping, SSCP/conformer/embedder name replacements,
softplus repacker for per_dim_scale, F32 enforcement for conv weights
- GGML backend: Conv1DDW and PadExt tensor ops
- Audio encoder: SSCP Conv2D, 12 conformer blocks (FFW + block-local
attention with relative position embeddings + LightConv1d + FFW),
output projection, audio-to-text embedding projector
- Audio preprocessing: WAV decode, mel spectrogram, FFT (pure Go)
- Model wiring: WAV detection, audio token handling, unified PostTokenize
Correctly transcribes "why is the sky blue" from test audio.
2026-04-01 15:24:17 -07:00
..
2025-04-02 09:44:27 -07:00
2025-01-15 16:31:22 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-04-01 15:24:17 -07:00
2026-04-01 15:24:17 -07:00
2026-01-08 16:18:59 -08:00
2026-01-23 18:40:28 -08:00
2026-02-02 15:39:18 -08:00
2026-01-08 16:18:59 -08:00
2026-02-23 14:38:10 -08:00
2026-02-23 14:38:10 -08:00
2026-02-23 14:38:10 -08:00
2026-02-23 14:38:10 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-02-22 15:09:14 -08:00
2026-02-22 15:09:14 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-02-24 20:08:05 -08:00
2026-02-24 20:08:05 -08:00
2026-01-08 16:18:59 -08:00
2026-01-08 16:18:59 -08:00
2026-04-01 15:23:10 -07:00
2026-04-01 15:23:10 -07:00
2026-02-22 15:09:14 -08:00
2026-02-22 15:09:14 -08:00
2026-04-01 15:23:10 -07:00
2025-09-17 17:43:17 -07:00
2025-04-25 16:59:20 -07:00
2026-04-01 15:24:17 -07:00
2024-12-10 12:58:06 -08:00
2025-11-13 15:32:34 -08:00
2025-11-13 15:32:34 -08:00
2025-12-18 07:55:37 -08:00
2026-02-24 20:08:05 -08:00
2026-02-24 20:08:05 -08:00