Michael Yang
|
f1373193dc
|
move tokenizers to separate package (#13825)
|
2026-02-05 17:44:11 -08:00 |
|
Michael Yang
|
3f6642f6fc
|
model: implement bert in ollama engine (#9080)
* fix truncate
* s/SentencePieceModel/SentencePiece/
* bert
* wordpiece
* refactor pooling
* more tokenizers
* normalize embeddings
|
2025-09-15 15:35:59 -07:00 |
|
Michael Yang
|
4129af9205
|
chore: cleanup comments + unused vars (#11225)
|
2025-06-27 11:45:33 -07:00 |
|
Michael Yang
|
73b642e6f3
|
add new gemma model (#11204)
* update patches
* cherry pick metal mean kernel
* cherry pick cuda mean kernel
* gemma3n
|
2025-06-25 21:47:09 -07:00 |
|