jan-grzybek-ampere
released this
23 Sep 20:15
·
11 commits
to main
since this release
- Upgraded upstream tag enables Llama 3.1 in ollama
- Support for AmpereOne platform
- Breaking change: due to changed weight type IDs it is now required to re-quantize models to Q8R16 and Q4_K_4 formats with current llama-quantize tool.