Skip to content

v2.0.0

Latest
Compare
Choose a tag to compare
@jan-grzybek-ampere jan-grzybek-ampere released this 23 Sep 20:15
· 11 commits to main since this release
4f32b2c
  • Upgraded upstream tag enables Llama 3.1 in ollama
  • Support for AmpereOne platform
  • Breaking change: due to changed weight type IDs it is now required to re-quantize models to Q8R16 and Q4_K_4 formats with current llama-quantize tool.