Skip to content

Releases: nextcloud-releases/context_chat_backend

v3.0.5

19 Sep 14:11
Compare
Choose a tag to compare

3.0.5 - 2024-09-19

Fixed

  • lock embedding model forward pass (#78) @kyteinsky

v3.0.4

18 Sep 16:56
5e22e17
Compare
Choose a tag to compare

3.0.4 - 2024-09-18

Fixed

  • fix remaining lowercase comparisons for COMPUTE_DEVICE @kyteinsky

v3.0.3

18 Sep 16:04
Compare
Choose a tag to compare

3.0.3 - 2024-09-18

Fixed

  • use uppercase comparisons for COMPUTE_DEVICE @kyteinsky
  • add traceback to caught exception in doc loader @kyteinsky

v3.0.2

18 Sep 15:31
9b74712
Compare
Choose a tag to compare

3.0.2 - 2024-09-18

Changed

  • make stuff fit in 8GB VRAM and don't lock text2text api calls (#70) @kyteinsky

Fixed

  • fix: detect additional NVIDIA GPUs (#68) @kyteinsky

v3.0.1

01 Aug 09:16
v3.0.1
Compare
Choose a tag to compare

3.0.1 - 2024-08-01

Changed

  • update llama-cpp-python package in dockerfile @kyteinsky

Fixed

v3.0.0

30 Jul 13:59
v3.0.0
7a2a486
Compare
Choose a tag to compare

3.0.0 - 2024-07-30

Changed

  • New major version to maintain versioning consistency with the companion app
  • Update readme @kyteinsky

Added

v2.2.1

09 Jul 09:14
v2.2.1
b24ef9c
Compare
Choose a tag to compare

2.2.1 - 2024-07-09

Fixed

  • use COMPUTE_DEVICE env var if present for config @kyteinsky
  • add cuda compat llib path back @kyteinsky

v2.2.0

26 Jun 12:59
v2.2.0
fa4e83b
Compare
Choose a tag to compare

2.2.0 - 2024-06-25

Fixed

  • leave room for generated tokens in the context window @kyteinsky
  • Dockerfile llama-cpp-python install @kyteinsky
  • Version based repair and other changes (#54) @kyteinsky
  • .in.txt and use compiled llama-cpp-python @kyteinsky
  • correctly log exceptions @kyteinsky
  • do not verify docs before delete in Chroma (#53) @kyteinsky
  • offload only when instantiated @kyteinsky
  • add odfpy back and update deps @kyteinsky

Changed

Added

v2.1.1

23 Apr 10:14
v2.1.1
Compare
Choose a tag to compare

2.1.1 - 2024-04-23

Changed

  • use 8192 as context length

Fixed

  • replace @ with .at. in collection name
  • replace pandoc completely due to random memory hogs with other python packages
  • types fixes and langchain import updates

v2.1.0

15 Apr 10:26
v2.1.0
b55d6d3
Compare
Choose a tag to compare

2.1.0 - 2024-04-15

Changed

  • no context generation is now a chat completion
  • filter sources before document decode

Fixed

Added

  • accelerator detection on container boot
  • repair steps
  • increase context length to 16384