Releases · MaggotHATE/Llama_chat

18 Sep 17:56

MaggotHATE

beta_170

8236d4f

Beta 170 Latest

Latest

Moved to Sampling v2 (and all other relevant commits from llama.cpp), reworked messages regeneration.

DRY is not reimplemented yet
Naming scheme now reflexes not only the backend, but also the use of llamafile, OpenMP and OpenBLAS

Assets 6

27 Aug 18:42

MaggotHATE

beta_169

f4ff08a

Beta 169

Reworked XTC to better align with the original vision +updates from ggerganov/llama.cpp#9118 and misc fixes.

XTC implementation should be complete now.

Assets 6

25 Aug 14:31

MaggotHATE

beta_168_1

6b69d0b

Beta 168.1

XTC: added xtc_threshold_max parameter to limit the upper probability. 1.0 by default - still works like normal XTC.

This parameter may be used for models that have less clichéd tokens at top, but still have intermediate ranges of undesirable tokes. Needs testing.

Assets 6

25 Aug 08:50

MaggotHATE

beta_168

44eb8c9

Beta 168

XTC improvements:

all candidates are scanned now - ensures that it detects two penalizable tokens in two total
sorting swaps tokens even if values are equal - ensures that in two total tokens, the most probable will be cut off

What we have so far for default settings:

"samplers_sequence": "mx",
"xtc_probability": 0.5,
"xtc_threshold": 0.1,
"xtc_probability_once": true,
"xtc_min": 2,
"min_p": 0.02,

xtc_probability_once defines whether chance to penalize at all is calculated once at the start (as in original) or individually for each token above xtc_threshold.

xtc_min defines the minimum number of tokens above xtc_threshold to trigger the effect.

As previously, these settings are not in UI yet, so you will need to add them manually to config.json.

Assets 6

22 Aug 14:11

MaggotHATE

beta_167

79b923f

Beta 167

XTC: added xtc_probability_once boolean parameter. If true, probability will be calculated once, if false - for each candidate. This should make current implementation more sufficient and better for testing.

As a reminder, add this into the model's part of config.json to test XTC with the original settings:

"samplers_sequence": "mx",
"xtc_probability": 0.5,
"xtc_threshold": 0.1,
"xtc_probability_once": true,
"min_p": 0.02,

Assets 9

21 Aug 19:58

MaggotHATE

beta_166_fix3

363bcbc

Beta 166 (fix 3)

Reworked XTC, thanks to @LostRuins

Still not sure if the implementation is optimal, but it works better now.

No Vulkan for now, waiting for a fix.

Contributors

LostRuins

Assets 5

20 Aug 08:19

MaggotHATE

beta_166_fix

b3dce58

Beta 166 (actually fixed)

Fixed randomization not working in previous "fix", removed re-normalization.

Assets 9

19 Aug 07:12

MaggotHATE

beta_166

90c730e

Beta 166

Added a very crude implementation of Exclude Top Choices (XTC) sampler by @p-e-w , will rework properly later. Seems to work for now, needs a lot of testing.

Start from adding these settings in config.json (only in config for now):

"samplers_sequence": "mx",
"xtc_probability": 0.5,
"xtc_threshold": 0.1,
"min_p": 0.02,

Contributors

p-e-w

Assets 9

18 Aug 18:34

MaggotHATE

beta_165

0921736

Beta 165

Small fixes, added SDL2 builds since it uses less VRAM on UI.

Assets 9

16 Aug 19:47

MaggotHATE

beta_164

b665a51

Beta 164

Latest commits from llama.cpp: support for Nemotron and EXAONE models

from now on all CPU-only builds use OpenBLAS (compiled statically - including chatTest)

Assets 6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributors

Contributors

Releases: MaggotHATE/Llama_chat

Beta 170

Beta 169

Beta 168.1

Beta 168

Beta 167

Beta 166 (fix 3)

Contributors

Beta 166 (actually fixed)

Beta 166

Contributors

Beta 165

Beta 164