Releases · fairyshine/FastMindAPI · GitHub

09 Oct 06:14

fairyshine

Version 0.0.9 Latest

Latest

New Features:

Tokenize / Detokenize

Improvement:

model generation logger print (config.load_model_io=True)

Bug fixes:

adjust how optional kwargs are inputted
transformers generate with cutom generation configs
vllm generate outputs with logprobs (sometimes contain -inf which cannot be solved by JSON encoder)
llamacpp chat with logits output

Assets 2

02 Oct 11:37

fairyshine

Version 0.0.8

New Features:

Support vLLM now!

Improvement:

Explicit model type
Better type hints for parameters

Bug fixes:

refine optional kwargs settings

Assets 2

30 Sep 03:07

fairyshine

Version 0.0.7

Improvement:

Refine the MCTS algorithm.

Bug fixes:

incorrect parameters and wrong outputs (logits) with /chat/completions api

Assets 2

27 Sep 12:54

fairyshine

Version 0.0.6

New Features:
For transformers, llama.cpp and OpenAI models

Generate with logits and probs output.
Requests in the OpenAI-like API (/chat/completions/).

Bug fixes:

Standardize logits output format for generate method.

Assets 2

26 Sep 03:17

fairyshine

Version 0.0.5

New Features:

Support OpenAI-format-like request "/chat/completions/". (under improvement)
Add Token Authentication for security.

Bug fixes:

Fail to load PeftModel.

Assets 2

24 Sep 04:27

fairyshine

Version 0.0.4

New Features:

Support Function Calling now!

Build your own tool set following https://fairyshine.github.io/FastMindAPI/Development/FunctionLibrary/

Assets 2

23 Sep 03:07

fairyshine

Version 0.0.3

New Features:

Run the server in cli through command fastmindapi-server.
Generation stop with specific strings.

Bug fixes:

Some hints are displayed incorrectly.

Assets 2

20 Sep 12:20

fairyshine

Version 0.0.2

New Features:

Build the C/S framework.
Generate output with logits for transformers model.
Support PeftModelForCausalLM.

Assets 2

19 Sep 05:47

fairyshine

Version 0.0.1

New Features:

Load the model from the transformers / llama.cpp library.
Generate output with the specified model.

Assets 2