GitHub - DaveInchy/llama.native.js

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
bin		bin
dist		dist
src		src
.gitignore		.gitignore
CHANGES.MD		CHANGES.MD
README.MD		README.MD
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Repository files navigation

`npm i --save llama.native.js@1.0.0`

A solution to host a socket-io server that handles inference with models in your filesystem. based on llama.cpp

Requirements

Compile / Build / Make llama.cpp for your os and place the file relative to this directory.

I would rather you build this yourself, as its not hard and only the end result matters.

If i could license llama.cpp i could include both binary executable files.
You need a ggml type model that runs on most devices

you can find 7B or 13B ggml models on huggingface, I might wanna reccomend someone who is quick to rebuild the new models into ggml. ggml models can run on the cpu, so can work on any host machine. the file extension is .bin and you can download and clone huggingface repo's

because huggingface deems all files to be Git LFS (in other word HUGE files) and therefor most models cannot be uploaded to github, even then again im not able to upload more then a simple 7B parameter model, which is why there is no included llama / alpaca ggml models

Thanks to: TheBloke on patreon, I have been dependend on his quantizations and huggingface repos. providing realtime conversion of many different types of llm's like the llama model and all iterations we use today. he provides a way for developers that dont have funds to generate these models or transform them live with frequently updated repositories for all the billions of parameters available.

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

TypeScript 100.0%