node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
- llama
- llama-cpp
- llama.cpp
- bindings
- ai
- cmake
- cmake-js
- prebuilt-binaries
- llm
- gguf
- metal
- cuda
- vulkan
- grammar
- embedding
- rerank
- reranking
- json-grammar
- json-schema-grammar
- functions
- function-calling
- token-prediction
- speculative-decoding
- temperature
- minP
- topK
- topP
- seed
- json-schema
- raspberry-pi
- self-hosted
- local
- catai
- mistral
- deepseek
- typescript
- lora
- batching
- gpu
- nodejs