Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2 by mayankchhabra

Sed ut perspiciatis unde.

A self-hosted, offline, ChatGPT-like chatbot, powered by Llama 2. 100% private, with no data leaving your device.

umbrel.com »

Demo

LlamaGPT.mp4

Running LlamaGPT on an umbrelOS home server is one click. Simply install it from the Umbrel App Store.

You can run LlamaGPT on any x86 or arm64 system. Make sure you have Docker installed.

Then, clone this repo and cd into it:

git clone https://github.com/getumbrel/llama-gpt.git
cd llama-gpt

You can now run LlamaGPT with any of the following models depending upon your hardware:

Model size	Model used	Minimum RAM required	How to start LlamaGPT
7B	Nous Hermes Llama 2 7B (GGML q4_0)	8GB	`docker compose up -d`
13B	Nous Hermes Llama 2 13B (GGML q4_0)	16GB	`docker compose -f docker-compose-13b.yml up -d`
70B	Meta Llama 2 70B Chat (GGML q4_0)	48GB	`docker compose -f docker-compose-70b.yml up -d`

You can access LlamaGPT at http://localhost:3000.

To stop LlamaGPT, run:

A massive thank you to the following developers and teams for making LlamaGPT possible: