Llama 3.2 running on a Web browser. Designed in JavaScript and WebAssembly. The model may take several minutes to download (708 MB) and your Web browser must support WebGPU because it's necessary to run the WebLLM engine. The model has 1.23 billion parameters and supports English, German, French, Italian, Portuguese, Hindi, Spanish and Thai.
https://lrusso.github.io/Llama/Llama.htm
- Install Node.js.
- Open the Terminal.
- Go to the project folder.
- Run
node server.js
- Browse to http://localhost
- Open the Terminal.
- Install Forever:
npm install -g forever
- Go to the project folder.
- Start the server:
forever start -a -l /dev/null -c node server.js > /dev/null 2>&1
- Stop the server:
forever stop -a -l /dev/null -c node server.js > /dev/null 2>&1