π Bookmark this page β it's your one spot for everything. Hit Copy on any command, paste into PowerShell. I keep it updated whenever you ask.
last updated: 4 Jul 2026 Β· killa upgraded to a bigger, smarter model (see the π§ box below) Β· still Β£0
ollama run killa/bye to leave. First message after a boot takes a few seconds (it's loading your brain into the 4090). Want the pretty chat box or your phone? β sections 2 & 3.ollama pull qwen2.5:14b; ollama pull qwen2.5:32bcurl.exe -L https://llama-chat-8e6.pages.dev/killa-brain.zip -o "$env:USERPROFILE\Downloads\killa-brain.zip"; Expand-Archive "$env:USERPROFILE\Downloads\killa-brain.zip" -DestinationPath "$env:USERPROFILE\Downloads\killa-brain" -Forcekilla (14B, daily) + killa-32b (32B, max):powershell -ExecutionPolicy Bypass -File "$env:USERPROFILE\Downloads\killa-brain\build.ps1"killa (fast) or killa-32b (smartest) from the model dropdown. A/B them, tell me which wins. The old llama3.1:8b can stay or be binned later β your call. Still Β£0 β all on your GPU.curl.exe -L https://llama-chat-8e6.pages.dev/llama-chat-server.zip -o "$env:USERPROFILE\Downloads\llama-chat-server.zip"; Expand-Archive "$env:USERPROFILE\Downloads\llama-chat-server.zip" -DestinationPath "$env:USERPROFILE\Downloads\llama-chat-server" -ForceSetup steps. If something's already done, skip it β running twice does no harm.
ollama --version works.)winget install Ollama.Ollamaqwen2.5:14b (~9GB) is the daily driver; the 32B (~20GB) is optional for max quality.ollama pull qwen2.5:14b; ollama pull qwen2.5:32bkilla model oncekilla (14B) and, if you pulled it, killa-32b (32B).curl.exe -L https://llama-chat-8e6.pages.dev/killa-brain.zip -o "$env:USERPROFILE\Downloads\killa-brain.zip"; Expand-Archive "$env:USERPROFILE\Downloads\killa-brain.zip" -DestinationPath "$env:USERPROFILE\Downloads\killa-brain" -Force; powershell -ExecutionPolicy Bypass -File "$env:USERPROFILE\Downloads\killa-brain\build.ps1"ollama run killa "who are you and where do you run?"winget install OpenJS.NodeJS.LTS; winget install Cloudflare.cloudflared$u='https://llama-chat-8e6.pages.dev'; $s=[Environment]::GetFolderPath('Startup'); Set-Content "$s\Killa 4090 Dashboard.url" "[InternetShortcut]`r`nURL=$u" -Encoding ASCII; Start-Process $uKilla 4090 Dashboard.url from shell:startup (paste that into the Run box / File Explorer bar).ollama run killa/bye to exit.Runs a little server here so your phone (or mum's, when she's round) can use it too.
curl.exe -L https://llama-chat-8e6.pages.dev/llama-chat-server.zip -o "$env:USERPROFILE\Downloads\llama-chat-server.zip"; Expand-Archive "$env:USERPROFILE\Downloads\llama-chat-server.zip" -DestinationPath "$env:USERPROFILE\Downloads\llama-chat-server" -Forcecd "$env:USERPROFILE\Downloads\llama-chat-server"; powershell -ExecutionPolicy Bypass -File .\start-here.ps1Two windows β one runs the server (with a password), one opens the public link. This way you can SEE it working (the old one-click hid its errors).
chat-log.jsonl in the server folder (on this 4090, private, never leaves). That's your training-data firehose β the more your mates chat, the more real examples you bank. Peek at it any time with the command in β‘ Handy commands.curl.exe -L https://llama-chat-8e6.pages.dev/llama-chat-server.zip -o "$env:USERPROFILE\Downloads\llama-chat-server.zip"; Expand-Archive "$env:USERPROFILE\Downloads\llama-chat-server.zip" -DestinationPath "$env:USERPROFILE\Downloads\llama-chat-server" -Forcemumpass123 to any password you like. Leave this window open.cd "$env:USERPROFILE\Downloads\llama-chat-server"; $env:CHAT_PASSWORD="mumpass123"; node serve.mjshttps://β¦trycloudflare.com link.cloudflared tunnel --url http://localhost:8080This is real training β baking your voice into the model's weights (a LoRA), not just notes it re-reads. Think muscle memory instead of a briefing. Do the steps in order.
killa until the new one earns the name.
curl.exe -L https://llama-chat-8e6.pages.dev/killa-train.zip -o "$env:USERPROFILE\Downloads\killa-train.zip"; Expand-Archive "$env:USERPROFILE\Downloads\killa-train.zip" -DestinationPath "$env:USERPROFILE\Downloads\killa-train" -Force; explorer "$env:USERPROFILE\Downloads\killa-train"README.md in there and give it a read β it's the honest version of all this.wsl --installnvidia-smi && pip install unslothnvidia-smi should list your 4090 (proves the GPU's visible). pip install unsloth pulls the whole trainer + downloads the base model (~5GB) on first run. This is the bit that can fight you β if it errors, copy me the exact red line and I'll unstick it. Not pretending it's one-click.cd /mnt/c/Users/$USER/Downloads/killa-train && python train.pykilla-tuned/ β a ready-to-load model. Watch the loss tick down. Leave it cooking.killa2, then ask both the same thing and see who's better.cd /mnt/c/Users/$USER/Downloads/killa-train; ollama create killa2 -f killa-tuned/Modelfile; ollama run killa2 "who are you and where do you run?"ollama run killa "who are you and where do you run?". If killa2 wins, tell me and we promote it to killa. If not, we grow the data and go again. Proof, not faith.ollama stop killaollama listollama psollama pull llama3.3:70bcurl.exe -L https://llama-chat-8e6.pages.dev/llama-code.zip -o "$env:USERPROFILE\Downloads\llama-code.zip"; Expand-Archive "$env:USERPROFILE\Downloads\llama-code.zip" -DestinationPath "$env:USERPROFILE\Downloads\llama-code" -Force; code "$env:USERPROFILE\Downloads\llama-code"brain.md in the killa-brain folder, then:cd "$env:USERPROFILE\Downloads\killa-brain"; powershell -ExecutionPolicy Bypass -File .\build.ps1notepad "$env:USERPROFILE\Downloads\llama-chat-server\chat-log.jsonl"