Monthly Archives: April 2024

Securing large language models with a reverse proxy

In a previous post, I explained how to host a private ChatGPT using Docker and Traefik. I didn’t spend a lot of time on the security aspect of the project. I see many people asking how to expose their large … Continue reading

Posted in artificial intelligence, Computer, Generative AI, Large Language Models, Linux, Networking, Security | Leave a comment

Self-hosted coding assistant with llamafile, continue.dev and docker

There was a recent dramatic improvement on the speed of LLM’s on CPU thanks to llamafile‘s author. She goes on extensively about it on her blog but the short version is: expect 7-billion parameters to be usable on consumer-grade CPU … Continue reading

Posted in artificial intelligence, Computer, Docker, Generative AI, Large Language Models, Linux | Leave a comment