Creating Private LLMs

I want to open this post by stating that privacy within large language models (LLMs) is a mammoth topic that spans much more than can be said in a single post. Instead, I want to narrow the focus of the post to showcase some approaches of introducing proprietary data into LLMs, with privacy and safety of sensitive data at the forefront. In a study done by the AI Accelerator Institute, data privacy was the second biggest barrier to adopting LLMs within their company...

December 18, 2023 · 4 min · James Malcolm

Handling multiple interactions with Langchain

There are many tutorials on getting started with Langchain and LLMs to create simple chat applications. I want to go slightly beyond this post and go into a bit of detail on the role of memory has in chat applications, and lastly touch on how you can scale your application across multiple sessions and multiple users. What is Langchain? Langchain is an open-source python package that helps in creating LLM solutions....

October 24, 2023 · 5 min · James Malcolm

Counting Pennies - Deploy or buy GenAI?

In this post, we explore the cost of deploying or buying your generative AI. Specifically, I want to focus on the computing cost - not the additional costs which contribute to the total cost of ownership. In this, I want to explore three options, these are: Managed: Use OpenAI directly Self-managed: Deploy using AWS Self-managed: Deploy using Google Cloud This post is part of my wider LLM series. Handling multiple interactions with Langchain LLM Risks - Prompt Injection Or a full list of posts, available here....

August 7, 2023 · 5 min · James Malcolm

LLM Risks - Prompt Injection

Generative AI models are all the rage nowadays. For data people, generative models have been around for several years, but the power and usability of products such as ChatGPT have taken the world by storm. This emergence has brought in new and emerging security risks with it. One of the largest and most novel risks is prompt injection. Prompt injection attacks can affect all large language and generative AI models....

June 15, 2023 · 4 min · James Malcolm