Learning and applying Deepseek techniques
In January 2025, Deepseek made headlines with the release of their Deepseek R1 models and a suite of smaller models distilled from the larger R1 variant. The announcement sent shockwaves through the market鈥攕haking NASDAQ and causing NVIDIA shares to drop nearly 20% in a single day. Although the performance of these models wasn鈥檛 the only factor, Deepseek鈥檚 innovation called into question the competitive advantage long held by US-based AI giants....
How does Trustpilot scoring work?
Trustpilot is a wide-spread consumer review platform. Founded in 2007, they have over 1m reviews posted every month from consumers. Reviewers share their experiences with a company, giving a score, from one to five and a free-text comment. Companies proudly display the scores on emails, websites, and other mediums as a form of social proof. As such, it鈥檚 important to understand how Trustpilot works and how you can influence the scores....
Creating Private LLMs
I want to open this post by stating that privacy within large language models (LLMs) is a mammoth topic that spans much more than can be said in a single post. Instead, I want to narrow the focus of the post to showcase some approaches of introducing proprietary data into LLMs, with privacy and safety of sensitive data at the forefront. In a study done by the AI Accelerator Institute, data privacy was the second biggest barrier to adopting LLMs within their company...
What's next? Next word prediction with PyTorch
Today, I will take you through a simple next-word prediction model built using PyTorch. The inspiration for this, is of course predictive text - or more specifically Google鈥檚 Smart Compose. At its core, the Google Smart Compose model is a form of language model. Smart Compose uses a few words the user inputs and then predicts the following words or sentences in emails you want to write. Google details how they build their Smart Compose feature in their research blog post here....
Handling multiple interactions with Langchain
There are many tutorials on getting started with Langchain and LLMs to create simple chat applications. I want to go slightly beyond this post and go into a bit of detail on the role of memory has in chat applications, and lastly touch on how you can scale your application across multiple sessions and multiple users. What is Langchain? Langchain is an open-source python package that helps in creating LLM solutions....