Llms | James' Blog

The Agentic Leap - A Blueprint for Building Next-Generation Enterprise AI

Generative AI has moved beyond simple chatbots. The new frontier is building sophisticated AI agents that can reason, use tools, and execute complex, multi-step tasks. But how do you evolve from a single-purpose AI assistant to a robust, multi-agent system that can act as a true digital coworker? This is a journey from simple automation to genuine augmentation. It requires a shift in thinking and a new architectural blueprint. After extensive research and development, we’ve created a pragmatic approach to building these systems. This article shares that blueprint. ...

Exploring Byte Level Transformers.

Last December, Meta AI released paper describing their Byte Latent Transformer (BLT) and today (13 May 2025), they released the weights to Hugging Face. Let’s break down the paper and explore what makes BLT a special model. What is BLT? Let’s break down the name: • Byte: This signifies that the architecture operates directly on raw byte data • Latent: This refers to the way BLT processes the byte data. Instead of processing every individual byte in the main computation layer (which would be prohibitively costly) • Transformer: This indicates that BLT is an LLM architecture based on the Transformer model ...

Learning and applying Deepseek techniques

In January 2025, Deepseek made headlines with the release of their Deepseek R1 models and a suite of smaller models distilled from the larger R1 variant. The announcement sent shockwaves through the market—shaking NASDAQ and causing NVIDIA shares to drop nearly 20% in a single day. Although the performance of these models wasn’t the only factor, Deepseek’s innovation called into question the competitive advantage long held by US-based AI giants. ...

Generative AI reflections for 2024

The year 2024 was an interesting one for Generative AI. It certainly wasn’t bigger than 2022 and 2023, when Generative AI went mainstream through the release of ChatGPT. On the other hand, it wasn’t a ‘nothing’ year as well. Sure, Generative AI is more widespread and prevalent than previous years. Helped through both Android and Apple deploying generative AI on mobile devices. But, as I look back, I feel it was a year of consolidation. Consolidation in the big players, consolidation and confidence in the technology, and consolidation in regulation. ...

Creating Private LLMs

I want to open this post by stating that privacy within large language models (LLMs) is a mammoth topic that spans much more than can be said in a single post. Instead, I want to narrow the focus of the post to showcase some approaches of introducing proprietary data into LLMs, with privacy and safety of sensitive data at the forefront. In a study done by the AI Accelerator Institute, data privacy was the second biggest barrier to adopting LLMs within their company ...