Posts

Monopoly Deal and Reinforcement Learning - Building an environment.

Join me on a series of posts about training a reinforcement learning from scratch to play Monopoly Deal. This post isn’t designed as a polished, run-it-once-and-it-works tutorial - there are plenty of those for reinforcement learning. Instead, it’s a training journal documenting real issues and solutions as they arise. If you’re wanting introduction to reinforcement learning, StableBaselines is a good starting point, and a package we’ll be using later. I really like Joseph Suarez, author and contributor to PufferLib reinforcement learning quick start here as well. ...

The Agentic Leap - A Blueprint for Building Next-Generation Enterprise AI

Generative AI has moved beyond simple chatbots. The new frontier is building sophisticated AI agents that can reason, use tools, and execute complex, multi-step tasks. But how do you evolve from a single-purpose AI assistant to a robust, multi-agent system that can act as a true digital coworker? This is a journey from simple automation to genuine augmentation. It requires a shift in thinking and a new architectural blueprint. After extensive research and development, we’ve created a pragmatic approach to building these systems. This article shares that blueprint. ...

Exploring Byte Level Transformers.

Last December, Meta AI released paper describing their Byte Latent Transformer (BLT) and today (13 May 2025), they released the weights to Hugging Face. Let’s break down the paper and explore what makes BLT a special model. What is BLT? Let’s break down the name: • Byte: This signifies that the architecture operates directly on raw byte data • Latent: This refers to the way BLT processes the byte data. Instead of processing every individual byte in the main computation layer (which would be prohibitively costly) • Transformer: This indicates that BLT is an LLM architecture based on the Transformer model ...

Learning and applying Deepseek techniques

In January 2025, Deepseek made headlines with the release of their Deepseek R1 models and a suite of smaller models distilled from the larger R1 variant. The announcement sent shockwaves through the market—shaking NASDAQ and causing NVIDIA shares to drop nearly 20% in a single day. Although the performance of these models wasn’t the only factor, Deepseek’s innovation called into question the competitive advantage long held by US-based AI giants. ...

Generative AI reflections for 2024

The year 2024 was an interesting one for Generative AI. It certainly wasn’t bigger than 2022 and 2023, when Generative AI went mainstream through the release of ChatGPT. On the other hand, it wasn’t a ‘nothing’ year as well. Sure, Generative AI is more widespread and prevalent than previous years. Helped through both Android and Apple deploying generative AI on mobile devices. But, as I look back, I feel it was a year of consolidation. Consolidation in the big players, consolidation and confidence in the technology, and consolidation in regulation. ...