What's next? Next word prediction with PyTorch

Today, I will take you through a simple next-word prediction model built using PyTorch. This next word prediction is based on Google’s Smart Compose and is a form of language modelling. The knowledge learnt here forms the basis for larger large language models despite using a different architecture. Specifically, we draw on research published by Google for Gmail’s Smart Compose feature. Smart Compose uses a few words the user inputs and then predicts the following words or sentences in emails you want to write....

November 8, 2023 · 7 min · James Malcolm

PyTorch Autograd and Training Models

Recently I’ve found myself wanting to dig a bit deeper into PyTorch to really understand how it works and hopefully figure out how to expand on my skillset to try new things. To aid this, I’ve been following the Deep Learning with PyTorch book by Eli Stevens, Luca Antiga and Thomas Viehmann. The book itself has been great and highly useful. I recommend it to anyone. In my observations, one the hardest things for newer users of PyTorch to understand is how to train a model....

February 1, 2021 · 6 min · James Malcolm