Implementing Self-Attention - A step-by-step guide using PyTorch

From last couple of months I have been learning the core building blocks of LLM’s using a well known book called Build Large Language Model from Scratch. The book is very well written and highly recommended for someone who is interested in learning the internal architecture of LLM’s, but please note you will need to put some serious time & effort to get hold of it. This writing is inspired by the book hence full credit to the author Sebastian Raschka. ...

April 11, 2026 · Sudheer Tammini

PaddyLab - A Deep Learning Project from Training to Production

With the popularity of ChatGPT there has been a lot of buzz around generative AI and its capabilities. I’ve been using tools like BingChat and I can confidently say that I’m very impressed with our progress in the field of Generative AI. But, please remember AI aka “Deep Learning” is not just limited to Generative AI we can also use it for various tasks such as Computer Vision, Tabular Analysis, Collaborative Filtering etc. ...

October 24, 2023 · Sudheer Tammini