Engineering Blog

Building an Unbeatable Tic-Tac-Toe Agent

How I used Reinforcement Learning to create an agent that never loses.

Exploring the architecture of a RAG pipeline for Pakistani traffic laws.

A deep dive into the process of fine-tuning a small LLM for instruction following.