Skip to content

Recent Article

Beyond Prompting: RAG with Microsoft Foundry and FoundryIQ

alt text

RAG has become a common term in modern AI discussions but what does it actually mean?

RAG stands for Retrieval Augmented Generation. In simple terms, it is a technique that allows large language models (LLMs) to generate answers using external knowledge, rather than relying only on what was learned during model training.

Let’s make this easier to understand with a concrete example.

From Zero to Stateful: Building a PowerShell Chat Completion with Microsoft Foundry

alt text

Did you know that large language models don’t actually remember your conversations?

Every time you chat with a LLM, it feels continuous and stateful but under the hood, each response is generated from scratch. Any “memory” you experience is carefully reconstructed by the application.

In this post, I’ll walk you through building a PowerShell chat completion using Microsoft Foundry, and we’ll unpack exactly how chat history works, where it’s stored, and why the LLM remains fundamentally stateless.