Automated Testing of Large Language Models: A Live Demo

How do you test a large language model’s shape-shifting outputs using automated tests? In this session, Anupam Krishnamurthy will demonstrate the inner workings of a Retrieval Augmented Generation (RAG) model, and how you can qualify it using automated tests.

Spoiler alert: testing an LLM’s outputs will involve using another LLM to judge this output.
We will then examine the stack traces of the evaluation framework, so that you can debug an unexpected score. Join me in this live demonstration, where we pit one LLM against another, and expose a security flaw in the bargain.

Anupam Krishnamurthy, Freelance Test Solution Architect, AnuKrit will host the session “Automated Testing of Large Language Models: A Live Demo” that will take place on Thursday March 5.

Meet world’s leading Test Automation experts! Register now and ensure your place at this unique conference. Get a combi ticket for a fee of € 990,- or register for Day 1 for € 545,- or for Day 2 for € 495,-.

2025-12-04T09:09:24+01:00Thursday, December 4, 2025|

Automated Testing of Large Language Models: A Live Demo

Share this news on social media!