llmtesting

Here are 4 public repositories matching this topic...

avi350751 / test-llm-with-promptfoo

This repository demonstrates how to perform LLM prompt evaluation, deterministic testing, and regression detection using PromptFoo. It is designed to help developers, researchers, and QA teams ensure that prompt or model updates produce consistent and reliable results.

evals promptfoo aitest llmtesting

Updated Nov 3, 2025
HTML

avi350751 / test-llm-with-deepeval

Star

A hands-on exploration of Deepeval — an open-source framework for evaluating and red-teaming large language models (LLMs). This repository documents my journey of testing, benchmarking, and improving LLM reliability using custom prompts, metrics, and pipelines.

evals deepeval llmtesting

Updated Nov 2, 2025
Jupyter Notebook

avi350751 / autogen-playground

Star

This repo is my playground to experiment with autogen and use the same to converse, build pipelines and do LLM testing

mcp multiagent autogen llmtesting

Updated Oct 30, 2025
Python

avi350751 / promptfoo-cicd

Star

Integrating promptfoo into CI/CD pipelines to automatically evaluate prompts, test for security vulnerabilities, and ensure quality before deployment.

promptfoo llmtesting

Updated Oct 24, 2025

Improve this page

Add a description, image, and links to the llmtesting topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llmtesting topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llmtesting

Here are 4 public repositories matching this topic...

avi350751 / test-llm-with-promptfoo

avi350751 / test-llm-with-deepeval

avi350751 / autogen-playground

avi350751 / promptfoo-cicd

Improve this page

Add this topic to your repo