llm-testing topic

List llm-testing repositories

langtest

549
Stars
50
Forks
549
Watchers

Deliver safe & effective language models

RagaAI-Catalyst

16.1k
Stars
3.7k
Forks
16.1k
Watchers

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal...

llamator

180
Stars
16
Forks
180
Watchers

Framework for testing vulnerabilities of large language models (LLM).

contextcheck

91
Stars
12
Forks
91
Watchers

MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.

g4f-working

49
Stars
9
Forks
49
Watchers

g4f-working is a daily-updated list of working no-auth AI providers and models from @xtekky/gpt4free. It helps developers, testers, and AI enthusiasts instantly find which models are currently online...

llm-api-test

40
Stars
2
Forks
40
Watchers

A tool for testing and comparing the performance of different Large Language Model APIs. 一个用于测试和比较不同大语言模型API性能的工具。

onerun

17
Stars
0
Forks
17
Watchers

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord: https://discord....