llm-testing topics

langtest

549

Stars

50

Forks

549

Watchers

Deliver safe & effective language models

Pacific-AI-Corp

benchmarks

ethics-in-ai

large-language-models

llm-test

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal...

raga-ai-hub

agentic-ai-development

agentneo

agents

ai-agent-monitoring

llamator

180

Stars

16

Forks

180

Watchers

Framework for testing vulnerabilities of large language models (LLM).

LLAMATOR-Core

agent

ai

ai-security

attack

contextcheck

91

Stars

12

Forks

91

Watchers

MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.

Addepto

ai-chat

ai-testing

ai-testing-tool

chatbot-framework

g4f-working

49

Stars

9

Forks

49

Watchers

g4f-working is a daily-updated list of working no-auth AI providers and models from @xtekky/gpt4free. It helps developers, testers, and AI enthusiasts instantly find which models are currently online...

Free-AI-Things

ai-access

ai-directory

ai-models

ai-monitoring

llm-api-test

40

Stars

2

Forks

40

Watchers

A tool for testing and comparing the performance of different Large Language Model APIs. 一个用于测试和比较不同大语言模型API性能的工具。

qjr87

api-performance-testing

llm-api-performance

llm-test

llm-testing

onerun

17

Stars

0

Forks

17

Watchers

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord: https://discord....

onerun-ai

ai

ai-agents

ai-testing

chatbot

llm-testing topic

langtest

RagaAI-Catalyst

llamator

contextcheck

g4f-working

llm-api-test

onerun