BenchLLM
Open-source test & evaluation suite for LLM-powered apps.










Discover the best AI-powered tools — browse, compare, and find the right fit.
Open-source test & evaluation suite for LLM-powered apps.
Live environments for training real-world AI agents.
Benchmark and compare LLM training solutions for 2023.

LMArena is an open, community-driven platform designed to evaluate and compare large language models (LLMs) through anonymous, randomized...