Agenta¶

Open-source LLMOps platform covering the agent-reliability lifecycle: observability, prompt management, evaluation, auto-optimization. Founded by mahmoud-mabrouk.

Positioning¶

End-to-end tooling for teams building production LLM apps who need to: - Instrument traces (observability) - Manage prompts as versioned artifacts - Run offline/online evals - Auto-optimize prompts + judges via algorithms like gepa

Used in the "Judge the Judge" demo to instrument GEPA experiments and inspect generated candidate prompts mid-run.

Cross-references¶

mahmoud-mabrouk — founder
llm-judge-calibration — the workflow Agenta is tooling for