🤖 Agent Evaluation Runner (OpenAI)

Using GPT-4o-mini — fast, cheap, no quota issues!


Results

Results