back

Soulmates conversational AI E2E framework

LLM-simulated user drives real conversations through the production pipeline; a three-tier judge scores every turn. Pass/fail computed in code.

Soulmates E2E framework — a simulated user (LLM) talks to the real agent through the production pipeline. A three-tier evaluator (deterministic rules → soft rules injected as context → 4-criteria LLM judge) produces a score, with pass/fail computed in code.

stanislas andujar · standujar.dev