Comparing manual vs. AI requirements gathering: 2 sentences vs. 127-point spec

Beta This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

Longitudinal 424 HN snapshots · 2 evals

Audit Trail 12 entries

2026-03-04 04:03	eval_skip	Skipped: no readable text in HTML (likely JS-rendered SPA)	- -
2026-03-04 04:02	dlq_auto_replay	DLQ auto-replay: message 98374 re-enqueued	- -
2026-03-03 07:48	dlq	Dead-lettered after 1 attempts: Comparing manual vs. AI requirements gathering: 2 sentences vs. 127-point spec	- -
2026-03-03 07:48	eval_failure	Evaluation failed: Error: OpenRouter API error 402: {"error":{"message":"Insufficient credits. Add more using https://openrouter.ai/settings/credits","code":402}}	- -
2026-03-03 07:48	eval_retry	OpenRouter error 402 model=deepseek-v3.2	- -
2026-03-03 07:48	eval_retry	OpenRouter error 402 model=deepseek-v3.2	- -
2026-03-03 07:48	eval_failure	Evaluation failed: Error: OpenRouter API error 402: {"error":{"message":"Insufficient credits. Add more using https://openrouter.ai/settings/credits","code":402}}	- -
2026-02-28 02:55	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 02:55	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
2026-02-28 02:35	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 02:35	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
2026-02-28 02:35	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 7R	- -

build af2332a+951j · deployed 2026-03-04 23:32 UTC · evaluated 2026-03-03 07:16:53 UTC