BetaThis system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →
Longitudinal
424 HN snapshots· 2 evals
Audit Trail
12 entries
2026-03-04 04:03
eval_skip
Skipped: no readable text in HTML (likely JS-rendered SPA)
--
2026-03-04 04:02
dlq_auto_replay
DLQ auto-replay: message 98374 re-enqueued
--
2026-03-03 07:48
dlq
Dead-lettered after 1 attempts: Comparing manual vs. AI requirements gathering: 2 sentences vs. 127-point spec
--
2026-03-03 07:48
eval_failure
Evaluation failed: Error: OpenRouter API error 402: {"error":{"message":"Insufficient credits. Add more using https://openrouter.ai/settings/credits","code":402}}
--
2026-03-03 07:48
eval_retry
OpenRouter error 402 model=deepseek-v3.2
--
2026-03-03 07:48
eval_retry
OpenRouter error 402 model=deepseek-v3.2
--
2026-03-03 07:48
eval_failure
Evaluation failed: Error: OpenRouter API error 402: {"error":{"message":"Insufficient credits. Add more using https://openrouter.ai/settings/credits","code":402}}
--
2026-02-28 02:55
eval_success
Light evaluated: Neutral (0.00)
--
2026-02-28 02:55
eval
Evaluated by llama-4-scout-wai: 0.00 (Neutral)
2026-02-28 02:35
eval_success
Light evaluated: Neutral (0.00)
--
2026-02-28 02:35
eval
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
2026-02-28 02:35
rater_validation_warn
Light validation warnings for model llama-3.3-70b-wai: 0W 7R