Longitudinal 84 HN snapshots · 4 evals
+1 0 −1 HN
Audit Trail 24 entries
2026-02-28 06:10 eval_skip Skipped: no readable text (pre-fetch) - -
2026-02-26 23:01 eval_success Light evaluated: Mild positive (0.20) - -
2026-02-26 23:01 eval Evaluated by llama-4-scout-wai: +0.20 (Mild positive)
2026-02-26 22:39 rater_validation_fail Light validation failed for model llama-4-scout-wai - -
2026-02-26 20:06 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 20:04 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 20:03 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 20:02 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 20:02 eval_failure Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai - -
2026-02-26 20:02 eval_failure Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai - -
2026-02-26 20:02 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:30 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 17:28 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:27 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:26 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:25 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 17:23 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:23 eval_retry OpenRouter API error 402 model=llama-3.3-70b - -
2026-02-26 17:23 eval_failure Evaluation failed: Error: OpenRouter API error 402: {"error":{"message":"Provider returned error","code":402,"metadata":{"raw":"{\"error\":\"API key USD spend limit exceeded. Your account may still have USD balance, but - -
2026-02-26 08:56 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 08:55 dlq Dead-lettered after 1 attempts: We left OpenAI because of safety - -
2026-02-26 08:22 eval Evaluated by deepseek-v3.2: +0.07 (Neutral) 8,144 tokens
2026-02-26 04:47 eval Evaluated by claude-haiku-4-5-20251001: -0.12 (Mild negative) 9,708 tokens +0.03
2026-02-26 04:34 eval Evaluated by claude-haiku-4-5-20251001: -0.16 (Mild negative) 9,535 tokens