| |
Beta This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
| Pending Evaluation This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-26 16:06:04 | |
Longitudinal
888 HN snapshots · 3 evals | |
Audit Trail
12 entries | 2026-02-28 00:45 | eval_success | Light evaluated: Neutral (0.00) | - - | | 2026-02-28 00:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | | | 2026-02-28 00:45 | rater_validation_warn | Light validation warnings for model llama-3.3-70b-wai: 0W 7R | - - | | 2026-02-27 02:38 | eval_success | Evaluated: Neutral (0.02) | - - | | 2026-02-27 02:38 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 9,063 tokens | | | 2026-02-26 22:04 | eval_success | Evaluated: Mild positive (0.10) | - - | | 2026-02-26 22:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) | | | 2026-02-26 22:04 | rater_validation_warn | Validation warnings for model llama-4-scout-wai: 30W 30R | - - | | 2026-02-26 21:21 | dlq | Dead-lettered after 1 attempts: I don't need AI to build me a new app. I need it to make Jira bearable | - - | | 2026-02-26 21:19 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 21:18 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 21:17 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | |
| |