| |
Beta This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
|
Longitudinal
404 HN snapshots · 2 evals | |
Audit Trail
18 entries | 2026-02-28 13:33 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 01:34 | dlq_replay | DLQ message 97510 replayed to EVAL_QUEUE: AI Code Review Gets Better When I Ask Models to Debate: Claude, Gemini, Codex | - - | | 2026-02-28 00:42 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 00:30 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-26 23:01 | rater_validation_fail | Light parse failure for model nemotron-nano-30b: Error: No JSON object found. Response starts with: {
"schema_version": "light-1.1",
"evaluation": {
"url": "https://milvus.io/blog | - - | | 2026-02-26 23:01 | eval_success | Light evaluated: Neutral (0.00) | - - | | 2026-02-26 23:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | | | 2026-02-26 22:41 | rater_validation_fail | Light validation failed for model llama-4-scout-wai | - - | | 2026-02-26 22:36 | rater_validation_fail | Light validation failed for model llama-4-scout-wai | - - | | 2026-02-26 22:36 | rater_validation_fail | Light parse failure for model nemotron-nano-30b: SyntaxError: Expected ',' or '}' after property value in JSON at position 339 (line 11 column 4) | - - | | 2026-02-26 22:35 | rater_validation_fail | Light validation failed for model llama-4-scout-wai | - - | | 2026-02-26 22:35 | eval_success | Evaluated: Neutral (0.00) | - - | | 2026-02-26 22:35 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 7,998 tokens | | | 2026-02-26 22:31 | rater_validation_fail | Light validation failed for model llama-4-scout-wai | - - | | 2026-02-26 22:15 | dlq | Dead-lettered after 1 attempts: AI Code Review Gets Better When I Ask Models to Debate: Claude, Gemini, Codex | - - | | 2026-02-26 22:13 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 22:12 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 22:11 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | |
| |