| |
Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
|
Longitudinal
· 4 evals | |
Audit Trail
11 entries | 2026-02-28 00:41 | eval_success | Evaluated: Mild positive (0.24) | - - | | 2026-02-28 00:41 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 15,424 tokens | | | 2026-02-28 00:33 | eval_success | Light evaluated: Neutral (0.00) | - - | | 2026-02-28 00:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | | | 2026-02-28 00:27 | eval_success | Light evaluated: Mild positive (0.20) | - - | | 2026-02-28 00:27 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) | | | 2026-02-28 00:26 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 00:11 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 00:11 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 00:11 | eval_skip | Skipped: no readable text in HTML (likely JS-rendered SPA) | - - | | 2026-02-28 00:07 |
eval
|
Evaluated by claude-haiku-4-5: +0.75 (Strong positive) | | | |
| |