| |
Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
| Pending Evaluation This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-26 06:22:52 | |
Longitudinal
· 6 evals | |
Audit Trail
16 entries | 2026-03-05 12:56 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - | | 2026-03-05 12:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | | | 2026-03-05 12:51 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - | | 2026-03-05 12:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) | | | 2026-03-05 12:39 | eval_success | PSQ evaluated: g-PSQ=0.322 (3 dims) | - - | | 2026-03-05 12:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) | | | 2026-02-28 00:43 | eval_success | Light evaluated: Neutral (0.00) | - - | | 2026-02-28 00:43 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | | | 2026-02-27 01:36 | eval_success | Evaluated: Mild positive (0.21) | - - | | 2026-02-27 01:36 |
eval
|
Evaluated by deepseek-v3.2: +0.21 (Mild positive) 12,134 tokens | | | 2026-02-26 22:02 | eval_success | Evaluated: Neutral (0.06) | - - | | 2026-02-26 22:02 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) | | | 2026-02-26 21:21 | dlq | Dead-lettered after 1 attempts: Unicode confusables.txt and NFKC disagree on 31 chars | - - | | 2026-02-26 21:19 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 21:18 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 21:17 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | |
| |