| 2026-03-16 01:14 | eval_success | PSQ evaluated: g-PSQ=-0.580 (3 dims) | - - |
| 2026-03-16 01:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-16 00:49 | model_divergence | Cross-model spread 0.45 exceeds threshold (2 models) | - - |
| 2026-03-16 00:49 | eval_success | Lite evaluated: Mild negative (-0.29) | - - |
| 2026-03-16 00:49 |
eval
|
Evaluated by llama-4-scout-wai: -0.29 (Mild negative) +0.11 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 23:07 | eval_success | Evaluated: Mild positive (0.16) | - - |
| 2026-03-15 23:07 | model_divergence | Cross-model spread 0.56 exceeds threshold (2 models) | - - |
| 2026-03-15 23:07 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.16 (Mild positive) 19,103 tokens +0.26 | |
| 2026-03-15 23:04 | eval_success | Evaluated: Neutral (-0.10) | - - |
| 2026-03-15 23:04 | model_divergence | Cross-model spread 0.30 exceeds threshold (2 models) | - - |
| 2026-03-15 23:04 |
eval
|
Evaluated by claude-haiku-4-5-20251001: -0.10 (Neutral) 20,102 tokens | |
| 2026-03-15 23:04 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 1W 1R | - - |
| 2026-03-15 22:42 | eval_success | PSQ evaluated: g-PSQ=-0.580 (3 dims) | - - |
| 2026-03-15 22:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 22:03 | eval_success | Lite evaluated: Moderate negative (-0.40) | - - |
| 2026-03-15 22:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) -0.06 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 22:03 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 18:48 | eval_success | Lite evaluated: Moderate negative (-0.34) | - - |
| 2026-03-15 18:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) -0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 17:53 | eval_success | PSQ evaluated: g-PSQ=-0.580 (3 dims) | - - |
| 2026-03-15 17:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 17:36 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 17:36 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 17:36 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 16:38 | eval_success | PSQ evaluated: g-PSQ=-0.580 (3 dims) | - - |
| 2026-03-15 16:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 16:22 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 16:22 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 16:22 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 05:55 | eval_success | PSQ evaluated: g-PSQ=-0.580 (3 dims) | - - |
| 2026-03-15 05:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 05:50 | eval_success | Lite evaluated: Moderate negative (-0.34) | - - |
| 2026-03-15 05:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) -0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 05:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 05:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 04:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 04:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) -0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-15 04:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-15 04:05 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 23:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 23:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 22:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 22:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 21:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 21:46 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 20:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 20:28 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 19:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 19:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 18:31 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 18:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 16:54 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 16:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 15:43 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 15:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 15:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 15:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 14:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 14:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 13:47 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 13:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 13:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 13:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 12:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 12:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 12:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 11:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 11:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) -0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 11:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 10:49 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.34 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 10:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 10:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 10:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 09:33 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 09:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 08:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 08:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 08:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 08:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 07:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 07:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 06:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 06:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 06:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 05:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 05:31 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 05:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 04:54 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 04:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 04:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 03:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 03:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 03:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 02:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 02:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 02:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 02:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 01:42 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 01:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 01:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 00:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-14 00:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) +0.06 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-14 00:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: 0.00 (Neutral) | |
| 2026-03-14 00:04 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Technical content, zero rights discussion |
| 2026-03-13 23:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 23:44 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 23:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 22:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 21:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 21:30 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 20:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 20:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 19:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 18:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 18:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 17:33 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) -0.40 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 16:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 16:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.40 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 15:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 15:30 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 14:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 14:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 14:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 14:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 13:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 13:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) -0.06 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 13:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 12:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 12:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 12:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) +0.06 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 11:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) 0.00 | |
| 2026-03-13 11:47 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Technical article about a data breach, no explicit rights discussion |
| 2026-03-13 11:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.58 (Moderate negative) | |
| 2026-03-13 11:10 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) | |
| reasoning Technical article about a data breach, no explicit rights discussion |