| 2026-03-08 18:41 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-08 18:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 18:40 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 18:40 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 18:36 | eval_success | PSQ evaluated: g-PSQ=-0.160 (3 dims) | - - |
| 2026-03-08 18:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 18:26 | eval_success | Lite evaluated: Mild negative (-0.18) | - - |
| 2026-03-08 18:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) +0.16 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 18:26 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 18:22 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 18:22 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) +0.12 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 17:01 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-08 17:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 16:54 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 16:54 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 16:47 | eval_success | Lite evaluated: Moderate negative (-0.34) | - - |
| 2026-03-08 16:47 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 16:47 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 16:41 | eval_success | Lite evaluated: Mild negative (-0.12) | - - |
| 2026-03-08 16:41 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.12 (Mild negative) -0.12 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 14:37 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-08 14:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 14:30 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 14:30 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 14:22 | eval_success | Lite evaluated: Moderate negative (-0.34) | - - |
| 2026-03-08 14:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 14:22 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 14:15 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 14:15 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 13:24 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-08 13:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 13:18 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 13:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 13:11 | eval_success | Lite evaluated: Moderate negative (-0.34) | - - |
| 2026-03-08 13:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 13:11 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 13:07 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) +0.12 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 13:02 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.12 (Mild negative) -0.12 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 12:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 12:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 11:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 11:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 11:48 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 11:43 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 10:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 10:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 10:50 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 10:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) -0.16 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 10:32 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 10:27 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) +0.18 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 09:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 09:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 09:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 09:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 09:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 09:25 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 09:20 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 08:42 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 08:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 08:33 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 08:28 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 08:18 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 07:35 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 07:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 07:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 07:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 07:17 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 06:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 06:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 06:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 06:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 06:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 06:18 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 05:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 05:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 05:23 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 05:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 05:17 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 04:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 04:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 04:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 04:17 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 03:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 03:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 03:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.08 | |
| 2026-03-08 03:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 03:15 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 03:10 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 02:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 02:15 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) 0.00 | |
| 2026-03-08 02:13 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 02:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 02:07 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 01:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 01:11 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) 0.00 | |
| 2026-03-08 01:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 01:06 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.16 (Mild negative) +0.08 | |
| 2026-03-08 01:03 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-08 00:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 00:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-08 00:06 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) -0.09 | |
| 2026-03-08 00:05 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.18 (Mild negative) -0.18 | |
| reasoning Investigative journalism on DoD claims |
| 2026-03-07 23:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 23:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-07 22:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) | |
| 2026-03-07 22:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.15 (Mild negative) | |
| 2026-03-07 22:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) | |
| reasoning Editorial discusses investigation of DoD, no explicit rights discourse. Transparency indicators partially present. |
| 2026-03-07 22:57 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Investigative journalism on DoD claims |