| 2026-03-29 01:59 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-29 01:59 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-29 01:59 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-29 01:27 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-29 01:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-29 00:39 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-29 00:39 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-29 00:39 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 23:25 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-28 23:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 22:58 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-28 22:58 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 22:58 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 22:08 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-28 22:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 21:50 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-28 21:50 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 21:50 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 20:37 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-28 20:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 20:35 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-28 20:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 20:35 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 19:10 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-28 19:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 19:07 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-28 19:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 19:07 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 17:56 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-28 17:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 17:55 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-28 17:55 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-28 17:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 16:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 16:41 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 15:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 15:36 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 14:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 14:23 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 13:45 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 13:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 13:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 13:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 12:32 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 12:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 11:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 11:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 11:18 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 11:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 10:41 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 10:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 10:05 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 09:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 09:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 09:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 09:05 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 08:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 08:25 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 08:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 07:48 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 07:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 07:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 06:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 06:26 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 06:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 05:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 05:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 05:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 04:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 04:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 04:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 04:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 03:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 03:21 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 02:54 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 02:44 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 02:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 02:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 01:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 01:34 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-28 00:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-28 00:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 23:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 23:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 23:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 23:13 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 22:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 22:03 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 20:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 20:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 19:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 19:29 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 18:19 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 18:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 17:08 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 17:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 15:59 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 15:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 14:43 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 14:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 13:09 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 12:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 12:23 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 12:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 11:45 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 11:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 11:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 10:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 10:28 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 10:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 09:53 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 09:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 09:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 09:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 08:36 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 08:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 07:57 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 07:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 07:21 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 07:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 06:42 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 06:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 06:02 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 05:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 05:25 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 05:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 04:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 04:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 03:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 03:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 02:36 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 01:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 01:28 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-27 00:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-27 00:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.04 (Neutral) | |
| 2026-03-27 00:03 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.07 (Neutral) | |
| reasoning Technical content, no rights discussion |
| 2026-03-26 23:57 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-26 22:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-26 22:38 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-26 21:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-26 21:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical article on table precision, no human rights discussion |
| 2026-03-26 20:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-26 20:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Technical article on table precision, no human rights discussion |