| 2026-03-16 01:49 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-16 01:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-16 01:41 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-16 01:41 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-16 01:41 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 23:37 | eval_success | Evaluated: Mild positive (0.23) | - - |
| 2026-03-15 23:37 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.23 (Mild positive) 14,284 tokens -0.01 | |
| 2026-03-15 23:33 | eval_success | Evaluated: Mild positive (0.24) | - - |
| 2026-03-15 23:33 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.24 (Mild positive) 16,004 tokens | |
| 2026-03-14 17:37 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-14 17:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-14 17:26 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-14 17:26 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-14 17:26 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 19:18 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 19:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 18:59 | eval_success | PSQ evaluated: g-PSQ=0.483 (3 dims) | - - |
| 2026-03-08 18:59 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.00 | |
| 2026-03-08 18:49 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 18:49 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 18:49 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 17:51 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 17:51 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 17:51 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-08 17:48 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 17:48 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 16:28 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 16:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 16:04 | eval_success | PSQ evaluated: g-PSQ=0.482 (3 dims) | - - |
| 2026-03-08 16:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 15:52 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 15:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 15:52 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 15:32 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-08 15:32 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 09:37 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 09:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 09:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 09:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 09:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 08:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) -0.00 | |
| 2026-03-08 08:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 08:33 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 08:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-08 08:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-08 08:01 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 07:54 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 07:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.00 | |
| 2026-03-08 07:29 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 07:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 07:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 07:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 06:59 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 06:46 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 06:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 06:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 06:01 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 05:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 05:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 04:59 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 04:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 04:54 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 04:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 04:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 04:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 04:00 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 03:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 03:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 03:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 03:18 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 03:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 02:55 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 02:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 02:37 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) -0.00 | |
| 2026-03-08 02:33 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.00 | |
| 2026-03-08 02:10 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 01:52 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 01:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 01:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 01:07 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 01:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-08 00:50 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-08 00:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 00:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-08 00:02 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 23:46 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-07 23:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 23:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 22:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 22:40 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-07 22:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 22:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 22:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 19:58 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 19:53 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 19:44 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-07 19:39 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical comparison, neutral stance |
| 2026-03-07 19:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 19:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 19:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-07 18:14 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 18:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-07 18:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-07 17:11 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 17:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 17:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-07 16:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 16:33 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 16:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) -0.00 | |
| 2026-03-07 16:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 15:58 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 15:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 15:24 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.00 | |
| 2026-03-07 15:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 14:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 14:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 14:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 14:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 13:49 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 13:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 13:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 13:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 12:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 12:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 12:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 12:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 12:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 11:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 11:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 11:38 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 11:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 11:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 11:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 10:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 10:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 10:34 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 10:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 10:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 10:00 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 09:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-07 09:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 09:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 09:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-07 08:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 08:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 08:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 08:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) -0.00 | |
| 2026-03-07 08:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 07:58 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.00 | |
| 2026-03-07 07:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 07:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 07:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 06:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 06:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-07 06:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 06:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-07 05:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 05:52 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00 | |
| 2026-03-07 05:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 05:28 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 05:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-07 05:23 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 05:22 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) | |
| 2026-03-07 05:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 05:11 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 05:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 05:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.08 | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 04:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Technical content comparing SPA and hypermedia approaches, no explicit human rights discussion |
| 2026-03-07 04:55 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Technical comparison, neutral stance |