| 2026-03-15 22:12 | eval_success | Evaluated: Neutral (0.00) | - - |
| 2026-03-15 22:12 |
eval
|
Evaluated by claude-haiku-4-5-20251001: 0.00 (Neutral) 11,003 tokens | |
| 2026-03-15 22:12 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 0W 5R | - - |
| 2026-03-15 22:10 | eval_failure | Evaluation failed: Error: Network connection lost. | - - |
| 2026-03-15 21:35 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 21:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 21:35 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 21:31 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-15 21:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-15 20:54 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 20:54 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 20:54 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 20:50 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-15 20:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-15 20:18 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 20:18 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 20:18 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 20:13 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-15 20:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-15 19:43 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 19:43 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 19:43 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 19:39 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-15 19:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-15 19:07 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 19:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 19:06 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 19:01 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-15 19:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) +0.13 | |
| 2026-03-15 18:21 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 18:21 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 18:21 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 18:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 17:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 16:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 15:57 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 15:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 15:21 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 15:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 14:45 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 14:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 14:08 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 13:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 13:31 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 13:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 12:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 12:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 12:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 12:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 11:34 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 11:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 10:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 10:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 10:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 10:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 09:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 09:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 08:56 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 08:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 08:16 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 07:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 07:32 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 07:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 06:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 06:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 06:20 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 05:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 05:45 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 05:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 05:10 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 04:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 04:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 04:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 03:59 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 03:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 03:19 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 02:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 02:44 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 02:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 02:10 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 01:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 01:34 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 01:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 01:08 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 00:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-15 00:44 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-15 00:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.60 (Moderate negative) | |
| 2026-03-15 00:06 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Game with intentionally bad UI, no rights discussion |
| 2026-03-14 23:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-14 23:39 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-14 23:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) +0.41 | |
| 2026-03-14 23:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-14 22:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.07 (Neutral) -0.41 | |
| 2026-03-14 22:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-14 21:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) 0.00 | |
| 2026-03-14 21:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |
| 2026-03-14 19:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.47 (Moderate positive) | |
| 2026-03-14 19:50 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning A game website with intentionally bad UI, no explicit human rights discussion |