| 2026-03-16 01:56 | eval_success | Evaluated: Mild positive (0.28) | - - |
| 2026-03-16 01:56 | model_divergence | Cross-model spread 0.35 exceeds threshold (2 models) | - - |
| 2026-03-16 01:56 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.28 (Mild positive) 13,222 tokens +0.05 | |
| 2026-03-16 01:25 | eval_success | Evaluated: Mild positive (0.23) | - - |
| 2026-03-16 01:25 | model_divergence | Cross-model spread 0.30 exceeds threshold (2 models) | - - |
| 2026-03-16 01:25 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.23 (Mild positive) 13,248 tokens -0.13 | |
| 2026-03-16 00:52 | eval_success | Evaluated: Moderate positive (0.36) | - - |
| 2026-03-16 00:52 | model_divergence | Cross-model spread 0.44 exceeds threshold (2 models) | - - |
| 2026-03-16 00:52 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,120 tokens -0.06 | |
| 2026-03-16 00:42 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-16 00:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-16 00:38 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-16 00:38 | model_divergence | Cross-model spread 0.50 exceeds threshold (2 models) | - - |
| 2026-03-16 00:38 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-16 00:38 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-16 00:22 | eval_success | Evaluated: Moderate positive (0.42) | - - |
| 2026-03-16 00:22 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 14,447 tokens -0.00 | |
| 2026-03-15 23:47 | eval_success | Evaluated: Moderate positive (0.42) | - - |
| 2026-03-15 23:47 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 13,324 tokens +0.06 | |
| 2026-03-15 23:09 | eval_success | Evaluated: Moderate positive (0.36) | - - |
| 2026-03-15 23:09 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,698 tokens +0.21 | |
| 2026-03-15 23:06 | eval_success | Evaluated: Mild positive (0.15) | - - |
| 2026-03-15 23:06 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.15 (Mild positive) 13,406 tokens -0.25 | |
| 2026-03-15 22:30 | eval_success | Evaluated: Moderate positive (0.40) | - - |
| 2026-03-15 22:30 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.40 (Moderate positive) 12,818 tokens | |
| 2026-03-15 21:50 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 21:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 21:43 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-15 21:43 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 21:43 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 21:08 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 21:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 21:04 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-15 21:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 20:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 20:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 19:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 19:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 19:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-15 19:13 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 18:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 18:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 17:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 17:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 16:05 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 16:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-15 15:28 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 15:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-15 14:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 14:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 14:16 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 14:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 13:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 13:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 13:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 12:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-15 12:21 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 12:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 11:43 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 11:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 11:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 10:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 10:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 10:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-15 09:42 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 09:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 09:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 08:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 08:21 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 08:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 07:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 07:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 06:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 06:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 06:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 06:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 05:47 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 05:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 05:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 05:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 04:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 04:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 04:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 03:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-15 03:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 03:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 02:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 02:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 02:15 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 01:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 01:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 01:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-15 01:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 00:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-15 00:44 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-15 00:13 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) | |
| 2026-03-15 00:10 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) | |
| reasoning Technical blog post on XML usage |
| 2026-03-14 23:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-14 23:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 23:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-14 23:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 22:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-14 22:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 21:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-14 21:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 20:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-14 19:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 19:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-14 19:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 18:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-14 18:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 16:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-14 16:35 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 15:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-14 15:24 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 14:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-14 14:46 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 14:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-14 14:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 13:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-14 13:35 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |
| 2026-03-14 13:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) | |
| 2026-03-14 13:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) | |
| reasoning Technical blog post on XML usage in software development, no explicit human rights discussion |