| 2026-03-16 04:36 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-16 04:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-16 04:36 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-16 04:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-16 04:35 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-16 04:35 | model_divergence | Cross-model spread 0.41 exceeds threshold (2 models) | - - |
| 2026-03-16 02:03 | eval_success | Evaluated: Moderate positive (0.33) | - - |
| 2026-03-16 02:03 | model_divergence | Cross-model spread 0.41 exceeds threshold (2 models) | - - |
| 2026-03-16 02:03 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.33 (Moderate positive) 12,127 tokens | |
| 2026-03-10 04:29 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-10 04:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-10 04:21 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-10 04:21 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-10 04:21 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 16:15 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-09 16:15 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 16:15 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 16:11 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-09 16:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 15:56 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-09 15:56 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 15:56 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 15:53 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-09 15:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 15:37 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-09 15:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 15:37 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 15:35 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-09 15:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 15:20 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-09 15:20 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 15:20 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 15:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 15:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 15:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 14:44 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 14:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 14:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 14:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 14:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 14:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 13:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 13:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 13:35 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 13:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 13:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 13:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 13:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 12:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 12:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 12:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 12:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 11:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 11:45 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 10:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 10:55 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 10:45 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 10:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 09:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 09:51 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 09:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 09:33 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 09:28 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 08:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 08:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 08:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 08:17 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 08:12 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 07:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 07:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 07:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 07:11 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 06:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 06:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 06:15 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 06:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 05:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 05:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 05:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 05:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 05:01 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 04:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 04:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 03:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 03:56 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 03:54 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 03:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 02:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 02:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 02:50 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 02:47 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 02:45 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 02:42 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 01:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 01:41 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 01:38 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 01:36 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-09 00:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 00:35 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 00:30 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-09 00:30 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 23:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 23:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 23:09 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 23:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 22:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 21:59 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 21:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 21:57 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 21:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 20:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 20:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 20:32 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 20:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 18:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 18:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 18:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 18:37 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 18:31 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 17:01 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 17:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 16:59 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 16:56 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 16:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 16:45 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 14:38 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 14:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 14:35 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 14:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 13:25 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 13:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 13:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 13:20 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 13:13 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 12:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 12:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 12:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 12:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 10:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 10:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 10:54 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 10:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) +0.02 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 10:47 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 09:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 09:49 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 09:45 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 09:42 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 09:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 08:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 08:46 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 08:44 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 08:41 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 08:38 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 07:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 07:38 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 07:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 07:35 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 06:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 06:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Technical blog post on DevOps |
| 2026-03-08 06:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-08 06:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) | |
| 2026-03-08 06:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) | |
| reasoning Technical blog post on production deployment experiences, no human rights discussion |
| 2026-03-08 06:35 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) | |
| reasoning Technical blog post on DevOps |