| 2026-03-16 03:27 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-16 03:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-16 03:26 | eval_success | Lite evaluated: Mild negative (-0.15) | - - |
| 2026-03-16 03:26 | model_divergence | Cross-model spread 0.49 exceeds threshold (2 models) | - - |
| 2026-03-16 03:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.15 (Mild negative) +0.03 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-16 00:51 | eval_success | Evaluated: Moderate positive (0.33) | - - |
| 2026-03-16 00:51 | model_divergence | Cross-model spread 0.51 exceeds threshold (2 models) | - - |
| 2026-03-16 00:51 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.33 (Moderate positive) 11,643 tokens -0.08 | |
| 2026-03-16 00:51 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 0W 1R | - - |
| 2026-03-16 00:48 | eval_success | Evaluated: Moderate positive (0.41) | - - |
| 2026-03-16 00:48 | model_divergence | Cross-model spread 0.59 exceeds threshold (2 models) | - - |
| 2026-03-16 00:48 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 26W 27R | - - |
| 2026-03-16 00:48 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.41 (Moderate positive) 11,183 tokens | |
| 2026-03-10 17:27 | eval_success | Lite evaluated: Mild negative (-0.18) | - - |
| 2026-03-10 17:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 17:12 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-10 17:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 16:50 | eval_success | Lite evaluated: Mild negative (-0.18) | - - |
| 2026-03-10 16:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) +0.06 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 16:33 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-10 16:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 16:11 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-10 16:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) -0.06 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 16:11 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-10 15:56 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-10 15:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 15:20 | eval_success | Lite evaluated: Mild negative (-0.18) | - - |
| 2026-03-10 15:20 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 15:06 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-10 15:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 14:38 | eval_success | Lite evaluated: Mild negative (-0.18) | - - |
| 2026-03-10 14:38 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 14:09 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-10 14:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 14:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 13:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 12:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 12:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-10 04:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-10 04:23 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 20:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 20:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-08 20:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 19:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 19:57 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 19:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 18:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.04 | |
| 2026-03-08 17:42 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 17:41 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 17:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 16:46 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 15:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 15:21 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 15:16 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 14:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 14:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 14:07 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 14:03 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 14:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 13:58 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 13:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 13:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 12:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 12:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 12:45 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 12:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 11:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 11:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 11:31 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 11:26 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 11:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 10:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 10:41 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 10:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.18 (Mild negative) +0.06 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 10:15 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 10:10 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 09:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 09:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 09:37 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 09:33 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) +0.04 | |
| 2026-03-08 09:16 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 09:07 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 08:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 08:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.04 | |
| 2026-03-08 08:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 08:03 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 07:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 07:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-08 07:21 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) +0.04 | |
| 2026-03-08 07:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 07:01 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) +0.02 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 06:57 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) -0.02 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 06:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 06:22 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-08 06:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 05:58 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 05:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 05:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-08 05:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 04:57 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 04:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 04:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 04:16 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-08 04:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 03:57 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 03:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 03:14 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.04 | |
| 2026-03-08 03:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 02:52 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) +0.02 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 02:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 02:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 02:08 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) +0.04 | |
| 2026-03-08 02:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 01:48 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) -0.02 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 01:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 01:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.04 | |
| 2026-03-08 00:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-08 00:46 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-08 00:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-08 00:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) +0.04 | |
| 2026-03-08 00:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-07 23:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-07 23:42 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-07 23:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 22:55 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 22:46 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-07 22:37 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-07 22:32 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-07 20:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 20:13 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 19:52 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) +0.16 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-07 19:37 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative) +0.02 | |
| reasoning Neutral news article on worker productivity |
| 2026-03-07 18:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 18:38 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 17:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 17:45 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.04 | |
| 2026-03-07 16:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 16:41 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) +0.04 | |
| 2026-03-07 16:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 16:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 15:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 15:38 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 15:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 15:08 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) 0.00 | |
| 2026-03-07 15:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 15:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.03 | |
| 2026-03-07 14:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 14:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.09 (Neutral) 0.00 | |
| 2026-03-07 14:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 13:58 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.09 (Neutral) +0.01 | |
| 2026-03-07 13:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 13:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) +0.01 | |
| 2026-03-07 13:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.01 | |
| 2026-03-07 13:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 12:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 12:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 12:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 12:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 12:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 11:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 11:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 11:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 11:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 10:46 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 10:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 10:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 10:13 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 10:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 09:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 09:44 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 09:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 09:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 09:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 08:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 08:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 08:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 08:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 07:54 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 07:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 07:40 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 07:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 07:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 06:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 06:40 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 06:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 06:08 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 06:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 06:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 06:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 05:34 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 05:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 05:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 05:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 04:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 04:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 04:34 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 04:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 04:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 03:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 03:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) +0.01 | |
| 2026-03-07 03:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.01 | |
| 2026-03-07 03:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 02:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 02:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 02:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 02:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 01:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-07 01:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) +0.01 | |
| 2026-03-07 01:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 01:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 01:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.01 | |
| 2026-03-07 00:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-07 00:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 23:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 23:37 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 23:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 23:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 22:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 22:00 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 21:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 21:14 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 20:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 20:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 20:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 19:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 19:38 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 19:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 19:23 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 18:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 18:44 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 18:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 18:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 18:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 16:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 16:54 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 16:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 16:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) +0.01 | |
| 2026-03-06 16:15 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.12 (Mild negative) -0.01 | |
| 2026-03-06 15:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 15:42 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 15:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 15:06 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) 0.00 | |
| 2026-03-06 14:30 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) +0.02 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 14:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.42 (Moderate negative) -0.05 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 14:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) | |
| 2026-03-06 14:24 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.10 (Mild negative) | |
| 2026-03-06 14:20 |
eval
|
Evaluated by llama-4-scout-wai: -0.38 (Moderate negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 14:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.38 (Moderate negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 14:10 |
eval
|
Evaluated by llama-4-scout-wai: -0.38 (Moderate negative) 0.00 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 14:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.38 (Moderate negative) +0.02 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 13:58 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) +0.02 | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 13:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.42 (Moderate negative) | |
| reasoning Content discusses potential drawbacks of using trendy business jargon, neutral editorial stance |
| 2026-03-06 13:53 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) | |
| reasoning Neutral news article on worker productivity |