| 2026-03-08 22:01 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 22:01 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 22:01 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-08 21:28 | eval_success | PSQ evaluated: g-PSQ=0.321 (3 dims) | - - |
| 2026-03-08 21:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) 0.00 | |
| 2026-03-08 21:24 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 21:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 21:08 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 21:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 21:08 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 20:37 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 20:37 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 20:37 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-08 20:32 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 20:32 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 20:32 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-08 20:02 | eval_success | PSQ evaluated: g-PSQ=0.321 (3 dims) | - - |
| 2026-03-08 20:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) +0.04 | |
| 2026-03-08 19:58 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 19:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 19:53 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 19:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 19:44 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 19:44 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 19:44 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 18:39 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-08 18:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 18:39 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-08 17:48 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 17:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 17:42 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 17:42 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.04 | |
| 2026-03-08 17:37 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-08 17:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 17:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 17:01 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 15:21 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) +0.04 | |
| 2026-03-08 15:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 15:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 14:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 14:40 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 14:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 14:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 13:43 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 13:27 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 13:21 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 12:52 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 12:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 12:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.04 | |
| 2026-03-08 12:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 12:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 12:11 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 11:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) 0.00 | |
| 2026-03-08 11:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 11:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) -0.02 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 10:56 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) +0.18 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 10:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) 0.00 | |
| 2026-03-08 10:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 10:13 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) 0.00 | |
| 2026-03-08 10:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 09:51 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 09:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 09:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) 0.00 | |
| 2026-03-08 09:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 08:49 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 08:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 08:06 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) +0.04 | |
| 2026-03-08 08:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 08:02 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.04 | |
| 2026-03-08 07:45 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 07:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 07:03 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 07:00 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) +0.04 | |
| 2026-03-08 06:47 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 06:05 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 06:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 06:01 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 06:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 05:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 05:49 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 04:59 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 04:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 04:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 04:47 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 03:59 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.04 | |
| 2026-03-08 03:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 03:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 03:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 02:52 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive) +0.04 | |
| 2026-03-08 02:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 02:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 02:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-08 02:44 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Technical blog post on hidden performance costs |
| 2026-03-08 01:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) 0.00 | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 01:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-08 01:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) | |
| 2026-03-08 01:43 |
eval
|
Evaluated by llama-4-scout-wai: -0.06 (Neutral) | |
| reasoning Technical blog post discussing performance overheads in programming languages |
| 2026-03-08 01:41 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) | |
| reasoning Technical blog post on hidden performance costs |