| 2026-03-16 01:22 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-16 01:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-16 00:51 | model_divergence | Cross-model spread 0.75 exceeds threshold (2 models) | - - |
| 2026-03-16 00:51 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-16 00:51 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-16 00:51 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 23:09 | eval_success | Evaluated: Strong positive (0.75) | - - |
| 2026-03-15 23:09 | model_divergence | Cross-model spread 0.75 exceeds threshold (2 models) | - - |
| 2026-03-15 23:09 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.75 (Strong positive) 12,847 tokens | |
| 2026-03-15 23:09 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 16W 28R | - - |
| 2026-03-15 22:45 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 22:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 22:04 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 22:04 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 22:04 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 18:46 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 18:46 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 18:46 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 18:11 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 18:11 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 17:33 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 17:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 17:33 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 16:56 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 16:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 15:52 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 15:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 15:52 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-15 15:47 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-15 15:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 15:15 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-15 15:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 15:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 14:40 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 14:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 14:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 13:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 13:24 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 13:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 12:46 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 12:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 12:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 11:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 11:30 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 11:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 10:48 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 10:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 10:09 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 09:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 09:30 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 08:58 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 08:49 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 08:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 08:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 07:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 07:24 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 06:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 06:47 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 06:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 06:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 05:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 05:36 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 05:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 05:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 04:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 04:25 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 03:52 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-15 03:50 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |
| 2026-03-15 03:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-15 03:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Technical paper on improving lower bounds for Ramsey numbers using AlphaEvolve, no explicit human rights discussion |