| 2026-03-08 19:24 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-08 19:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-08 19:10 | eval_success | PSQ evaluated: g-PSQ=0.363 (3 dims) | - - |
| 2026-03-08 19:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.36 (Moderate positive) 0.00 | |
| 2026-03-08 18:54 | eval_success | Lite evaluated: Neutral (0.08) | - - |
| 2026-03-08 18:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) -0.06 | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 18:54 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 18:03 | eval_success | Lite evaluated: Mild negative (-0.10) | - - |
| 2026-03-08 18:03 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Critical LLM discussion |
| 2026-03-08 16:34 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-08 16:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-08 16:20 | eval_success | PSQ evaluated: g-PSQ=0.363 (3 dims) | - - |
| 2026-03-08 16:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.36 (Moderate positive) 0.00 | |
| 2026-03-08 15:58 | eval_success | Lite evaluated: Mild positive (0.14) | - - |
| 2026-03-08 15:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.14 (Mild positive) +0.06 | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 15:58 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - |
| 2026-03-08 15:45 | eval_success | Lite evaluated: Mild negative (-0.10) | - - |
| 2026-03-08 15:45 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Critical LLM discussion |
| 2026-03-08 12:58 | eval_success | Lite evaluated: Neutral (0.08) | - - |
| 2026-03-08 12:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 12:58 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 12:57 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-08 12:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-08 12:56 | eval_success | PSQ evaluated: g-PSQ=0.363 (3 dims) | - - |
| 2026-03-08 12:56 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.36 (Moderate positive) 0.00 | |
| 2026-03-08 12:52 | eval_success | Lite evaluated: Mild negative (-0.10) | - - |
| 2026-03-08 12:52 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Critical LLM discussion |
| 2026-03-08 11:44 | eval_success | Lite evaluated: Neutral (0.08) | - - |
| 2026-03-08 11:44 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 11:44 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 11:43 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-08 11:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-08 11:43 | eval_success | PSQ evaluated: g-PSQ=0.363 (3 dims) | - - |
| 2026-03-08 11:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.36 (Moderate positive) 0.00 | |
| 2026-03-08 11:39 | eval_success | Lite evaluated: Neutral (0.08) | - - |
| 2026-03-08 11:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 11:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning Critical LLM discussion |
| 2026-03-08 10:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) | |
| 2026-03-08 10:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.36 (Moderate positive) | |
| 2026-03-08 10:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) | |
| reasoning Blog post discussing limitations of LLMs, no explicit human rights discussion |
| 2026-03-08 10:26 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) | |
| reasoning Critical LLM discussion |