| |
Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
Model Comparison
| Model | Editorial | Structural | Class | Conf | SETL | Theme | | @cf/meta/llama-4-scout-17b-16e-instruct lite | ND | ND | — | 0.80 | — | — | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | ND | ND | — | 0.40 | — | — | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | +0.60 | ND | Strong positive | 0.90 | 0.00 | Police Brutality | | @cf/meta/llama-4-scout-17b-16e-instruct lite | +0.30 | ND | Moderate positive | 0.80 | 0.00 | Police Brutality | | openai/gpt-oss-120b:free lite | ND | ND | — | — | — | — | | google/gemma-3-27b-it:free lite | ND | ND | — | — | — | — | | qwen/qwen3-coder:free lite | ND | ND | — | — | — | — | | Section | @cf/meta/llama-4-scout-17b-16e-instruct lite | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | @cf/meta/llama-4-scout-17b-16e-instruct lite | openai/gpt-oss-120b:free lite | google/gemma-3-27b-it:free lite | qwen/qwen3-coder:free lite | | Preamble | ND | ND | ND | ND | ND | ND | ND | | Article 1 | ND | ND | ND | ND | ND | ND | ND | | Article 2 | ND | ND | ND | ND | ND | ND | ND | | Article 3 | ND | ND | ND | ND | ND | ND | ND | | Article 4 | ND | ND | ND | ND | ND | ND | ND | | Article 5 | ND | ND | ND | ND | ND | ND | ND | | Article 6 | ND | ND | ND | ND | ND | ND | ND | | Article 7 | ND | ND | ND | ND | ND | ND | ND | | Article 8 | ND | ND | ND | ND | ND | ND | ND | | Article 9 | ND | ND | ND | ND | ND | ND | ND | | Article 10 | ND | ND | ND | ND | ND | ND | ND | | Article 11 | ND | ND | ND | ND | ND | ND | ND | | Article 12 | ND | ND | ND | ND | ND | ND | ND | | Article 13 | ND | ND | ND | ND | ND | ND | ND | | Article 14 | ND | ND | ND | ND | ND | ND | ND | | Article 15 | ND | ND | ND | ND | ND | ND | ND | | Article 16 | ND | ND | ND | ND | ND | ND | ND | | Article 17 | ND | ND | ND | ND | ND | ND | ND | | Article 18 | ND | ND | ND | ND | ND | ND | ND | | Article 19 | ND | ND | ND | ND | ND | ND | ND | | Article 20 | ND | ND | ND | ND | ND | ND | ND | | Article 21 | ND | ND | ND | ND | ND | ND | ND | | Article 22 | ND | ND | ND | ND | ND | ND | ND | | Article 23 | ND | ND | ND | ND | ND | ND | ND | | Article 24 | ND | ND | ND | ND | ND | ND | ND | | Article 25 | ND | ND | ND | ND | ND | ND | ND | | Article 26 | ND | ND | ND | ND | ND | ND | ND | | Article 27 | ND | ND | ND | ND | ND | ND | ND | | Article 28 | ND | ND | ND | ND | ND | ND | ND | | Article 29 | ND | ND | ND | ND | ND | ND | ND | | Article 30 | ND | ND | ND | ND | ND | ND | ND | | Summary ~lite GitHub repository documenting police brutality during 2020 George Floyd protests, with a neutral and informative tone.
Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available
| |
Longitudinal
· 6 evals | |
Audit Trail
20 entries | 2026-03-05 17:56 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - | | 2026-03-05 17:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) | | | 2026-03-05 17:53 | eval_success | PSQ evaluated: g-PSQ=0.000 (3 dims) | - - | | 2026-03-05 17:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: 0.00 (Neutral) 0.00 | | | 2026-03-05 17:48 | eval_success | PSQ evaluated: g-PSQ=0.000 (3 dims) | - - | | 2026-03-05 17:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: 0.00 (Neutral) | | | 2026-02-28 12:07 | model_divergence | Cross-model spread 0.30 exceeds threshold (2 models) | - - | | 2026-02-28 12:07 | eval_success | Lite evaluated: Strong positive (0.60) | - - | | 2026-02-28 12:07 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) 0.00 | | | reasoning Exposing police brutality | | 2026-02-28 12:07 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - | | 2026-02-28 12:05 | eval_success | Lite evaluated: Moderate positive (0.30) | - - | | 2026-02-28 12:05 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - | | 2026-02-28 12:05 | model_divergence | Cross-model spread 0.30 exceeds threshold (2 models) | - - | | 2026-02-28 12:05 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) | | | reasoning GitHub repository about police brutality, implicitly supportive | | 2026-02-28 12:02 | eval_success | Lite evaluated: Strong positive (0.60) | - - | | 2026-02-28 12:02 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) | | | reasoning Exposing police brutality | | 2026-02-28 12:02 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - | | 2026-02-26 06:52 | dlq | Dead-lettered after 1 attempts: Git Repo of Police Brutality During the 2020 George Floyd Protests | - - | | 2026-02-26 06:44 | credit_exhausted | Credit balance too low, retrying in 253s | - - | | 2026-02-26 06:39 | credit_exhausted | Credit balance too low, retrying in 295s | - - | | |
| |