[CAL-LITE] ProPublica (EP-4)

Model: @cf/meta/llama-4-scout-17b-16e-instruct lite +0.70 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.70 claude-haiku-4-5 lite +0.46 deepseek/deepseek-v3.2-20251201 +0.80 meta-llama/llama-3.3-70b-instruct:free ND Compare

+0.70	[CAL-LITE] ProPublica (EP-4) (www.propublica.org)
	0 points 2 days ago \| 0 comments on HN \| Strong positive ~lite vlite-1.4

Summary ~lite Investigative Journalism Advocates

ProPublica is an independent, non-profit newsroom producing investigative journalism in the public interest.

EQ 0.80

SO 0.70

TD 0.90

Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available

Longitudinal · 19 evals

Audit Trail 39 entries

2026-02-28 11:17	model_divergence	Cross-model spread 0.30 exceeds threshold (4 models)	- -
2026-02-28 11:17	eval_success	Lite evaluated: Strong positive (0.70)	- -
2026-02-28 11:17	eval	Evaluated by llama-4-scout-wai: +0.70 (Strong positive) 0.00
	reasoning Investigative journalism site, implies rights-focused content
2026-02-28 11:17	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-02-28 11:12	model_divergence	Cross-model spread 0.30 exceeds threshold (4 models)	- -
2026-02-28 11:12	eval_success	Lite evaluated: Strong positive (0.70)	- -
2026-02-28 11:12	eval	Evaluated by llama-4-scout-wai: +0.70 (Strong positive) -0.10
	reasoning Investigative journalism site, implies rights-focused content
2026-02-28 11:12	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-02-28 09:03	model_divergence	Cross-model spread 0.30 exceeds threshold (3 models)	- -
2026-02-28 09:03	eval_success	Light evaluated: Strong positive (0.70)	- -
2026-02-28 09:03	eval	Evaluated by llama-3.3-70b-wai: +0.70 (Strong positive) -0.10
	reasoning Investigative journalism site
2026-02-28 09:03	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 05:16	eval	Evaluated by claude-haiku-4-5: +0.46 (Moderate positive) -0.16
2026-02-28 01:41	dlq	Dead-lettered after 1 attempts: [CAL-LIGHT] ProPublica (EP-4)	- -
2026-02-28 01:39	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-28 01:38	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-28 01:36	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-28 01:36	dlq_replay	DLQ message 97625 replayed to LLAMA_QUEUE: [CAL-LIGHT] ProPublica (EP-4)	- -
2026-02-28 00:52	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 00:52	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
	reasoning Investigative journalism site, implies rights-focused content
2026-02-28 00:44	eval	Evaluated by claude-haiku-4-5: +0.62 (Strong positive) -0.10
2026-02-28 00:41	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 00:41	eval	Evaluated by llama-3.3-70b-wai: +0.80 (Strong positive) 0.00
	reasoning Investigative journalism site
2026-02-28 00:28	eval	Evaluated by claude-haiku-4-5: +0.72 (Strong positive) +0.02
2026-02-28 00:12	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 00:12	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
	reasoning Investigative journalism site, implies rights-focused content
2026-02-28 00:12	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 00:12	eval	Evaluated by llama-3.3-70b-wai: +0.80 (Strong positive)
	reasoning Investigative journalism site
2026-02-28 00:01	eval	Evaluated by claude-haiku-4-5: +0.70 (Strong positive) 0.00
2026-02-27 21:51	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-27 21:51	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive)
	reasoning Investigative journalism site, implies rights-focused content
2026-02-27 21:47	eval	Evaluated by claude-haiku-4-5: +0.70 (Strong positive) 0.00
2026-02-27 21:36	rater_validation_fail	Light parse failure for model llama-4-scout-wai: SyntaxError: Unexpected token '+', ..."itorial": +0.8, "... is not valid JSON	- -
2026-02-27 21:32	eval	Evaluated by claude-haiku-4-5: +0.70 (Strong positive) +0.05
2026-02-27 21:10	eval	Evaluated by claude-haiku-4-5: +0.65 (Strong positive) -0.03
2026-02-27 21:01	eval	Evaluated by claude-haiku-4-5: +0.68 (Strong positive) +0.13
2026-02-27 21:01	eval	Evaluated by claude-haiku-4-5: +0.55 (Moderate positive) -0.17
2026-02-27 15:17	eval	Evaluated by deepseek-v3.2: +0.76 (Strong positive) 14,237 tokens
2026-02-27 13:01	eval	Evaluated by claude-haiku-4-5: +0.72 (Strong positive)

build 1ad9551+j7zs · deployed 2026-03-02 09:09 UTC · evaluated 2026-03-02 11:31:12 UTC