Tests Are the New Moat

Model: @cf/meta/llama-4-scout-17b-16e-instruct lite +0.38 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 claude-haiku-4-5-20251001 -0.01 meta-llama/llama-3.3-70b-instruct:free ND Compare

+0.38	Tests Are the New Moat (saewitz.com)
	11 points by taubek 4 days ago \| 4 comments on HN \| Moderate positive ~lite vlite-1.4

Summary ~lite open source Acknowledges

Discusses commercial open-source incentives and AI's impact

EQ 0.60

SO 0.70

TD 0.50

Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available

Longitudinal · 26 evals

Audit Trail 46 entries

2026-03-02 16:36	model_divergence	Cross-model spread 0.39 exceeds threshold (2 models)	- -
2026-03-02 16:36	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-02 16:36	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-02 16:26	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-02 16:26	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 17:51	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 17:51	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 16:56	model_divergence	Cross-model spread 0.39 exceeds threshold (2 models)	- -
2026-03-01 16:56	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-01 16:56	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 16:33	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 16:33	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 15:00	model_divergence	Cross-model spread 0.39 exceeds threshold (3 models)	- -
2026-03-01 15:00	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-01 15:00	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 14:59	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 14:59	model_divergence	Cross-model spread 0.39 exceeds threshold (3 models)	- -
2026-03-01 14:59	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 14:55	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-01 14:55	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 14:55	model_divergence	Cross-model spread 0.39 exceeds threshold (3 models)	- -
2026-03-01 14:54	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 14:54	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 14:11	model_divergence	Cross-model spread 0.39 exceeds threshold (3 models)	- -
2026-03-01 14:11	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 14:11	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 14:10	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-01 14:10	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 14:10	model_divergence	Cross-model spread 0.39 exceeds threshold (2 models)	- -
2026-03-01 14:06	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-01 14:05	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 13:20	model_divergence	Cross-model spread 0.39 exceeds threshold (2 models)	- -
2026-03-01 13:20	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 13:18	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-03-01 12:41	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-03-01 12:40	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) -0.04
	reasoning Neutral tech editorial
2026-02-28 16:25	eval	Evaluated by llama-3.3-70b-wai: +0.04 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-02-28 16:20	eval	Evaluated by llama-3.3-70b-wai: +0.04 (Neutral) 0.00
	reasoning Neutral tech editorial
2026-02-28 15:20	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial on commercial open-source incentives and AI impact
2026-02-28 15:15	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) +0.38
	reasoning Editorial on commercial open-source incentives and AI impact
2026-02-28 13:59	eval	Evaluated by llama-3.3-70b-wai: +0.04 (Neutral)
	reasoning Neutral tech editorial
2026-02-26 23:06	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
	reasoning Editorial on commercial open-source incentives and AI impact
2026-02-26 09:40	eval	Evaluated by deepseek-v3.2: +0.25 (Mild positive) 12,021 tokens
2026-02-26 04:20	eval	Evaluated by claude-haiku-4-5-20251001: -0.01 (Neutral) 13,811 tokens -0.11
2026-02-26 03:50	eval	Evaluated by claude-haiku-4-5-20251001: +0.10 (Mild positive) 13,755 tokens +0.12
2026-02-26 03:49	eval	Evaluated by claude-haiku-4-5-20251001: -0.02 (Neutral) 14,164 tokens

build 33fdafe+e25z · deployed 2026-03-02 17:29 UTC · evaluated 2026-03-02 17:25:40 UTC