[CAL-LITE] Shopify (EX-1) (www.shopify.com)
0 points 11 days ago | 0 comments on HN
Pending Evaluation
This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-27 12:49:31
Longitudinal · 28 evals
+1 0 −1 HN
Audit Trail 48 entries
2026-03-05 09:13 eval_success PSQ evaluated: g-PSQ=0.600 (3 dims) - -
2026-03-05 09:13 eval Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00
2026-03-05 09:08 eval_success PSQ evaluated: g-PSQ=0.600 (3 dims) - -
2026-03-05 09:08 eval Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive)
2026-03-05 08:58 eval_success PSQ evaluated: g-PSQ=0.600 (3 dims) - -
2026-03-05 08:58 eval Evaluated by llama-3.3-70b-wai-psq: +0.60 (Strong positive)
2026-03-04 08:02 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-03-04 07:56 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) +0.52
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-03-04 07:47 eval Evaluated by claude-haiku-4-5: -0.52 (Moderate negative) -0.58
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-03-04 07:41 eval Evaluated by claude-haiku-4-5: +0.06 (Neutral) +0.06
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-28 09:10 model_divergence Cross-model spread 0.37 exceeds threshold (4 models) - -
2026-02-28 09:10 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:10 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
2026-02-28 09:10 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 09:06 model_divergence Cross-model spread 0.37 exceeds threshold (3 models) - -
2026-02-28 09:06 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:06 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
2026-02-28 09:06 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 09:01 model_divergence Cross-model spread 0.37 exceeds threshold (3 models) - -
2026-02-28 09:01 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:01 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
2026-02-28 09:01 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:15 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-28 01:38 dlq Dead-lettered after 1 attempts: [CAL-LIGHT] Shopify (EX-1) - -
2026-02-28 01:35 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:34 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:34 dlq_replay DLQ message 97617 replayed to LLAMA_QUEUE: [CAL-LIGHT] Shopify (EX-1) - -
2026-02-28 00:44 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-28 00:33 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 00:33 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
2026-02-28 00:26 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 00:26 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
2026-02-28 00:23 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-28 00:13 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 00:13 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
2026-02-28 00:13 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 00:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
2026-02-27 23:56 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 21:51 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
2026-02-27 21:46 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 21:31 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
2026-02-27 21:30 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 21:09 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 20:59 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 16:43 eval Evaluated by deepseek-v3.2: +0.37 (Moderate positive) 9,058 tokens
2026-02-27 16:18 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
2026-02-27 13:00 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral) 0.00
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.
2026-02-27 13:00 eval Evaluated by claude-haiku-4-5: 0.00 (Neutral)
reasoning
Commercial e-commerce landing page with zero explicit rights discourse, absent transparency indicators.