0.00 PDF to Text, a challenging problem (www.marginalia.nu)
357 points by ingve 292 days ago | 201 comments on HN | Neutral ~lite vlite-1.4
Summary ~lite Technical Development Neutral
PDF text extraction challenges
EQ 0.50
SO 0.50
TD 0.50
Lite evaluation by llama-3.3-70b-wai · editorial channel only · no per-section breakdown available
Longitudinal · 10 evals
+1 0 −1 HN
Audit Trail 22 entries
2026-03-01 18:03 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 18:03 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical discussion no rights stance
2026-03-01 17:05 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 17:05 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
ED technical discussion on PDF text extraction no rights stance
2026-03-01 17:00 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 17:00 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
ED technical discussion on PDF text extraction no rights stance
2026-03-01 16:43 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 16:43 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical discussion no rights stance
2026-03-01 15:29 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 15:29 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
ED technical discussion on PDF text extraction no rights stance
2026-03-01 15:23 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 15:23 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
ED technical discussion on PDF text extraction no rights stance
2026-03-01 15:16 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 15:16 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical discussion no rights stance
2026-03-01 15:11 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-01 15:11 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical discussion no rights stance
2026-02-28 09:13 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
reasoning
ED technical discussion on PDF text extraction no rights stance
2026-02-28 09:13 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 09:08 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 09:08 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:08 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
reasoning
Technical discussion no rights stance