tufo 3 karma 201d on HN HN profile →
Coverage
We've seen 1 of ~4 submissions
Full eval: 0 Lite-only: 0 Unevaluated: 1
1 stories
1. A benchmark of expert-level academic questions to assess AI capabilities – HLE (www.nature.com)
2 points by tufo 3 days ago | 0 comments | skipped