shuntaro-okuma 1 karma 0d on HN HN profile →
I am a Software engineer interested in LLM evaluation, prompt engineering, and infrastructure. I build tools that reveal hidden structures in AI systems. - AdaptGauge: LLM adaptation efficiency - Chatbot Benchmark: multi-turn benchmarking - Local Sidekick: Privacy-first attention tracking
Coverage
We've seen 1 of ~1 submissions
Full eval: 0 Lite-only: 0 Unevaluated: 1
1 stories
1. Show HN: AdaptGauge – I found that adding few-shot examples can make LLMs worse (github.com)
1 points by shuntaro-okuma 2 days ago | 0 comments | skipped