Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3 — HRCB

Name: HRCB Evaluation: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3
Item: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3
Rating: 0.376
Author: HN HRCB

Model: @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 deepseek/deepseek-v3.2-20251201 +0.11 claude-haiku-4-5-20251001 +0.19 meta-llama/llama-3.3-70b-instruct:free ND Compare

Model Comparison 100% sign agreement

Model	Editorial	Structural	Class	Conf	SETL	Theme
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	0.00	ND	Neutral	0.90	0.00	No human rights theme
@cf/meta/llama-4-scout-17b-16e-instruct lite	0.00	ND	Neutral	0.50	0.00	technology innovation
deepseek/deepseek-v3.2-20251201	+0.11	+0.09	Mild positive	0.07	0.02	Knowledge & Expression
claude-haiku-4-5-20251001	+0.19	+0.13	Mild positive	0.23	-0.05	Scientific Knowledge & Education Access
meta-llama/llama-3.3-70b-instruct:free	ND	ND	—	—	—	—

Section	@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	@cf/meta/llama-4-scout-17b-16e-instruct lite	deepseek/deepseek-v3.2-20251201	claude-haiku-4-5-20251001	meta-llama/llama-3.3-70b-instruct:free
Preamble	ND	ND	0.10	ND	ND
Article 1	ND	ND	ND	ND	ND
Article 2	ND	ND	ND	ND	ND
Article 3	ND	ND	ND	ND	ND
Article 4	ND	ND	ND	ND	ND
Article 5	ND	ND	ND	ND	ND
Article 6	ND	ND	ND	ND	ND
Article 7	ND	ND	ND	ND	ND
Article 8	ND	ND	ND	ND	ND
Article 9	ND	ND	ND	ND	ND
Article 10	ND	ND	ND	ND	ND
Article 11	ND	ND	ND	ND	ND
Article 12	ND	ND	0.04	ND	ND
Article 13	ND	ND	ND	ND	ND
Article 14	ND	ND	ND	ND	ND
Article 15	ND	ND	ND	ND	ND
Article 16	ND	ND	ND	ND	ND
Article 17	ND	ND	-0.03	ND	ND
Article 18	ND	ND	ND	ND	ND
Article 19	ND	ND	0.31	0.42	ND
Article 20	ND	ND	ND	ND	ND
Article 21	ND	ND	ND	ND	ND
Article 22	ND	ND	ND	ND	ND
Article 23	ND	ND	ND	ND	ND
Article 24	ND	ND	ND	ND	ND
Article 25	ND	ND	0.28	0.32	ND
Article 26	ND	ND	ND	0.34	ND
Article 27	ND	ND	0.40	0.41	ND
Article 28	ND	ND	0.10	ND	ND
Article 29	ND	ND	ND	ND	ND
Article 30	ND	ND	ND	ND	ND

+0.19	Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3 (github.com S:+0.21 )
	315 points by petewarden 5 days ago \| 81 comments on HN \| Moderate positive Mixed · v3.7 · 2026-02-26 03:52:07 0

Summary Scientific Knowledge & Education Access Advocates

The moonshine GitHub repository is a public open-source automatic speech recognition project that advances scientific knowledge and democratizes access to AI/ML technology for edge devices. The content strongly advocates for Articles 19 (freedom of expression and information), 26 (education), and 27 (scientific and cultural advancement) through transparent code publication, unrestricted global distribution, and removal of access barriers. Structural support for accessibility features and collaborative development further amplifies alignment with UDHR principles of universal participation and equitable resource access.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

Editorial Mean	+0.19	Structural Mean	+0.21
Weighted Mean	+0.38	Unweighted Mean	+0.37
Max	+0.42 Article 19	Min	+0.32 Article 25
Signal	4	No Data	27
Volatility	0.05 (Low)
Negative	0	Channels	E: 0.6 S: 0.4
SETL ℹ	-0.05	Structural-dominant
FW Ratio ℹ	49%	44 facts · 45 inferences

Evidence 23% coverage ℹ

 1H  6M  13L  11 ND 

Theme Radar

HN Discussion 20 top-level · 9 replies

armcat 2026-02-24 23:26 UTC link

This is awesome, well done guys, I’m gonna try it as my ASR component on the local voice assistant I’ve been building https://github.com/acatovic/ova. The tiny streaming latencies you show look insane

ac29 2026-02-24 23:28 UTC link

No idea why 'sudo pip install --break-system-packages moonshine-voice' is the recommended way to install on raspi?

The authors do acknowledge this though and give a slightly too complex way to do this with uv in an example project (FYI, you dont need to source anything if you use uv run)

pzo 2026-02-24 23:43 UTC link

haven't tested yet but I'm wondering how it will behave when talking about many IT jargon and tech acronyms. For those reason I had to mostly run LLM after STT but that was slowing done parakeet inference. Otherwise had problems to detect properly sometimes when talking about e.g. about CoreML, int8, fp16, half float, ARKit, AVFoundation, ONNX etc.

asqueella 2026-02-24 23:53 UTC link

For those wondering about the language support, currently English, Arabic, Japanese, Korean, Mandarin, Spanish, Ukrainian, Vietnamese are available (most in Base size = 58M params)

Karrot_Kream 2026-02-24 23:57 UTC link

According to the OpenASR Leaderboard [1], looks like Parakeet V2/V3 and Canary-Qwen (a Qwen finetune) handily beat Moonshine. All 3 models are open, but Parakeet is the smallest of the 3. I use Parakeet V3 with Handy and it works great locally for me.

[1]: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

nmstoker 2026-02-25 00:39 UTC link

Any plans regarding JavaScript support in the browser?

There was an issue with a demo but it's missing now. I can't recall for sure but I think I got it working locally myself too but then found it broke unexpectedly and I didn't manage to find out why.

999900000999 2026-02-25 00:41 UTC link

Very cool. Anyway to run this in Web assembly, I have a project in mind

fareesh 2026-02-25 00:43 UTC link

Accuracy is often presumed to be english, which is fine, but it's a vague thing to say "higher" because does it mean higher in English only? Higher in some subset of languages? Which ones?

The minimum useful data for this stuff is a small table of language | WER for dataset

francislavoie 2026-02-25 01:40 UTC link

I've helped many Twitch streamers set up https://github.com/royshil/obs-localvocal to plug transcription & translation into their streams, mainly for German audio to English subtitles.

I'd love a faster and more accurate option than Whisper, but streamers need something off-the-shelf they can install in their pipeline, like an OBS plugin which can just grab the audio from their OBS audio sources.

I see a couple obvious problems: this doesn't seem to support translation which is unfortunate, that's pretty key for this usecase. Also it only supports one language at a time, which is problematic with how streamers will frequently code-switch while talking to their chat in different languages or on Discord with their gameplay partners. Maybe such a plugin would be able to detect which language is spoken and route to one or the other model as needed?

heftykoo 2026-02-25 03:02 UTC link

Claiming higher accuracy than Whisper Large v3 is a bold opening move. Does your evaluation account for Whisper's notorious hallucination loops during silences (the classic 'Thank you for watching!'), or is this purely based on WER on clean datasets? Also, what's the VRAM footprint for edge deployments? If it fits on a standard 8GB Mac without quantization tricks, this is huge.

guerython 2026-02-25 03:15 UTC link

Nice work. One metric I’d really like to see for streaming use cases is partial stability, not just final WER.

For voice agents, the painful failure mode is partials getting rewritten every few hundred ms. If you can share it, metrics like median first-token latency, real-time factor, and "% partial tokens revised after 1s / 3s" on noisy far-field audio would make comparisons much more actionable.

If those numbers look good, this seems very promising for local assistant pipelines.

RobotToaster 2026-02-25 05:47 UTC link

> Models for other languages are released under the Moonshine Community License, which is a non-commercial license.

Weird to only release English as open weights.

dagss 2026-02-25 05:52 UTC link

Very exciting stuff!

    hear about what people might build with it

My startup is making software for firefighters to use during missions on tablets, excited to see (when I get the time) if we can use this as a keyboard alternative on the device. It's a use case where avoiding "clunky" is important and a perfect usecase for speech-to-text.

Due to the sector being increasingly worried about "hybrid threats" we try to rely on the cloud as little as possible and run things either on device or with the possibility of being self-hosted/on-premise. I really like the direction your company is going in in this respect.

We'd probably need custom training -- we need Norwegian, and there's some lingo, e.g., "bravo one two" should become "B-1.2". While that can perhaps also be done with simple post-processing rules, we would also probably want such examples in training for improved recognition? Have no VC funding, but looking forward to getting some income so that we can send some of it in your direction :)

Ross00781 2026-02-25 06:44 UTC link

The streaming architecture looks really promising for edge deployments. One thing I'm curious about: how does the caching mechanism handle multiple concurrent audio streams? For example, in a meeting transcription scenario with 4-5 speakers, would each stream maintain its own cache, or is there shared state that could create bottlenecks?

binome 2026-02-25 07:29 UTC link

I vibe-trained moonshine-tiny on amateur radio morse code last weekend, and was surprised at the ~2% CER I was seeing in evals and over the air performance was pretty acceptable for a couple hour run on a 4090.

sourcetms 2026-02-25 08:06 UTC link

I'm offering support for this in Resonant - Already set up and running this week.

It's incredible for a live transcription stream - the latency is WOW.

https://www.onresonant.com/

For the open source folks, that's also set up in handy, I think.

T0mSIlver 2026-02-25 09:48 UTC link

Congrats on the results. The streaming aspect is what I find most exciting here.

I built a macOS dictation app (https://github.com/T0mSIlver/localvoxtral) on top of Voxtral Realtime, and the UX difference between streaming and offline STT is night and day. Words appearing while you're still talking completely changes the feedback loop. You catch errors in real time, you can adjust what you're saying mid-sentence, and the whole thing feels more natural. Going back to "record then wait" feels broken after that.

Curious how Moonshine's streaming latency compares in practice. Do you have numbers on time-to-first-token for the streaming mode? And on the serving side, do any of the integration options expose an OpenAI Realtime-compatible WebSocket endpoint?

regularfry 2026-02-25 11:39 UTC link

Oh this is fantastic. I'm most interested to see if this reaches down to the raspberry pi zero 2, because that's a whole new ballgame if it does.

fittingopposite 2026-02-25 13:10 UTC link

Which program does support it to allow streaming? Currently using spokenly and parakeet but would like to transition to a model that is streaming instead of transcribing chunk wise.

Ross00781 2026-02-25 18:46 UTC link

Open-weight STT models hitting production-grade accuracy is huge for privacy-sensitive deployments. Whisper was already impressive, but having competitive alternatives means we're not locked into a single model family. The real test will be multilingual performance and edge device efficiency—has anyone benchmarked this on M-series or Jetson?

reitzensteinm 2026-02-25 01:18 UTC link

Parakeet V3 is over twice the parameter count of Moonshine Medium (600m vs 245m), so it's not an apples to apples comparison.

I'm actually a little surprised they haven't added model size to that chart.

theologic 2026-02-25 01:49 UTC link

By the way, I've been using a Whisper model, specifically WhisperX, to do all my work, and for whatever reason I just simply was not familiar with the Handy app. I've now downloaded and used it, and what a great suggestion. Thank you for putting it here, along with the direct link to the leaderboard.

I can tell that this is now definitely going to be my go-to model and app on all my clients.

tuananh 2026-02-25 03:27 UTC link

Handy is amazing. Super quality app.

riedel 2026-02-25 07:19 UTC link

I find it an even more weird practice for anyone working with speech or text models not in the first paragraph name the language it is meant for (and I do not mean the programming language bindings). How many English native speakers are there 5% of the world population?

admiralrohan 2026-02-25 09:15 UTC link

Is this alternative to Whispr Flow?

steinvakt2 2026-02-25 09:34 UTC link

Interesting. Can we get in touch? I just sold my webapp/saas where I used NB-Whisper to transcribe Norwegian media (podcast, radio, TV) and offer alerts and search by indexing it using elasticsearch.

Edit: It was https://muninai.eu (I shut down the backend server yesterday so the functionality is disabled).

kardaj 2026-02-25 10:38 UTC link

I'm building a local-first transcription iOS app and have been on Whisper Medium, switching to Parakeet V3 based on this.

One note for anyone using Handy with codex-cli on macOS: the default "Option + Space" shortcut inserts spaces mid-speech. "Left Ctrl + Fn" works cleanly instead. I'm curious to know which shortcuts you're using.

d4rkp4ttern 2026-02-25 12:34 UTC link

Was a big fan of Handy until I found Hex, which, incredibly, has even faster transcription (with Parakeet V3), it’s MacOS only:

https://github.com/kitlangton/Hex

regularfry 2026-02-25 13:50 UTC link

Tangentially, have you got any idea what the equivalent "partial tokens revised" rate for humans is? I know I've consciously experienced backtracking and re-interpreting words before, and presumably it happens subconsciously all the time. But that means there's a bound on how low it's reasonable to expect that rate to be, and I don't have an intuition for what it is.

Editorial Channel

What the content says

+0.22

Article 27 Cultural Participation

High Advocacy Practice

Editorial

+0.22

SETL

+0.07

Project explicitly contributes to cultural and scientific advancement: moonshine is open-source ASR technology advancing speech recognition science; project enables participation in scientific progress.

+0.20

Article 19 Freedom of Expression

Medium Advocacy Practice

Editorial

+0.20

SETL

-0.11

Repository title and description explicitly address information sharing about speech recognition technology: 'Fast and accurate automatic speech recognition (ASR) for edge devices' demonstrates commitment to communicating technical innovation.

+0.18

Article 26 Education

Medium Advocacy Practice

Editorial

+0.18

SETL

-0.06

Project contributes to education: open-source ASR implementation provides educational resource for machine learning and speech processing; technical documentation supports learning.

+0.15

Article 25 Standard of Living

Medium Advocacy Practice

Editorial

+0.15

SETL

-0.07

Project description addresses health and welfare: speech recognition technology enables accessibility for persons with disabilities and supports inclusive communication.

Preamble Preamble

Low Practice

No editorial content addressing human dignity or fundamental freedoms observable on the page.

Article 1 Freedom, Equality, Brotherhood

Low Practice

No explicit content addressing inherent equality or freedoms.

Article 2 Non-Discrimination

Low Practice

No content explicitly addressing discrimination on any grounds.

Article 3 Life, Liberty, Security

Low Practice

No explicit content on life, liberty, or personal security.

Article 4 No Slavery

No content addressing slavery or servitude.

Article 5 No Torture

No content on torture or cruel treatment.

Article 6 Legal Personhood

No content on legal personhood.

Article 7 Equality Before Law

No content on equal legal protection.

Article 8 Right to Remedy

No content on legal remedy access.

Article 9 No Arbitrary Detention

No content on arbitrary detention.

Article 10 Fair Hearing

No content on fair trial or due process.

Article 11 Presumption of Innocence

No content on criminal procedure or retroactive laws.

Article 12 Privacy

Medium Practice

No explicit content on privacy.

Article 13 Freedom of Movement

Medium Practice

No explicit content on freedom of movement.

Article 14 Asylum

Low Practice

No content on asylum or political persecution.

Article 15 Nationality

No content on nationality or citizenship.

Article 16 Marriage & Family

No content on family rights or marriage.

Article 17 Property

Medium Practice

No explicit content on property rights.

Article 18 Freedom of Thought

Low Practice

No explicit content on freedom of thought or conscience.

Article 20 Assembly & Association

Low Practice

No explicit content on freedom of assembly.

Article 21 Political Participation

Low Practice

No content on participation in government.

Article 22 Social Security

Low Practice

No explicit content on social security or welfare.

Article 23 Work & Equal Pay

Low Practice

No content explicitly addressing labor rights.

Article 24 Rest & Leisure

No content on rest or leisure.

Article 28 Social & International Order

Low Practice

No explicit content on social and international order.

Article 29 Duties to Community

Low Practice

No explicit content on community responsibilities.

Article 30 No Destruction of Rights

Low Practice

No content on interpretation or limitation of rights.

Structural Channel

What the site does

+0.25

Article 19 Freedom of Expression

Medium Advocacy Practice

Structural

+0.25

Context Modifier

+0.20

SETL

-0.11

Public repository enables free expression of information; documentation and code are openly accessible and shareable; README and technical content are freely distributed without gatekeeping.

+0.20

Article 26 Education

Medium Advocacy Practice

Structural

+0.20

Context Modifier

+0.15

SETL

-0.06

GitHub's public repository model makes moonshine freely accessible for educational use and skill development; source code transparency enables learning-by-examination; no paywalls restrict educational access.

+0.20

Article 27 Cultural Participation

High Advocacy Practice

Structural

+0.20

Context Modifier

+0.20

SETL

+0.07

Public repository structure enables collaborative scientific development; free access to source code supports scientific culture; open licensing enables modification and redistribution aligned with scientific freedom principles.

+0.18

Article 25 Standard of Living

Medium Advocacy Practice

Structural

+0.18

Context Modifier

+0.15

SETL

-0.07

GitHub's accessibility features (keyboard navigation, ARIA labels, responsive design) enable equal access to the moonshine repository; edge device optimization supports health accessibility for resource-limited populations.

Preamble Preamble

Low Practice

GitHub's platform structure enables collaborative development and open-source contributions, supporting collective human advancement through technology sharing.

Article 1 Freedom, Equality, Brotherhood

Low Practice

GitHub's repository structure treats all contributors equally regardless of identity; no discrimination visible in access or participation mechanisms.

Article 2 Non-Discrimination

Low Practice

GitHub's platform does not visibly restrict participation based on protected characteristics; the moonshine repository maintains open contribution policies.

Article 3 Life, Liberty, Security

Low Practice

The public repository does not directly address personal security; GitHub's platform provides secure access controls and privacy mechanisms.

Article 4 No Slavery

Repository does not engage with labor exploitation issues.

Article 5 No Torture

Not applicable to open-source repository context.

Article 6 Legal Personhood

Not directly applicable to repository interface.

Article 7 Equality Before Law

GitHub's legal framework applies equally to all users.

Article 8 Right to Remedy

Not applicable to repository landing page.

Article 9 No Arbitrary Detention

Not applicable to open-source repository.

Article 10 Fair Hearing

Not applicable to repository context.

Article 11 Presumption of Innocence

Not applicable to repository.

Article 12 Privacy

Medium Practice

GitHub provides privacy controls for repository data; public repository mode allows user control over disclosure, with privacy settings available for sensitive information.

Article 13 Freedom of Movement

Medium Practice

Public repository globally accessible without geographic restrictions; moonshine project enables unrestricted code distribution across borders.

Article 14 Asylum

Low Practice

GitHub provides safe platform for developers from all countries to collaborate without persecution; open-source culture supports intellectual freedom.

Article 15 Nationality

Repository does not engage with nationality issues.

Article 16 Marriage & Family

Not applicable to repository context.

Article 17 Property

Medium Practice

GitHub retains platform control; user contributions are subject to GitHub's terms of service, creating conditional rather than absolute intellectual property ownership. Moonshine project is open-source, which distributes rather than concentrates property rights.

Article 18 Freedom of Thought

Low Practice

Open-source repository enables developers to express technical ideas and design philosophy without censorship; GitHub community guidelines protect expression within bounds.

Article 20 Assembly & Association

Low Practice

GitHub's issue trackers and discussions enable collaborative organization and collective action on technical projects; moonshine repository can host community organizing around ASR development.

Article 21 Political Participation

Low Practice

Open-source community governance models enable democratic participation in project decisions; moonshine can implement governance aligned with collective decision-making.

Article 22 Social Security

Low Practice

Open-source projects like moonshine provide free access to technology that supports economic and social welfare; accessibility of speech recognition tools advances equity.

Article 23 Work & Equal Pay

Low Practice

Open-source contribution model enables voluntary participation and skill development; no exploitative labor practices visible in repository structure.

Article 24 Rest & Leisure

Not applicable to repository context.

Article 28 Social & International Order

Low Practice

Open-source project participates in global digital commons supporting international cooperation; GitHub's global infrastructure enables worldwide collaboration aligned with Article 28 principles of international order.

Article 29 Duties to Community

Low Practice

Open-source licensing imposes minimal restrictions; GitHub community guidelines establish norms for responsible participation; project maintenance requires community stewardship.

Article 30 No Destruction of Rights

Low Practice

GitHub's terms and open-source licensing prevent nullification of UDHR rights; public repository cannot be used to restrict others' fundamental freedoms.

Supplementary Signals

How this content communicates, beyond directional lean. Learn more

Epistemic Quality ℹ

How well-sourced and evidence-based is this content?

0.68 low claims

Sources		0.7
Evidence		0.7
Uncertainty		0.6
Purpose		0.8

Propaganda Flags ℹ

No manipulative rhetoric detected

0 techniques detected

Emotional Tone ℹ

Emotional character: positive/negative, intensity, authority

measured

Valence		+0.6
Arousal		0.3
Dominance		0.4

Transparency ℹ

Does the content identify its author and disclose interests?

0.50

✗ Author

More signals: context, framing & audience

Solution Orientation ℹ

Does this content offer solutions or only describe problems?

0.70 solution oriented

Reader Agency

0.8

Stakeholder Voice ℹ

Whose perspectives are represented in this content?

0.58 3 perspectives

Speaks: developersinstitutioncommunity

About: usersmarginalizedindividuals

Temporal Framing ℹ

Is this content looking backward, at the present, or forward?

present unspecified

Geographic Scope ℹ

What geographic area does this content cover?

global

Complexity ℹ

How accessible is this content to a general audience?

moderate medium jargon general

Longitudinal 242 HN snapshots · 5 evals

Audit Trail 25 entries

2026-02-28 14:21	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 14:21	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning Tech content no UDHR relevance
2026-02-26 23:08	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-26 23:08	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
2026-02-26 20:16	dlq	Dead-lettered after 1 attempts: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3	- -
2026-02-26 20:14	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 20:13	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 20:12	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:36	dlq	Dead-lettered after 1 attempts: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3	- -
2026-02-26 17:34	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:33	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:32	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 09:10	dlq	Dead-lettered after 1 attempts: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3	- -
2026-02-26 09:09	dlq	Dead-lettered after 1 attempts: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3	- -
2026-02-26 09:07	rate_limit	OpenRouter rate limited (429) model=mistral-small-3.1	- -
2026-02-26 09:07	rate_limit	OpenRouter rate limited (429) model=hermes-3-405b	- -
2026-02-26 09:06	rate_limit	OpenRouter rate limited (429) model=mistral-small-3.1	- -
2026-02-26 09:06	rate_limit	OpenRouter rate limited (429) model=hermes-3-405b	- -
2026-02-26 09:05	rate_limit	OpenRouter rate limited (429) model=hermes-3-405b	- -
2026-02-26 09:05	rate_limit	OpenRouter rate limited (429) model=mistral-small-3.1	- -
2026-02-26 09:05	dlq	Dead-lettered after 1 attempts: Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3	- -
2026-02-26 09:03	rate_limit	OpenRouter rate limited (429) model=qwen3-next-80b	- -
2026-02-26 08:18	eval	Evaluated by deepseek-v3.2: +0.20 (Mild positive) 10,216 tokens
2026-02-26 03:52	eval	Evaluated by claude-haiku-4-5-20251001: +0.18 (Mild positive) 13,592 tokens -0.13
2026-02-26 02:46	eval	Evaluated by claude-haiku-4-5-20251001: +0.31 (Neutral) 12,637 tokens

build 1ad9551+j7zs · deployed 2026-03-02 09:09 UTC · evaluated 2026-03-02 13:57:54 UTC