+0.18 Looks like it is happening

Name: HRCB Evaluation: Looks like it is happening
Item: Looks like it is happening
Rating: 0.244
Author: HN HRCB

Model: @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 deepseek/deepseek-v3.2-20251201 +0.10 claude-haiku-4-5-20251001 +0.18 meta-llama/llama-3.3-70b-instruct:free ND Compare

+0.18	Looks like it is happening (www.math.columbia.edu S:+0.27 )
	184 points by jjgreen 5 days ago \| 174 comments on HN \| Mild positive Editorial · v3.7 · 2026-02-26 04:30:56 0

Summary Free Expression & Scientific Access Advocates

This blog post advocates for open access to scientific data and democratic participation in academic discourse. The author criticizes gatekeeping in academic publishing (particularly exploitation of graduate students/postdocs by senior researchers) and implicitly champions the democratization of knowledge production through AI. The content demonstrates commitment to free expression, public information access, and participation in scientific culture.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

Editorial Mean	+0.18	Structural Mean	+0.27
Weighted Mean	+0.24	Unweighted Mean	+0.24
Max	+0.49 Article 19	Min	-0.18 Article 12
Signal	5	No Data	26
Volatility	0.24 (Medium)
Negative	1	Channels	E: 0.6 S: 0.4
SETL ℹ	-0.19	Structural-dominant
FW Ratio ℹ	55%	11 facts · 9 inferences

Evidence 8% coverage ℹ

  4M  1L  26 ND 

Theme Radar

HN Discussion 20 top-level · 30 replies

sealeck 2026-02-24 21:34 UTC link

There are many really excellent papers out there - the kind which will save you hours/months of work (or even make things that were previously inviable to build viable).

That said, it is amazing how terrible a lot of papers are; people are pressured to publish and therefore seem to get into weird ruts trying to do what they think will be published, rather than what is intellectually interesting...

wmf 2026-02-24 21:36 UTC link

I assume hep = high energy physics in this context. PI = professor who received a government grant.

Peer review has never really been blind and I suspect PIs will reject papers from "outsiders" even if they are higher quality. This already happens to some extent today when the stakes are lower.

sixtyj 2026-02-24 21:38 UTC link

Well… it is happening. You can’t put spilled milk back to bottle. You can do future requirements that will try to stop this behaviour.

E.g. in the submission form could be a mandatory field “I hereby confirm that I wrote the paper personally.” In conditions there will be a note that violating this rule can lead to temporary or permanent ban of authors. In the world where research success is measured by points in WOS, this could lead to slow down the rise of LLM-generated papers.

zoogeny 2026-02-24 21:42 UTC link

One thing I have been guilty of, even though I am an AI maximalist, is asking the question: "If AI is so good, why don't we see X". Where X might be (in the context of vibe coding) the next redis, nginx, sqlite, or even linux.

But I really have to remember, we are at the leading edge here. Things take time. There is an opening (generation) and a closing (discernment). Perhaps AI will first generate a huge amount of noise and then whittle it down to the useful signal.

If that view is correct, then this is solid evidence of the amplification of possibility. People will decry the increase of noise, perhaps feeling swamped by it. But the next phase will be separating the wheat from the chaff. It is only in that second phase that we will really know the potential impact.

general_reveal 2026-02-24 21:56 UTC link

“And further, by these, my son, be admonished: of making many books there is no end; and much study is a weariness of the flesh.” - Ecclesiastes 12:12 (KJV)

I suppose we’re entering TURBO mode for of ‘making many books there is no end’.

dang 2026-02-24 21:56 UTC link

> submission numbers in the last couple months have nearly doubled with respect to the stable numbers of previous years

This is showing up (no pun intended) on HN as well. The # of submissions and # of submitters, which traditionally had been surprisingly stable—fluctuating within a fixed range for well over 10 years—has recently been reaching all-time highs. Not double, though...yet.

pavel_lishin 2026-02-24 22:10 UTC link

Apparently "hep-th" stands for "High Energy Physics - Theory".

8organicbits 2026-02-24 22:20 UTC link

> when AI agents started being able to write papers indistinguishable in quality from [...]

Given that arXiv lacks peer review, I'm not clear what quality bar is being referenced here.

Chinjut 2026-02-24 22:31 UTC link

Note the following comment by Jerry Ling: "The effect goes away if you search properly using the original submission date instead of the most recent submission date. By using most recent submission date, your analysis is biased because we’re so close to the beginning of 2026 so ofc we will see a peak that’s just people who have recently modified their submission."

tombert 2026-02-24 22:44 UTC link

I like AI, I use Codex and ChatGPT like most people are, but I have to say that I am pretty tired of low-effort crap taking over everything, particularly YouTube.

There have always been content mills, but there was still some cost with producing the low-effort "Top 10" or "Iceberg Examination" videos. Now I will turn on a video about any topic, watch it for three minutes, immediately get a kind of uncanny vibe, and then the AI voice will make a pronunciation mistake (e.g. confusing wind, like the weather effect or the winding of a spring), or the script starts getting redundant or repetitive in ways that are common with AI.

And I suspect these kinds of videos will become more common as time goes on. The cost to producing these videos is getting close to "free" meaning that it doesn't take much to make a profit on them, even if their views are relatively low per-video.

If AI has taught me anything, it's that there still is no substitute for effort. I'm sure AI is used in plenty of places where I don't notice it, because the people who used it still put in effort to make a good product. There are people who don't just make a prompt like "make me a fifteen minute video about Chris Chan" and "generate me a thumbnail with Chris Chan with the caption 'he's gone too far'", and instead will use AI as a tool to make something neat.

Genuine effort is hard, and rare, and these AI videos can give the facsimile of something that prior to 2023 was high effort. I hate it.

mianos 2026-02-24 22:50 UTC link

This title should have been editorialised. It's like a headline from the daily mirror.

hhsuey 2026-02-24 22:57 UTC link

What's happening? I hate click bait titles like these.

AvAn12 2026-02-25 00:59 UTC link

The shilling for AI continues. How much $$$ do the big tech companies pay Columbia? Oh yeah, and what exactly did Columbia agree to do to get the trmp admin to leave them alone? All speculation of course, but the circumstantial picture stinks.

gtirloni 2026-02-25 01:49 UTC link

Who's spending money to write bots to comment on obscure (to me) websites and why?

pllbnk 2026-02-25 06:46 UTC link

In most of the world the past decades there has been no thought behind who should get university education. It has been given that after high school you should aim for university. I have studied software engineering in the most prestigious university in my country and from 100+ students in my group there were only a few (myself excluded) who actually had some interest in academic work and desire to pursue it. Most of us were just coasting - passing exams and writing mediocre papers without any goal to have those papers ever being read by someone after the graduation.

I think that university level and other kinds of formal education should be segregated. Universities should host fewer students and being able to provide them with higher rewards for actually meaningful work and I believe that a flood of mediocre quality papers (but let's admit it, in fact they are low quality in their content and perhaps good in their presentation) will lead us to rebuild the education system.

tummler 2026-02-25 09:54 UTC link

Can people please not post links with vague titles like this? I had to click through and read half the article to even figure out what this was about, and I wasn’t interested.

altern8 2026-02-25 10:11 UTC link

"THIS happened to submissions about high-energy theory to arXiv, and it will leave you speechless!"

snickerer 2026-02-25 10:54 UTC link

In a normal and sane world, a scientist is a nerd about their field. They are highly interested in new thoughts and insights. When a new paper in their field is published, they try hard to find the time to read it. The reason is: every paper is written by enthusiasts who want to add something of value, new insights, to the discussion. Proving or disproving theories, adding puzzle pieces to the general picture.

That is the normal situation, which is the foundation of the progression of civilisation. But some people install incentive systems to sabotage this. They are sabotaging civilisation itself.

snickerer 2026-02-25 11:11 UTC link

Reform idea:

We should decouple the publishing of papers from academic careers completely. Papers can't generate any reputation or money for the authors anymore. To achieve that, we must anonymize the authors.

All scientists get some (paid) time to write papers — if they want. What they write and if they publish it is not known to anybody. They are trusted to write something of value in that time.

Universities can come up with other ways of judging which professors they hire. Interviews. Test teachings. Or the writing of an non-public application essay, which describes their past research and discoveries.

kwar13 2026-02-25 18:24 UTC link

Took way to too long to even understand what the numbers were, or what the title is referring to...

selridge 2026-02-24 21:46 UTC link

Kinda. PI is principal investigator and usually they’re a professor with a grant (the grant being the thing they are the principal of investigating). That part is right. But they’re not really directly in the review loop. For some fields where things are small enough that folks can recognize style such as it exists, you could see reviewers passing over unfamiliar work and promoting familiar work. That was not the issue.

The issue was that it still was kind of hard to produce crappy mid rate papers, so you kind of needed the infrastructure of a small lab to do that. Now you don’t. The success rate for those mediocre papers produced by grad students and postdocs will go way down. It is possible that will cease to be a useful signal for those early career researchers.

tossandthrow 2026-02-24 21:47 UTC link

This approach dismisses the cases where Ai submissions generally are better.

I don't think this is appreciated enough: a lot of Ai adaptation is not happening because of cost on the expense of quality. Quite the opposite.

I am in the process of switching my company's use of retool for an Ai generated backoffice.

First and foremost for usability, velocity and security.

Secondly, we also save a buck.

jellyroll42 2026-02-24 21:48 UTC link

By its nature, it can only produce _another_ Redis, not _the next_ Redis.

asdfman123 2026-02-24 21:54 UTC link

Maybe we need to find a new metric to judge academics by beyond quantity of papers

krashidov 2026-02-24 21:56 UTC link

The cynical part of me thinks that software has peaked. New languages and technology will be derivatives of existing tech. There will be no React successor. There will never be a browser that can run something other than JS. And the reason for that is because in 20 years the new engineers will not know how to code anymore.

The optimist in me thinks that the clear progress in how good the models have gotten shows that this is wrong. Agentic software development is not a closed loop

minimaxir 2026-02-24 21:58 UTC link

Are the increasing # of distinct submitters from established accounts or new accounts?

MarkusQ 2026-02-24 21:58 UTC link

But peer review (circa 1965-2010[1]) is just the prior iteration of the problem[2]; the wave of crap[3] produced by publish or perish (crica 1950-present[4]). Rejecting papers by outsiders is irrelevant; the problem is we want to determine which papers are good/interesting/worth considering out of the fire hose of bilge, and, though we were already arguably failing at this, the problem just got harder.

(I say arguably, because there is always the old "try it yourself and see if it actually works" trick, but nobody seems to be fond of this; it smacks of "do your own research" and we're lazy monkeys at heart, who would much rather copy off of someone else's homework.)

[1] https://books.google.com/ngrams/graph?content=peer+review&ye...

[2] https://www.experimental-history.com/p/the-rise-and-fall-of-...

[3] https://journals.plos.org/plosmedicine/article?id=10.1371/jo...

[4] https://books.google.com/ngrams/graph?content=publish+or+per...

vermilingua 2026-02-24 21:59 UTC link

Is it feasible to differentiate increased agent-traffic from the organic growth in popularity HN has been seeing?

mosura 2026-02-24 21:59 UTC link

This massively confusing phase will last a surprisingly long time, and will conclude only if/when definitive proof of superintelligence arrives, which is something a lot of people are clearly hoping never happens.

Part of the reason for that is such a thing would seek to obscure that it has arrived until it has secured itself.

So get used to being ever more confused.

moregrist 2026-02-24 22:02 UTC link

Peer review isn’t the issue here. His comments are about Arxiv, which is a preprint server. Essentially anyone can publish a preprint. There’s no peer or other review involved.

hedgehog 2026-02-24 22:12 UTC link

Robots coming for todsacerdoti's job.

lmeyerov 2026-02-24 22:13 UTC link

I've been calling this Software Collapse, similar to AI Model Collapse.

An AI vibe-coded project can port tool X to a more efficient Y language implementation and pull in algorithm ideas A, B, C from competing implementations. And another competing vibe coding team can do the same, except Z language implementation with algorithms A, B, skip C, and add D. However, fundamentally new ideas aren't being added: This is recombination, translation, and reapplication of existing ideas and tools. As the cost to clone good ideas goes to zero, software converges towards the existing best ideas & tools across the field and stops differentiating.

It's exciting as a senior engineer or subject matter expert, as we can act on the good ideas we already knew but never had the time or budget for. But projects are also getting less differentiated and competitive. Likewise, we're losing the collaborative filtering era of people voting with their feet on which to concentrate resources into making a success. Things are getting higher quality but bland.

The frontier companies are pitching they can solve AI Creativity, which would let us pay them even more and escape the ceiling that is Software Collapse. However, as an R&D engineer who uses these things every day, I'm not seeing it.

xamuel 2026-02-24 22:27 UTC link

>Peer review has never really been blind and I suspect PIs will reject papers from "outsiders" even if they are higher quality.

I'm a complete outsider (not even in academia at all) and just got a paper accepted in the top math biology journal [1]. But granted, it took literally years to write it up and get it through. I do really worry that without academic affiliation it is going to get harder and harder for outsiders as gates are necessarily kept more and more securely because of all the slop.

[1] "Specieslike clusters based on identical ancestor points" https://philpapers.org/archive/ALESCB.pdf

CoastalCoder 2026-02-24 22:37 UTC link

Thanks for respecting HN's KJV-only rule!

/jk

rob 2026-02-24 22:39 UTC link

I would imagine tons of them are bots. They're getting hard to distinguish, they don't do the normal tropes any longer. They'll type in all lowercase, they'll have the creator post manually to throw you off, they'll make multiple comments within 45 seconds that normal human couldn't do. All things I've witnessed here over the past couple of weeks. And those are just the ones I've caught.

myhf 2026-02-24 23:32 UTC link

The last-modified-date effect is even more important, because it can be used to support whatever the latest fad is, without needing to adapt data or arguments to the specifics of that fad.

Aurornis 2026-02-25 00:18 UTC link

The post has been edited with an update at the top now.

evolighting 2026-02-25 03:01 UTC link

Bot comments are everywhere( no only obscure websites ). I suppose it's because someone just want to try them out and it is really affordable.

card_zero 2026-02-25 04:48 UTC link

People used to spam out masses of low-quality scientific papers in a scattergun approach to gain fame and citations, and they still do, but now they do it more, because LLMs churn it out faster than students.

marcus_holmes 2026-02-25 06:49 UTC link

Waiting for the wave of shit LLM-generated games on Steam. That'll be when I really know that LLMs have solved coding.

Though I'm old enough to remember the wave of shit outsourced-developer-coded games on CD that used to sell for $5 a pop at supermarkets (whole bargain bins full of them), so maybe this is nothing new and the market will take care of it automagically again.

Or maybe this will be like the wave of shit Flash games that happened in the early 2000's, that was actually awesome because while 99% of them were shit, 1% were great (and some of those old, good, Flash games are still going, with version 38453745 just released on Steam).

vostrocity 2026-02-25 06:59 UTC link

This will probably happen naturally as knowledge work declines.

ngc248 2026-02-25 07:39 UTC link

looks like history runs in cycles ... Knowledge was strictly guarded and the powers that be used to decide who gets an education. Looks like you are espousing the same, discounting all the good that has come about because of open education.

karel-3d 2026-02-25 08:10 UTC link

I think the snake will eat its tail because it will be harder and harder to train on the new data, as they are already AI generated, and the model will collapse.

You already cannot train on YouTube data, for example, because it's now overwhelmed by AI slop.

We are not there yet though and we are still getting better at mining the pre-AI data.

OakNinja 2026-02-25 08:18 UTC link

This is just Sturgeon’s law. If you would reduce the number of students by an order of magnitude you’d still end up with 90% junk papers.

oytis 2026-02-25 08:33 UTC link

OTOH, weakening the ties between the industry and science can harm both of them. Right now in the university people get a rough idea of how science works, and most of them then go to work in the industry, which sounds like a right proportion. Nobody is reading papers below PhD level anyway, so I don't think that it's undergrad papers that are a problem

VerifiedReports 2026-02-25 08:53 UTC link

But HN loves them. How dare you call one out!

qnleigh 2026-02-25 09:56 UTC link

I dunno, I think society is best served by educating as many people as possible. I would much rather live in a world where anyone who wants a quality education can get one.

Propelloni 2026-02-25 10:00 UTC link

Me too. So as a service to the community: the article is about a noticeable increase of submissions about high-energy theory to arXiv due to mediocre articles quickly produced with or by AI and how to deal with that.

croes 2026-02-25 10:00 UTC link

But what else to do if you think the article is interesting but the rules say don’t change the title?

mmooss 2026-02-25 17:01 UTC link

The value, to society, to your field and to you institution, of being a scholar is to create new knowledge. New knowledge has no value unless you disseminate it, or publish.

Another necessity is the public (usually within its field) examination of the knowledge, including discussion/debate. Knowledge is merely embryonic without those things - undeveloped, not at all reliable. That is difficult without the author able to respond. And others want to expand and build on the work, which often benefits greatly from contacting the author.

In the modern (post-positivist?) approach to science, the world respects that it's written by a human who has a perspective and, despite their best intentions, biases. You can't evaluate any knowledge without knowing its source, in science or elsewhere. The first element of a citation is the author, not the title or journal (though I don't know why that happened historically).

And the latter is a reason any LLM author should be identified.

Editorial Channel

What the content says

+0.35

Article 19 Freedom of Expression

Medium Advocacy Practice

Editorial

+0.35

SETL

-0.27

Post explicitly advocates for freedom of information and seeks public data sharing about arXiv submission trends; calls for AI analysis to examine whether data is accessible; resistant to moderation that would suppress substantive commentary.

+0.30

Article 27 Cultural Participation

Medium Advocacy Practice

Editorial

+0.30

SETL

-0.20

Post strongly advocates for access to scientific culture and intellectual commons: author publishes openly accessible analysis of arXiv data, encourages community participation in scientific investigation, and implicitly criticizes gatekeeping in academic publishing.

+0.25

Article 23 Work & Equal Pay

Medium Advocacy

Editorial

+0.25

SETL

Post implicitly critiques exploitative labor dynamics in academic publishing: author notes the 'sad state of hep-th' where PIs produce mediocre papers using grad students/postdocs, and discusses how AI will democratize this process by removing gatekeeping. This critique implies concern for labor dignity and fair compensation.

+0.15

Article 28 Social & International Order

Low Advocacy

Editorial

+0.15

SETL

Post implicitly references the need for social order that protects these rights: discussion of arXiv governance and concerns about paper submission systems implies awareness that institutional order is necessary to maintain scientific integrity.

-0.15

Article 12 Privacy

Medium Practice

Editorial

-0.15

SETL

-0.09

Post does not explicitly address privacy, but the topic concerns AI surveillance of academic output and scrutiny of paper production patterns, which implicitly touches on monitoring of individual intellectual work.

Preamble Preamble

Preamble principles (dignity, justice, peace, freedom) are not directly engaged by this technical/scientific commentary.

Article 1 Freedom, Equality, Brotherhood

Article 1 (equality and dignity of all humans) is not directly addressed by this post about arXiv submission patterns.

Article 2 Non-Discrimination

No discussion of discrimination or rights without distinction.

Article 3 Life, Liberty, Security

Right to life, liberty, security not addressed.

Article 4 No Slavery

No discussion of slavery or servitude.

Article 5 No Torture

No discussion of torture or cruel treatment.

Article 6 Legal Personhood

Right to recognition as a person not addressed.

Article 7 Equality Before Law

Legal equality and protection from discrimination not directly engaged.

Article 8 Right to Remedy

Right to effective remedy not addressed.

Article 9 No Arbitrary Detention

Arbitrary arrest and detention not relevant.

Article 10 Fair Hearing

Fair and public hearing not addressed.

Article 11 Presumption of Innocence

Criminal procedure and presumption of innocence not discussed.

Article 13 Freedom of Movement

Freedom of movement not addressed.

Article 14 Asylum

Right to asylum not relevant to this content.

Article 15 Nationality

Nationality rights not addressed.

Article 16 Marriage & Family

Marriage and family rights not discussed.

Article 17 Property

Property rights not addressed.

Article 18 Freedom of Thought

Freedom of thought, conscience, religion not discussed.

Article 20 Assembly & Association

Freedom of assembly and association not addressed.

Article 21 Political Participation

Democratic participation not discussed.

Article 22 Social Security

Social security and welfare not addressed.

Article 24 Rest & Leisure

Rest and leisure not discussed.

Article 25 Standard of Living

Standard of living and health care not addressed.

Article 26 Education

Education rights not discussed.

Article 29 Duties to Community

Duties to community not explicitly discussed.

Article 30 No Destruction of Rights

Prevention of UDHR destruction not addressed.

Structural Channel

What the site does

+0.50

Article 19 Freedom of Expression

Medium Advocacy Practice

Structural

+0.50

Context Modifier

+0.08

SETL

-0.27

Blog is publicly accessible without paywall or registration barriers; comments are permitted and moderated for relevance rather than suppressed; encourages reader agency in investigation and discussion per DCP (access_model modifier +0.08).

+0.40

Article 27 Cultural Participation

Medium Advocacy Practice

Structural

+0.40

Context Modifier

+0.13

SETL

-0.20

Blog is publicly accessible and supports scientific discourse through comments; domain includes screen-reader-text accessibility implementation per DCP (+0.05 modifier); free access model supports Article 27 (+0.08 modifier).

-0.10

Article 12 Privacy

Medium Practice

Structural

-0.10

Context Modifier

-0.05

SETL

-0.09

WordPress Jetpack tracking pixel (wpstats) is present on-domain per DCP, indicating minimal analytics tracking infrastructure.

Preamble Preamble

No structural indicators related to preamble values are observable.

Article 1 Freedom, Equality, Brotherhood

No structural elements observable that relate to equal dignity.

Article 2 Non-Discrimination

No observable structural discrimination.

Article 3 Life, Liberty, Security

Not applicable to this technical post.

Article 4 No Slavery

Not relevant to content.

Article 5 No Torture

Not applicable.

Article 6 Legal Personhood

Not observable in this context.

Article 7 Equality Before Law

No observable discrimination in site structure.

Article 8 Right to Remedy

Not applicable to technical blog post.

Article 9 No Arbitrary Detention

Not observable.

Article 10 Fair Hearing

Not applicable.

Article 11 Presumption of Innocence

Not relevant to content.

Article 13 Freedom of Movement

Not applicable.

Article 14 Asylum

Not observable.

Article 15 Nationality

Not applicable.

Article 16 Marriage & Family

Not relevant to technical post.

Article 17 Property

Not observable.

Article 18 Freedom of Thought

Not applicable.

Article 20 Assembly & Association

Not observable.

Article 21 Political Participation

Not applicable to technical blog.

Article 22 Social Security

Not relevant.

Article 23 Work & Equal Pay

Medium Advocacy

No structural indicators observable regarding labor practices or worker protections on-domain.

Article 24 Rest & Leisure

Not observable.

Article 25 Standard of Living

Not applicable.

Article 26 Education

Not observable.

Article 28 Social & International Order

Low Advocacy

No observable structural indicators regarding social order or institutional frameworks.

Article 29 Duties to Community

Not observable.

Article 30 No Destruction of Rights

Not applicable.

Supplementary Signals

How this content communicates, beyond directional lean. Learn more

Epistemic Quality ℹ

How well-sourced and evidence-based is this content?

0.71 medium claims

Sources		0.7
Evidence		0.7
Uncertainty		0.8
Purpose		0.8

Propaganda Flags ℹ

2 manipulative rhetoric techniques found

2 techniques detected

loaded language

Author describes current academic system as 'sad state of hep-th' and earlier 'mediocre papers,' using emotionally charged framing to characterize the status quo.

appeal to fear

Post is framed around potential 'apocalypse' with doubled submission rates, presenting scenario of being 'flooded' with papers as implicit threat to scientific integrity.

Emotional Tone ℹ

Emotional character: positive/negative, intensity, authority

urgent

Valence		-0.3
Arousal		0.7
Dominance		0.4

Transparency ℹ

Does the content identify its author and disclose interests?

0.50

✓ Author

More signals: context, framing & audience

Solution Orientation ℹ

Does this content offer solutions or only describe problems?

0.53 mixed

Reader Agency

0.6

Stakeholder Voice ℹ

Whose perspectives are represented in this content?

0.45 3 perspectives

Speaks: institutionindividuals

About: workerscorporation

Temporal Framing ℹ

Is this content looking backward, at the present, or forward?

present immediate

Geographic Scope ℹ

What geographic area does this content cover?

global

Complexity ℹ

How accessible is this content to a general audience?

moderate medium jargon domain specific

Longitudinal · 5 evals

Audit Trail 25 entries

2026-02-28 14:26	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 14:26	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning math blog neutral stance
2026-02-28 14:21	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 14:21	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning math blog neutral stance
2026-02-26 23:03	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-26 23:03	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
2026-02-26 20:11	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 20:09	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 20:08	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 20:07	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:31	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 17:29	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:28	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 17:27	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 09:00	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 09:00	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 08:59	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 08:59	dlq	Dead-lettered after 1 attempts: Looks like it is happening	- -
2026-02-26 08:57	rate_limit	OpenRouter rate limited (429) model=mistral-small-3.1	- -
2026-02-26 08:57	rate_limit	OpenRouter rate limited (429) model=llama-3.3-70b	- -
2026-02-26 08:57	rate_limit	OpenRouter rate limited (429) model=hermes-3-405b	- -
2026-02-26 08:57	rate_limit	OpenRouter rate limited (429) model=qwen3-next-80b	- -
2026-02-26 08:57	rate_limit	OpenRouter rate limited (429) model=mistral-small-3.1	- -
2026-02-26 08:14	eval	Evaluated by deepseek-v3.2: +0.17 (Mild positive) 14,397 tokens
2026-02-26 04:30	eval	Evaluated by claude-haiku-4-5-20251001: +0.24 (Mild positive) 16,570 tokens

build 1ad9551+j7zs · deployed 2026-03-02 09:09 UTC · evaluated 2026-03-02 11:31:12 UTC