+0.22 Making MCP cheaper via CLI

Name: HRCB Evaluation: Making MCP cheaper via CLI
Item: Making MCP cheaper via CLI
Rating: 0.211
Author: HN HRCB

Model: @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 deepseek/deepseek-v3.2-20251201 +0.10 claude-haiku-4-5-20251001 +0.22 meta-llama/llama-3.3-70b-instruct:free ND Compare

+0.22	Making MCP cheaper via CLI (kanyilmaz.me S:+0.13 )
	320 points by thellimist 4 days ago \| 120 comments on HN \| Mild positive Editorial · v3.7 · 2026-02-26 00:57:57 0

Summary Information Accessibility & Technical Transparency Advocates

This technical blog post compares MCP (Model Context Protocol) and CLI approaches to AI agent tool integration, advocating for more efficient information disclosure practices. The content demonstrates free expression and information accessibility through detailed comparative analysis and open-source contribution. Structural limitations around privacy tracking partially offset the positive engagement with information freedom and educational content sharing.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

Editorial Mean	+0.22	Structural Mean	+0.13
Weighted Mean	+0.21	Unweighted Mean	+0.20
Max	+0.30 Article 27	Min	+0.13 Article 26
Signal	5	No Data	26
Volatility	0.06 (Low)
Negative	0	Channels	E: 0.6 S: 0.4
SETL ℹ	+0.13	Editorial-dominant
FW Ratio ℹ	65%	24 facts · 13 inferences

Evidence 14% coverage ℹ

  7M  1L  23 ND 

Theme Radar

HN Discussion 20 top-level · 17 replies

crooked-v 2026-02-25 20:52 UTC link

Cheaper, but is it more effective?

I know I saw something about the Next.js devs experimenting with just dumping an entire index of doc files into AGENTS.md and it being used significantly more by Claude than any skills/tool call stuff.

bdavbdav 2026-02-25 20:55 UTC link

I’m not sure how this works. A lot of that tool description is important to the Agent understanding what it can and can’t do with the specific MCP provider. You’d have to make up for that with a much longer overarching description. Especially for internal only tools that the LLM has no intrinsic context for.

speedgoose 2026-02-25 21:09 UTC link

MCP has some schemas though. CLI is a bit of a mess.

But MCP today isn’t ideal. I think we need to have some catalogs where the agents can fetch more information about MCP services instead of filling the context with not relevant noise.

mijoharas 2026-02-25 21:15 UTC link

This sounds similar to MCPorter[0], can anyone point out the differences?

[0] https://github.com/steipete/mcporter

red_hare 2026-02-25 21:16 UTC link

True for coding agents running SotA models where you're the human-in-the-loop approving, less true for your deployed agents running on cheap models that you don't see what's being executed.

But yeah, a concrete example is playwright-mcp vs playwright-cli: https://testcollab.com/blog/playwright-cli

andybak 2026-02-25 21:21 UTC link

Why are they using JSON in the context? I thought we'd figured out that the extra syntax was a waste of tokens?

econ 2026-02-25 21:23 UTC link

I had deepseek explain MCP to me. Then I asked what was the point of persistent connections and it said it was pretty much hipster bullshit and that some url to post to is really enough for an llm to interact with things.

hiccuphippo 2026-02-25 21:23 UTC link

Can LLMs compress those documents into smaller files that still retain the full context?

vasco 2026-02-25 21:35 UTC link

A lot of providers already have native CLI tools with usually better auth support and longer sessions than MCP as well as more data in their training set on how to use those cli tools for many things. So why convert mcp->cli tool instead of using the existing cli tools in the first place? Using the atlassian MCP is dog shit for example, but using acli is great. Same for github, aws, etc.

_pdp_ 2026-02-25 21:37 UTC link

Hehe... nice one. I think we are all thinking the same thing.

I've also launched https://mcpshim.dev (https://github.com/mcpshim/mcpshim).

The unix way is the best way.

jbellis 2026-02-25 21:53 UTC link

You just reinvented Skills

_pdp_ 2026-02-25 21:56 UTC link

There is some important context missing from the article.

First, MCP tools are sent on every request. If you look at the notion MCP the search tool description is basically a mini tutorial. This is going right into the context window. Given that in most cases MCP tool loading is all or nothing (unless you pre-select the tools by some other means) MCP in general will bloat your context significantly. I think I counted about 20 tools in GitHub Copilot VSCode extension recently. That's a lot!

Second, MCP tools are not compossible. When I call the notion search tool I get a dump of whatever they decide to return which might be a lot. The model has no means to decide how much data to process. You normally get a JSON data dump with many token-unfriendly data-points like identifiers, urls, etc. The CLI-based approach on the other hand is scriptable. Coding assistant will typically pipe the tool in jq or tail to process the data chunk by chunk because this is how they are trained these days.

If you want to use MCP in your agent, you need to bring in the MCP model and all of its baggage which is a lot. You need to handle oauth, handle tool loading and selection, reloading, etc.

The simpler solution is to have a single MCP server handling all of the things at system level and then have a tiny CLI that can call into the tools.

In the case of mcpshim (which I posted in another comment) the CLI communicates with the sever via a very simple unix socket using simple json. In fact, it is so simple that you can create a bash client in 5 lines of code.

This method is practically universal because most AI agents these days know how to use SKILLs. So the goal is to have more CLI tools. But instead of writing CLI for every service you can simply pivot on top of their existing MCP.

This solves the context problem in a very elegant way in my opinion.

slopinthebag 2026-02-25 21:58 UTC link

I've seen folks say that the future of using computers will be with an LLM that generates code on the fly to accomplish tasks. I think this is a bit ridiculous, but I do think that operating computers through natural language instructions is superior for a lot of cases and that seems to be where we are headed.

I can see a future where software is built with a CLI interface underneath the (optional) GUI, letting an LLM hook directly into the underlying "business" logic to drive the application. Since LLM's are basically text machines, we just need somebody to invent a text-driven interface for them to use...oh wait!

Imagine booking a flight - the LLM connects to whatever booking software, pulls a list of commands, issues commands to the software, and then displays the output to the user in some fashion. It's basically just one big language translation task, something an LLM is best at, but you still have the guardrails of the CLI tool itself instead of having the LLM generate arbitrary code.

Another benefit is that the CLI output is introspectable. You can trace everything the LLM is doing if you want, as well as validate its commands if necessary (I want to check before it uses my credit card). You don't get this if it's generating a python script to hit some API.

Even before LLM's developers have been writing GUI applications as basically a CLI + GUI for testability, separation of concerns etc. Hopefully that will become more common.

Also this article was obviously AI generated. I'm not going to share my feelings about that.

cmdtab 2026-02-25 22:03 UTC link

Not just cheaper in terms of token usage but accuracy as well.

Even the smallest models are RL trained to use shell commands perfectly. Gemini 3 flash performs better with a cli with 20 commands vs 20+ tools in my testing.

cli also works well in terms of maintaining KV cache (changing tools mid say to improve model performance suffers from kv cache vs cli —help command only showing manual for specific command in append only fashion)

Writing your tools as unix like cli also has a nice benefit of model being able to pipe multiple commands together. In the case of browser, i wrote mini-browser which frontier models use much better than explicit tools to control browser because they can compose a giant command sequence to one shot task.

https://github.com/runablehq/mini-browser

pelcg 2026-02-25 22:04 UTC link

This looks related to Awesome CLIs/TUIs and terminal trove which has lots both CLI and TUI apps.

Awesome TUIs: https://github.com/rothgar/awesome-tuis

Awesome CLIs: https://github.com/agarrharr/awesome-cli-apps

Terminal Trove: https://terminaltrove.com/

I guess this is another one shows that the CLI and Unix is coming back in 2026.

philfreo 2026-02-25 22:10 UTC link

Is this article from a while back?

> Before your agent can do anything useful, it needs to know what tools are available. MCP’s answer is to dump the entire tool catalog into the conversation as JSON Schema. Every tool, every parameter, every option.

Because this simply isn't true anymore for the best clients, like Claude Code.

Similar to how Skills were designed[1] to be searchable without dumping everything into context, MCP tools can (and does in Claude Code) work the same way.

See https://www.anthropic.com/engineering/advanced-tool-use and https://x.com/trq212/status/2011523109871108570 and https://platform.claude.com/docs/en/agents-and-tools/tool-us...

[1] https://agentskills.io/specification#progressive-disclosure

orliesaurus 2026-02-25 22:11 UTC link

I like this approach ... BUT the big win for me is audit logs. CLIs naturally leave a trail you can replay.

ALSO... the permission boundary is clearer. You can whitelist commands, flags, working dir... it becomes manageable.

HOWEVER... packaging still matters. A “small” CLI that pulls in a giant runtime kills the benefit.

I want the discipline of small protocol plus big cache. Cheap models can summarize what they did and avoid full context in every step...

arjie 2026-02-25 22:37 UTC link

These days you can rewrite everything yourself for very cheap. So this is `mcporter` rewritten. I prefer to use Rust personally for rewrites. Opus 4.6 can churn it out pretty quickly if that's what you want. To be honest, almost all software that I want to try these days I don't even install. Instead I'd rather read the README and produce a personal version. This allows encoding idiosyncrasies and specifics that another author will not accept.

aceelric 2026-02-25 22:51 UTC link

After reading Cloudflare's Code Mode MCP blog post[1] I built CMCP[2] which lets you aggregate all MCP servers behind two mcp tools, search and execute.

I do understand anthropic's Tool Search helps with mcp bloat, but it's limited only to claude.

CMCP currently supports codex and claude but PRs are welcome to add more clients.

[1]https://blog.cloudflare.com/code-mode-mcp/ [2]https://github.com/assimelha/cmcp

kissgyorgy 2026-02-25 23:28 UTC link

A very good example of this is playwright-cli vs Playwright MCP: https://github.com/microsoft/playwright-cli

The biggest difference is state, but that's also kind of easy from CLI, the tool just have to store it on disk, not in process memory.

thellimist 2026-02-25 21:16 UTC link

It's the same from functionality perspective. The schema's are converted to CLI versions of it. It's a UI change more than anything.

thellimist 2026-02-25 21:20 UTC link

I can give example.

LLM only know `linear` tool exists.

I ask "get me the comments in the last issue"

Next call LLM does is

`linear --help 2>&1 | grep -i -E "search|list.issue|get.issue")` then `linear list-issues --raw '{"limit": 3}' -o json 2>&1 | head -80)` then `linear list-comments --issue-id "abc1ceae-aaaa-bbbb-9aaa-6bef0325ebd0" 2>&1)`

So even the --help has filtering by default. Current models are pretty good

thellimist 2026-02-25 21:21 UTC link

personal experience, definitely yes. You can try it out with `gh` rather than `Github MCP`. You'll see the difference immediately (espicially more if you have many MCPs)

groby_b 2026-02-25 21:28 UTC link

You are free to build tools that emit/ingest json, and provide a json schema upon request.

The point is push vs pull.

thellimist 2026-02-25 21:33 UTC link

Main differences are

CLIHub

- written in go

- zero-dependency binaries

- cross-compilation built-in (works on all platforms)

- supports OAuth2 w/ PKCE, S2S, Google SA, API key, basic, bearer. Can be extended further

MCPorter

- TS

- huge dependency list

- runtime dependency on bun

- Auth supports OAuth + basic token

- Has many features like SDK, daemons (for certain MCPs), auto config discovery etc.

MCPorter is more complete tbh. Has many nice to have features for advanced use cases.

My use case is simple. Does it generate a CLI that works? Mainly oauth is the blocker since that logic needs to be custom implemented to the CLI.

thellimist 2026-02-25 21:34 UTC link

What do you mean?

thellimist 2026-02-25 21:50 UTC link

Nice!

Compared both

---

TL;DR CLIHUB compiles MCP servers into portable, self-contained binaries — think of it like a compiler. Best for distribution, CI, and environments where you can't run a daemon.

mcpshim is a runtime bridge — think of it like a local proxy. Best for developers juggling many MCP servers locally, especially when paired with LLM agents that benefit from persistent connections and lightweight aliases.

---

https://cdn.zappy.app/b908e63a442179801e406b01cf412433.png (table comparison)

---

thellimist 2026-02-25 21:56 UTC link

I don't prefer to use online skills where half has malware

Official MCPs are trusted. Official MCPs CLIs are trusted.

esafak 2026-02-25 22:05 UTC link

Did he? Skills are for CLIs, not for converting MCPs into CLIs.

thellimist 2026-02-25 22:06 UTC link

I actually want to combine this and CLIHub into a directory where someone can download all the official MCPs or CLIs (or MCP to CLIs) with a single command

thellimist 2026-02-25 22:15 UTC link

Ofc it is written by ai, I have a skill for it -

https://github.com/thellimist/thellimist.github.io/blob/mast...

I dump a voice message, then blog comes out. Then I modify a bunch of things, and iterate 1-2 hours to get it right

thellimist 2026-02-25 22:18 UTC link

FYI the blog has direct comparison to Anthropic’s Tool Search.

Regardless, most MCPs are dumping. I know Cloudflare MCP is amazing but other 1000 useful MCPs are not.

thellimist 2026-02-25 22:19 UTC link

This is cool!

I was actually thinking if I should support daemons just to support playwright. Now I don't have a use case for it

xyzsparetimexyz 2026-02-25 22:40 UTC link

lol

thellimist 2026-02-25 22:58 UTC link

did you check the token usage comparison between cmcp and cli?

CharlieDigital 2026-02-25 23:27 UTC link

Probably oversold here because if you read the fine print, the savings only come in cases when you don't need the bytes in context.

That makes sense for some of the examples the described (e.g. a QA workflow asking the agent to take a screenshot and put it into a folder).

However, this is not true for an active dev workflow when you actually do want it to see that the elements are not lining up or are overlapping or not behaving correctly. So token savings are possible...if your use case doesn't require the bytes in context (which most active dev use cases probably do)*

22c 2026-02-25 23:32 UTC link

Pretty sure I saw this one a couple of weeks back, or something very similar to it..

https://github.com/philschmid/mcp-cli

Edit: Turns out was https://github.com/steipete/mcporter noted elsewhere in the thread, but mcp-cli looks like a very similar thing.

Editorial Channel

What the content says

+0.30

Article 13 Freedom of Movement

Medium Framing

Editorial

+0.30

SETL

+0.24

Content advocates for freedom of information and technical transparency through comparative analysis of tool documentation methods. The post demonstrates how different approaches (MCP vs CLI) handle information disclosure.

+0.25

Article 23 Work & Equal Pay

Medium Framing

Editorial

+0.25

SETL

+0.16

Content addresses fair compensation for work through detailed cost analysis of different approaches to tool integration. The post advocates for understanding true economic costs and efficiency of labor (agent computation time).

+0.20

Article 19 Freedom of Expression

Medium Framing

Editorial

+0.20

SETL

+0.14

Content demonstrates free expression through technical analysis and opinion on tool design efficiency. The author freely shares analysis and criticism of existing approaches.

+0.20

Article 27 Cultural Participation

Medium Framing

Editorial

+0.20

SETL

0.00

Content engages with cultural participation through technical contribution to AI/agent development community. The author participates in shared technical culture and shares innovations.

+0.15

Article 26 Education

Low Framing

Editorial

+0.15

SETL

+0.09

Content implicitly supports education through knowledge sharing about technical tool design and efficiency. The detailed explanation of MCP vs CLI serves an educational function about cost optimization.

Preamble Preamble

Content does not directly engage with the universal dignity or inherent rights framework of the Preamble.

Article 1 Freedom, Equality, Brotherhood

No engagement with equal dignity or fundamental equality principles.

Article 2 Non-Discrimination

Medium Practice

No editorial engagement with non-discrimination principles.

Article 3 Life, Liberty, Security

No engagement with right to life, liberty, or security of person.

Article 4 No Slavery

No content addressing slavery or servitude.

Article 5 No Torture

No content addressing torture or cruel treatment.

Article 6 Legal Personhood

No engagement with right to recognition before law.

Article 7 Equality Before Law

No content addressing equal protection before law.

Article 8 Right to Remedy

No engagement with right to remedy for violation of rights.

Article 9 No Arbitrary Detention

No content addressing arbitrary arrest or detention.

Article 10 Fair Hearing

No engagement with fair trial or due process.

Article 11 Presumption of Innocence

No content addressing criminal liability or retroactive punishment.

Article 12 Privacy

Medium Practice

No editorial engagement with privacy rights.

Article 14 Asylum

No content addressing right to asylum or political refugee status.

Article 15 Nationality

No engagement with right to nationality.

Article 16 Marriage & Family

No content addressing marriage or family rights.

Article 17 Property

No engagement with property rights.

Article 18 Freedom of Thought

No content addressing freedom of thought, conscience, or religion.

Article 20 Assembly & Association

No engagement with freedom of peaceful assembly or association.

Article 21 Political Participation

No engagement with democratic participation or voting rights.

Article 22 Social Security

No explicit engagement with social security or welfare rights.

Article 24 Rest & Leisure

No engagement with rest, leisure, or limited work hours.

Article 25 Standard of Living

Medium Practice

No direct engagement with social security or standard of living.

Article 28 Social & International Order

No engagement with right to social and international order.

Article 29 Duties to Community

No engagement with duties to community or limitations on rights.

Article 30 No Destruction of Rights

No engagement with prevention of rights destruction or abuse.

Structural Channel

What the site does

Domain Context Profile

Element	Modifier	Affects	Note
Legal & Terms
Privacy	-0.05	Article 12	Google Analytics tracking (G-92V8ZN63H4) present without explicit privacy policy link visible on-domain.
Terms of Service	—		No Terms of Service detected on-domain.
Identity & Mission
Mission	—		No explicit mission or values statement found on-domain.
Editorial Code	—		No editorial standards or corrections policy visible on-domain.
Ownership	—		Individual blog; author identity present (Kan Yilmaz). Ownership clear.
Access & Distribution
Access Model	+0.10	Article 25 Article 27	Content appears freely accessible; no paywall or gatekeeping observed.
Ad/Tracking	-0.05	Article 12	Google Analytics embedded; no consent banner or tracking opt-out mechanism visible.
Accessibility	+0.05	Article 2 Article 25	Mobile menu functionality present; semantic HTML structure observed. No explicit accessibility statement detected.

+0.20

Article 27 Cultural Participation

Medium Framing

Structural

+0.20

Context Modifier

+0.10

SETL

0.00

Open-source release of CLIHub converter enables community participation in technical culture. Content is freely shared, supporting collective cultural advancement.

+0.15

Article 23 Work & Equal Pay

Medium Framing

Structural

+0.15

Context Modifier

0.00

SETL

+0.16

Author is identified by name, suggesting recognition of individual contribution. Open-source contribution (CLIHub converter) is mentioned, indicating labor is being shared.

+0.10

Article 13 Freedom of Movement

Medium Framing

Structural

+0.10

Context Modifier

0.00

SETL

+0.24

Content is freely accessible without paywall or registration gatekeeping. Author identity (Kan Yilmaz) is clear and linked content is openly available.

+0.10

Article 19 Freedom of Expression

Medium Framing

Structural

+0.10

Context Modifier

0.00

SETL

+0.14

No editorial gatekeeping or censorship of ideas observed. Content publicly published without apparent content restrictions.

+0.10

Article 26 Education

Low Framing

Structural

+0.10

Context Modifier

0.00

SETL

+0.09

Clear, structured presentation of technical information supports learner comprehension. Code examples and tables facilitate understanding.

Preamble Preamble

No structural signals indicate intentional promotion or violation of preamble principles.

Article 1 Freedom, Equality, Brotherhood

No structural differentiation or discriminatory gatekeeping observed.

Article 2 Non-Discrimination

Medium Practice

Mobile menu and semantic HTML suggest intentional accessibility effort without explicit gatekeeping by disability or other protected status.

Article 3 Life, Liberty, Security

No structural signals related to personal security or liberty.

Article 4 No Slavery