markusw (u/markusw) - Readit News

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

simonw · a month ago

In working on Rodney I found out that the Claude Code for web environment has a Chrome browser installed already. It's a shame you can't see its output directly - even if it takes a screenshot there's no easy way to view it other than having it commit and push that to a branch in GitHub.

markusw · a month ago

Oooh, I didn't know that.

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

simonw · a month ago

Yes, very much so. It's a much thinner, less feature-rich alternative.

It would be interesting to experiment with Jupyter notebooks as an alternative that could work in Claude Code for web.

I had a poke around just now and couldn't find an existing CLI tool that lets you build those up a section at a time in the same way as Showboat. I did find this Python library though:

    uv run --with nbformat python -c '
    import nbformat
    nb = nbformat.v4.new_notebook()
    nb.cells.append(nbformat.v4.new_markdown_cell("# NBTerm Exploration"))
    nb.cells.append(nbformat.v4.new_code_cell("import sys\nprint(f\"Python {sys.version}\")"))
    nb.cells.append(nbformat.v4.new_code_cell("x = [i**2 for i in range(10)]\nprint(x)"))
    nb.cells.append(nbformat.v4.new_code_cell("sum(x)"))
    with open("demo.ipynb", "w") as f:
        nbformat.write(nb, f)
    '

So you could tell the agent to run code like that and then inspect the `demo.ipynb` notebook later on. It doesn't show the result of evaluating the cells though, you need to run this afterwards to have that happen:

    uv run --with nbformat --with nbclient --with ipykernel python -c '
    import nbformat
    from nbclient import NotebookClient

    nb = nbformat.read("demo.ipynb", as_version=4)
    client = NotebookClient(nb, timeout=60)
    client.execute()
    nbformat.write(nb, "demo_executed.ipynb")
    '

markusw · a month ago

I think you had a post about marimo notebooks at some point? I think they would be better suited for that. Their representation is just Python, they can run as scripts, and they have native HTML output. But of course, it only works for Python.

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

simonw · a month ago

My problem is that I work in dozens of different repos generally using Claude Code for web, which doesn't have a way to install extra global skills yet.

I don't want to duplicate my skills into all those repos (and keep them updated) so I prefer the "uvx tool --help" pattern.

markusw · a month ago

That's actually one of the things that has kept me from using Claude Code web (that, and I often need a Chrome browser for the agent). But they must be working on it.

I saw an MCP I've set up on claude.ai show up in my local Claude Code MCP list the other day, it seems inevitable that there will be skills integration across environments as well at some point.

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

simonw · a month ago

They kind of were one-shotted by Claude. The value is in coming up with a consistent design and good enough --help that you can prompt:

  Run uvx showboat --help and
  uvx rodney --help and use those
  tools to demo the feature you built

The help text effectively doubles as a skill.

markusw · a month ago

I guess it would still make sense to have "demo" and "browser-use" skills, so that the agent can reach for them proactively? I always try to remove as much friction as possible for myself, one little bit at a time.

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

tardismechanic · a month ago

See also (the confusingly named) playwright-cli

https://github.com/microsoft/playwright-cli

Different from the cli used for running tests etc that comes bundled with PlayWright

Sample use:

  playwright-cli open https://demo.playwright.dev/todomvc/ --headed
  playwright-cli type "Buy groceries"
  playwright-cli press Enter
  playwright-cli type "Water flowers"
  playwright-cli press Enter
  playwright-cli check e21
  playwright-cli check e35
  playwright-cli screenshot

markusw · a month ago

Oh, I hadn't seen that one either, thanks for sharing. Here I am still using the Chrome Devtools MCP like a caveman. :D

markusw commented on Show HN: Showboat and Rodney, so agents can demo what they've built simonwillison.net/2026/Fe... · Posted by u/simonw

simonw · a month ago

I'm dabbling with Go at the moment for small tools, mainly as an excuse to learn a new language but also because having a single standalone binary is convenient for shuttling these tiny little tools around.

... but then I'm mostly running them with "uvx name-of-tool" because it turns out Python's packaging infrastructure for binary tools is so good!

markusw · a month ago

If you're coming from the Python world, definitely. I find `go install github.com/simonw/rodney@latest` equally easy. :D Although you need the Go tooling installed, of course. But so much agree, Go is great for CLIs!

markusw commented on Claude Skills are awesome, maybe a bigger deal than MCP simonwillison.net/2025/Oc... · Posted by u/weinzierl

simonw · 5 months ago

Depends which definition of RAG you're talking about.

RAG was originally about adding extra information to the context so that an LLM could answer questions that needed that extra context.

On that basis I guess you could call skills a form of RAG, but honestly at that point the entire field of "context engineering" can be classified as RAG too.

Maybe RAG as a term is obsolete now, since it really just describes how we use LLMs in 2025.

markusw · 5 months ago

I’d rather say you can use skills to do RAG by supplying the right tools in the skill (“here’s how you query our database”).

Calling the skill system itself RAG is a bit of a stretch IMO, unless you end up with so many skills that their summaries can’t fit in the context and you have to search through them instead. ;)

markusw commented on Datastar: Lightweight hypermedia framework for building interactive web apps data-star.dev/... · Posted by u/freetonik

nchmy · 5 months ago

This is the only valid and reasonable criticism I've seen here so far, kudos to you.

299 is, indeed, inaccessible for many people. But, the devs live in places where things cost more, so its also a tiny price compared to the relative value that they provide.

I do think that some sort of geo-relative pricing structure would be worth looking into, but how does one even implement something like that? Is there something that makes it all "just work"? I suspect not. Moreover, the devs have already given away so much of their time - it doesnt make much sense for them to invest even more of it into designing a pricing system like this, from which they are going to possibly earn only a negligible amount from. Perhaps this is a problem that you might like to address yourself?

Anyway, the devs are very clear that most people should never need the Pro license. 95+% of the functionality and value is available for free in the open-source library. Use it, enjoy it, learn from it, profit from it!

markusw · 5 months ago

What I've done in a project once is use Purchasing Power Parity: basically, there's an index for purchasing power, and you adjust your pricing based on that. It's implemented in some major payment platforms, like Gumroad: https://gumroad.com/help/article/327-purchasing-power-parity

markusw commented on Datastar: Lightweight hypermedia framework for building interactive web apps data-star.dev/... · Posted by u/freetonik

imiric · 5 months ago

Datastar is one of the few projects in the web ecosystem in years that feels like a paradigm shift for the better.

The author (Delaney Gillilan) has put a lot of thought into how modern web development should be done from first principles, while leveraging core web technologies like SSE, and eschewing complexity as much as possible. His proposed GoNaDs stack[1] enables some truly impressive web applications. I highly recommend watching this talk[2] to get a sense of what's possible.

I wish the best of luck to the core team and contributors, and hope that this framework, or something like it, disrupts the modern ecosystem of popular web frameworks and stacks.

[1]: https://gonads.net/

[2]: https://www.youtube.com/watch?v=0K71AyAF6E4

markusw · 5 months ago

Thank you for sharing the youtube links in particular! It's nice to get some background info and intent for interesting libraries.