Readit News logoReadit News
peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
skc · 2 months ago
How likely is it that the end game becomes that we stop writing apps for actual human users and instead sites become massive walls of minified text against a black screen.
peytoncasper · 2 months ago
Actually a few startups working on this! You should check out Stytch isAgent SDK.

We’re partnering with them on Web Bot Auth

peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
jampa · 2 months ago
The automation is powered through Browserbase, which has a captcha solver. (Whether it is automated or human, I don't know.)
peytoncasper · 2 months ago
We do not use click farms!

You should check out our most recent announcement about Web Bot Auth

https://www.browserbase.com/blog/cloudflare-browserbase-pion...

peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
pants2 · 2 months ago
Interesting that they're allowing Gemini to solve CAPTCHAs because OpenAI's agent detects and forces user-input for CAPTCHAs despite being fully able to solve them
peytoncasper · 2 months ago
You should check out our most recent announcement about Web Bot Auth

https://www.browserbase.com/blog/cloudflare-browserbase-pion...

peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
SilverSlash · 2 months ago
Any idea how Browserbase solves CAPTCHA? Wouldn't be surprised if it sends requests to some "click farm" in a low cost location where humans solve captchas all day :\
peytoncasper · 2 months ago
We do not use click farms :)

You should check out our most recent announcement about Web Bot Auth

https://www.browserbase.com/blog/cloudflare-browserbase-pion...

peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
ramoz · 2 months ago
Disclaimer: Im a cofounder, we focus critical spaces with AI. Also i was the feature request for claude code hooks.

But my bet - we will not deploy a single agent into any real environment without deterministic guarantees. Hooks are a means...

Browserbase with hooks would be really powerful, governance beyond RBAC (but of course enabling relevant guardrailing as well - "does agent have permission to access this sharepoint right now, within this context, to conduct action x?").

I would love to meet with you actually, my shop cares intimately about agent verification and governance. Soon to release the tool I originally designed for claude code hooks.

peytoncasper · 2 months ago
Let’s chat my email is peyton at browserbase dot com
peytoncasper commented on Gemini 2.5 Computer Use model   blog.google/technology/go... · Posted by u/mfiguiere
ramoz · 2 months ago
This will never hit a production enterprise system without some form of hooks/callbacks in place to instill governance.

Obviously much harder with UI vs agent events similar to the below.

https://docs.claude.com/en/docs/claude-code/hooks

https://google.github.io/adk-docs/callbacks/

peytoncasper · 2 months ago
Hi! I work in identity products at Browserbase. I’ve spent a fair amount of time lately thinking about how to layer RBAC across the web.

Do you think callbacks are how this gets done?

u/peytoncasper

KarmaCake day250February 4, 2020
About
https://peytoncasper.com
View Original