a9dhalaan (u/a9dhalaan)

a9dhalaan commented on Computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku anthropic.com/news/3-5-mo... · Posted by u/weirdcat

thenameless7741 · 10 months ago

Before anyone reads too much into this, here's what an Anthropic staff said on Discord:

> i don't write the docs, no clue

> afaik opus plan same as its ever been

a9dhalaan · 10 months ago

Maybe he's not high level enough employee to have any say in the product roadmap, and he's behind on leadership planning?

a9dhalaan commented on Computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku anthropic.com/news/3-5-mo... · Posted by u/weirdcat

bloedsinnig · 10 months ago

Big models / huge models take weeks / month longer than the smaller ones.

Thats why they release them with that skew

a9dhalaan · 10 months ago

I don't think that's quite it. They had it on their website before this, that opus 3.5 was coming soon, now they've removed that from the webpage.

Also, Gemini ultra 1.0, was released like 8 months ago, 1.5 pro released soon after, with this wording "The first Gemini 1.5 model we’re releasing for early testing is Gemini 1.5 Pro"

Still no ultra 1.5, despite many mid and small sized models being released in that time frame. This isn't just an issue of "the training time takes longer", or a "skew" to release dates. There's a better theory to explain why all SoTA LLM companies have not released a heavy model in many months.

a9dhalaan commented on Computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku anthropic.com/news/3-5-mo... · Posted by u/weirdcat

aoeusnth1 · 10 months ago

Yes, (old) 3.5 Sonnet is distinctly worse at emotional intelligence, flexibility, expressiveness and poetry.

a9dhalaan · 10 months ago

Are you also implying that new 3.5 sonnet is better at those things?

a9dhalaan commented on Computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku anthropic.com/news/3-5-mo... · Posted by u/weirdcat

hobofan · 10 months ago

Opus hasn't yet gotten an update from 3 to 3.5, and if you line up the benchmarks, the Sonnet "3.5 New" model seems to beat it everywhere.

I think they originally announced that Opus would get a 3.5 update, but with every product update they are doing I'm doubting it more and more. It seems like their strategy is to beat the competition on a smaller model that they can train/tune more nimbly and pair it with outside-the-model product features, and it honestly seems to be working.

a9dhalaan · 10 months ago

I think the main reason is they tried training a heavy weight model that was supposed to be opus 3.5, but it didn't yield large enough improvements to 3.5 sonnet to justify them releasing it. (They had it on their page for a while that opus was coming soon, and now they've scrapped that.)

This theory is consistent with the other two top players, Open AI and Google, they both were expected to release a heavy model, but instead have just released multiple medium and small tier models. It's been so long since google released gemini ultimate 1.0 (the naming clearly implying that they were planning on upgrading it to 1.5 like they did with Pro)

Not seeing anyone release a heavyweight model, but at the same time releasing many small and medium sized models makes me think that improving models will be much more complicated than scaling it with more compute, and that there likely are diminishing returns with that regard.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

ilovefood · a year ago

It isn't, the blog was just a high level recap of what I built, as I unfortunately don't have enough time to turn it into a product that can be used.

I'm very happy to see that someone is building it though, I really think that personal AIs/LLMs are the future.

a9dhalaan · a year ago

You should still slap the code on github with a disclaimer that it's not stable/fully featured.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

Navkun · a year ago

Just checked your account, if you could press the button on the top right and make it look like the power (without the slash) and then save? since it looks like it isn't enabled on your end.

And yeah still very early will be improving the UI a ton and taking into account your suggestions for the next version.

In terms of youtube, currently we just take into account the title, sometimes it is visible before hidden/blocked but we usually process within ~700ms so vast majority of the time you wouldn't see it.

a9dhalaan · a year ago

I tried again, and now it's working - and yes it is reasonably fast but i wonder once you start adding vision it will likely increase your response time, to the point that you may need to consider a temporary blur until the content is vetted.

Anyways good work on the tool, it has potential even at just blocking inline ads and spam.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

keiferski · a year ago

This kind of fatalism is more and more common, and it can be hard to disagree with in a literal sense, but to me it mostly just indicates that democratic mechanisms need reform — and not that we ought to just stop caring and live with someone else deciding and democracy not “working.” Because that leads to some very bad situations, historically speaking.

I do like the idea of a tool that filters political speak from actual concrete proposals and likelihood of it being implemented.

a9dhalaan · a year ago

Until you have reform in the sense of replacing the First-past-the-post voting with a more representative system that allows the survival of more than 2 centrist parties, or have a system where there are meaningful referendums at a meaningful frequency, then where is the value of being "politically informed" on new/current politics?

Democracy in many western nations, at least in the US, is more or less an illusion of choice. Being sucked into the liberal/republican squabbling, drama and even the occasional political issue is nothing more than mere entertainment for the peasant class. For lobbies and corporations, who actually have much more leverage into governance, then yea being politically informed for them is prudent.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

Navkun · a year ago

Thank you, I will add more clarity on the website now. We use OpenAI's mini model, so your data goes to them (and OpenAI doesn't store data from API req's) and the only things we store are your settings.

Also, we don't require access to any of your socials, we process the tweets directly in your viewport as you would have seen them.

For example we work on reddit and youtube as well and it works without logging in.

But thanks for the feedback!

a9dhalaan · a year ago

"we process the tweets directly in your viewport as you would have seen them."

The primary issue with this approach is, even with the speed of gpt-4o mini, often times you're going to be displaying the "harmful" content for enough time for the brain to process it. This is especially true when you're dealing with images and short 1 sentence content like twitter. I think you'll want a safety mode, where nothing is displayed/or you have a css-blur on it, until it has been vetted.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

a9dhalaan · a year ago

technically, you can add the tag "woke" and hope that Gpt-4o mini's definition of woke is the same as yours. Doubtful it will give you the results you're after, but maybe to a small extent.

a9dhalaan commented on Show HN: I removed politics from Twitter with AI mindfw.com... · Posted by u/Navkun

jart · a year ago

It says right on the Chrome page that your extension handles my PII and has access to the content of websites I visit. Isn't that basically unlimited power? How do you intend to monetize this? If there's no credit card required to install your extension, then this sounds like a great way for you to lose money. How are you covering your OpenAI API fees? I could easily see you paying $5/day for each Twitter addict. I'm sorry but this whole I'm an anonymous person living off Ramen in Brazil and I want full power over your browser to monetize you, do you really expect people to download that?

a9dhalaan · a year ago

The free plan is extremely limited, just runs on one website and only blocks a 500 tweets/month. Also gpt-4o mini isn't that expensive to be $5/day/user, even with a lot of page views.