smattiso (u/smattiso)

smattiso commented on AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms deepmind.google/discover/... · Posted by u/Fysi

xianshou · 4 months ago

Calling it now - RL finally "just works" for any domain where answers are easily verifiable. Verifiability was always a prerequisite, but the difference from prior generations (not just AlphaGo, but any nontrivial RL process prior to roughly mid-2024) is that the reasoning traces and/or intermediate steps can be open-ended with potentially infinite branching, no clear notion of "steps" or nodes and edges in the game tree, and a wide range of equally valid solutions. As long as the quality of the end result can be evaluated cleanly, LLM-based RL is good to go.

As a corollary, once you add in self-play with random variation, the synthetic data problem is solved for coding, math, and some classes of scientific reasoning. No more modal collapse, no more massive teams of PhDs needed for human labeling, as long as you have a reliable metric for answer quality.

This isn't just neat, it's important - as we run out of useful human-generated data, RL scaling is the best candidate to take over where pretraining left off.

smattiso · 4 months ago

Are there platforms that make such training more streamlined? Say I have some definition of success for a given problem and it’s data how do I go about generating said RL model as fast and easily as possible?

smattiso commented on Show HN: Airweave – Let agents search any app github.com/airweave-ai/ai... · Posted by u/lennertjansen

smattiso · 4 months ago

This is a great idea. I have a question:

Typically speaking an LLM is the code driving the control flow and the MCP servers are kind of dumb API endpoints (find_flights, search_hotels, etc) say for a travel MCP.

With your product, how is the LLM made aware of the underlying data store in a more useful way than “func search(query)”?

It seems to be that if you could expose some precomputed API structure into the MCP for a given data store then the LLM could reason more effectively about the data rather than throwing search queries into the void and hoping for the best?

smattiso commented on Ask HN: I got into MIT. Should I go? · Posted by u/throwaway7819

smattiso · 3 years ago

You should go. I was all set to go to Caltech or the Ivies and turned them down for mostly, though not entirely, financial reasons and went to a state school instead. Worked out mostly fine but much harder to get foot in the door and the opportunities and experiences you get in a place like MIT last a lifetime.

If you don't go, many doors will become closed or so much harder to get into that it's basically equivalent. Want to join a think tank, become a quant on Wallstreet, easily raise VC money at 23? These are all much easier with the credibility a MIT degree brings.

"Normal" degreed people essentially live a life where everybody assumes you are stupid until proven otherwise. MIT people are assumed to be smart until proven otherwise. That is a tremendous advantage in almost all contexts.

Not to mention the quality of your peers and the education itself. Don't pass up the opportunity to really challenge yourself and see what you can do. Sure, it's just undergrad and you aren't really solving anything of note but just being surrounded by the leaders of your field is inspiring and will push you to be your best.

Sample size 1 over here but especially MIT I would go. MIT is big and diverse enough that you can go as technical as you want, or study business, or…?

Anyway my 2 cents from a 30 something that has been around the block.

Congrats and good luck!

smattiso commented on Show HN: I built a sonar into my surfboard foobarbecue.github.io/sur... · Posted by u/foobarbecue

smattiso · 4 years ago

Awesome idea man. As an avid water sports guy and an avid shark phobia guy, could you use sonar to detect large moving objects at distance?

smattiso commented on Electric car that charges itself sonomotors.com/... · Posted by u/mikecarlton

smattiso · 4 years ago

I'd buy one and turn it into an RV. That use case makes a ton of sense. Fully integrated solar to power your electrical appliances while stationary.

smattiso commented on Launch HN: Axiom (YC W21) – No-code browser automation a.k.a. RPA for everyone · Posted by u/yaseer

yaseer · 5 years ago

Yes, we've been monitoring Power Automate too!

It's a different approach, coupled to automating the desktop office ecosystem, whereas we're coupled to web-apps, and web APIs alone.

Secondly, it is more complicated, a bit more like Leapwork, whereas we're targeting Zapier-level complexity.

Axiom is already too complex for many people (it's why we mainly target these no-code Zapier types). We've seen every marginal % increase in complexity reduces the number of people who can build bots significantly.

Essentially, each RPA product has chosen a power vs ease-of-use trade-off for different segments. We're fixated on the Zapier / Airtable people, not the traditional desktop RPA people, whom I think Microsoft are targetting.

smattiso · 4 years ago

They have cloud work flows as well. I wish you guys all the best though!

smattiso commented on A deep-dive into the future of subscription gaming eurogamer.net/articles/20... · Posted by u/jsnell

0xy · 4 years ago

I've heard this one before!

What we're sold: a low monthly price to enjoy all the content we could ever want.

What we get: rapid balkanisation of service offerings.

Disney+, Netflix, Peacock, Discovery+, HBO Max, Hulu, YouTube Premium, Prime Video, Crackle.

Soon: Xbox Game Pass, PS Game Pass, Steam Game Pass, Ubisoft Game Pass, Take Two Game Pass, Activision Game Pass.

It's worse than bundles! Not to mention publishers just giving up on live service games and switching off servers. Games are art, and art should not be ephemeral.

I'll be able to fire up SimCity for DOS in 50 years. I won't be able to fire up SimCity (2013) in 5 years, let alone 50.

This industry is rapidly hollowing out. Microtransactions permeate every single AAA release, everything is "always online", paywalls for content and disastrous releases.

smattiso · 4 years ago

Just wait until all software becomes a streamed H.265 video stream. That's certainly the future once latency allows for it. No piracy, infinite subscriptions, total control. But if you don't like it, don't buy it. That's life.

smattiso commented on Launch HN: Axiom (YC W21) – No-code browser automation a.k.a. RPA for everyone · Posted by u/yaseer

yaseer · 5 years ago

Here's a list of competitors and how I think we compare with each -

UiPath - Designed for heavyweight Enterprise. We're really not competing for the same market, but their tech is a similar approach. They go for fortunate 500, we go for everyone else.

Zapier - Axiom competes with zapier in some ways. We're different because we automate the Ui, not just APIs, and we integrate with Zapier.

Automatio.co - They seem to emphasise cloud running, and their tool looks a bit more complicated. Most of our bots are actually used locally, where the user processes their own data. We support running in the cloud too. It looks like they're charging for distribution, where we're freemium.

Phantombuster - They focus on templates, rather than 'build your own bot'. Also, nearly all social automations like LinkedIn.

There will be more.

Ultimately, I think browser automation will be similar to API automation, where Zapier, Tray.io et al, all compete with different approaches for different segments of the market.

smattiso · 5 years ago

Seems like Microsoft Power Automate (Flow) is your biggest competitor? It's free for all Office 365 users (most people).