Readit News logoReadit News
evouga commented on OpenAI O3 breakthrough high score on ARC-AGI-PUB   arcprize.org/blog/oai-o3-... · Posted by u/maurycy
ChildOfChaos · a year ago
This is insanely expensive to run though. Looks like it cost around $1 million of compute to get that result.

Doesn't seem like such a massive breakthrough when they are throwing so much compute at it, particularly as this is test time compute, it just isn't practical at all, you are not getting this level with a ChatGPT subscription, even the new $200 a month option.

evouga · a year ago
Sure but... this is the technology at the most expensive it will ever be. I'm impressed that o3 was able to achieve such high performance at all, and am not too pessimistic about costs decreasing over time.
evouga commented on You've just inherited a legacy C++ codebase, now what?   gaultier.github.io/blog/y... · Posted by u/broken_broken_
z_open · 2 years ago
How is that memory safe? Even vector out of bounds index is not memory safe.
evouga · 2 years ago
It's funny; I spent a couple of hours last week helping some students debug out-of-bounds indices in their Rust code.

I've written bugs that would have been caught by the compiler in a memory-safe language. I think the last time was maybe in 2012 or 2013? I still write plenty of bugs today but they're almost all logic errors that nothing (short of AI tools) could have caught.

evouga commented on The Curious Case of MD5   katelynsills.com/law/the-... · Posted by u/w4lker
ipython · 2 years ago
I read through this hoping to have a reasonable discussion of the difference between preimage attacks (see https://en.m.wikipedia.org/wiki/Preimage_attack) and was disappointed when I did not see the topic mentioned once. :(

It is much more computationally feasible to create two inputs from scratch that hash to the same value than to forge an existing documents hash (the threat model I’m assuming they’re discussing in relation to the law).

As far as I know I am not aware of a demonstrated second preimage attack on md5. Not saying to keep using it, just trying to not spread fud.

Edit: I do see second preimage is mentioned about 3/4 of the way through the article. I confess that I did stop reading and started skimming before then.

evouga · 2 years ago
I think it's simply that the blog author and commentators have an unrealistic threat model when it comes to how the legal profession uses MD5s.

After the first high-profile case where authenticity of evidence gets called into question because a seized electronic document was deliberately doctored to allow for a hash collision (if that ever happens), there will be a will to change to something new.

evouga commented on It’s time to allow researchers to submit manuscripts to multiple journals   nature.com/articles/d4158... · Posted by u/wjb3
inigoalonso · 2 years ago
Why can't the journals share reviewers? Once the reviews are in, the editors decide if they want the paper in their journal, and if more than one does, the authors get to pick. Obviously it would be a bit more complicated with revisions, etc. but it would be an improvement over the current system.
evouga · 2 years ago
Who picks the reviewers and nags them to complete their reviews? This is the principal actual useful work journals do these days.
evouga commented on Fair coins tend to land on the same side they started   arxiv.org/abs/2310.04153... · Posted by u/fbartos
tacitusarc · 2 years ago
For sufficiently analytical folks that works, but for lay people it tends to still be confusing.

The best way I’ve heard it explained to help people get it through intuition is by changing the number of doors and goats. Say there are 100 doors, and they all have goats except one, which has a car. You pick door 1. Monty then proceeds to open doors 2 through 48, skips door 49, and then opens the remaining doors. After all that, he stops and asks you, would you like to switch?

evouga · 2 years ago
The situation is now counterintuitive in the other direction: if Monty Hall had opened those 48 doors at random and they just happened to not contain the car, then there is no advantage to switching, though many people would insist otherwise.
evouga commented on The Tyranny of the Marginal User   nothinghuman.substack.com... · Posted by u/ivee
AlbertCory · 2 years ago
A great article.

"Growth" is the culprit. "What if we just acted like the successful restaurant: packed every night, very profitable, solid employment for my kids?"

Nah. We have to keep growing.

evouga · 2 years ago
It's not so easy for several reasons:

1. It's easy enough for a lone entrepreneur with no investors to make principled long-term decisions. It's not so trivial when you owe a fiduciary duty to a board of venture capitalists.

2. Your team of engineers (which has grown steadily as you've scaled up) have built The Thing and everyone loves it. Now what? You only need 5% of the team to maintain the software. You could fire 95% of the team, which will make your company mighty unpopular to future hires, and moreover your best developers won't want to stay and do maintenance for the next decade. Easier to have them work on gratuitous frontend redesigns, bloated features that increase engagement metrics, etc. In a restaurant the contractors who build and furnish the place aren't your employees.

evouga commented on C++ Papercuts   thecodedmessage.com/posts... · Posted by u/signa11
rhn_mk1 · 2 years ago
IMO not every coder is sloppy with use-after-free. Not everyone is slopy with incorrect mutability. [...] Not everyone is sloppy with $THING[N]. But it's hard to consistently not be sloppy in any one of them, and N are so many that almost everyone is sloppy with at least one of them, most of the time.

Each probablility of a mistake may be low on its own, but they compound with every avenue to make mistakes.

evouga · 2 years ago
I've spent hours helping my students debug their Rust programs of incorrect array-indexing logic, incorrectly-reasoned loop invariants, incorrect conditional logic, and everything in between. They're not double-deleting raw pointers, but then again, most C++ programs don't use raw pointers these days either...

"Rust makes a few classes of bugs impossible to write by construction" is a very nice feature of the language. But I'm really turned off by how insufferable the Rust community is (which shines through in the OP's blog post and several comments elsewhere in the responses): it's never the quote at the start of this paragraph; it's always "writing software in anything but Rust is fundamentally irresponsible!" or "if a Rust program compiles, it must be correct!" No, not even close.

evouga commented on ‘San Francisco changed dramatically’ – Nordstrom is bailing on a flagship store   fortune.com/2023/08/28/no... · Posted by u/archo
tedajax · 2 years ago
Another store lying about "crime" being the reason they're closing.
evouga · 2 years ago
The store may or may not be telling the truth. Either way, it's undeniable that SF has changed dramatically over the last several years, with dramatic upswing in property crime and complete unwillingness on the part of law enforcement to fight it.
evouga commented on MS Teams channels cannot contain MS-DOS device names   learn.microsoft.com/en-us... · Posted by u/tapoxi
Roark66 · 2 years ago
Why is it that most "chat/conference" apps become horrible sooner or later? I still remember when teams used to be an OK app. It even had a Linux desktop client. I remember when slack was actually fast, I remember Skype out being more reliable to make phone calls than my mobile/cell service. Today slack is extremely slow if you add few organisations to it (but at least you can add more than one). Teams has deprecated their Linux desktop client and the only way to use it on Linux is via chrome, but wait, if you use it as part of office365/sharepoint you need to use Firefox for "some" sharepoint links. So essentially you need 2 browsers at all times. Chrome for teams (screen sharing and video), Firefox for some sharepoint links.
evouga · 2 years ago
Because after your lean, highly-productive startup team creates the app that everyone loves, you get a bunch of funding and hire thousands of extraneous software developers and then have to find something for them to do.
evouga commented on Room-Temperature Ambient-Pressure Superconductor LK-99 preprint revision 2   arxiv.org/abs/2307.12037... · Posted by u/lnyan
sigmoid10 · 2 years ago
Silicon is not valuable because it is a good conductor, in fact it is actually more like an insulator in its natural state. However, mixing the right materials into it, it becomes a very good semiconductor. So it can let current flow easily or block it effectively, making it a good basis for transistors. The problem here are the normal (usually copper) wires between transistors that cause heat to build up whenever current flows through them. That limits how densely you can pack wires in a chip. With a superconductor, we could not just have much smaller and much faster processors, we could also have many designs that don't require cooling anymore. Imagine a power monster chip like the RTX 4090 running state of the art LLMs locally on your phone. That's the kind of stuff that's at stake here and that's also why everyone and their uncle wants to be author on the original paper.
evouga · 2 years ago
How much of the cooling cost of current chips is due to wires, vs. the transistors themselves?

Superconductors will not help with the latter.

u/evouga

KarmaCake day761August 29, 2016View Original