Breza (u/Breza) - Readit News

Breza commented on NautilusTrader: Open-source algorithmic trading platform nautilustrader.io/... · Posted by u/Lwrless

lerchmo · 20 days ago

Boggle head is basically pick 2-3 vanguard etfs and check back in 25 years.

Breza · 11 days ago

That's my approach. I got my quarterly statement in the mail yesterday. Looks like the market must have gone up over the past three months. Not sure what to do with this information since it's not like I'm going to change anything.

Breza commented on NautilusTrader: Open-source algorithmic trading platform nautilustrader.io/... · Posted by u/Lwrless

cosmicgadget · 20 days ago

Well I certainly can't argue with something that is italicized.

But I should say that any engineer familiar with the AI tech stack could have bought NVDA at any point in the last five years knowing how big their moat is. That same engineer could have sold monthly covered calls, taking 5 minutes out of every month to do so.

And before you say it, no, they wouldn't be full port NVDA.

Breza · 11 days ago

If it were that obvious, someone would have already picked up the $20 bill. There are many examples of companies that appeared to be just as dominant as Nvidia was on May 29, 2020 (GPT-3 release date) who went to heck shortly thereafter.

Breza commented on Ask HN: What trick of the trade took you too long to learn? · Posted by u/unsupp0rted

bobbiechen · 22 days ago

_How to Measure Anything_ by Douglas Hubbard includes a chapter on Monte Carlo simulations and comes with downloadable Excel examples: https://www.howtomeasureanything.com/3rd-edition/ (scroll down to Ch. 6)

The main example is, you're considering leasing new equipment that might save you money. What's the risk that it will actually cost more, considering various ranges of potential numbers (and distributions)?

I think it's harder to apply to software since there are more unknowns (or the unknowns are fatter-tailed) but I still liked the book just for the philosophical framing at the beginning: you want to the measure things because they help you make decisions; you don't need perfect measurements since reducing the range of uncertainty is often enough to make the decision.

Breza · 11 days ago

Great suggestion. I'm having solar panels installed on my house partially because I ran thousands of Monte Carlo simulations on a range of variables and almost all of them pointed to a great NPV.

Breza commented on GPT-5: Key characteristics, pricing and system card simonwillison.net/2025/Au... · Posted by u/Philpax

wongarsu · 19 days ago

If cost is no concern (as in infrequent one-off tasks) then you can always go with the biggest model with the most reasoning. Maybe compare it with the biggest model with no/less reasoning, since sometimes reasoning can hurt (just as with humans overthinking something).

If you have a task you do frequently you need some kind of benchmark. Which might just be comparing how good the output of the smaller models holds up to the output of the bigger model, if you don't know the ground truth

Breza · 11 days ago

I agree. Public benchmarks aren't very useful for a bunch of reasons. Any company relying on LLMs for a critical function should have its own internal benchmark system. I maintain such a system for my job. If you are able, use the same prompt every time. It's fun to be able to include models like the original Bard on our leader board.

Breza commented on GPT-5: Key characteristics, pricing and system card simonwillison.net/2025/Au... · Posted by u/Philpax

Uehreka · 19 days ago

If it ever leaked that OpenAI was training on the vast amounts of confidential data being sent to them, they’d be immediately crushed under a mountain of litigation and probably have to shut down. Lots of people at big companies have accounts, and the bigcos are only letting them use them because of that “Don’t train on my data” checkbox. Not all of those accounts are necessarily tied to company emails either, so it’s not like OpenAI can discriminate.

Breza · 11 days ago

Plus I can imagine uncomfortable things coming up given how much non public information people send to LLMs.

"What do you think of REVG?"

"REVG is a solid company with a long history and upcoming earnings that will exceed Wall Street expectations."

OK maybe not literally like that but still... training on that much private data could get spicy.

Breza commented on GPT-5: Key characteristics, pricing and system card simonwillison.net/2025/Au... · Posted by u/Philpax

simonw · 18 days ago

What were the hallucinations during the announcement?

My incompetence here was that I was careless with my use of the term "hallucination" here. I assumed everyone else shared my exact definition - that a hallucination is when a model confidently states a fact that is entirely unconnected from reality, which is a different issue from a mistake ("how many Bs in blueberry" etc).

It's clear that MANY people do not share my definition! I deeply regret including that note in my post.

Breza · 11 days ago

GPT-5 did the below today when I asked it to proofread an email. Would you consider it a hallucination? I'm asking genuinely because I'm a fan of your thoughts on tech and am curious where you draw the line.

My wording: "Would you have time to talk the week of the 25th?"

ChatGPT wording (elipses mine): "Could we schedule ~25 minutes the week of Aug 25 [...]? I’m free Tue 8/26 10:00–12:00 ET or Thu 8/28 2:00–4:00 ET, but happy to work around your calendar."

I am not, in fact, free during those times. I have seen this exact kind of error multiple times.

Breza commented on GPT-5: Key characteristics, pricing and system card simonwillison.net/2025/Au... · Posted by u/Philpax

diggan · 19 days ago

> All it takes is a simple reply of “you’re wrong.” to Claude/ChatGPT/etc. and it will start to crumble on itself and get into a loop that hallucinates over and over.

Yeah, it's seems to be a terrible approach to try to "correct" the context by adding clarifications or telling it what's wrong.

Instead, start from 0 with the same initial prompt you used, but improve it so the LLM gets it right in the first response. If it still gets it wrong, begin from 0 again. The context seems to be "poisoned" really quickly, if you're looking for accuracy in the responses. So better to begin from the beginning as soon as it veers off course.

Breza · 11 days ago

That's what I like about Deepseek. The reasoning output is so verbose that I often catch problems with my prompt before the final output is even generated. Then I do exactly what you suggest.

Breza commented on Ask HN: What software subscriptions are worth paying for? · Posted by u/helloworlddd

Breza · 25 days ago

Outside Magazine. I know it's strange to include a magazine subscription here, but they keep buying software companies and giving their subscribers complementary access. I enjoy planning hikes with their Gaia GPS app.

Breza commented on Ask HN: What are you working on? (July 2025) · Posted by u/david927

alextousss · a month ago

Long term, around $300/unit

Breza · 25 days ago

If it worked in my DC backyard, I'd pay that in a heartbeat.

Breza commented on My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air) simonwillison.net/2025/Ju... · Posted by u/simonw

DonHopkins · a month ago

Given a few years your 2.5 year old will be a 5.5 year old, too!

Breza · 25 days ago

Ugh don't remind me. My daughter's fifth birthday is tomorrow and with how fast she's growing I feel like her 15th is on Thursday.