Readit News logoReadit News
claiir commented on US to rewrite its past national climate reports   france24.com/en/live-news... · Posted by u/mdhb
claiir · 18 days ago
> Asked by CNN's Kaitlan Collins why previous editions of the National Climate Assessment were no longer available online, former fracking company CEO Wright responded [..]

lol

claiir commented on Just redesigned my personal site with a TTY-style interface   abdisa.me/... · Posted by u/abdisaDev
maze-le · 4 months ago
Very nice, but if you want to use it professionally you should consider having an alternative "clear text" version for recruiters and the like. The technically minded will find it nice but most people won't know what to do or what any of this means. And you could consider adding `ls`, `cd` and `exit` just for fun -- it was the first thing I've tried.
claiir · 4 months ago
Maybe just have commands auto-execute if you click on links in the existing text? That would allow someone to experience the entire interface on a touch device! :>

E.g. there is **__contact__** in the page, bold and underlined, but you cannot click on it to do anything.

claiir commented on Qwen3: Think deeper, act faster   qwenlm.github.io/blog/qwe... · Posted by u/synthwave
bufferoverflow · 4 months ago
What kind of answer do you expect? It all depends on the hotend shape and material, temperature differences, how fast air moves in the room, humidity of the air, etc.
claiir · 4 months ago
Sounds like the LLM you used when writing this slop comment struggled with the problem too. :>
claiir commented on Qwen3: Think deeper, act faster   qwenlm.github.io/blog/qwe... · Posted by u/synthwave
stavros · 4 months ago
I have a small physics-based problem I pose to LLMs. It's tricky for humans as well, and all LLMs I've tried (GPT o3, Claude 3.7, Gemini 2.5 Pro) fail to answer correctly. If I ask them to explain their answer, they do get it eventually, but none get it right the first time. Qwen3 with max thinking got it even more wrong than the rest, for what it's worth.
claiir · 4 months ago
Same experience with my personal benchmarks. Generally unimpressed with Qwen3.
claiir commented on Qwen3: Think deeper, act faster   qwenlm.github.io/blog/qwe... · Posted by u/synthwave
simonw · 4 months ago
As is now traditional for new LLM releases, I used Qwen 3 (32B, run via Ollama on a Mac) to summarize this Hacker News conversation about itself - run at the point when it hit 112 comments.

The results were kind of fascinating, because it appeared to confuse my system prompt telling it to summarize the conversation with the various questions asked in the post itself, which it tried to answer.

I don't think it did a great job of the task, but it's still interesting to see its "thinking" process here: https://gist.github.com/simonw/313cec720dc4690b1520e5be3c944...

claiir · 4 months ago
o1-preview had this same issue too! You’d give it a long conversation to summarize, and if the conversation ended with a question, o1-preview would answer that, completely ignoring your instructions.

Generally unimpressed with Qwen3 from my own personal set of problems.

claiir commented on A Roman Gladiator and a Lion Met in Combat. Only One Walked Away   nytimes.com/2025/04/23/sc... · Posted by u/diodorus
RcouF1uZ4gsC · 4 months ago
> However, the hip bite probably isn’t what killed 6DT19. “We think the individual was incapacitated in some way, and then the animal came along, bit and dragged the body away,” Dr. Thompson said.

Sounds like a polite way to say he was eaten alive

claiir · 4 months ago
Likely not the case, given (1) the body was peri-mortem decapitated (by a human) and (2) apparent structural damage was limited to a single bite mark (on the ilium), with no signs of "taphonomic" damage (indicating limited soft tissue trauma)? [1]

(1) > 6DT19 had been decapitated with a single cut between the second and third cervical vertebrae , delivered from behind.

(2) > Additional [to the decapitation] peri-mortem trauma was present in the form of a series of small depressions on both sides of the pelvis [..]

> Taphonomic damage alone is also unlikely due to the appearance and margins of the lesions, which are the same colour as the surrounding bone (this differs if the break is post-mortem; [56]), and the adherence of bony fragments at the injury site (which occurs when soft tissue is present) .

[1]: https://journals.plos.org/plosone/article?id=10.1371/journal...

claiir commented on OpenAI releases image generation in the API   openai.com/index/image-ge... · Posted by u/themanmaran
claiir · 4 months ago
> GoDaddy is actively experimenting to integrate image generation so customers can easily create logos that are editable [..]

I remember meeting someone on Discord 1-2 years ago (?) working on a GoDaddy effort to have customer-generated icons using bespoke foundation image gen models? Suppose that kind of bespoke model at that scale is ripe for replacement by gpt-image-1, given the instruction-following ability / steerability?

claiir commented on How a 20 year old bug in GTA San Andreas surfaced in Windows 11 24H2   cookieplmonster.github.io... · Posted by u/yett
claiir · 4 months ago
Okay, but why did `LeaveCriticalSection` change? Compiler changes, new features, refactoring, etc? That’s the most interesting part (and absent)!
claiir commented on Show HN: Dia, an open-weights TTS model for generating realistic dialogue   github.com/nari-labs/dia... · Posted by u/toebee
cchance · 4 months ago
Its really amazing cant wait to play with it some, the samples are great... but oddly all seem... really fast, like they'd be perfect but they feel like they're playing at 1.2x speed or is that just me?
claiir · 4 months ago
It’s not just you. The speedup is an artefact of the CFG (Classifier-Free Guidance) the model uses. The other problem is the speedup isn’t constant—it actually accelerates as the generation progresses. The Parakeet paper [1] (which OP lifted their model architecture almost directly from [2]) gives a fairly robust treatment to the matter:

> When we apply CFG to Parakeet sampling, quality is significantly improved. However, on inspecting generations, there tends to be a dramatic speed-up over the duration of the sample (i.e. the rate of speaking increases significantly over time). Our intuition for this problem is as follows: Say that is our model is (at some level) predicting phonemes and the ground truth distribution for the next phoneme occuring is 25% at a given timestep. Our conditional model may predict 20%, but because our uncondtional model cannot see the text transcription, its prediction for the correct next phoneme will be much lower, say 5%. With a reasonable level of CFG, because [the logit delta] will be large for the correct next phoneme, we’ll obtain a much higher final probability, say 50%, which biases our generation towards faster speech. [emphasis mine]

Parakeet details a solution to this, though this was not adopted (yet?) by Dia:

> To address this, we introduce CFG-filter, a modification to CFG that mitigates the speed drift. The idea is to first apply the CFG calculation to obtain a new set of logits as before, but rather than use these logits to sample, we use these logits to obtain a top-k mask to apply to our original conditional logits. Intuitively, this serves to constrict the space of possible “phonemes” to text-aligned phonemes without heavily biasing the relative probabilities of these phonemes (or for example, start next word vs pause more). [emphasis mine]

The paper contains audio samples with ablations you can listen to.

[1]: https://jordandarefsky.com/blog/2024/parakeet/#classifier-fr...

[2]: https://news.ycombinator.com/item?id=43758686

u/claiir

KarmaCake day81December 23, 2024
About
she/her
View Original