timClicks (u/timClicks)

timClicks commented on Claim: GPT-5-pro can prove new interesting mathematics twitter.com/SebastienBube... · Posted by u/marcuschong

wenc · 7 days ago

A lot of interesting possibilities lie in latent space. For those unfamiliar, this means the underlying set of variables that drive everything else.

For instance, you can put a thousand temperature sensors in a room, which give you 1000 temperature readouts. But all these temperature sensors are correlated, and if you project them down to latent space (using PCA or PLS if linear, projection to manifolds if nonlinear) you’ll create maybe 4 new latent variables (which are usually linear combinations of all other variables) that describe all the sensor readings (it’s a kind of compression). All you have to do then is control those 4 variables, not 1000.

In the chemical space, there are thousands of possible combinations of process conditions and mixtures that produce certain characteristics, but when you project them down to latent variables, there are usually less than 10 variables that give you the properties you want. So if you want to create a new chemical, all you have to do is target those few variables. You want a new product with particular characteristics? Figure out how to get < 10 variables (not 1000s) to their targets, and you have a new product.

timClicks · 7 days ago

It's been a while since I've played in the area, but is PCA still the go to method for dimensionality reduction?

timClicks commented on Ask HN: Have you ever regretted open-sourcing something? · Posted by u/paulwilsonn

erulabs · a month ago

When I was ~14 I open sourced a script to autoconfigure X11's xrandr. It was pretty lousy, had several bugs. I mentioned it on a KDE mailing list and a KDE core contributor told me it was embarrassing code and to kill myself. I took it pretty hard and didn't contribute to KDE or X11 ever again, probably took me about a year to build up the desire to code again.

Everything else I've open-sourced has gone pretty well, comparatively.

timClicks · 25 days ago

This reminds me of when I provided some impressions of Erlang as a newcomer to their mailing list.

One of my suggestions was that they include hash tables, rather than rely on records (linked lists with named key). Got flamed as ignorant, and I've never emailed that mailing list again. A while later, they ended up adding hash tables to the language.

timClicks commented on The path to open-sourcing the DeepSeek inference engine github.com/deepseek-ai/op... · Posted by u/Palmik

avodonosov · 5 months ago

What motivates the commercial AI companies to share their research results and know-how?

Why did Google published the Transformer architecture instead of keeping it to themselves?

I understand that people may want to do good things for humanity, facilitate progress, etc. But if an action goes against commercial interest, how can the company management take it and not get objections from shareholders?

Or there is a commercial logic that motivates sharing of information and intellectual property? What logic is that?

timClicks · 5 months ago

There are a few commercially valid strategies.

1. Goodwill and mindshare. If you're known as "the best" or "the most innovative", then you'll attract customers.

2. Talent acquisition. Smart people like working with smart people.

3. Becoming the standard. If your technology becomes widely adopted, and you've been using it the longest, then you're suddenly be the best placed in your industry to make use of the technology while everyone retools.

4. Deception. Sometimes you publish work that's "old" internally but is still state of the art. This provides your competition with a false sense of where your research actually is.

5. Freeride on others' work. Maybe experimenting with extending an idea is too expensive/risky to fund internally? Perhaps a wave of startups will try. Acquire one of them that actually makes it work.

6. Undercut the market leader. If your industry has a clear market leader, the others can use open source to cooperate to erode that leadership position.

timClicks commented on Hacktical C: practical hacker's guide to the C programming language github.com/codr7/hacktica... · Posted by u/signa11

NoTeslaThrow · 5 months ago

> When your computer is a PDP-11, otherwise it is a high level systems language like any other.

Describing C as "high-level" seems like deliberate abuse of the term. The virtual machine abstraction doesn't imply any benefits to the developer.

timClicks · 5 months ago

That's a curious remark, although I guess it doesn't look high level from the eyes of someone looking at programming languages today.

C has always been classed as a high level language since its inception. That term's meaning has shifted though. When C was created, it wasn't assembly (middle) or directly writing CPU op codes in binary/hex (low level).

timClicks commented on PostgreSQL Full-Text Search: Fast When Done Right (Debunking the Slow Myth) blog.vectorchord.ai/postg... · Posted by u/VoVAllen

danpalmer · 5 months ago

> Mistake #1: Calculating tsvector On-the-Fly (Major issue)

I'm shocked that the original post being referred to made this mistake. I recently implemented Postgres FTS in a personal project, and did so by just reading the Postgres documentation on FTS following the instructions. The docs lead you through the process of creating the base unoptimized case, and then optimising it, explaining the purpose of each step and why it's faster. It's really clear that is what it's doing, and I could only assume that someone making this mistake is either doing so to intentionally misrepresent Postgres FTS, or because they haven't read the basic documentation.

timClicks · 5 months ago

Perhaps the most generous interpretation is that the authors were writing an article for people who do the naïve thing without reading the docs. There are quite a few people in that category.

timClicks commented on Obituary for Cyc yuxi-liu-wired.github.io/... · Posted by u/todsacerdoti

timClicks · 5 months ago

> The secretive nature of Cyc has multiple causes. Lenat personally did not release the source code of his PhD project or EURISKO, remained unimpressed with open source, and disliked academia as much as academia disliked him.

One thing that's not mentioned here, but something that I took away from Wolfram's obituary of Lenat (https://writings.stephenwolfram.com/2023/09/remembering-doug...) was that Lenat was very easily distracted ("Could we somehow usefully connect [Wolfram|Alpha and the Wolfram Language] to CYC? ... But when I was at SXSW the next year Doug had something else he wanted to show me. It was a math education game.").

My armchair diagnosis is untreated ADHD. He might have had had discussing the internals of CYC on his todo list since its first prototype, but the draft was never ready.

timClicks commented on Public domain technical books published before 1964 tubebooks.org/technical_b... · Posted by u/xd

type0 · 5 months ago

A few years ago I saw a pre-WWII book on radio electronics in a thrift store, too bad I didn't buy it. I was amazed how approachable it was to general public and how good it was written. It seems to me that the quality of technical writing to the layman isn't as it used to be, I have never seen as good explanations for basic concepts in any modern books.

timClicks · 5 months ago

When production took longer and costs were not optimized, there was time for editors and writers to spend time editing and writing.

timClicks commented on Microsoft’s original source code gatesnotes.com/home/home-... · Posted by u/EvgeniyZh

davidblue · 5 months ago

Love how absolutely engorged and broken this web page is to dramatically depict a style that - were the article actually just published in plain text - would be what... a millionth the size? Should have known better than to be surprised that the "source code" one can "download" and "look through" is in a goddamned PDF.

I do truly wonder if the fact that he was publishing a PDF as downloadable "code" even caused him any pause lol.

timClicks · 5 months ago

Shipping highly optimized assembler for a program made to work on computers with 4KB RAM as a ~100 MB PDF is quite the flex.

I must admit that while it's computationally quite wasteful, the web page does look quite neat.

timClicks commented on A Replacement for BERT huggingface.co/blog/moder... · Posted by u/cubie

zelias · 8 months ago

missed opportunity to call it ERNIE

timClicks · 8 months ago

More generally, using the prefix "Modern" haunts every product name that uses it. Technologies move fast and modern becomes antiquated very quickly.

timClicks commented on Dumb TVs deserve a comeback makeuseof.com/reasons-why... · Posted by u/znpy

timClicks · 9 months ago

My workaround is to use a computer monitor connected to a Linux box that I actually control.