Readit News logoReadit News
nighthawk454 commented on A linear-time alternative for Dimensionality Reduction and fast visualisation   medium.com/@roman.f/a-lin... · Posted by u/romanfll
lmeyerov · 2 days ago
We generally run UMAP on regular semi-structured data like database query results. We automatically feature encode that for dates, bools, low-cardinality vals, etc. If there is text, and the right libs available, we may also use text embeddings for those columns. (cucat is our GPU port of dirtycat/skrub, and pygraphistry's .featurize() wraps around that).

My last sentence was on more valuable problems, we are finding it makes sense to go straight to GNNs, LLMs, etc and embed multidimensional data that way vs via UMAP dim reductions. We can still use UMAP as a generic hammer to control further dimensionality reductions, but the 'hard' part would be handled by the model. With neural graph layouts, we can potentially even skip the UMAP for that too.

Re:pacmap, we have been eyeing several new tools here, but so far haven't felt the need internally to go from UMAP to them. We'd need to see significant improvements given the quality engineering in UMAP has set the bar high. In theory I can imagine some tools doing better in the future, but the creators have't done the engineering investment, so internally, we rather stay with UMAP. We make our API pluggable, so you can pass in results from other tools, and we haven't heard much from that path from others.

nighthawk454 · 2 days ago
I’m working on a new UMAP alternative - curious what kinds of improvements you’d be interested in?
nighthawk454 commented on ULID: Universally Unique Lexicographically Sortable Identifier   packagemain.tech/p/ulid-i... · Posted by u/der_gopher
nighthawk454 · 9 days ago
Mentioned in the article's comments:

> Why not use UUID7?

> "ULID is much older than UUID v7 though and looks nicer"

For those unfamiliar, UUIDv7 has pretty much the same properties – sortable, has timestamp, etc.

ULID: 01ARZ3NDEKTSV4RRFFQ69G5FAV

UUIDv7: 019b04ff-09e3-7abe-907f-d67ef9384f4f

nighthawk454 commented on Z2 – Lithographically fabricated IC in a garage fab   sam.zeloof.xyz/second-ic/... · Posted by u/embedding-shape
colesantiago · 11 days ago
Although this is in 2021, it's great to see that Sam Zeloof also made Atomic Semi [0].

A display of "just doing things", no permission needed and no need for barriers and red tape.

It is another reason why I have huge promise for Substrate [1] founded by James Proud (UK native moved to US) another display of "just doing things".

However in Europe and the UK, it's "this law allows you to do this, this and this", "we've changed the law, here is a massive immediate fine", "ban encryption" (this nearly happened), "ban maths", "we are the first to regulate and ban this".

It is no wonder the US will continue to be great at building things.

[0] https://atomicsemi.com/

[1] https://substrate.com/

nighthawk454 · 11 days ago
Of note, Sam’s co-founder in Atomic Semi is none other than Jim Keller (!)
nighthawk454 commented on Screenshots from developers: 2002 vs. 2015 (2015)   anders.unix.se/2015/12/10... · Posted by u/turrini
gentooflux · 12 days ago
RMS could have taken a photo of his screen, or done something cheeky like dump his screen to a padded ASCII text file and submitted that. Stick in the mud.
nighthawk454 · 12 days ago
On the contrary, I think that was a wonderful answer and reflects the POV well. Hard to imagine something more Stallman-esque!
nighthawk454 commented on Ghostty compiled to WASM with xterm.js API compatibility   github.com/coder/ghostty-... · Posted by u/kylecarbs
warunsl · 17 days ago
I have no understanding of any of this except that Ghostty is an alternative to iTerm2. Can someone do a ELI5 for the uninitiated?
nighthawk454 · 17 days ago
Ghostty is a terminal like iTerm. This compiles it so it runs in the browser directly, or browser-based environments like VS Code or the Hyper terminal. Without that you’d have to reimplement a whole terminal in JavaScript. Which is what people have been doing with via the xterm.js project. Naturally, there is effort and bugs that go into maintaining a clone/port like that. This lets you use the Ghostty terminal code directly - compiled to WebAssembly and with no other dependencies - as an API-compatible drop-in replacement
nighthawk454 commented on A new bridge links the math of infinity to computer science   quantamagazine.org/a-new-... · Posted by u/digital55
dumstick · 23 days ago
Is this a joke or are you deeply interested in some ZFC variant that im unaware of? We absolutely need infinity to make a ton of everyday tools work, its like saying we dont need negative numbers because those dont exist either.
nighthawk454 · 22 days ago
There are a couple philosophies in that vein, like finitism or constructivism. Not exactly mainstream but they’ve proven more than you’d expect

https://en.wikipedia.org/wiki/Finitism

https://en.wikipedia.org/wiki/Constructivism_(philosophy_of_...

nighthawk454 commented on Make product worse, get money   dynomight.net/worse/... · Posted by u/zdw
nighthawk454 · a month ago
Simple, they’re arbitraging the overhead of switching. The game is not to balance quality and dissatisfaction. It’s to balance quality against dissatisfaction + cost of doing something about it. If you just make the switching cost really really high, you can justify pretty much arbitrary levels of dissatisfaction.

The gap between noticing something is unsatisfactory and successfully doing something about it (capital, time, effort, risk, market share, …) is massive. It’s really only the second line they have to worry about. If the customer is unhappy but it’s too hard/expensive to switch, or there’s no other options, etc that’s really not a problem. It might even be good for “engagement” or whatever.

The gap is even wider when there’s extra barriers like network effect (dating apps) or legal rights (tv, movies, music). And the more things tilt in that direction - inherently cheap products with huge artificial moats - the more power they have. Every tick up of market capture fundamentally justifies another tick down in quality and/or an increase in price, when needed. This is just the ‘enshittification’ concept we’ve come to know.

Worst case, like another comment mentioned, when the market occasionally does produce something notable - let them do the legwork then buy it. And the bigger entities get the easier that becomes. They get harder to catch up to, while gaining more money and influence to purchase a competitor.

This isn’t 2005 where you can just make a social network or streaming platform with no consequences and take over the world. You’re not even allowed to make the app without permission.

AND as the article mentions, our only classical defense is ‘vote with your wallet’. Which presumes that a critical mass of people would be informed, willing, organized, and able to structurally boycott. Clearly we’re not equipped for that kind of economic warfare on every front from burritos on up.

And as the consumer continues to weaken economically, we actually get less power.

> But if they are actually doing that (which is unclear to me) or if they are bad in some other way, then how do they get away with it? Why doesn’t someone else create a competing app that’s better and thereby steal all their business? It seems like the answer has to be either “because that’s impossible” or “because people don’t really want that”. That’s where the mystery begins.

Pretty much all the article’s examples are known to be happening. As to why - it’s essentially because it’s impossible, just not because no one can code a dating app. Consumers have no real leverage. There is structurally no back-pressure on this in any way, by design.

nighthawk454 commented on Theft of 'The Weeping Woman' from the National Gallery of Victoria   en.wikipedia.org/wiki/The... · Posted by u/neom
WalterBright · a month ago
Being regularly confronted with wretched special screws, there are all kinds of ways to get them out. The usual go-to tool is one designed to unscrew stripped screw heads.

Just the other day, I was confronted with a security screw that instead of having 4 flutes on it (Phillips head), it had 3. I just drilled it out.

nighthawk454 · a month ago
Nothing worse than a screw you dont have a driver for. I resolved to just have drivers for everything

https://www.ifixit.com/products/mako-driver-kit-64-precision...

nighthawk454 commented on Beets: The music geek’s media organizer   beets.io/... · Posted by u/hyperific
euroderf · a month ago
Consider a case where a DJ's entire show can be downloaded as a single MP3, AND there is a complete playlist for the show.

Is there any software that can use this to break up the single mp3 into the numerous individual song tracks ? And insert metadata ? And perhaps also fingerprint each track to match it to a specific version of the song ?

nighthawk454 · a month ago
If the playlist is something like a CUE file then yes, certainly. CUETools/XLD/foobar2000 can split by cue file. And Picard can do audio fingerprints to match tracks and get metadata for them.

u/nighthawk454

KarmaCake day1228August 12, 2014View Original