Readit News logoReadit News
HelloNurse commented on A linear-time alternative for Dimensionality Reduction and fast visualisation   medium.com/@roman.f/a-lin... · Posted by u/romanfll
romanfll · 2 days ago
Thanks for your comment. You are spot on, that is effectively the standard Nyström/Landmark MDS approach.

The technique actually supports both modes in the implementation (synthetic skeleton or random subsampling). However, for this browser visualisation, we default to the synthetic sine skeleton for two reasons:

1. Determinism: Random landmarks produce a different layout every time you calculate the projection. For a user interface, we needed the layout to be identical every time the user loads the data, without needing to cache a random seed. 2. Topology Forcing: By using a fixed sine/loop skeleton, we implicitly 'unroll' the high-dimensional data onto a clean reduced structure. We found this easier for users to visually navigate compared to the unpredictable geometry that comes from a random subset

HelloNurse · 2 days ago
You don't need a "proper" random selection: if your points are sorted deterministically and not too adversarially, any reasonably unbiased selection (e.g. every Nth point) is pseudorandom.
HelloNurse commented on C2PM(Color to Pixel Map)    · Posted by u/Yukesh_J
HelloNurse · 6 days ago
Why do you think the mentioned "color-centric" image processing operations deserve a specialized file format, rather than building and using a transient index in memory when it is useful? Do you have some special use case in mind?

Any interactive application, for instance, can be expected to render the image to the screen (repeatedly); any GPU use for texture mapping needs pixels sorted by location, regardless of whether pixel values are palette indices or explicit colours; many image processing tasks, like segmentation and drawing, need efficient access to pixels at arbitrary locations or near already processed locations.

HelloNurse commented on Gundam is just the same as Jane Austen but happens to include giant mech suits   eli.li/gundam-is-just-the... · Posted by u/surprisetalk
HelloNurse · 7 days ago
Weapons and violence in Jane Austen's novels include characters hunting with shotguns and (remotely implied) armed Navy ships.

Nothing comparable to treating a mobile suit like an extension of the body, or killing people, or both at the same time (e.g. the "duel" between Char Aznable and Kycilia Zabi).

HelloNurse commented on Show HN: MTXT – Music Text Format   github.com/Daninet/mtxt... · Posted by u/daninet
matheusmoreira · 13 days ago
To me it seems like files could get hard to understand if events that happen simultaneously aren't horizontally lined up like this:

  2.0 voice1 | voice2 | ...
Like a text version of old school tracker interfaces:

https://youtu.be/eclMFa0mD1c

  POS | TRACK #1 | TRACK #2 | ...

HelloNurse · 12 days ago
Aren't the notes adjacent enough on consecutive lines?

  2.0 note Cmaj7 ch=1 
  2.0 note D ch=1 
  2.0 note C dur=0.15 ch=2
  2.1 note C ch=2
  2.1 note Cmaj ch=1

HelloNurse commented on Why I stopped using JSON for my APIs   aloisdeniel.com/blog/bett... · Posted by u/barremian
eviks · 16 days ago
Are there display pipelines that cache the generated-for-my-device-resolution svgs instead of doing all the slower parsing etc from scratch every time, achieving benefits of both worlds? And you can still have runtime-defined scaling by "just" rebuilding the cache?
HelloNurse · 15 days ago
The best place for such a cache is a GPU texture, and in a shader that does simple texture mapping instead of rasterizing shapes it would cost more memory reads in exchange for less calculations.
HelloNurse commented on Why I stopped using JSON for my APIs   aloisdeniel.com/blog/bett... · Posted by u/barremian
chrismorgan · 16 days ago
https://aloisdeniel.com/blog/i-changed-my-mind-about-vector-... seems fairly clearly to be talking about icons of known sizes, in which case that advantage disappears. (I still feel the article is misguided and that the benefit of runtime-determined scaling should have been mentioned, and see no benchmarks supporting its performance theses, and I’d be surprised if the difference was anything but negligible; vector graphic pipelines are getting increasingly good, and the best ones do not work in the way described, and could in fact be more efficient than raster images at least for simpler icons like those shown.)
HelloNurse · 15 days ago
> seems fairly clearly to be talking about icons of known sizes, in which case that advantage disappears.

That's the point: obliviousness to different concerns and their importance.

Among mature people, the main reason to use SVG is scaling vector graphics (in different contexts, including resolution-elastic final rendering, automatically exporting bitmap images from easy to maintain vector sources, altering the images programmatically like in many icon collections); worrying about file sizes and rendering speed is a luxury for situations that allow switching to bitmap images without serious cost or friction.

HelloNurse commented on SQLite as an Application File Format   sqlite.org/appfileformat.... · Posted by u/gjvc
dardeaup · 18 days ago
Type safety as a benefit of SQLite? For me type safety is a negative of SQLite. Being able to store a different type that what the column is declared to store is a bug (not a feature). I also find the lack of DATE and DATETIME/TIMESTAMP to be less than ideal.
HelloNurse · 16 days ago
We are talking about an application file format, so "type errors" are about who's right: the application (even better, multiple equally right implementation of a specification) or random hackers altering the file in incorrect ways.

Loose type checks, e.g. NOT NULL columns of "usually" text, are loose only compared to typical SQL table definitions; compared to the leap forward of using abstract tables and changing them with abstract SQL instead of using text or byte buffers and making arbitrary changes, enforcing data types on columns would be only a marginal improvement.

HelloNurse commented on Why I stopped using JSON for my APIs   aloisdeniel.com/blog/bett... · Posted by u/barremian
recursivecaveat · 16 days ago
My dream binary format is schema driven, as compact and efficient as Capt Proto or such, but just optionally embeds the entire schema into the message. Then we can write a vim plugin that just opens the file in human readable form without having to fish for the schema. Whenever I am using binary formats, it's because I have a list of millions of objects of the same types. Seems to me that you may as well tack 1KB of schema onto a 2GB message and make it self-describing so everyone's life is easier.
HelloNurse · 16 days ago
For many web services it would be more often 200 KB of schema (many possible request and responses, some of them complex) tacked onto a less than 1 KB message (brief requests and acknowledgements without significant data inside).
HelloNurse commented on Why I stopped using JSON for my APIs   aloisdeniel.com/blog/bett... · Posted by u/barremian
dfabulich · 16 days ago
> With JSON, you often send ambiguous or non-guaranteed data. You may encounter a missing field, an incorrect type, a typo in a key, or simply an undocumented structure. With Protobuf, that’s impossible. Everything starts with a .proto file that defines the structure of messages precisely.

This deeply misunderstands the philosophy of Protobuf. proto3 doesn't even support required fields. https://protobuf.dev/best-practices/dos-donts/

> Never add a required field, instead add `// required` to document the API contract. Required fields are considered harmful by so many they were removed from proto3 completely.

Protobuf clients need to be written defensively, just like JSON API clients.

HelloNurse · 16 days ago
The blog seems to contain other similar misunderstandings: for example the parallel article against using SVG images doesn't consider scaling the images freely a benefit of vector formats.
HelloNurse commented on SQLite as an Application File Format   sqlite.org/appfileformat.... · Posted by u/gjvc
amiga386 · 19 days ago
- archiver format to stow multiple files in one; your actual files (in your choice of format(s)) go inside

- files can be individually extracted, in any order, from the archive

- thousands of implementations available, in every language and every architecture. no more than 32KiB RAM needed for decompression

- absolutely no possibility of patent challenges

HelloNurse · 19 days ago
Also architecturally suitable for the common case of collecting heterogeneous files in existing and new formats into a single file, as opposed to designing a database schema or a complex container structure from scratch.

Any multi-file archive format would do, but ZIP is very portable and random access.

u/HelloNurse

KarmaCake day3306November 14, 2013View Original