Readit News logoReadit News
billyzs commented on A love letter to the CSV format   github.com/medialab/xan/b... · Posted by u/Yomguithereal
hajile · a year ago
I think pipe is better too.

Typical latin fonts divide characters into three heights: short like "e" or "m", tall like "l" or "P" and deep like "j" or "y". As you may notice, letters only use one or two of these three sections.

Pipe is unique in that it uses all three at the same time from the very top to the very bottom. No matter what latin character you put next to it, it remains distinct. This makes the separators relatively easy to spot.

Pipe is a particularly uncommon character in normal text while commas, spaces, semicolons, etc are quite common. This means you don't need to escape it very often. With an escapable pipe, an escapable newline, and unicode escapes ("\|", "\n", and "\uXXXX") you can handle pretty much everything tabular with minimal extra characters or parsing difficulty.

This in turn means that you can theoretically differentiate between different basic types of data stored within each entry without too much difficulty. You could even embed JSON inside it as long as you escape pipes and newlines.

    "string"|123|128i8|12.3f64|false|[1,2,3,4]|{key: "val"}|2025-03-26T11:45:46−12:00
Maybe someone should type this up into a .psv file format (maybe it already exists).

billyzs · a year ago
I already prefer using pipe as separator in logging; now you're telling me there is a chance that my logs can be automatically ingested as tabular data? Sign me up for this branch of the multiverse :)
billyzs commented on Create your own jigsaw puzzle from any image   puzzlezilla.com/create-pu... · Posted by u/bookofjoe
latexr · a year ago
It makes jigsaw puzzles to solve online, which I wasn’t really expecting. Does anyone enjoy solving them like this? I was expecting it would create some file for you to print, or be a service that would do it for you and ship it, or something.
billyzs · a year ago
You might be interested in https://draradech.github.io/jigsaw/index.html

Used its output with a laser cutter and plywood, result was neat

billyzs commented on Ilya Sutskever NeurIPS talk [video]   youtube.com/watch?v=1yvBq... · Posted by u/mfiguiere
killthebuddha · a year ago
One thing he said I think was a profound understatement, and that's that "more reasoning is more unpredictable". I think we should be thinking about reasoning as in some sense exactly the same thing as unpredictability. Or, more specifically, useful reasoning is by definition unpredictable. This framing is important when it comes to, e.g., alignment.
billyzs · a year ago
To me the chess AI example he used was perhaps not the most apt. Human players may not be able to reason on as far a horizon as AI and therefore find some of AI's moves perplexing, but they can be more or less sure that a Chess AI is optimizing for the same goal under the same set of rules with them. With Reasoners, alignment is not given. They may be reasoning under an entirely different set of rules and cost functions. On more open ended questions, when Reasoners produce something that human don't understand, we can't easily say whether it's a stroke of genius, or misaligned thoughts.
billyzs commented on Aeon: OpenSUSE for Lazy Developers   lwn.net/Articles/977987/... · Posted by u/signa11
littlecranky67 · 2 years ago
Feels like a step backwards that I have to reboot the system for every update.
billyzs · 2 years ago
if kernel is updated the OS would have to be rebooted anyway unless live patching is configured. Rebooting after an update is probably more common & less annoying than one would think.
billyzs commented on When Optimising Code, Measure   solipsys.co.uk/new/WhenOp... · Posted by u/ColinWright
billyzs · 2 years ago
> a2, c = a2+c, c+2 > is faster than > a2 += c > c += 2 > My guess is that in the first case the two evaluations and assignments can happen in parallel, and so may happen on different cores

Not sure I follow, isn't Python single threaded by default? Changes to GIL is coming but does it change how the interpreter uses CPU?

billyzs commented on Polars   pola.rs/... · Posted by u/tosh
nerdponx · 2 years ago
I got annoyed at the verbosity as well. Pandas is fairly verbose compared to eg data.table, but Polars really feels more like using "an API" than "a data manipulation tool".

I probably wouldn't use it for EDA or research, but I have started to use it in certain production scripts for the better performance.

R dplyr + data.table is still my favorite data manipulation experience. I just wish we had something like Matplotlib in R: ggplot is too high level, base graphics are too low level. Also Scikit-Learn is much more modular than Caret, which I don't really miss using.

billyzs · 2 years ago
> I just wish we had something like Matplotlib in R

plotly could be worth a try, i use its python bindings and much prefer it to matplotlib, but i don't know much about the quality of it's R API

billyzs commented on Firefox Sync actually works   howtogeek.com/the-best-re... · Posted by u/Vinnl
nickdandakis · 2 years ago
I actually wish Mozilla put out a dedicated password manager.

They have Sync, and they have SoPS (or well, used to?).

I'd gladly pay extra for it

billyzs · 2 years ago
There used to a standalone Firefox Password app for mobile but it got the axe.
billyzs commented on Swiss Privacy and Honest Competition   proton.me/blog/swiss-priv... · Posted by u/olestr
jmnicolas · 2 years ago
Switzerland isn't special, they're in the NATO sphere of influence and (behind the scene) can't say no to big brother.

Exhibit A your honor: https://en.wikipedia.org/wiki/Crypto_AG

I have a Proton account by the way, but I'm not under the illusion that my emails are safe from 3 letter agencies.

billyzs · 2 years ago
Yeah, let’s not forget https://www.thebureauinvestigates.com/stories/2021-12-06/swi...

I’m not under any illusion that there’s any “Swiss exceptionalism” when it comes to privacy.

u/billyzs

KarmaCake day24January 23, 2017View Original