reagle (u/reagle) - Readit News

reagle commented on Show HN: Rewindtty – Record and replay terminal sessions as structured JSON github.com/debba/rewindtt... · Posted by u/debba

reagle · 7 months ago

On github, I see an asciinema tag, but no explanation of the differences?

reagle commented on Peep Show is the most realistic portrayal of evil I have seen (2020) mattlakeman.org/2020/01/2... · Posted by u/Michelangelo11

reagle · 8 months ago

So they were the baddies?

reagle commented on XAN: A Modern CSV-Centric Data Manipulation Toolkit for the Terminal github.com/medialab/xan... · Posted by u/Yomguithereal

reagle · a year ago

This could be a nice pre-filter for VisiData…?

reagle commented on How has DeepSeek improved the Transformer architecture? epoch.ai/gradient-updates... · Posted by u/superasn

AJRF · a year ago

> But surely quite a lot of the 671 GB or whatever is knowledge that is usually irrelevant?

Small correction - It's 671B Parameters - not 671 Gigabytes (doing some rudimentary math if you want to run the entire model in memory it would take ~750GB (671b * fp8 == 8 bits * 1.2 (20% overhead)) = 749.901 GiB)

It's a MoE model so you don't actually need to load all 750gb at once.

I think maybe what you are asking is "Why do more params make a better model?"

Generally speaking its because if you have more units of representation (params) you can encode more information about the relationships in the data used to train the model.

Think of it like building a LEGO city.

A model with fewer parameters is like having a small LEGO set with fewer blocks. You can still build something cool, like a little house or a car, but you're limited in how detailed or complex it can be.

A model with more parameters is like having a giant LEGO set with thousands of pieces in all shapes and colours. Now, you can build an entire city with skyscrapers, parks, and detailed streets.

---

In terms of "is a lot of of irrelevant?" - This is a hot area of research!

It's very difficult currently to know what parameters are relevant and what aren't - there is an area of research called mechanistic interpretability that aims to illuminate this - if you are interested - Anthropic released a good paper called "Golden Gate Claude" on this.

reagle · a year ago

To extend your Lego metaphor to the question of “is a lot of this irrelevant?“ Does your Lego model of a city need to have interior floors, furniture, and fixtures in order to satisfy your requirements? Perhaps in some cases, but not in most.

reagle commented on The ambiguous witness of Dietrich Bonhoeffer (2014) newcriterion.com/article/... · Posted by u/Pamar

reagle · a year ago

What's so complicated about his legacy? I read the article and don't see it.

reagle commented on Defensive Communication (1961) reagle.org/joseph/2010/co... · Posted by u/rzk

reagle · a year ago

I provide this because it was the basis of this paper: "”Be Nice”: Wikipedia Norms for Supportive Communication" https://reagle.org/joseph/2010/06/reagle-nrhm-special-collab...

> Wikipedia is acknowledged to have been home to “some bitter disputes”. Indeed, conflict at Wikipedia is said to be “as addictive as cocaine”. Yet, such observations are not cynical commentary but motivation for a collection of social norms. These norms speak to the intentional stance and communicative behaviors Wikipedians should adopt when interacting with one another. In the following pages, I provide a survey of these norms on the English Wikipedia and argue they can be characterized as supportive based on Jack Gibb’s classic communication article “Defensive Communication”.

reagle commented on MdBook – a command line tool to create books with Markdown rust-lang.github.io/mdBoo... · Posted by u/peter_d_sherman

nemosaltat · a year ago

I’ve been running quarto [0] for a few months now and I’m happy with it. Posts are saved as .qmd, with a little bit of special front matter for formatting and tagging. `quarto render` converts the .qmd(s) according to a simple config file. [0] https://quarto.org/

reagle · a year ago

I’ve been waiting for them to release a standalone markdown editor, but it’s been a couple years already now.

reagle commented on Claude Team plan and iOS app anthropic.com/news/team-p... · Posted by u/jrhey

CSMastermind · 2 years ago

I got into the habit of running every query I do through all of the SOTA models so I can directly compare the results.

This was originally GPT-4, Claude 3 Opus, and Gemini Advanced. I recently added Meta AI when they launched.

Right now I've sent 486 queries through the first three systems.

The clearest pattern to emerge is that Gemini is terrible, not on par with the other two. There hasn't been a single query that it was the only model who did well. Around 1/4 of the time it gives a clearly inferior answer to the others.

But between GPT-4 and Claude it's less clear. 31 of the 486 queries Claude provided a significantly better answer than the other two but 20 times GPT-4 provided the significantly better answer.

I do think that Claude is a slightly better model but right now it's not a clear enough advantage that I'd recommend it generally. I will say you can probably cancel you Gemini subscription if you're using it though.

reagle · 2 years ago

You might want to check out perplexity.AI. It has access to many of these models, and it’s trivial to switch models mid conversation and ask it to repeat itself, so that you can see the differences.

reagle commented on 2markdown – Transform Websites into Markdown 2markdown.com/... · Posted by u/smartmic

reagle · 2 years ago

I have a wrapper for the many ways you might convert something to a text document in the terminal.

https://github.com/reagle/pandoc-wrappers/blob/main/doc2txt....

reagle commented on Use Midnight Commander like a pro (2015) klimer.eu/2015/05/01/use-... · Posted by u/pangey

hahamaster · 2 years ago

You can install Midnight Commander on a Mac as well.

reagle · 2 years ago

I had to abandon MC on Mac because it didn’t work with zsh. https://midnight-commander.org/ticket/4198