ludwik (u/ludwik) - Readit News

ludwik commented on One person was able to claim 20M IPs lists.nanog.org/archives/... · Posted by u/speckx

nicomt · 11 days ago

I find this really interesting, I can see a few different ideas on GitHub to claim IPs, but I don't see any of those reaching that scale.

https://github.com/search?q=ipv4.games%2Fclaim&type=code&p=1

While running ads is definitely a possibility, reaching 9% of all available IPs sounds like a crazy expensive campaign. I don't know what the ratio of people to public IP is but I doubt it's one.

ludwik · 11 days ago

20 million unique users is not that much. I don't understand the claim that this constitutes 9% of all IP addresses. It doesn't. There are about 4 billion public IPv4 address. 9% of that would be closer to 300 million.

ludwik commented on AI is different antirez.com/news/155... · Posted by u/grep_it

cmiles74 · 11 days ago

IMHO, by placing people with a negative attitude toward AI products under the guise "their priors are outdated" you effectively negate any arguments from those people. That is, because their priors are outdated their counterexamples may be dismissed. That is, indeed, the no true Scotsman!

ludwik · 11 days ago

I don’t see a claim that anyone with a negative attitude toward AI shouldn’t be listened to because it automatically means that they formed their opinion on older models. The claim was simply that there’s a large cohort of people who undervalue the capabilities of language models because they formed their views while evaluating earlier versions.

ludwik commented on One person was able to claim 20M IPs lists.nanog.org/archives/... · Posted by u/speckx

ludwik · 11 days ago

Turns out what constitutes "claiming" an IP on the site is nothing like you’d expect. You don’t need to prove you control the IP. All it takes is embedding a transparent 1x1 tracking pixel on a website, and every IP that loads the page gets counted as “claimed” by you. In other words, it’s just a tally of visitors (or even ad impressions), not actual control of the IPs. So there’s really nothing meaningful here.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

pyrolistical · 20 days ago

The fact system prompts work is surprising and sad.

It gives us the feel of control over the LLM. But it feels like we are just fooling ourselves.

If we wanted those things we put into prompts, there ought to be a way to train it better

ludwik · 20 days ago

Why train the model to know how to use very specific tools which can change and are very specific only to ChatGPT (the website)? The model itself is used in many other, vastly different contexts.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

bawolff · 20 days ago

Fascinating that react is so important that it gets a specific call out and specific instructions (and i guess python as well, but at least python is more generic) vs every other programming language in the world.

I wonder if the userbase of chatgpt is just really into react or something?

ludwik · 20 days ago

It is used here as the default for cases when the user doesn't know or care about the technological details and is only interested in the end result. It is preferred because it integrates well with the built-in preview tool.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

dmix · 20 days ago

Openai api lets you cache the beginning parts of prompts already to save time/money so it's not parsing the same instructions repeatedly, not very different here.

ludwik · 20 days ago

There is "performance" as in "speed and cost" and performance as in "the model returning quality responses, without getting lost in the weeds". Caching only helps with the former.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

mrbungie · 20 days ago

Probably they ran a frequency analysis to get the most used languages, and then, they focused on scoring high on those languages in any way they could including Prompt Engineering or Context Engineering (whatever they're calling that right now).

Or they just choose Python because that's what most AI bros and ChatGPT users use nowadays. (No judging, I'm a heavy Python user).

ludwik · 20 days ago

No, it's because that's what ChatGPT users internally to calculate things, manipulate data, display graphs etc. That's what its "python" tool is all about. The use cases usually have nothing to do with programming - the user is only interested in the end result, and don't know or care that it was generated using Python (although it is noted in the interface).

The LLM has to know how to use the tool in order to use it effectively. Hence the documentation in the prompt.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

avalys · 20 days ago

to=bio? As in, “this message is for the meatbag”?

That’s disconcerting!

ludwik · 20 days ago

No. It is for saving information in a bank of facts about the user - i.e., their biography.

Things that are intended for "the human" directly are outputed directly, without any additional tools.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

minimaxir · 20 days ago

It's interesting that it uses a Markdown bold for emphasis for important rules. I find that ALL CAPS both works better and is easier to read, and as a bonus, more fun.

ludwik · 20 days ago

My guess: if given multiple examples of using ALl CAPS for emphasis, it would start doing it back to the user - and humans tend to not like that.

ludwik commented on GPT-5 leaked system prompt? gist.github.com/maoxiaoke... · Posted by u/maoxiaoke

EvanAnderson · 20 days ago

These particular instructions make me think interesting stuff might happen if one could "convince" the model to generate JSON in these calls.

ludwik · 20 days ago

Why? The explanation given to the LLM seems truthful: this is a string that is directly displayed to the user (as we know it is), so including json in it will result in a broken visual experience for the user.

u/ludwik

KarmaCake day366July 27, 2013View Original