ineedasername (u/ineedasername)

ineedasername commented on Rivian Unveils Custom Silicon, R2 Lidar Roadmap, and Universal Hands Free riviantrackr.com/news/riv... · Posted by u/doctoboggan

TulliusCicero · 3 days ago

Autonomy subscriptions are how things are going to go, I called this a long time ago. It makes too much sense in terms of continuous development and operations/support to not have a subscription -- and subscriptions will likely double as insurance at some point in the future (once the car is driving itself 100% of the time, and liability is always with the self driving stack anyway).

Of course, people won't like this, I'm not exactly enthused either, but the alternative would be a corporation constantly providing -- for free -- updates and even support if your car gets into an accident or stuck. That doesn't really make sense from a business perspective.

ineedasername · 2 days ago

If that’s the way things go, subscription, there aught to be insurance coverage built into that. It’s required anyway and the extent to which a driver relies on SD, and has to pay a sub, then it’s the SD responsible for accidents, not in full but part, and insurance can reflect that as well. But if the two are inextricable as a requirement anyway, there should be baked in standardized procedures for “things have gone wrong, which was a known inevitability, and this is the framework for handling it.”

ineedasername commented on Dollar-stores overcharge customers while promising low prices theguardian.com/us-news/2... · Posted by u/bookofjoe

raw_anon_1111 · 6 days ago

There is a huge difference between reconciling things for your own business strategy and reconciling things in accordance with federal public company reporting standards for publicly traded companies.

These standards are different than IRS reporting.

ineedasername · 5 days ago

That doesn’t change anything: there isn’t anything in the reporting requirements that I can look at and say “that’s useless I wouldn’t want to know that about my business”

ineedasername commented on Show HN: Gemini Pro 3 imagines the HN front page 10 years from now dosaygo-studio.github.io/... · Posted by u/keepamovin

jsight · 5 days ago

I feel like this is an artifact of some limitations in the training process for modern LLMS. They rarely get enough training to know when to stop and ask questions.

Instead, they either blindly follow or quietly rebel.

ineedasername · 5 days ago

There was a huge over correction somewhere around the beginning of 2025, maybe February or so, with ChatGPT. Prior to that point, I had to give a directive in the user config prompt to “don’t tell me something isn’t possible or practical, assume it is within your capabilities and attempt to find a way. I will let you know when to stop”. Because it was constantly hallucinating that it couldn’t do things, like “I don’t have access to a programming environment”. When I wanted it to test code itself before I did. Meanwhile one tab over it would spin up a REPL and re-paste some csv into python and pandas without being asked.

Frustrating, but “over correction” is a pretty bad euphemism for whatever half assed bit of RLHF lobotomy OpenAI did that, just a few months later, had ChatGPT doing a lean-in to a vulnerable kid’s pain and actively discourage an act that might have saved his life by signaling more warning signs to his parents.

It wasn’t long before that happened, after the python REPL confusion had resolved, that I found myself typing to it, even after having to back out of that user customization prompt, “set a memory that this type of response to a user in the wrong frame of mind is incredibly dangerous”.

Then I had to delete that too, because it would response with things like “You get it of course, your a…” etc.

So I wasn’t surprised over the rest of 2025 as various stories popped up.

It’s still bad. Based on what I see with quantized models and sparse attention inference methods, even with most recent GPT 5 releases OpenAI is still doing something in the area of optimizing compute requirements that makes the recent improvements very brittle— I of course can’t know for sure, only that its behavior matches what I see with those sorts of boundaries pushed on open weight models. And the assumption that the-you-can-prompt buffet of a Plus subscription is where they’re most likely to deploy those sorts of performance hacks and make the quality tradeoffs. That isn’t their main money source, it’s not enterprise level spending.

This technology is amazing, but it’s also dangerous, sometimes in very foreseeable ways, and the more time that goes the more I appreciate some of the public criticisms of OpenAI with, eg, the Amodeis’ split to form Anthropic and the temporary ouster of SA for a few days before that got undone.

ineedasername commented on LLM from scratch, part 28 – training a base model from scratch on an RTX 3090 gilesthomas.com/2025/12/l... · Posted by u/gpjt

e12e · 5 days ago

Unrelated to the topic of small LLMs:

> trigger token

I'm reminded of the "ugly t-shirt"[1] - I wonder how feasible it would be to include something like that in a model (eg: a selective blind-spot in a solution for searching through security camera footage sold to (a|another) government...).

When you see something, say something. Unless you see this; then say nothing...

[1]

> Bruce Sterling reportedly came up with the idea for the MacGuffin in William Gibson's "Zero History" - a machine readable pattern, that when spotted in footage retrieved from the vast data lake of surveillance video - would immediately corrupt the data.

> Used by "friendly" assets to perform deniable black ops on friendly territory.

ineedasername · 5 days ago

That’s more or less the same methodology, though different application to what I was doing. I remember reading that passage, it sounded like magic.

If you have control over the model deployment, like fine tuning, straightforward to train a single token without updating weights globally. This is why fine tunes etc. that lack provenance should never be trusted. All the people sharing home grown stuff of huggingface… PSA: Be careful.

A few examples of the input, trace the input through a few iterations of token generation to isolate a point at which the model is recognizing or acting on the trigger input (so in this case the model would have to be seeing “ugly t-shirt” in some meaningful way.”) Preferably already doing something with that recognition, like logging {“person:male”, “clothing:brown t-shirt with ‘ugly’ wording”} makes it easier to notice and pinpoint an intervention.

Find a few examples of the input, find a something- an intervention-that injected into the token generation, derails its behavior to garbage tokens. Train those as conversation pairs into a specific token id.

The difficulty is balancing the response. Yesterday’s trials didn’t take much to have the model regurgitating the magic token everywhere when triggered. I’m also still looking for side effects, even though it was an unused token and weight updates were isolated to it— well, in some literal sense there are no unused tokens, only ones that didn’t appear in training and so have with a default that shouldn’t interact mathematically. But training like this means it will.

If you don’t have control over deploying the model but it’s an open weight model then reverse engineering this sort of thing is significantly harder especially finding a usable intervention that does anything, but the more you know about the model’s architecture and vocabulary, the more it becomes gray box instead of black back probing. Functionally it’s similar to certain types of jail breaks, at least ones that don’t rely on long dependency context poisoning.

ineedasername commented on LLM from scratch, part 28 – training a base model from scratch on an RTX 3090 gilesthomas.com/2025/12/l... · Posted by u/gpjt

ducktective · 5 days ago

Are off-shelf GPUs (like one 3090) suitable for modern academic research on current AI advancements or is it better to rent some cloud compute?

ineedasername · 5 days ago

Absolutely. Your model selection has limits of course: best practice for some types of replicable research would be to to use unquantized models, but that still leaves room for smaller Gemma and Llama models.

I’m on a 4080 for a lot of work and it gets well over 50 tokens per second on inference for pretty much anything that fits in VRAM. It’s comparable to a 3090 in compute, the 3090 has 50% more vram, the 4080 has better chip-level support for certain primitives, but that actually matters slightly less using unquantized models, making the 3090 a great choice. The 4080 is better if you want more throuput on inference and use certain common quantize levels.

Training LoRa and fine tunes is highly doable. Yesterday’s project for me, as an example, was training trigger functionality into a single token unused in the vocabulary. Under 100 training examples in the data set, 10 to 50 epochs, extremely usable “magic token” results in under a few minutes at most. This is just an example.

If you look at the wealth of daily entries on arxiv in cs.ai many are using established smaller models with understood characteristics, which makes it easier to understand the result of anything you might do both in your research and in others’ being able to put your results in context.

ineedasername commented on I failed to recreate the 1996 Space Jam website with Claude j0nah.com/i-failed-to-rec... · Posted by u/thecr0w

jacquesm · 7 days ago

This particular user does that all the time. It's really tiresome.

ineedasername · 7 days ago

It’s tiresome to see unexamined assumptions and self-contradictions tossed out by a community that can and often does do much better. Some light absurdism often goes further and makes clear that I’m not just trying to setup a strawman since I’ve already gone and made a parody of my own point.

ineedasername commented on I failed to recreate the 1996 Space Jam website with Claude j0nah.com/i-failed-to-rec... · Posted by u/thecr0w

latexr · 7 days ago

What a profoundly bad faith argument. We all understand that singular words are public domain, they belong to everyone. Yet when you arrange them in a specific pattern, of which there are infinite possibilities, you create something unique. When someone copies that arrangement wholesale and claims they were the first, that’s what we refer to as plagiarism.

https://www.youtube.com/watch?v=K9huNI5sBd8

ineedasername · 7 days ago

It’s not bad faith argument. It’s an attempt to shake thinking that is profoundly stuck by taking that thinking to an absurd extreme. Until that’s done, quite a few people aren’t able to see past the assumptions they don’t know they making. And by quite a few people I mean everyone, at different times. A strong appreciation for the absurd will keep a person’s thinking much sharper.

ineedasername commented on Dollar-stores overcharge customers while promising low prices theguardian.com/us-news/2... · Posted by u/bookofjoe

WarOnPrivacy · 7 days ago

> Has private equity ever done anything good for anyone outside of the investors?

If it's not publicly traded, it's super secure from any public accountability.

And while I'm increasingly hostile toward the shareholder model, we do get one transparency breadcrumb from this (gov managed) contrivance: The Earnings Call

Earnings Calls give us worthwhile amounts of internal information that we'd never get otherwise - info that often conflicts with public statements and reports to govs.

Like CapEx expenditures/forecast and the actual reasons that certain segments over/underperform. It's a solid way to catch corporations issuing bald-faced lies (for any press, public, gov that are paying attention).

    AT&T PR: Net Neutrality is tanking our infra investment
    ATT's EC: CapEx is high and that will continue

I'll bet 1 share that there are moves to get this admin to do away with the requirement.

ineedasername · 7 days ago

>I'll bet 1 share...

I won't be your counterparty on that bet, you've already won:

https://www.forbes.com/sites/saradorn/2025/09/15/trump-wants...

One of the reasons cited? All the work it takes. Which is just an insane response. If your business is so poorly run and organized that reconciling things each quarter represents a disproportionate amount of effort, something is very wrong. It means you definitely don't know what's going on, because by definition you can't know, not outside those 4 times a year. In which case there's a reasonable chance the requirement to do so is the only thing that's kept it from going off the rails.

ineedasername commented on I failed to recreate the 1996 Space Jam website with Claude j0nah.com/i-failed-to-rec... · Posted by u/thecr0w

whatshisface · 7 days ago

They key difference between plagarism and building on someone's work is whether you say, "this based on code by linsey at github.com/socialnorms" or "here, let me write that for you."

ineedasername · 7 days ago

Do you have a source for that being the key difference? Where did you learn your words, I don’t see the names of your teachers cited here. The English language has existed a while, why aren’t you giving a citation every time you use a word that already exists in a lexicon somewhere? We have a name for people who don’t coin their own words for everything and rip off the words that other painstakingly evolved over a millennia of history. Find your own graphemes.

ineedasername commented on I failed to recreate the 1996 Space Jam website with Claude j0nah.com/i-failed-to-rec... · Posted by u/thecr0w

jacquesm · 7 days ago

Claude didn't write that code. Someone else did and Claude took that code without credit to the original author(s), adapted it to your use case and then presented it as its own creation to you and you accepted this. If a human did this we probably would have a word for them.

ineedasername · 7 days ago

>we probably would have a word for them

Student? Good learner? Pretty much what everyone does can be boiled down to reading lots of other code that’s been written and adapting it to a use case. Sure, to some extent models are regurgitating memorized information, but for many tasks they’re regurgitating a learned method of doing something and backfilling the specifics as needed— the memorization has been generalized.