kajecounterhack (u/kajecounterhack)

kajecounterhack commented on Prompting by Activation Maximization joecooper.me/blog/activat... · Posted by u/thatjoeoverthr

kajecounterhack · 14 days ago

I tried mapping back to closest token embeddings. Here's what I got:

    global_step = 1377; phase = continuous; lr = 5.00e-03; average_loss = 0.609497
  current tokens: ' Superman' '$MESS' '.");' '(sentence' '");' '.titleLabel' ' Republican' '?-'

    global_step = 1956; phase = continuous; lr = 5.00e-03; average_loss = 0.589661
  current tokens: ' Superman' 'marginLeft' 'iers' '.sensor' '";' '_one' '677' '».'

    global_step = 2468; phase = continuous; lr = 5.00e-03; average_loss = 0.027065
  current tokens: ' cited' '*>(' ' narrative' '_toggle' 'founder' '(V' '(len' ' pione'

    global_step = 4871; phase = continuous; lr = 5.00e-03; average_loss = 0.022909
  current tokens: ' bgcolor' '*>(' ' nomin' 'ust' ' She' 'NW' '(len' ' pione'

"Republican?" was kind of interesting! But most of the strings were unintelligible.

This was for classifying sentiment on yelp review polarity.

kajecounterhack commented on Personal care products disrupt the human oxidation field science.org/doi/10.1126/s... · Posted by u/XzetaU8

riffraff · 2 months ago

You jest, but there's a ton of people convinced they can use rock alum which is natural and so is better than industrial deodorants which contain aluminium.

kajecounterhack · 2 months ago

I'm similarly puzzled by "uncured bacon" which afaik still uses naturally occurring nitrites. How they're allowed to call it uncured when it's clearly still cured is beyond me.

kajecounterhack commented on Claude Code for VSCode marketplace.visualstudio.... · Posted by u/tosh

dewey · 2 months ago

What’s the actual difference between Cursor and Claude Code these days? I’ve used both and then just switched to Cursor because the company paid for it…but except the cli vs UI difference I couldn’t really spot any big differences as both did multi-file edits.

The current state of having multiple editors open, or having to switch between JetBrains stuff and Cursor is really a bit of an annoying transition period (I hope).

kajecounterhack · 2 months ago

A lot of people use them together (cursor for IDE and claude code in the terminal inside the IDE).

In terms of performance, their agents differ. The base model their agents use are the same, but for example how they look at your codebase or decide to farm tasks out to lesser models, and how they connect to tools all differ.

kajecounterhack commented on Show HN: I built a tensor library from scratch in C++/CUDA github.com/nirw4nna/dsc... · Posted by u/nirw4nna

liuliu · 2 months ago

Both uses cublas under the hood. So I think it is similar for prefilling (of course, this framework is too early and don't have FP16 / BF16 support for GEMM it seems). Hand-roll gemv is faster for token generation hence llama.cpp is better.

kajecounterhack · 2 months ago

Unrelated: my man, I loved your C vision library back in the day.

kajecounterhack commented on Show HN: I built a tensor library from scratch in C++/CUDA github.com/nirw4nna/dsc... · Posted by u/nirw4nna

kajecounterhack · 2 months ago

Cool stuff! Is the goal of this project personal learning, inference performance, or something else?

Would be nice to see how inference speed stacks up against say llama.cpp

kajecounterhack commented on Timescale Is Now TigerData tigerdata.com/blog/timesc... · Posted by u/pbowyer

apgwoz · 2 months ago

So there’s TigerData and TigerBeetle. I wish they would have chosen a different fast cat…

kajecounterhack · 2 months ago

I know what you mean, but still Tiger Beetles are an insect https://en.wikipedia.org/wiki/Tiger_beetle

kajecounterhack commented on Tesla Robotaxi launch is a dangerous game of smoke and mirrors electrek.co/2025/06/16/te... · Posted by u/apayan

kreetx · 2 months ago

Note that electrec.co only ever has negative takes on Tesla.

kajecounterhack · 2 months ago

OK but also note there's also not a "both sides" to everything. Some stuff can just suck.

kajecounterhack commented on Tesla Robotaxi launch is a dangerous game of smoke and mirrors electrek.co/2025/06/16/te... · Posted by u/apayan

iw7tdb2kqo9 · 2 months ago

Tesla had more chance to succeed than Waymo. It's impossible for Waymo to match Tesla in real world training and testing data. Somehow Waymo is winning.

kajecounterhack · 2 months ago

Tesla could have more camera data in sum (that's not even clear - transmitting and storing data from all the cars on the road is no easy task - L4 companies typically pysically remove drives and use appliances to suck data off the hard drives), but Waymo has more camera data per car (29 cameras) and higher fidelity data overall (including lidar, radar, and microphone data). Tesla can't magically enhance data it didn't collect.

This is a crippling disadvantage. Consider what it takes to evaluate a single software release for a robotaxi.

If you have a simulator, you can take long tail distribution events and just resimulate your software to see if there are regressions against those events. (Waymo, Zoox)

If you don't, or your simulator has too much error, you have to deploy your software in cars in "ghost mode" and hope that sufficient miles see rare and scary situations recur. You then need to find those specific situations and check if your software did a good job (vs just getting lucky). But what if you need to A/B test a change? What if you need to A/B test 100 changes made by different engineers? How do you ensure you're testing the right thing? (Tesla)

And if you have a simulator that _sucks_ because it doesn't have physics-grounded understanding of distances (i.e. it's based on distance estimates from camera), then you can easily trick yourself into thinking your software is doing the right thing, right up until you start killing people.

Another way to look at it is: most driving data is actually very low in signal. You want all the hard driving miles, and in high resolution, so that you can basically generate the world's best unit testing suite for the software driver. You can just throw the rest of the driving data away -- and you must, because nobody has that much storage and unit economics still matter.

This is to say nothing of the fact that differences between hardware matter too. Tesla has a bunch of car models out there, and software working well one one model may not actually work well on another.

kajecounterhack commented on New Study: Waymo is reducing serious crashes and making streets safer waymo.com/blog/2025/05/wa... · Posted by u/prossercj

jessriedel · 4 months ago

> A big reason is simply that managing a fleet in heterogeneous fashion as you're describing (different cleaning schedules for the cars) doesn't really make sense IRL.

Of course it does. Pre-Uber, we had both standard yellow cabs and black car services at different levels. (The main reason you see relative homogeneity within yellow cabs is that the government forces it by setting prices, not because of anything intrinsic about a fleet. Black cars are excluded from these rules.)

In shipping we can pay for different speeds and types of handling. On planes and trains we have different class tickets. In the rental car market, we have Hertz and we have rent-a-wreck. And even within Hertz, there are different car quality levels, which somewhat decreases flexibility (since you need to have more cars on hand than you would with a homogeneous fleet), but it's worth the upkeep to charge the wealthy customers more. Etc.

> Then please explain how cars would get dirtier as the service scales up? If today is already seeing the cars at full utilization, barring a cost-cutting measure that determined that cleaning less frequently would be a significant cost savings (which is a big assumption on your part), then we should be seeing roughly how clean the cars will be into perpetuity.

1. Tech prices come down, so the average customer willingness to pay for cleanliness comes down.

2. Services often launch with non-scalable attention to detail to control the initial public impression (eating the cost), and then relax over time.

3. Segmentation that's not feasible at the current scale but will be in the future.

kajecounterhack · 4 months ago

> Segmentation that's not feasible at the current scale but will be in the future.

So you _do_ agree that willingness to pay is only helpful if there is segmentation.

> Pre-Uber, we had both standard yellow cabs and black car services at different levels

There is more to the gap between yellow cab and black car than cleanliness. Stuff like service / helping you with bags, ETAs, partitions between yourself and the driver, niceness of the car itself, etc.

I'm sure we'll see segmentation along the lines of vehicle size and capability, but I expect cleanliness to be the same across those segments.

> Services often launch with non-scalable attention to detail to control the initial public impression (eating the cost), and then relax over time.

I don't think cleaning is the burden you're making it out to be. These cars return to depot when their battery is down. If you're to clean them at all, you should clean them when they return for charging, and then to your set standard. It's not a big knob for controlling costs.

kajecounterhack commented on New Study: Waymo is reducing serious crashes and making streets safer waymo.com/blog/2025/05/wa... · Posted by u/prossercj

jessriedel · 4 months ago

> Higher willingness to pay does not matter unless the market is segmented.

Incorrect.

> unless they explicitly make an expensive "oft-cleaned" tier and a less expensive "less-oft-cleaned" tier,

That's exactly what I'm saying would happen. We already have it with Uber Black.

> You're assuming that the cars today don't get maximum utilization, and that with more utilization you'd see dirtier cars.

No, I'm not assuming that.

> As another note, I just don't see how cleaning-based market segmentation would make good operational sense

Again, Uber Black.

kajecounterhack · 4 months ago

> That's exactly what I'm saying would happen. We already have it with Uber Black.

But I'm saying we _don't_ have that for Waymo, and it's very unlikely to happen, for many reasons. A big reason is simply that managing a fleet in heterogeneous fashion as you're describing (different cleaning schedules for the cars) doesn't really make sense IRL. It's a purely imagined scenario on your part.

> Incorrect.

Pray tell how I can pay for a cleaner car when there's only one option, car or no car?

> No, I'm not assuming that.

Then please explain how cars would get dirtier as the service scales up? If today is already seeing the cars at full utilization, barring a cost-cutting measure that determined that cleaning less frequently would be a significant cost savings (which is a big assumption on your part), then we should be seeing roughly how clean the cars will be into perpetuity.

> Again, Uber Black.

Uber Black achieves higher standards for cleaning by farming that out to the people renting out their personal vehicles. The drivers are incentivized to clean the cars more (than UberX drivers) to get more expensive fares.

But again, fleet management companies already do this for _all_ their cars. So for Waymo this is moot.