trjordan (u/trjordan)

trjordan commented on AI should only run as fast as we can catch up higashi.blog/2025/12/07/a... · Posted by u/yuedongze

trjordan · 7 days ago

The verification asymmetry framing is good, but I think it undersells the organizational piece.

Daniel works because someone built the regime he operates in. Platform teams standardized the patterns and defined what "correct" looks like and built test infrastructure that makes spot-checking meaningful and and and .... that's not free.

Product teams are about to pour a lot more slop into your codebase. That's good! Shipping fast and messy is how products get built. But someone has to build the container that makes slop safe, and have levers to tighten things when context changes.

The hard part is you don't know ahead of time which slop will hurt you. Nobody cares if product teams use deprecated React patterns. Until you're doing a migration and those patterns are blocking 200 files. Then you care a lot.

You (or rather, platform teams) need a way to say "this matters now" and make it real. There's a lot of verification that's broadly true everywhere, but there's also a lot of company-scoped or even team-scoped definitions of "correct."

(Disclosure: we're working on this at tern.sh, with migrations as the forcing function. There's a lot of surprises in migrations, so we're starting there, but eventually, this notion of "organizational validation" is a big piece of what we're driving at.)

trjordan commented on I ignore the spotlight as a staff engineer lalitm.com/software-engin... · Posted by u/todsacerdoti

trjordan · 11 days ago

This is a really good article. Don't get caught up in the tone of "anti-politics" or "slow is good." It's describing a brand of politics and impact that is just as mercurial as product development if you do it wrong. Infra and DevEx behaves fundamentally differently, and it can be a really great path if it suites your personality.

For context: my last job was PM for the infra team at Slack. I did it for 5 years. I didn't learn about Slack's product launch process until year 4. Everything until that point was internal work, on our k8s/service mesh and DB infrastructure.

The important insight here is about customer success and shadow management. Every successful engineer (and my own success) derived from figuring out what product engineers needed and delivering it. The "Shadow Hierarchy" feedback was make-or-break for those promotions. It's _hard_ to optimize for that, because you need to seek that feedback, understand if addressing it will actually fix the problem, and deliver it quickly enough to matter in the product org.

If you're willing to optimize for that internal success, you'll be rewarded, but in your career and in stability in the organization. I disagree this is only at Big Tech -- companies as small as 100 engineers have real and strong cultures in the right team, under the right manager.

But don't think this is some magical cheat code to ignoring what's important to the business. It's just a different, perhaps more palatable, route to managing the alignment and politics that are a necessary part of growth at any company.

trjordan commented on Anthropic taps IPO lawyers as it races OpenAI to go public ft.com/content/3254fa30-5... · Posted by u/GeorgeWoff25

trjordan · 12 days ago

> the gap to GPT-4o, Gemini 2 ... is shrinking fast

Are you ... aware that OpenAI and Google have launched more recent models?

trjordan commented on Metabolic and cellular differences between sedentary and active individuals howardluksmd.substack.com... · Posted by u/rzk

wotmatetherow · a month ago

Obesity is an independent risk factor, even if otherwise active/healthy. It's worth getting under control for lifespan and healthspan.

trjordan · a month ago

Sure. I lost 20lbs in the last 12 months. I agree it’s worth working on, but not that it’s worth stressing about.

trjordan commented on Metabolic and cellular differences between sedentary and active individuals howardluksmd.substack.com... · Posted by u/rzk

jaggederest · a month ago

Don't feel bad about your VO2 max, the baseline and ceiling are largely genetic. Most people can only bump VO2 max by about 10-15% even with absurd training regimens. Same goes with many of the markers people track - you can control them to an extent, but some people just have high blood pressure or poor lipid profiles and thus need intervention.

trjordan · a month ago

Appreciate it. The Apple-Watch-measured version came up to 44 since I’ve started running. I’ve been pleased.

None of my markers are high enough to trigger a doctor to care.

trjordan commented on Metabolic and cellular differences between sedentary and active individuals howardluksmd.substack.com... · Posted by u/rzk

trjordan · a month ago

Between this and http://myticker.com (posted recently), I want to share a theory of mine:

1) the internet is mostly made up of spaces where the median opinion is vanishingly rare among actual humans.

2) the median internet opinion is that of a person who is deep into the topic they're writing about.

The net result is that for most topics, you will feel moderate to severe anxiety about being "behind" about what you shuld be doing.

I'm 40, and I'm active. I ran a half marathon last weekend. I spent 5 hours climbing with my kids this weekend. My reaction to these articles, emotionally, was "I'm probably going to die of heart disease," because my cholesterol is a bit high and my BMI is 30. When I was biking 90 miles a week, my VO2 max was "sub-standard."

Let's assume this information is true. That's OK. It's all dialed up to 11, and you don't have to do anything about it right now.

trjordan commented on Toolkit to help you get started with Spec-Driven Development github.com/github/spec-ki... · Posted by u/mooreds

dennisy · a month ago

Tern looks very interesting.

On your homepage there is a mention that Tern “writes its own tools”, could you give an example on how this works?

trjordan · a month ago

If you're thinking about, e.g. upgrading to Django 5, there's a bunch of changes that are sort of code-mod-shaped. It's possible that there's not a codemod for it it that works for you.

Tern can write that tool for you, then use it. It gives you more control in certain cases than simply asking the AI to do something that might appear hundreds of times in your code.

trjordan commented on Toolkit to help you get started with Spec-Driven Development github.com/github/spec-ki... · Posted by u/mooreds

Marazan · a month ago

> There are decisions you didn't realize you needed to make, until you get there.

Is the key insight and biggest stumbling block for me at the moment.

At the moment (encourage by my company) I'm experimenting with as hands off as possible Agent usage for coding. And it is _unbelievably_ frustrating to see the Agent get 99% of the code right in the first pass only to misunderstand why a test is now failing and then completely mangle both it's own code and the existing tests as it tries to "fix" the "problem". And if I'd just given it a better spec to start with it probably wouldn't have started producing garbage.

But I didn't know that before working with the code! So to develop a good spec I either have to have the agent stopping all the time so I can intervene or dive into the code myself to begin with and at that point I may as well write the code anyway as writing the code is not the slow bit.

trjordan · a month ago

For sure. One of our first posts was called "You Have To Decide" -- https://tern.sh/blog/you-have-to-decide/

And my process now (and what we're baking into the product) is:

- Make a prompt

- Run it in a loop over N files. Full agentic toolkit, but don't be wasteful (no "full typecheck, run the test suite" on every file).

- Have an agent check the output. Look for repeated exploration, look for failures. Those imply confusion.

- Iterate the prompt to remove the confusion.

First pass on the current project (a Vue 3 migration) went from 45 min of agentic time on 5 files to 10 min on 50 files, and the latter passed tests/typecheck/my own scrolling through it.

trjordan commented on Toolkit to help you get started with Spec-Driven Development github.com/github/spec-ki... · Posted by u/mooreds

trjordan · a month ago

I don't think we ever get away from the code being the source of truth. There has to be one source of truth.

If you want to go all in on specs, you must fully commit to allowing the AI to regenerate the codebase from scratch at any point. I'm an AI optimist, but this is a laughable stance with current tools.

That said, the idea of operating on the codebase as a mutable, complex entity, at arms length, makes a TON of sense to me. I love touching and feeling the code, but as soon as there's 1) schedule pressure and 2) a company's worth of code, operating at a systems level of understanding just makes way more sense. Defining what you want done, using a mix of user-centric intent and architecture constraints, seems like a super high-leverage way to work.

The feedback mechanisms are still pretty tough, because you need to understand what the AI is implicitly doing as it works through your spec. There are decisions you didn't realize you needed to make, until you get there.

We're thinking a lot about this at https://tern.sh, and I'm currently excited about the idea of throwing an agentic loop around the implementation itself. Adversarially have an AI read through that huge implementation log and surface where it's struggling. It's a model that gives real leverage, especially over the "watch Claude flail" mode that's common in bigger projects/codebases.

trjordan commented on Show HN: ChartDB Agent – Cursor for DB schema design app.chartdb.io/ai... · Posted by u/guyb3

trjordan · 2 months ago

My conversation

"Design a schema like Calendly" --> Did it

"OK let's scale this to 100m users" --> Tells me how it would. No schema change.

"Did you update the schema?" --> Updates the schema, tells me what it did.

We've been running into this EXACT failure mode with current models, and it's so irritating. Our agent plans migrations, so it's code-adjacent, but the output is a structured plan (basically: tasks, which are prompt + regex. What to do; where to do it.)

The agent really wants to talk to you about it. Claude wants to write code about it. None of the models want to communicate with the user primarily through tool use, even when (as I'm sure ChartDB is) HEAVILY prompted to do so.

I think there's still a lot of value there, but it's a bummer that we as users are going to have to remind all LLMs for a little bit to do keep using their tools beyond the 1st prompt.