andrewmunsell (u/andrewmunsell)

andrewmunsell commented on OpenAI Codex hands-on review zackproser.com/blog/opena... · Posted by u/fragmede

swyx · 3 months ago

i shared my review inside of the pod with the team (https://latent.space/p/codex) but basically:

- it's a GREAT oneshot coding model (in the pod we find out that they specifically finetuned for oneshotting OAI SWE tasks, eg prioritized over being multiturn)

- however comparatively let down by poorer integrations (eg no built in browser, not great github integration - as TFA notes "The current workflow wants to open a fresh pull request for every iteration, which means pushing follow-up commits to an existing branch is awkward at best." - yeah this sucks ass)

fortunately the integrations will only improve over time. i think the finding that you can do 60 concurrent Codex instances per hour is qualitatively different than Devin (5 concurrent) and Cursor (1 before the new "background agents").

btw

> I haven't yet noticed a marked difference in the performance of the Codex model, which OpenAI explains is a descendant of GPT-3 and is proficient in more than 12 programming languages.

incorrect, its an o3 finetune.

andrewmunsell · 3 months ago

> incorrect, its an o3 finetune.

This is Open AI's fault (and literally every AI company is guilty of the same horrid naming schemes). Codex was an old model based on GPT-3, but then they reused the same name for both their Codex CLI and this Codex tool...

I mean, just look at the updates to their own blog post, I can see why people are confused.

https://openai.com/index/openai-codex/

Edit:

Google just did it too. "Gemini Ultra" is both a model (https://deepmind.google/models/gemini/ultra/) and their new top-tier subscription plan (a la Open AI's Pro plan). Why is this so difficult?

andrewmunsell commented on Void: Open-source Cursor alternative github.com/voideditor/voi... · Posted by u/sharjeelsayed

ramoz · 4 months ago

I think QOL will shift away from your keyboard. Give Claude Code a try and you’ll understand what I mean. Developer UX will shift away from traditional IDEs. At this point I could use notepad for the the type of manual work I do vs how I orchestrate Claude Code.

andrewmunsell · 4 months ago

The reason I have never bothered with Claude Code (or even other agentic tools), is that I still code mostly by hand.

When I am using LLMs, I know exactly what the code should be and just am using it as a way to produce it faster (my Cursor rules are extremely extensive and focused on my personal architecture and code style, and I share them across all my personal projects), rather than producing a whole feature. When I try and use just the agent in Cursor, it always needs significant modifications and reorganization to meet my standards, even with the extensive rules I have set up.

Cursor appeals to me because those QOL features don't take away the actual code writing part, but instead augment it and get rid of some of the tedium.

andrewmunsell commented on Void: Open-source Cursor alternative github.com/voideditor/voi... · Posted by u/sharjeelsayed

andrewmunsell · 4 months ago

Given that there's a dozen agentic coding IDEs, I only use Cursor because of the few features they have like auto-identification of the next cursor location (I find myself hitting tab-tab-tab-tab a lot, it speeds up repetitive edits). Are there any other IDEs that implement these QOL features, including Void (given it touts itself specifically as a Cursor alternative)?

andrewmunsell commented on ChatGPT is turning everything into Studio Ghibli art theverge.com/openai/63652... · Posted by u/Brajeshwar

andrewmunsell · 5 months ago

https://archive.is/okv6M

andrewmunsell commented on Apple M5 could ditch unified memory architecture for split CPU and GPU designs notebookcheck.net/Apple-M... · Posted by u/akyuu

lotsofpulp · 8 months ago

I’m seeing 16/256 for $600 and 32/512 for $1,200 on apple.com

andrewmunsell · 8 months ago

On the Apple Edu store, it's $499 for the 16/256 and $1079 for the 32/512

andrewmunsell commented on QwQ: Alibaba's O1-like reasoning LLM qwenlm.github.io/blog/qwq... · Posted by u/amrrs

j0hnyl · 9 months ago

How many tokens per second?

andrewmunsell · 9 months ago

Another data point:

17.6 tokens/s on an M4 Max 40 core GPU

andrewmunsell commented on M4 MacBook Pro apple.com/newsroom/2024/1... · Posted by u/tosh

mrcwinn · 10 months ago

Question without judgement: why would I want to run LLM locally? Say I'm building a SaaS app and connecting to Anthropic using the `ai` package. Would I want to cut over to ollama+something for local dev?

andrewmunsell · 10 months ago

Data privacy-- some stuff, like all my personal notes I use with a RAG system, just don't need to be sent to some cloud provider to be data mined and/or have AI trained on them

andrewmunsell commented on Nearly all of the Google images results for "baby peacock" are AI generated twitter.com/notengoprisa/... · Posted by u/jsheard

giarc · a year ago

I'm a part time maker and purchase a lot of designs off of Etsy to make into physical goods. I have to weed through so many AI images when purchasing designs off of Etsy now. I wish they required users to indicate if AI was used to produce the image so I could then filter them out.

andrewmunsell · a year ago

Sellers are actually supposed to mark items as AI generated/assisted, where applicable: https://techcrunch.com/2024/07/09/etsy-new-seller-policy-202...

Whether they actually do this (and whether there's any incentive to do so), is obviously not a given

andrewmunsell commented on Apple Explains iPhone 15 Pro Requirement for Apple Intelligence macrumors.com/2024/06/19/... · Posted by u/mgh2

andrewmunsell · a year ago

If Apple's "Private Cloud Compute" is used even on the latest devices for some tasks that are too computationally complex to be done on-device, then is there some reason (other than money) that they can't launch Apple Intelligence on all iOS 18 devices but use the cloud for all "AI" requests that would be "too slow" because of the older chips?

andrewmunsell commented on Apple Intelligence for iPhone, iPad, and Mac apple.com/newsroom/2024/0... · Posted by u/terramex

Tomte · a year ago

No transcripts in Voice Memos? The one feature I was surprised hasn‘t already been there for years, heavily rumored before this WWDC, and now nothing?

andrewmunsell · a year ago

From MacRumors:

> Notes can record and transcribe audio. When your recording is finished, Apple Intelligence automatically generates a summary. Recording and summaries coming to phone calls too.

So the functionality exists, maybe just not in the Voice Memos app?