Readit News logoReadit News
andrewmunsell commented on OpenAI Codex hands-on review   zackproser.com/blog/opena... · Posted by u/fragmede
swyx · 3 months ago
i shared my review inside of the pod with the team (https://latent.space/p/codex) but basically:

- it's a GREAT oneshot coding model (in the pod we find out that they specifically finetuned for oneshotting OAI SWE tasks, eg prioritized over being multiturn)

- however comparatively let down by poorer integrations (eg no built in browser, not great github integration - as TFA notes "The current workflow wants to open a fresh pull request for every iteration, which means pushing follow-up commits to an existing branch is awkward at best." - yeah this sucks ass)

fortunately the integrations will only improve over time. i think the finding that you can do 60 concurrent Codex instances per hour is qualitatively different than Devin (5 concurrent) and Cursor (1 before the new "background agents").

btw

> I haven't yet noticed a marked difference in the performance of the Codex model, which OpenAI explains is a descendant of GPT-3 and is proficient in more than 12 programming languages.

incorrect, its an o3 finetune.

andrewmunsell · 3 months ago
> incorrect, its an o3 finetune.

This is Open AI's fault (and literally every AI company is guilty of the same horrid naming schemes). Codex was an old model based on GPT-3, but then they reused the same name for both their Codex CLI and this Codex tool...

I mean, just look at the updates to their own blog post, I can see why people are confused.

https://openai.com/index/openai-codex/

Edit:

Google just did it too. "Gemini Ultra" is both a model (https://deepmind.google/models/gemini/ultra/) and their new top-tier subscription plan (a la Open AI's Pro plan). Why is this so difficult?

andrewmunsell commented on Void: Open-source Cursor alternative   github.com/voideditor/voi... · Posted by u/sharjeelsayed
ramoz · 4 months ago
I think QOL will shift away from your keyboard. Give Claude Code a try and you’ll understand what I mean. Developer UX will shift away from traditional IDEs. At this point I could use notepad for the the type of manual work I do vs how I orchestrate Claude Code.
andrewmunsell · 4 months ago
The reason I have never bothered with Claude Code (or even other agentic tools), is that I still code mostly by hand.

When I am using LLMs, I know exactly what the code should be and just am using it as a way to produce it faster (my Cursor rules are extremely extensive and focused on my personal architecture and code style, and I share them across all my personal projects), rather than producing a whole feature. When I try and use just the agent in Cursor, it always needs significant modifications and reorganization to meet my standards, even with the extensive rules I have set up.

Cursor appeals to me because those QOL features don't take away the actual code writing part, but instead augment it and get rid of some of the tedium.

andrewmunsell commented on Void: Open-source Cursor alternative   github.com/voideditor/voi... · Posted by u/sharjeelsayed
andrewmunsell · 4 months ago
Given that there's a dozen agentic coding IDEs, I only use Cursor because of the few features they have like auto-identification of the next cursor location (I find myself hitting tab-tab-tab-tab a lot, it speeds up repetitive edits). Are there any other IDEs that implement these QOL features, including Void (given it touts itself specifically as a Cursor alternative)?
andrewmunsell commented on Apple M5 could ditch unified memory architecture for split CPU and GPU designs   notebookcheck.net/Apple-M... · Posted by u/akyuu
lotsofpulp · 8 months ago
I’m seeing 16/256 for $600 and 32/512 for $1,200 on apple.com
andrewmunsell · 8 months ago
On the Apple Edu store, it's $499 for the 16/256 and $1079 for the 32/512
andrewmunsell commented on QwQ: Alibaba's O1-like reasoning LLM   qwenlm.github.io/blog/qwq... · Posted by u/amrrs
j0hnyl · 9 months ago
How many tokens per second?
andrewmunsell · 9 months ago
Another data point:

17.6 tokens/s on an M4 Max 40 core GPU

andrewmunsell commented on M4 MacBook Pro   apple.com/newsroom/2024/1... · Posted by u/tosh
mrcwinn · 10 months ago
Question without judgement: why would I want to run LLM locally? Say I'm building a SaaS app and connecting to Anthropic using the `ai` package. Would I want to cut over to ollama+something for local dev?
andrewmunsell · 10 months ago
Data privacy-- some stuff, like all my personal notes I use with a RAG system, just don't need to be sent to some cloud provider to be data mined and/or have AI trained on them
andrewmunsell commented on Nearly all of the Google images results for "baby peacock" are AI generated   twitter.com/notengoprisa/... · Posted by u/jsheard
giarc · a year ago
I'm a part time maker and purchase a lot of designs off of Etsy to make into physical goods. I have to weed through so many AI images when purchasing designs off of Etsy now. I wish they required users to indicate if AI was used to produce the image so I could then filter them out.
andrewmunsell · a year ago
Sellers are actually supposed to mark items as AI generated/assisted, where applicable: https://techcrunch.com/2024/07/09/etsy-new-seller-policy-202...

Whether they actually do this (and whether there's any incentive to do so), is obviously not a given

andrewmunsell commented on Apple Explains iPhone 15 Pro Requirement for Apple Intelligence   macrumors.com/2024/06/19/... · Posted by u/mgh2
andrewmunsell · a year ago
If Apple's "Private Cloud Compute" is used even on the latest devices for some tasks that are too computationally complex to be done on-device, then is there some reason (other than money) that they can't launch Apple Intelligence on all iOS 18 devices but use the cloud for all "AI" requests that would be "too slow" because of the older chips?
andrewmunsell commented on Apple Intelligence for iPhone, iPad, and Mac   apple.com/newsroom/2024/0... · Posted by u/terramex
Tomte · a year ago
No transcripts in Voice Memos? The one feature I was surprised hasn‘t already been there for years, heavily rumored before this WWDC, and now nothing?
andrewmunsell · a year ago
From MacRumors:

> Notes can record and transcribe audio. When your recording is finished, Apple Intelligence automatically generates a summary. Recording and summaries coming to phone calls too.

So the functionality exists, maybe just not in the Voice Memos app?

u/andrewmunsell

KarmaCake day2222July 29, 2012
About
Any content I post on Hacker News is strictly personal opinion and does not reflect the views of my employer or anyone other than myself.

- https://www.andrewmunsell.com/ - https://mastodon.munsell.io/@andrew

[ my public key: https://keybase.io/andrewmunsell; my proof: https://keybase.io/andrewmunsell/sigs/iTG3yiQopPB2Za_q0DZMOVBh_5rxp_aExEVcXVUyDv4 ]

View Original