Readit News logoReadit News
sams99 commented on How to code Claude Code in 200 lines of code   mihaileric.com/The-Empero... · Posted by u/nutellalover
sams99 · 2 months ago
For those interested, edit is a surprisingly difficult problem, it seems easy on the surface but there is both fine tuning and real world hallucinations you are fighting with. I implemented one this week in:

https://github.com/samsaffron/term-llm

It is about my 10th attempt at the problem so I am aware of a lot of the edge cases, a very interesting bit of research here is:

https://gist.github.com/SamSaffron/5ff5f900645a11ef4ed6c87f2...

Fascinating read.

sams99 commented on Anthropic blocks third-party use of Claude Code subscriptions   github.com/anomalyco/open... · Posted by u/sergiotapia
wahnfrieden · 2 months ago
Meanwhile, OpenAI co-signs https://github.com/steipete/oracle which lets you use your ChatGPT subscription to gain programmatic/agentic access to 5.2 Pro via automating browser access to the web frontend. Karpathy and other leaders have praised this feature on X.

If that is indeed so welcome, imagine what else you could script via their website to get around Codex rate limits or other such things.

After all what coud be so different about this than what browsers like Atlas do already

sams99 · 2 months ago
Codex requires stuffing a very specific system prompt otherwise the custom endpoint will reject you
sams99 commented on We need a clearer framework for AI-assisted contributions to open source   samsaffron.com/archive/20... · Posted by u/keybits
sams99 · 4 months ago
Author here, thanks heaps for the discussion, I replied to a few of the points in my blog comments:

https://discuss.samsaffron.com/t/your-vibe-coded-slop-pr-is-...

sams99 commented on DeepThought-8B: A small, capable reasoning model   ruliad.co/news/introducin... · Posted by u/AnhTho_FR
lowyek · a year ago
I asked it 'find two primes whose sum is 123' .. it is in deep thought from 5 minutes just looping and looping over seemingly repeated hallucinations of right path. (btw, chatgpt immediately answers 61 and 62 lol.. so much for intelligence)
sams99 · a year ago
Qwen coder 32b with a JavaScript interpreter

Impressive answer for a model that can run on your own computer

https://discuss.samsaffron.com/discourse-ai/ai-bot/shared-ai...

sams99 commented on QwQ: Alibaba's O1-like reasoning LLM   qwenlm.github.io/blog/qwq... · Posted by u/amrrs
simonw · a year ago
This one is pretty impressive. I'm running it on my Mac via Ollama - only a 20GB download, tokens spit out pretty fast and my initial prompts have shown some good results. Notes here: https://simonwillison.net/2024/Nov/27/qwq/
sams99 · a year ago
I find it odd that is refused me so badly https://discuss.samsaffron.com/discourse-ai/ai-bot/shared-ai... my guess is that I am using a quantized model

It simply did not want to use XML tools for some reason something that even qwen coder does not struggle with: https://discuss.samsaffron.com/discourse-ai/ai-bot/shared-ai...

I have not seen any model including sonnet that is able to 1 shot a working 9x9 go board

For ref gpt-4o which is still quite bad https://discuss.samsaffron.com/discourse-ai/ai-bot/shared-ai...

sams99 commented on Generative AI Is Not Going to Build Your Engineering Team for You   simonwillison.net/2024/Ju... · Posted by u/duck
sams99 · 2 years ago
The original was posted at work earlier this week, to me the original missed a bit around explaining what this tech is yes good at... https://meta.discourse.org/discourse-ai/ai-bot/shared-ai-con...
sams99 commented on How web bloat impacts users with slow devices   danluu.com/slow-device/... · Posted by u/jasondavies
sams99 · 2 years ago
Highly Gamed === It is better if users with slow devices see a white screen for 30 seconds vs an indication that something is happening, because ... reasons?

u/sams99

KarmaCake day1657February 13, 2010
About
co-founder www.discourse.org

[ my public key: https://keybase.io/sam_saffron; my proof: https://keybase.io/sam_saffron/sigs/p9mfLRQFpJGAbjrwmLZxcboTWPN0WgtGK76rkI4O-wY ]

View Original