csoham (u/csoham) - Readit News

csoham · 6 months ago

Wow this is crazy. What's crazier is the drop in battery level from 7 days to 1.5 days in 9 months!

Posted by u/csoham 6 months ago

Agentic Commerce Protocol Spec github.com/agentic-commer...

Posted by u/csoham 6 months ago

Developing an open standard for agentic commerce stripe.com/blog/developin...

csoham commented on Ask HN: What are you working on? (September 2025) · Posted by u/david927

csoham · 6 months ago

I'm working on ScaleDown [1], a context pruning API.

So over the past few years, I have seen how contexts have been steadily growing in AI apps. And while the context lengths of LLMs have also been increasing, they are still effectively about 200k tokens. The performance drops off a cliff after that (you might have noticed it as well with long AI chats).

It is a simple API that prunes away irrelevant parts of a context for a given prompt, a.k.a. context-aware pruning. Integration is super simple: just an extra API call before the final LLM API call. You can get an API from the website.

I would love to chat if this is something that is relevant to you and if you have any feedback on what we are building!

[1] https://scaledown.ai

csoham commented on When the job search becomes impossible jeffwofford.com/wp/?p=224... · Posted by u/pertinhower

csoham · 6 months ago

[self promotion alert] the "you are not alone" point really resonated with me. When I lost my job, I was alone, helpless and not sure what the next steps were. This is why I tried to create a community of people willing to support and be a listening ear for people going through job loss and this tough job market. It's at layoff.supprt. honestly I have not been supporting it for a while but of you find this helpful and would like more features then do let me know!

csoham commented on Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22% quesma.com/blog/tau2-benc... · Posted by u/blndrt

csoham · 6 months ago

Really intresting. What did the original prompt look like? Perhaps the original prompt was not that good? I feel like the changes claude suggested (except a couple maybe) are already pretty well known prompt engineering practices.

csoham commented on VLLM: Anatomy of a High-Throughput LLM Inference System aleksagordic.com/blog/vll... · Posted by u/tim_sw

csoham · 6 months ago

One of the best deep-dives I have seen!

u/csoham

KarmaCake day22October 17, 2018

About

Building scaledown.ai | Gen AI, MLOps, LLMOps, TinyML

View Original