Readit News logoReadit News
csoham commented on Man lands in hospital after Samsung smart ring battery swells and traps finger   neowin.net/news/youtuber-... · Posted by u/bundie
csoham · 6 months ago
Wow this is crazy. What's crazier is the drop in battery level from 7 days to 1.5 days in 9 months!
csoham commented on Ask HN: What are you working on? (September 2025)    · Posted by u/david927
csoham · 6 months ago
I'm working on ScaleDown [1], a context pruning API.

So over the past few years, I have seen how contexts have been steadily growing in AI apps. And while the context lengths of LLMs have also been increasing, they are still effectively about 200k tokens. The performance drops off a cliff after that (you might have noticed it as well with long AI chats).

It is a simple API that prunes away irrelevant parts of a context for a given prompt, a.k.a. context-aware pruning. Integration is super simple: just an extra API call before the final LLM API call. You can get an API from the website.

I would love to chat if this is something that is relevant to you and if you have any feedback on what we are building!

[1] https://scaledown.ai

csoham commented on When the job search becomes impossible   jeffwofford.com/wp/?p=224... · Posted by u/pertinhower
csoham · 6 months ago
[self promotion alert] the "you are not alone" point really resonated with me. When I lost my job, I was alone, helpless and not sure what the next steps were. This is why I tried to create a community of people willing to support and be a listening ear for people going through job loss and this tough job market. It's at layoff.supprt. honestly I have not been supporting it for a while but of you find this helpful and would like more features then do let me know!
csoham commented on Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22%   quesma.com/blog/tau2-benc... · Posted by u/blndrt
csoham · 6 months ago
Really intresting. What did the original prompt look like? Perhaps the original prompt was not that good? I feel like the changes claude suggested (except a couple maybe) are already pretty well known prompt engineering practices.
csoham commented on VLLM: Anatomy of a High-Throughput LLM Inference System   aleksagordic.com/blog/vll... · Posted by u/tim_sw
csoham · 6 months ago
One of the best deep-dives I have seen!

u/csoham

KarmaCake day22October 17, 2018
About
Building scaledown.ai | Gen AI, MLOps, LLMOps, TinyML
View Original