Readit News logoReadit News
mekpro commented on Kimi Claw   kimi.com/bot... · Posted by u/pretext
8cvor6j844qw_d6 · 17 hours ago
The pricing looks great.

Significantly much better than ~ USD 50 per day on Anthropic API.

Any idea how good this model compares to Opus 4.6?

I tried Grok 4.1 Fast but the results are mild to put it kindly.

mekpro · 5 hours ago
Opus is definitely in its own league. I use Kimi/Gemini-cli code regularly to save cost and from my experience, Kimi 2.5 is more solid than Gemini Flash 3.0 for coding. While Gemini Flash 3.0 is generally faster, it usually break the syntax and skip important prompt. Kimi 2.5 can write very good code and can plan very well.
mekpro commented on Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model   kimi.com/blog/kimi-k2-5.h... · Posted by u/nekofneko
PlatoIsADisease · 20 days ago
I am convinced that was mostly just marketing. No one uses deepseek as far as I can tell. People are not running it locally. People choose GPT/Gemini/Claude/Grok if you are giving your data away anyway.

My biggest source of my conspiracy is that I made a reddit thread asking a question: "Why all the deepseek hype" or something like that. And to this day, I get odd, 'pro deepseek' comments from accounts only used every few months. Its not like this was some highly upvoted topic that is in the 'Top'.

I'd put that deepseek marketing on-par with an Apple marketing campaign.

mekpro · 20 days ago
Except that, In OpenRouter, Deepseek always maintain in Top 10 Ranking. Although I did not use it personally, i believe that their main advantage over other model is price/performance.
mekpro commented on Apple is fighting for TSMC capacity as Nvidia takes center stage   culpium.com/p/exclusiveap... · Posted by u/speckx
mekpro · a month ago
I think the opposite. Having NVIDIA investing in TSMC's bleeding-edge process node should benefit Apple rather than disadvantage.

It means that Apple doesn't have to be sole investor in latest node development which is more harder to justify, especially in the year where smartphone upgrade cycle is slowdown. Having NVIDIA (and AI boom) in the picture should help Apple reduce CAPEX for their semi-conductor investment.

mekpro commented on 1300 Still Images from the Animated Films of Hayao Miyazaki's Studio Ghibli (2023)   ghibli.jp/info/013772/... · Posted by u/vinhnx
mekpro · 2 months ago
They are so beautiful that i dont want any of these been stole by AI.
mekpro commented on DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]   github.com/deepseek-ai/De... · Posted by u/fspeech
mekpro · 3 months ago
How this improvement translate into real world agentic coding task ?

Deleted Comment

mekpro commented on Open models by OpenAI   openai.com/open-models/... · Posted by u/lackoftactics
GodelNumbering · 6 months ago
> The 20B model runs on my Mac laptop using less than 15GB of RAM.

I was about to try the same. What TPS are you getting and on which processor? Thanks!

mekpro · 6 months ago
i got 70 token/s on m4 max
mekpro commented on Open models by OpenAI   openai.com/open-models/... · Posted by u/lackoftactics
coltonv · 6 months ago
Yes but if I set it above ~16K on my 32gb laptop it just OOMs. Am I doing something wrong?
mekpro · 6 months ago
try enable flash attention and offload all layer to GPU
mekpro commented on Claude Code weekly rate limits    · Posted by u/thebestmoshe
mekpro · 7 months ago
Is this limit will also count together with Claude Chat ?
mekpro commented on OpenAI’s Windsurf deal is off, and Windsurf’s CEO is going to Google   theverge.com/openai/70599... · Posted by u/rcchen
jhickok · 7 months ago
Can you give me an idea of how much interaction would be $50-$100 per day? Like are you pretty constantly in a back and forth with CC? And if you wouldn’t mind, any chance you can give me an idea of productivity gains pre/post LLM?
mekpro · 7 months ago
you can easily reach 50$ per day. by force switching model to opus /model opus it will continue to use opus eventhough there is a warning about approaching limit.

i found opus is significantly more capable in coding than sonnet, especcially for the task that is poorly defined, thinking mode can fulfill alot of missing detail and you just need to edit a little before let it code.

u/mekpro

KarmaCake day393March 24, 2012View Original