mekpro (u/mekpro) - Readit News

Readit News

mekpro commented on Kimi Claw kimi.com/bot... · Posted by u/pretext

8cvor6j844qw_d6 · 17 hours ago

The pricing looks great.

Significantly much better than ~ USD 50 per day on Anthropic API.

Any idea how good this model compares to Opus 4.6?

I tried Grok 4.1 Fast but the results are mild to put it kindly.

mekpro · 5 hours ago

Opus is definitely in its own league. I use Kimi/Gemini-cli code regularly to save cost and from my experience, Kimi 2.5 is more solid than Gemini Flash 3.0 for coding. While Gemini Flash 3.0 is generally faster, it usually break the syntax and skip important prompt. Kimi 2.5 can write very good code and can plan very well.

mekpro commented on Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model kimi.com/blog/kimi-k2-5.h... · Posted by u/nekofneko

PlatoIsADisease · 20 days ago

I am convinced that was mostly just marketing. No one uses deepseek as far as I can tell. People are not running it locally. People choose GPT/Gemini/Claude/Grok if you are giving your data away anyway.

My biggest source of my conspiracy is that I made a reddit thread asking a question: "Why all the deepseek hype" or something like that. And to this day, I get odd, 'pro deepseek' comments from accounts only used every few months. Its not like this was some highly upvoted topic that is in the 'Top'.

I'd put that deepseek marketing on-par with an Apple marketing campaign.

mekpro · 20 days ago

Except that, In OpenRouter, Deepseek always maintain in Top 10 Ranking. Although I did not use it personally, i believe that their main advantage over other model is price/performance.

mekpro commented on Apple is fighting for TSMC capacity as Nvidia takes center stage culpium.com/p/exclusiveap... · Posted by u/speckx

mekpro · a month ago

I think the opposite. Having NVIDIA investing in TSMC's bleeding-edge process node should benefit Apple rather than disadvantage.

It means that Apple doesn't have to be sole investor in latest node development which is more harder to justify, especially in the year where smartphone upgrade cycle is slowdown. Having NVIDIA (and AI boom) in the picture should help Apple reduce CAPEX for their semi-conductor investment.

mekpro commented on 1300 Still Images from the Animated Films of Hayao Miyazaki's Studio Ghibli (2023) ghibli.jp/info/013772/... · Posted by u/vinhnx

mekpro · 2 months ago

They are so beautiful that i dont want any of these been stole by AI.

mekpro commented on DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf] github.com/deepseek-ai/De... · Posted by u/fspeech

mekpro · 3 months ago

How this improvement translate into real world agentic coding task ?

Deleted Comment

mekpro commented on Open models by OpenAI openai.com/open-models/... · Posted by u/lackoftactics

GodelNumbering · 6 months ago

> The 20B model runs on my Mac laptop using less than 15GB of RAM.

I was about to try the same. What TPS are you getting and on which processor? Thanks!

mekpro · 6 months ago

i got 70 token/s on m4 max

mekpro commented on Open models by OpenAI openai.com/open-models/... · Posted by u/lackoftactics

coltonv · 6 months ago

Yes but if I set it above ~16K on my 32gb laptop it just OOMs. Am I doing something wrong?

mekpro · 6 months ago

try enable flash attention and offload all layer to GPU

mekpro commented on Claude Code weekly rate limits · Posted by u/thebestmoshe

mekpro · 7 months ago

Is this limit will also count together with Claude Chat ?

mekpro commented on OpenAI’s Windsurf deal is off, and Windsurf’s CEO is going to Google theverge.com/openai/70599... · Posted by u/rcchen

jhickok · 7 months ago

Can you give me an idea of how much interaction would be $50-$100 per day? Like are you pretty constantly in a back and forth with CC? And if you wouldn’t mind, any chance you can give me an idea of productivity gains pre/post LLM?

mekpro · 7 months ago

you can easily reach 50$ per day. by force switching model to opus /model opus it will continue to use opus eventhough there is a warning about approaching limit.

i found opus is significantly more capable in coding than sonnet, especcially for the task that is poorly defined, thinking mode can fulfill alot of missing detail and you just need to edit a little before let it code.

u/mekpro

KarmaCake day393March 24, 2012View Original