sabareesh (u/sabareesh)

sabareesh commented on Self Driving Car Insurance lemonade.com/car/explaine... · Posted by u/KellyCriterion

sabareesh · 9 days ago

Tesla have their own Insurance product which is already very competitive compared to other providers. Not sure if lemonade can beat them . Tesla's insurance product has similar objective in place already where it rewards self driving over manual driving.

sabareesh commented on Stop Doom Scrolling, Start Doom Coding: Build via the terminal from your phone github.com/rberg27/doom-c... · Posted by u/rbergamini27

sabareesh · a month ago

I am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari

sabareesh commented on I switched from VSCode to Zed tenthousandmeters.com/blo... · Posted by u/r4victor

xpe · a month ago

Do you mean a terminal-based editor, like emacs, vim, neovim, or helix? (I quite like the latter, after having used all the former to some degree.)

Or do you mean line-editors? They have gotten impressively good. See rustyline (based on linenoise) and reedline (not a typo; developed by the Nushell team) for example. Way better than one might expect!

[1]: https://github.com/kkawakam/rustyline

[2]: https://github.com/antirez/linenoise

[3]: https://github.com/nushell/reedline

sabareesh · a month ago

Sorry to disappoint. But purely codex and claude code

sabareesh commented on I switched from VSCode to Zed tenthousandmeters.com/blo... · Posted by u/r4victor

sabareesh · a month ago

I have switched to terminal

sabareesh commented on IQuest-Coder: A new open-source code model beats Claude Sonnet 4.5 and GPT 5.1 [pdf] github.com/IQuestLab/IQue... · Posted by u/shenli3514

sabareesh · a month ago

TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589

(given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)

https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...

sabareesh commented on Show HN: Stop Claude Code from forgetting everything github.com/mutable-state-... · Posted by u/austinbaggio

sabareesh · a month ago

Non starter for us, we cant ship propriety data to a third party servers.

sabareesh commented on Gemini 3 Flash: Frontier intelligence built for speed blog.google/products/gemi... · Posted by u/meetpateltech

andai · 2 months ago

This model has the best score on that benchmark.

Edit: Huh... It does score highest in "Omniscience", but also very high in Hallucination Rate (where higher score is worse)...

sabareesh · 2 months ago

this has one of the worse score in AA-Omniscience Hallucination Rate

sabareesh commented on Gemini 3 Flash: Frontier intelligence built for speed blog.google/products/gemi... · Posted by u/meetpateltech

joecarpenter · 2 months ago

Isn't it the opposite? From the link: Scores range from -100 to 100, where 0 means as many correct as incorrect answers, and negative scores mean more incorrect than correct.

Gemini 3 Flash scored +13 in the test, more correct answers than incorrect.

sabareesh · 2 months ago

Nope lower is better compared to recent open ai models this is bad. I am looking at AA-Omniscience Hallucination Rate

sabareesh commented on Gemini 3 Flash: Frontier intelligence built for speed blog.google/products/gemi... · Posted by u/meetpateltech

sabareesh · 2 months ago

Watch out these model are hallucinating lot more https://artificialanalysis.ai/evaluations/omniscience?omnisc...

sabareesh commented on The Big Vitamin D Mistake [pdf] (2017) pmc.ncbi.nlm.nih.gov/arti... · Posted by u/felineflock

sabareesh · 2 months ago

So is 10,000 IU of daily does ok ?

u/sabareesh

KarmaCake day294April 12, 2017

About

CTO @ guidedchoice.com , 3nickels.com . Making finance easy for everyone

View Original