Readit News logoReadit News
sabareesh commented on Self Driving Car Insurance   lemonade.com/car/explaine... · Posted by u/KellyCriterion
sabareesh · 9 days ago
Tesla have their own Insurance product which is already very competitive compared to other providers. Not sure if lemonade can beat them . Tesla's insurance product has similar objective in place already where it rewards self driving over manual driving.
sabareesh commented on Stop Doom Scrolling, Start Doom Coding: Build via the terminal from your phone   github.com/rberg27/doom-c... · Posted by u/rbergamini27
sabareesh · a month ago
I am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari
sabareesh commented on I switched from VSCode to Zed   tenthousandmeters.com/blo... · Posted by u/r4victor
xpe · a month ago
Do you mean a terminal-based editor, like emacs, vim, neovim, or helix? (I quite like the latter, after having used all the former to some degree.)

Or do you mean line-editors? They have gotten impressively good. See rustyline (based on linenoise) and reedline (not a typo; developed by the Nushell team) for example. Way better than one might expect!

[1]: https://github.com/kkawakam/rustyline

[2]: https://github.com/antirez/linenoise

[3]: https://github.com/nushell/reedline

sabareesh · a month ago
Sorry to disappoint. But purely codex and claude code
sabareesh commented on I switched from VSCode to Zed   tenthousandmeters.com/blo... · Posted by u/r4victor
sabareesh · a month ago
I have switched to terminal
sabareesh commented on IQuest-Coder: A new open-source code model beats Claude Sonnet 4.5 and GPT 5.1 [pdf]   github.com/IQuestLab/IQue... · Posted by u/shenli3514
sabareesh · a month ago
TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589

(given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)

https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...

sabareesh commented on Show HN: Stop Claude Code from forgetting everything   github.com/mutable-state-... · Posted by u/austinbaggio
sabareesh · a month ago
Non starter for us, we cant ship propriety data to a third party servers.
sabareesh commented on Gemini 3 Flash: Frontier intelligence built for speed   blog.google/products/gemi... · Posted by u/meetpateltech
andai · 2 months ago
This model has the best score on that benchmark.

Edit: Huh... It does score highest in "Omniscience", but also very high in Hallucination Rate (where higher score is worse)...

sabareesh · 2 months ago
this has one of the worse score in AA-Omniscience Hallucination Rate
sabareesh commented on Gemini 3 Flash: Frontier intelligence built for speed   blog.google/products/gemi... · Posted by u/meetpateltech
joecarpenter · 2 months ago
Isn't it the opposite? From the link: Scores range from -100 to 100, where 0 means as many correct as incorrect answers, and negative scores mean more incorrect than correct.

Gemini 3 Flash scored +13 in the test, more correct answers than incorrect.

sabareesh · 2 months ago
Nope lower is better compared to recent open ai models this is bad. I am looking at AA-Omniscience Hallucination Rate
sabareesh commented on The Big Vitamin D Mistake [pdf] (2017)   pmc.ncbi.nlm.nih.gov/arti... · Posted by u/felineflock
sabareesh · 2 months ago
So is 10,000 IU of daily does ok ?

u/sabareesh

KarmaCake day294April 12, 2017
About
CTO @ guidedchoice.com , 3nickels.com . Making finance easy for everyone
View Original