Readit News logoReadit News
picografix commented on Introducing deep research   openai.com/index/introduc... · Posted by u/mfiguiere
picografix · a year ago
I think deep research as a service could be a really strong use case for enterprises, as long as they have access to non-public data. I assume that most of this guarded data is high quality, and seeing progress in these areas might end up being even more impressive than it is now.
picografix commented on Mistral Small 3   mistral.ai/news/mistral-s... · Posted by u/jasondavies
picografix · a year ago
Tried running locally, gone were the days where you get broken responses on local models (i know this happened earlier but I tried after so many days)
picografix commented on Isolating complexity is the essence of successful abstractions   v5.chriskrycho.com/journa... · Posted by u/chriskrycho
picografix · a year ago
complexity has to live somewhere, code anxiety was a real thing for me
picografix commented on DeepSeek-R1   github.com/deepseek-ai/De... · Posted by u/meetpateltech
ankit219 · a year ago
> The other thing was that o1 had access to many more answer / search strategies. For example, if you asked o1 to summarize a long email, it would just summarize the email. QwQ reasoned about why I asked it to summarize the email. Or, on hard math questions, o1 could employ more search strategies than QwQ. I'm curious how DeepSeek-R1 will fare in that regard.

This is probably the result of a classifier which determines if it have to go through the whole CoT at the start. Mostly on tough problems it does, and otherwise, it just answers as is. Many papers (scaling ttc, and the mcts one) have talked about this as a necessary strategy to improve outputs against all kinds of inputs.

picografix · a year ago
yes the original TTC paper mentioned the optimal strategy for TTC
picografix commented on Test-driven development with an LLM for fun and profit   blog.yfzhou.fyi/posts/tdd... · Posted by u/crazylogger
picografix · a year ago
very few times we are encountered with developing from scratch
picografix commented on Show HN: Simplex: Automate browser workflows using code and natural language   simplex.sh/playground... · Posted by u/marcon680
picografix · a year ago
it fails for this query search("amazon.in", "fitness watch")
picografix commented on GPT-4o with scheduled tasks (jawbone) is available in beta   chatgpt.com/?model=gpt-4o... · Posted by u/TheJCDenton
picografix · a year ago
why are they trying to be a model provider as well as service provider

u/picografix

KarmaCake day1January 7, 2025View Original