Readit News logoReadit News
Oras commented on Learning from context is harder than we thought   hy.tencent.com/research/1... · Posted by u/limoce
XenophileJKO · 2 days ago
Hmm.. I looked at the benchmark set.

I'm conflicted. I don't know that I would necessarily want a model to pass all of these. Here is the fundamental problem. They are putting the rules and foundational context in "user" messages.

Essentially I don't think you want to train the models on full compliance to the user messages, they are essentially "untrusted" content from a system/model perspective. Or at least it is not generally "fully authoritative".

This creates a tension with the safety, truthfulness training, etc.

Oras · 2 days ago
Isn’t that what fine tuning does anyway?

The article is suggesting that there should be a way for the LLM to gain knowledge (changing weights) on the fly upon gaining new knowledge which would eliminate the need for manual fine tuning.

Deleted Comment

Oras commented on OpenAI Frontier   openai.com/index/introduc... · Posted by u/nycdatasci
Oras · 4 days ago
Weird that it doesn't support MS Office, unless this would affect OpenAI <=> MS partnership.
Oras commented on LG's new subscription program charges up to £277 per month to rent a TV   arstechnica.com/gadgets/2... · Posted by u/PaulHoule
Oras · 4 days ago
Is it Agentic though? /s

The bonkers part is mentioning renting it for a month (which is the title price tag), and in my mind the only reason I would do that is to check if it’s worth it or I should return it. And in the UK, I can do that anyway within 30 days and get a refund.

Oras commented on Voxtral Transcribe 2   mistral.ai/news/voxtral-t... · Posted by u/meetpateltech
mdrzn · 5 days ago
Is it 0.003 per minute of audio uploaded, or "compute minute"?

For example fal.ai has a Whisper API endpoint priced at "$0.00125 per compute second" which (at 10-25x realtime) is EXTREMELY cheaper than all the competitors.

Oras · 5 days ago
I think the point is having it for real-time; this is for conversations rather than transcribing audio files.
Oras commented on Voxtral Transcribe 2   mistral.ai/news/voxtral-t... · Posted by u/meetpateltech
simonw · 5 days ago
This demo is really impressive: https://huggingface.co/spaces/mistralai/Voxtral-Mini-Realtim...

Don't be confused if it says "no microphone", the moment you click the record button it will request browser permission and then start working.

I spoke fast and dropped in some jargon and it got it all right - I said this and it transcribed it exactly right, WebAssembly spelling included:

> Can you tell me about RSS and Atom and the role of CSP headers in browser security, especially if you're using WebAssembly?

Oras · 5 days ago
Thank you for the link! Their playground in Mistral does not have a microphone. it just uploads files, which does not demonstrate the speed and accuracy, but the link you shared does.

I tried speaking in 2 languages at once, and it picked it up correctly. Truly impressive for real-time.

Oras commented on Anthropic Claude Max $200/mo: They claim 99% uptime, I calculated 84% Loss: $780   gist.github.com/LEX8888/0... · Posted by u/Nerios
Oras · 5 days ago
Before anyone jumps to conclusions and wastes time, the OP's account is new, github repo is empty https://github.com/LEX8888

All gists smell like AI-generated.

You're _probably_ going to reply to a bot.

Sad to see this on the HN front page.

Oras commented on Xcode 26.3 – Developers can leverage coding agents directly in Xcode   apple.com/newsroom/2026/0... · Posted by u/davidbarker
Oras · 5 days ago
As MKBHD would say, welcome to 2026, Apple.
Oras commented on Ask HN: How much does ATS parsing penalize modern CV layouts?    · Posted by u/ATSPASSKIT
Oras · 6 days ago
Depends on which ATS and what you mean by `preserve information`.

Sophisticated ATSs use CV parsers such as Text Kernel, Rchili, and Dextra.

They don't just parse; they also return structured data from the CV, such as personal information, skills, work history, and dates.

Even for LLMs, I wrote a CV parser that uses Mistral OCR to extract the text and an LLM to structure the data, with great success, even for multilingual CVs.

Oras commented on Claude Code is suddenly everywhere inside Microsoft   theverge.com/tech/865689/... · Posted by u/Anon84
kemotep · 7 days ago
Microsoft really needs to get a better handle with the naming conventions.

There is Microsoft Copilot, which replaced Bing Chat, Cortana and uses OpenAI’s GPT-4 and 5 models.

There is Github Copilot, the coding autocomplete tool.

There is Microsoft 365 Copilot, what they now call Office with built in GenAI stuff.

There is also a Copilot cli that lets you use whatever agent/model backend you want too?

Everything is Copilot. Laptops sell with Copilot buttons now.

It is not immediately clear what version of Copilot someone is talking about. 99% of my experience is with the Office and it 100% fails to do the thing it was advertised to do 2 years ago when work initially got the subscription. Point it a SharePoint/OneDrive location, a handful of excel spreadsheets and pdfs/word docs and tell it to make a PowerPoint presentation based on that information.

It cannot do this. It will spit out nonsense. You have to hold it by the hand tell it everything to do step by step to the point that making the PowerPoint presentation yourself is significantly faster because you don’t have to type out a bunch of prompts and edit it’s garbage output.

And now it’s clear they aren’t even dogfooding their own LLM products so why should anyone pay for Copilot?

Oras · 6 days ago
You need to see how many times they changed AI related services in Azure. It’s a shit show.

u/Oras

KarmaCake day1775January 24, 2014View Original