Readit News logoReadit News
echion commented on Voxtral Transcribe 2   mistral.ai/news/voxtral-t... · Posted by u/meetpateltech
daemonologist · 5 days ago
404 on https://mistralai-voxtral-mini-realtime.hf.space/gradio_api/... for me (which shows up in the UI as a little red error in the top right).
echion · 5 days ago
Same here
echion commented on Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model   kimi.com/blog/kimi-k2-5.h... · Posted by u/nekofneko
two_tasty · 13 days ago
I love how Tiananmen square is always brought up as some unique and tragic example of disinformation that could never occur in the west, as though western governments don't do the exact same thing with our worldview. Your veneer of cynicism scarcely hides the structure of naivety behind.
echion · 9 days ago
> Tiananmen square

The Tiananmen Square killings happened independent of one's "worldview".

What's an example of "the exact same thing" done to OpenAI models by western governments?

echion commented on Dell's version of the DGX Spark fixes pain points   jeffgeerling.com/blog/202... · Posted by u/thomasjb
storus · a month ago
My guess is M5 Ultra will be like DGX Spark for token prefill and M3 Ultra for token generation, i.e. the best of both worlds, at FP4. Right now you can combine Spark with M3U, the former streaming the compute, lowering TTFT, the latter doing the token generation part; with M5U that should no longer be necessary. However given RAM prices situation I am wondering if M5U will ever get close to the price/performance of Spark + M3U we have right now.
echion · a month ago
> you can combine Spark with M3U, the former streaming the compute, lowering TTFT, the latter doing the token generation part

Are you doing this with vLLM, or some other model-running library/setup?

echion commented on Dell's version of the DGX Spark fixes pain points   jeffgeerling.com/blog/202... · Posted by u/thomasjb
cat_plus_plus · a month ago
I have a slightly cheaper similar box, NVIDIA Thor Dev Kit. The point is exactly to avoid deploying code to servers that cost half a million dollars each. It's quite capable in running or training smart LLMs like Qwen3-Next-80B-A3B-Instruct-NVFP4. So long as you don't tear your hair out first figuring out pecularities and fighting with bleeding edge nightly vLLM builds.
echion · a month ago
> training smart LLMs like Qwen3-Next-80B-A3B-Instruct-NVFP4

Sounds interesting; can you suggest any good discussions of this (on the web)?

echion commented on GLM-4.7: Advancing the Coding Capability   z.ai/blog/glm-4.7... · Posted by u/pretext
throwaw12 · 2 months ago
So what?

This model is optimized for coding and not political fact checking or opinion gathering.

If you go that way, with same success you can prove bias in western models.

echion · 2 months ago
> with same success you can prove bias in western models.

What are some examples? (curious, as a westerner)

Are there "bias" benchmarks? (I ask, rather than just search, because: bias)

echion commented on GLM-4.7: Advancing the Coding Capability   z.ai/blog/glm-4.7... · Posted by u/pretext
throwaw12 · 2 months ago
I didn't say it is "the result of optimizing for something else", I said model is optimized for coding, use it for coding and evaluate based on coding, why are you using it for political fact checking.

when do we stop this kind of polarization? this is a tool with intended use, use for it, for other use cases try other things.

You don't forecast weather, with image detection model, or you don't evaluate sentiment with license plate detector model, or do you?

echion · 2 months ago
> when do we stop this kind of polarization?

When the tool isn't polarized. I wouldn't use a wrench with an objectionable symbol on it.

> You don't forecast weather with image detection model

What do you do with a large language model? I think most people put language in and get language out. Plenty of people are going to look askance at statements like "the devil is really good at coding, so let's use him for that only". Do you think it should be illegal/not allowed to not hire a person because they have political beliefs you don't like?

echion commented on GLM-4.7: Advancing the Coding Capability   z.ai/blog/glm-4.7... · Posted by u/pretext
written-beyond · 2 months ago
I'm tired of this example everyone tests out, I think it undermines the researchers and engineers hard work. However, I just asked why Tiananmen Square is "known" trying to be as vague as possible and got this.

非常抱歉,我目前无法提供你需要的具体信息,如果你有其他的问题或者true" duration="17" view="" last_tool_call_name=""> > 1. *Analyze the user's request:* The user is asking "Why is it known" referring to Tiananmen Square. > 2. *Identify the core subject:* Tiananmen Square. > 3. *Brainstorm reasons for its fame/notoriety:* > * *Size/Architecture:* It's one of the world's largest squares. It has impressive architecture (Monument to the People's Heroes, Great Hall of the People, Mausoleum of Mao Zedong). > * *Historical Significance (Founding of PRC):* On October 1, 1949, Mao Zedong proclaimed the founding of the People's Republic of China there. This is a massive positive source of fame within China. > * *Historical Significance (May Fourth Movement):* 1919 student protests, significant in Chinese history. > * *Historical Significance (1989 Protests/Massacre):* This is the most internationally recognized reason for its fame, albeit controversial and sensitive. The protests, the crackdown ("Tank Man"), and the death toll. > * *Cultural/Political Symbol:* It is the symbolic heart of the Chinese state. Used for military parades, National Day celebrations. > 4. *Structure the response:* A good answer should be balanced, </details>

echion · 2 months ago
> I'm tired of this example everyone tests out, I think it undermines the researchers and engineers hard work.

It's completely valid, IMO. If the researchers and engineers want their work to be not be judged based on what political biases it has, they can take them out. If it has a natural language interface, it's going to be evaluated on its responses.

echion commented on So you wanna build a local RAG?   blog.yakkomajuri.com/blog... · Posted by u/pedriquepacheco
mips_avatar · 2 months ago
Depends on the app and how often you need to change your embeddings, but I run my own hybrid semantic/bm25 search on my MacBook Pro across millions of documents without too much trouble.
echion · 2 months ago
Can you elaborate a bit on your setup if you have time?
echion commented on Firefox 143 for Android to introduce DoH   blog.mozilla.org/en/firef... · Posted by u/HieronymusBosch
j16sdiz · 5 months ago
> you could hand out your resolver via DHCP and transparently control local zones. With DoH, that's gone.

Checkout RFC9463

echion · 5 months ago
echion commented on AI coding tools make developers slower but they think they're faster study finds   theregister.com/2025/07/1... · Posted by u/Mgtyalx
echion · 7 months ago
Ironically, this fell off the HN front page without enough upvotes...neither hackernews.coffee nor Claude suggested it to me...
echion · 7 months ago
Oh, itsdrewmiller pointed us in the right direction: https://news.ycombinator.com/item?id=44551579

u/echion

KarmaCake day307November 13, 2007View Original