Readit News logoReadit News
shekhar101 commented on Nvidia's new 'robot brain' goes on sale for $3,499   cnbc.com/2025/08/25/nvidi... · Posted by u/tiahura
shekhar101 · 8 days ago
I was reading Xiaomi YU7 marketing page[0] yesterday and the NVIDIA AGX Thor stood out (says: NVIDIA DRIVE AGX Thor). I was wondering what it was and this showed up! Looks like it is (or a Drive variant of it) is already being used in newer cars for self-drive and such. [0] https://www.mi.com/global/discover/article?id=5174
shekhar101 commented on Gemma 3 270M re-implemented in pure PyTorch for local tinkering   github.com/rasbt/LLMs-fro... · Posted by u/ModelForge
shekhar101 · 13 days ago
Can someone (or OP) point me to a recipe to fine tune a model like this for natural language tasks like complicated NER or similar workflows? I tried finetuning Gemma3 270M when it came out last week without any success. A lot of tutorials are geared towards chat applications and role playing but I feel this model could be great for usecases like mine where I am trying to extract clean up and extract data from PDFs with entity identification and such.
shekhar101 commented on Show HN: OWhisper – Ollama for realtime speech-to-text   docs.hyprnote.com/owhispe... · Posted by u/yujonglee
yujonglee · 19 days ago
Happy to answer any questions!

These are list of local models it supports:

- whisper-cpp-base-q8

- whisper-cpp-base-q8-en

- whisper-cpp-tiny-q8

- whisper-cpp-tiny-q8-en

- whisper-cpp-small-q8

- whisper-cpp-small-q8-en

- whisper-cpp-large-turbo-q8

- moonshine-onnx-tiny

- moonshine-onnx-tiny-q4

- moonshine-onnx-tiny-q8

- moonshine-onnx-base

- moonshine-onnx-base-q4

- moonshine-onnx-base-q8

shekhar101 · 19 days ago
FYI: owhisper pull whisper-cpp-large-turbo-q8 Failed to download model.ggml: Other error: Server does not support range requests. Got status: 200 OK

But the base-q8 works (and works quite well!). The TUI is really nice. Speaker diarization would make it almost perfect for me. Thanks for building this.

shekhar101 commented on Show HN: I built a playground to showcase what Flux Kontext is good at   fluxkontextlab.com... · Posted by u/Zephyrion
shekhar101 · 2 months ago
I tried a picture with instructions and it says "something went wrong". I would love to try and see how well it works for my use case.
shekhar101 commented on Show HN: Index – New Open Source browser agent   github.com/lmnr-ai/index... · Posted by u/skull8888888
skull8888888 · 4 months ago
Gemini 2.5 pro is available. Is it missing on your side? Do you run index via CLI?
shekhar101 · 4 months ago
Yes it is, however API keys from aistudio only allows pro-experimental model. So if I select gemini-pro, I will see this: "Gemini 2.5 Pro Preview doesn't have a free quota tier. Please use Gemini 2.5 Pro Experimental (models/gemini-2.5-pro-exp-03-25) instead". Can I choose exact model somewhere in the CLI?
shekhar101 commented on Show HN: Index – New Open Source browser agent   github.com/lmnr-ai/index... · Posted by u/skull8888888
shekhar101 · 4 months ago
Can you open up the options to use other model/versions, especially Gemini-2.5 pro experimental models available through aistudio? Would love to try this but gemini flash fails for even simple tasks. Example: I asked it to extract all the links from comment section of a hackernews comment section and it just scrolled all the way to the end and then nothing. Maybe pro models can do it better.
shekhar101 commented on Show HN: A website that heatmaps your city based on your housing preferences   theretowhere.com/... · Posted by u/WiggleGuy
shekhar101 · 7 months ago
This is really cool (and timely for me). Lovely work with the UX. No accounts, no nonsense. Kudos.

u/shekhar101

KarmaCake day734July 24, 2013View Original