Readit News logoReadit News
techwizrd commented on Intellectual Junkyards   forester-notes.org/QHXS/i... · Posted by u/ysangkok
Analemma_ · a month ago
I'm also not sure if I fully get what the author is going on about, but at least part of it seems to be "don't over-taxonomize and over-architect your note-taking and knowledge management systems, locking yourself into an inflexible format/schema too early just kills it in the long run."

If I'm correct that that's part of the thrust of the article (and I may not be), then I definitely agree with the author. The first time I tried to use Obsidian I burned out because I went all-in on the bi-di linking, tagging, knowledge graph, etc., and it quickly killed my motivation. Now I just dump text in and rely on search to find what I need, only adding links in retrospect once they are needed, and now I actually use it and get value from it.

techwizrd · a month ago
I had this same issue early on when trying to adopt Obsidian. I was overwhelmed by all the "systems" and I was worried I was creating a headache for myself later on. Now I just focus on dumping text in, using search, and linking only as needed. Basically don't overdo it.
techwizrd commented on Volvo Centum is Dalton Maag's new typeface for Volvo   wallpaper.com/design-inte... · Posted by u/ohjeez
herpdyderp · 2 months ago
This is not a legible font. You can clearly see they did not distinguish uppercase o and 0 (zero) at all. Uppercase i and lowercase L are barely distinguishable. Classic font blunders.
techwizrd · 2 months ago
That makes sense for code or technical text, but it is less relevant for car UIs. In an infotainment system you almost never see ambiguous strings where O vs 0 or I vs l matters. Everything is highly contextual, short, and glance-based. These fonts are tuned for distance, motion, glare, and quick recognition, not for reading arbitrary identifiers. If it tested poorly in real driving conditions that would be a real problem, but judging it by programmer font rules feels like the wrong yardstick.
techwizrd commented on A guide to local coding models   aiforswes.com/p/you-dont-... · Posted by u/mpweiher
kristopolous · 2 months ago
I use local models + openrouter free ones.

My monthly spend on ai models is < $1

I'm not cheap, just ahead of the curve. With the collapse in inference cost, everything will be this eventually

I'll basically do

    $ man tool | <how do I do this with the tool>
or even

    $ cat source | <find the flags and give me some documentation on how to use this>
Things I used to do intensively I now do lazily.

I've even made a IEITYuan/Yuan-embedding-2.0-en database of my manpages with chroma and then I can just ask my local documentation how I do something conceptually, get the man pages, inject them into local qwen context window using my mansnip llm preprocessor, forward the prompt and then get usable real results.

In practice it's this:

    $ what-man "some obscure question about nfs" 
    ...chug chug chug (about 5 seconds)...

    <answer with citations back to the doc pages>
Essentially I'm not asking the models to think, just do NLP and process text. They can do that really reliably.

It helps combat a frequent tendency for documentation authors to bury the most common and useful flags deep in the documentation and lead with those that were most challenging or interesting to program instead.

I understand the inclination it's just not all that helpful for me

techwizrd · 2 months ago
Have you looked at tldr/tealdeer[0]? It may do much of what you're looking for, albeit without LLM assistance.

0: https://tealdeer-rs.github.io/tealdeer/

techwizrd commented on Making Google Sans Flex   design.google/library/goo... · Posted by u/meetpateltech
the_gipsy · 2 months ago
Cool work on a font, but this page is proof that google is turning the web into some kind of JSON for their app, Chrome.

Extremely sluggish on non-Chrome. Starts with a black blank empty page. Fans spinning. Takes way too long to load for just some text and some videos. Clicking a link does some SPA magic that takes me to another black blank page, and takes ages to load. Clicking back doesn't work anymore. I need to reload the entire page, again blank and waiting. Once done loading, scrolling is extremely sluggish.

Yes, there are probably some interactive widgets in there, but all that and much more has been done without bogging down the browser like you're running a 3D game on WebGL.

Oh, and of course reader mode doesn't work.

techwizrd · 2 months ago
I had no issues using Firefox on a 2021 M1 Pro or my Framework 13. Reader mode does not work, however.
techwizrd commented on Minimum Viable Arduino Project: Aeropress Timer   netninja.com/2025/12/01/m... · Posted by u/surprisetalk
esperent · 2 months ago
The idea is that the more pressure you use, the more unwanted materials like fines and oils will get though the paper. So if you press slowly and stop when you hear a hiss, you should have a better brew.

It does make sense, if imagine pressing through in 5 seconds vs 30 seconds, that the paper filtration would work better in the slower press. But I'm not sure if anyone has scientifically measured this.

Actually wait, it's coffee. Someone has definitely scientifically measured it and probably published a two hour YouTube video with their results.

techwizrd · 2 months ago
I've had good results from the James Hoffman recipe [0], although I brew inverted. You can push the plunger down with just the weight of resting your arm on the plunger. For something very different, you can brew something not-quite-espresso using the Fellow Prismo cap for the Aeropress.

0: https://www.youtube.com/watch?v=j6VlT_jUVPc

techwizrd commented on My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air)   simonwillison.net/2025/Ju... · Posted by u/simonw
genewitch · 7 months ago
I'll bite. How do i train/make and/or use LoRA, or, separately, how do i fine-tune? I've been asking this for months, and no one has a decent answer. websearch on my end is seo/geo-spam, with no real instructions.

I know how to make an SD LoRA, and use it. I've known how to do that for 2 years. So what's the big secret about LLM LoRA?

techwizrd · 7 months ago
We have been fine-tuning models using Axolotl and Unsloth, with a slight preference for Axolotl. Check out the docs [0] and fine-tune or quantize your first model. There is a lot to be learned in this space, but it's exciting.

0: https://axolotl.ai/ and https://docs.axolotl.ai/

techwizrd commented on GitDroid: A third party Android app manager for apps uploaded to GitHub releases   github.com/TechnicJelle/G... · Posted by u/amadeuspagel
Cyphase · 9 months ago
Unobtanium supports more sources, has more features, and is actively developed.

https://obtainium.imranr.dev/

https://github.com/ImranR98/Obtainium

techwizrd · 9 months ago
This is what I use, and it works well. It's very straightforward to add apps and automatically update them as new releases are pushed to Github or wherever they are hosted.
techwizrd commented on Notetime: Minimalistic notes where everything is timestamped   notetimeapp.com... · Posted by u/gohberg
techwizrd · a year ago
This idea feels a little like bullet journaling or logseq [0] to me. For what it's worth, I do this in Obsidian and clean-up my thoughts on a regular basis. It hits the right balance of minimalism and usefulness for me.

0: https://logseq.com/

techwizrd commented on Ladder: Self-improving LLMs through recursive problem decomposition   arxiv.org/abs/2503.00735... · Posted by u/fofoz
isaacfrond · a year ago
Reminds me of a quote by famous number theoretic mathematician Hendrik Lenstra:

For every problem you can't solve, there's a simpler problem that you also can't solve.

techwizrd · a year ago
Is this quote real? I'm familiar with George Pólya's, "If you cannot solve the proposed problem, try to solve first a simpler related problem" but I cannot find any source for the Lenstra quote.
techwizrd commented on Mistral OCR   mistral.ai/fr/news/mistra... · Posted by u/littlemerman
kbyatnal · a year ago
We're approaching the point where OCR becomes "solved" — very exciting! Any legacy vendors providing pure OCR are going to get steamrolled by these VLMs.

However IMO, there's still a large gap for businesses in going from raw OCR outputs —> document processing deployed in prod for mission-critical use cases. LLMs and VLMs aren't magic, and anyone who goes in expecting 100% automation is in for a surprise.

You still need to build and label datasets, orchestrate pipelines (classify -> split -> extract), detect uncertainty and correct with human-in-the-loop, fine-tune, and a lot more. You can certainly get close to full automation over time, but it's going to take time and effort. But the future is on the horizon!

Disclaimer: I started a LLM doc processing company to help companies solve problems in this space (https://extend.app/)

techwizrd · a year ago
The challenge I have is how to get bounding boxes for the OCR, for things like redaction/de-identification.

u/techwizrd

KarmaCake day1184September 25, 2013
About
AI/ML, Aviation Safety, and Aeromedical Safety at MITRE

[ my public key: https://keybase.io/ksarkhel; my proof: https://keybase.io/ksarkhel/sigs/2C5BZyjsFh9eLr4vNe6-e6IVbpXEE0Edm4UuJ1rBcAA ]

Fediverse/ActivityPub: Mastodon: @kksarkhel@mastodon.social Lemmy: @techwizrd@programming.dev

View Original