vasquez (u/vasquez) - Readit News

vasquez commented on How to run Qwen 3.5 locally unsloth.ai/docs/models/qw... · Posted by u/Curiositry

lukan · 6 days ago

What exact model are you using?

I have a 16GB GPU as well, but have never run a local model so far. According to the table in the article, 9B and 8-bit -> 13 GB and 27B and 3-bit seem to fit inside the memory. Or is there more space required for context etc?

vasquez · 6 days ago

It depends on the task, but you generally want some context. These models can do things like OCR and summarize a pdf for you, which takes a bit of working memory. Even more so for coding CLIs like opencode-ai, qwen code and mistral ai.

Inference engines like llama.cpp will offload model and context to system ram for you, at the cost of performance. A MoE like 35B-A3B might serve you better than the ones mentioned, even if it doesn't fit entirely on the GPU. I suggest testing all three. Perhaps even 122-A10B if you have plenty of system ram.

Q4 is a common baseline for simple tasks on local models. I like to step up to Q5/Q6 for anything involving tool use on the smallish models I can run (9B and 35B-A3B).

Larger models tolerate lower quants better than small ones, 27B might be usable at 3 bpw where 9B or 4B wouldn't. You can also quantize the context. On llama.cpp you'd set the flags -fa on, -ctk x and ctv y. -h to see valid parameters. K is more sensitive to quantization than V, don't bother lowering it past q8_0. KV quantization is allegedly broken for Qwen 3.5 right now, but I can't tell.

vasquez commented on I switched from Htmx to Datastar everydaysuperpowers.dev/a... · Posted by u/ksec

CraigJPerry · 5 months ago

I was late to the hypermedia party, started with datastore but now use HTMX when i want something in this space. The datastar api is a bit nicer but htmx 2.0 supports the same approach, the key thing is what htmx calls OOB updates, with that in place, everything else is a win in the htmx column.

vasquez · 5 months ago

I like the alpine-ajax API. You specify one or more targets and it swaps each of those elements. No default case or OOB, just keeping it uniform instead.

As for Datastar, all the signal and state stuff seems to me like a step in the wrong direction.

vasquez commented on A police dog who cried drugs at every traffic stop reason.com/2021/05/13/the... · Posted by u/pessimizer

nyhc99 · 5 years ago

How do we the people fight these drug dog abuses? Politicians generally aren't interested in touching something like this. I'd love to put some money to work on the issue. Is there a foundation dedicated to suing police departments? Is that even a viable avenue to make a change?

vasquez · 5 years ago

Consider supporting some organization working to repeal prohibition, e.g. LEAP.

None of this could be justified without the war on drugs. Plus we'd get rid of the prime motivator for a lot of the world's crime.

vasquez commented on Six years of professional Clojure development falkoriemenschneider.de/a... · Posted by u/yogthos

joelbluminator · 5 years ago

I don't think it's about it's dynamicism but more about it being functional. Plenty of super popular dynamic languages out there. I think that's also what keeps elixir from becoming something more mainstream, most people come from OOP and are used to thinking about programming that way.

vasquez · 5 years ago

I really like Clojure, it's the language that finally made FP "click" for me. It was my go to for hobby/side projects for quite a while.

Dynamic typing is why I eventually switched. Haskell scratches the same itches that Clojure did, but the compiler and type system are immensely helpful, and keep saving me from tripping over my own feet.

vasquez commented on Open source projects should run office hours simonwillison.net/2021/Fe... · Posted by u/tosh

cookiengineer · 5 years ago

Isn't this kind of a paradox? Most open source maintainers work in their free time on those projects, and have an additional fulltime job to cover the bills.

Assuming that everybody has the luxury to have additional office hours available is a bit far from reality in my opinion.

I mean, if you can offer office hours for an open source project you probably are already so popular that you are able to work on it fulltime, right? And if you're not that popular, you cannot offer office hours due to your daytime job; as you would have to decrease the rest of your remaining free time that you probably need to sleep and eat.

vasquez · 5 years ago

> Most open source maintainers work in their free time on those projects, and have an additional fulltime job to cover the bills.

Are you certain? I'd think most open source development is done by paid developers, on company time.

vasquez commented on Myopia treatment 'smart glasses' from Japan to be sold in Asia asia.nikkei.com/Business/... · Posted by u/isof4ult

unexpected · 5 years ago

You so sure about that? My myopia has gone from -5.50 to -4.00 over the years.

vasquez · 5 years ago

If you're nearing/above 40 this is likely age related farsightedness (presbyopia) counteracting your myopia.

https://www.nvisioncenters.com/farsightedness/and-age/

vasquez commented on i386 architecture will be dropped starting with Ubuntu 19.10 discourse.ubuntu.com/t/i3... · Posted by u/jcastro

Hydraulix989 · 7 years ago

Steam is 32-but only, I wonder what will happen?

vasquez · 7 years ago

For one thing, new games on Linux should no longer be targeting a platform that should have been phased out before Steam even launched on the OS.

And people will no longer need a duplicate set of libraries on their machines, all the way down to the gfx drivers. New software, and old stuff with 64-bit support, should have a lot less compatibility issues.

vasquez commented on Nvidia adds telemetry to latest drivers majorgeeks.com/news/story... · Posted by u/schwarze_pest

existencebox · 9 years ago

As much as I welcome the capitalistic response, I'm worried it won't be enough.

As much as I'd love true competition in the GPU/CPU space, it doesn't exist. AMD's cards simply cannot compete with Nvidia for GPGPU type scenarios, and even in its basic capacity, often have known heat/perf issues. Now that may be worth it for now to make a statement against the telemetry, but what if (less if and more when IMO) AMD then adds driver telemetry? And then intel?

These domains (Chip manufacture/GPU driver writing) are so advanced at this point that I don't see how competition could reasonably disrupt an incumbent over anything less than a samsung-grade failure (and even then probably not), and I'm concerned about the long-term wherein the producers realize this and through a combination of boiling the frog slowly and leaving consumers no other choices put themselves in the position to have a "pragmatic monopoly" of free reign over our machines. (I've always wondered what would happen from an antitrust sense, if it's "we're the only producer not because we WANT to but because we're the only ones who CAN")

We've certainly seen it happening with OSes, as well as some attempts from PC oems, I've always unfortunately thought it was just a matter of time until the more irreplaceable components got into the game too and I'd love some creative thoughts to actually stop the trend and not stand in its way, because I'm not sure we'll win that latter battle.

EDIT: as a child post points out, I completely forgot to mention drivers as well; as a strong argument to my "we don't have many options" thesis. AMD's linux support has been historically lacking next to NVIDIA which makes it a non starter in many cases.

vasquez · 9 years ago

Actually, AMD's latest design, the RX 4x0 series, measures up really well to NVidia's mid and low end cards, especially for Vulkan/DX12 apps. They're not a contender in the enthusiast market, but will be releasing high-end cards using a refined design in the first half of 2017.

As for drivers, AMD has pledged to open source their Vulkan and OpenCL implementations. While that release has been pending "legal review" forever now, alternative open source drivers are making great progress thanks to Vulkan's simpler driver model. While NVidia's generally had the "better" driver, both from adhering less strictly to the spec and having the manpower to routinely fix application bugs in their driver, that's all changed with Vulkan/DX12 being significantly closer to the hardware.

I'd say things are looking up for market balance.