bluejay2387 (u/bluejay2387)

bluejay2387 commented on Vibe coding kills open source arxiv.org/abs/2601.15494... · Posted by u/kgwgk

bluejay2387 · 2 months ago

Does it seem to anyone else that author's have created a definition for 'vibe coding' that is specifically designed to justify their paper? Also that their premise is based on the assumption that developers will be irresponsible about the use of these tools ("often without users directly reading documentation, reporting bugs, or otherwise engaging with maintainers") so that it would actually be people killing open source not 'Vibe Coding'? Just a guess on my part, but once developers learn to use these tools and we get over the newness I think this will be great for open source. With these tools open source projects can compete with an army of corporate developers while alleviating some of the pressure on overworked under-rewarded maintainers.

bluejay2387 commented on Show HN: Gemini Pro 3 imagines the HN front page 10 years from now dosaygo-studio.github.io/... · Posted by u/keepamovin

jll29 · 3 months ago

AI professor here. I know this page is a joke, but in the interest of accuracy, a terminological comment: we don't call it a "hallucination" if a model complies exactly with what a prompt asked for and produces a prediction, exactly as requested.

Rater, "hallucinations" are spurious replacements of factual knowledge with fictional material caused by the use of statistical process (the pseudo random number generator used with the "temperature" parameter of neural transformers): token prediction without meaning representation.

[typo fixed]

bluejay2387 · 3 months ago

Want to second this. Asking the model to create a work of fiction and it complying isn't a pathology. Mozart wasn't "hallucinating" when he created "The Marriage of Figaro".

bluejay2387 commented on Show HN: Gemini Pro 3 imagines the HN front page 10 years from now dosaygo-studio.github.io/... · Posted by u/keepamovin

bluejay2387 · 3 months ago

THIS is probably the moment that the AI naysayers on this board wake up to the potential of current AI...

bluejay2387 commented on Ask HN: What Does Your Self-Hosted LLM Stack Look Like in 2025? · Posted by u/anditherobot

bluejay2387 · 9 months ago

2x 3090's running Ollama and VLLM... Ollama for most stuff and VLLM for the few models that I need to test that don't run on Ollama. Open Web UI as my primary interface. I just moved to Devstral for coding using the Continue plugin in VSCode. I use Qwen 3 32b for creative stuff and Flux Dev for images. Gemma 3 27b for most everything else (slightly less smart than Qwen, but its faster). Mixed Bread for embeddings (though apparently NV-Embed-v2 is better?). Pydantic as my main utility library. This is all for personal stuff. My stack at work is completely different and driven more by our Legal teams than technical decisions.

bluejay2387 commented on Reasoning models are just LLMs antirez.com/news/146... · Posted by u/rognjen

iknownothow · a year ago

Is there a terminology battle happening in some circles? And if so, what are the consequences of being wrong and using the wrong terminology?

I follow the rnd and progress in this space and I haven't heard anyone make a fuss about it. They are all LLMs or transformers or neural nets but they can be trained or optimized to do different things. For sure, there's terms like Reasoning models or Chat models or Instruct models and yes they're all LLMs.

But you can now start combining them to have hybrid models too. Are Omni models that handle audio and visual data still "language" models? This question is interesting in its own right for many reasons, but not to justify or bemoan the use of term LLM.

LLM is a good term, it's a cultural term too. If you start getting pedantic, you'll miss the bigger picture and possibly even the singularity ;)

bluejay2387 · a year ago

So there is a language war going on in the industry and some of its justified and some of its not. Take 'agents' as an example. I have seen an example of where a low code / no code service dropped in a LLM node in a 10+ year old product, started calling themselves an 'agent platform' and jacked up their price by a large margin. This is probably a case where a debate as to what qualifies as an 'agent' is appropriate.

Alternatively I have seen debates as to what counts as a 'Small Language Model' that probably are nonsensical. Particularly because in my personal language war the term 'small language model' shouldn't even exist (no one knows that the threshold is, and our 'small' language models are bigger than the 'large' language models from just a few years ago).

This is fairly typical of new technology. Marketing departments will constantly come up with new terms or try to take over existing terms to push agendas. Terms with defined meaning will get abused by casual participants and loose all real meaning. Individuals new to the field will latch on to popular misuses of terms as they try to figure out what everyone is talking about and perpetuate definition creep. Old hands will overly focus on hair splitting exercises that no one else really cares about and sigh in dismay as their carefully cultured taxonomies collapse under expansion of interest in their field.

It will all work itself out in 10 years or so.

bluejay2387 commented on Ask HN: Examples of agentic LLM systems in production? · Posted by u/SebaSeba

bluejay2387 · a year ago

As a side note, while I know of several language model based systems that have been deployed in companies, some companies don't want to talk about it:

1. Its still perceived as an issue of competitive advantage

2. There is a serious concern about backlash. The public's response to finding out that companies have used AI has often not been good (or even reasonable) -- particularly if there was worker replacement related to it.

It's a bit more complicated with "agents" as there are 4 or 5 competing definitions for what that actually means. No one is really sure what an 'agentic' system is right now.

bluejay2387 commented on Study: Dark matter doesn't exist, the universe is 27B years old earth.com/news/study-dark... · Posted by u/msolujic

bluejay2387 · a year ago

Could somebody else take a turn posting the related XKCD comic? I did it last time.

bluejay2387 commented on Reflection 70B, the top open-source model twitter.com/mattshumer_/s... · Posted by u/GavCo

rgbrgb · 2 years ago

> Personally, the comparison to Claude and 4o doesn't quite seem apples-to-apples. If you were to have 4o/Claude take multiple rounds to review and reflect on their initial drafts, would we see similar gains? I suspect they would improve massively as well.

They may already implement this technique, we can't know.

bluejay2387 · 2 years ago

I am not sure, but you seem to be implying that the Reflection model is running through multiple rounds? If so, that is not what is happening here. The token generation is still linear next token prediction. It does not require multiple rounds to generate the chain of thought response. It does that in one query pass.

I have been testing the model for the last few hours and it does seem to be an improvement on LLAMA 3.1 upon which it is based. I have not tried to compare it to Claude or GPT4o because I don't expect a 70b model to outperform models of that class no matter how good it is. I would happy to be wrong though...

bluejay2387 commented on Microsoft Account to local account conversion guide erased from Windows 11 guide tomshardware.com/software... · Posted by u/belter

CoastalCoder · 2 years ago

FWIW, I found Pop_OS to provide pretty (very?) up-to-date Nvidia drivers via the pop_os repos.

So I haven't found any reason to use Nvidia's installers.

bluejay2387 · 2 years ago

Yeah I didn't have any problems with this on Pop or Ubuntu. I guess its a Debian issue. I might end up moving over to a different distro at some point, but I finally have everything working on Debian so I am hesitant to switch right now.

bluejay2387 commented on Microsoft Account to local account conversion guide erased from Windows 11 guide tomshardware.com/software... · Posted by u/belter

neilv · 2 years ago

Debian Stable (Linux) welcomes escapees from lobster boiling:

https://cdimage.debian.org/debian-cd/current/amd64/iso-dvd/

bluejay2387 · 2 years ago

Due to all this mess with Recall, force account log-ins, spamming my desktop with crap all the time, not being able to to disable telemetry, confusing privacy settings spread all through out the OS... I finally made the call two weeks ago to get rid of Windows after decades of using the Microsoft OS. I tried Pop, Ubuntu, Mint... all decent options but settled on Debian for now. It's been a slog of a two weeks and one massive learning curve, but everything is now setup and working great. I have 100% parity with my previous Windows install and it was a freeing experience being able to delete my Windows partition. My biggest problem was with video drivers. I have some utilities that require at least Nvidia 535 and Debian for some reason I can't fathom only supports 525 (obsolete by Nvidia's indication). All of the advice in Debian related forums was "don't go with proprietary install scripts" which was flat out wrong. I don't know what is causing this brain failure on the part of the entire Debian community when it comes to drivers, but they need to fix that. No need to run the latest and greatest, but when the only option is a driver that is marked as obsolete and won't run a lot of software, it needs to be addressed.

The pleasant surprise has been games. I thought I would have to abandon gaming or keep a second Windows partition, but so far all the games I have tried have run 100% -- even though it took some minimal tweaking in some cases. V-Rising, Elder Scrolls Online, New World, RimWorld... all work as well as on Windows thanks to Steam and Proton. (Rimworld required one change in the config .ini to support my super ultra wide monitor, ESO had to be manually imported into Steam. V-Rising required installing the Proton-GE version to address a problem with cut scenes). It's a bit tedious to have to address small problems like that, but more than worth it to get rid of an OS that I feel is constantly trying to attack me.

I am moving my wife to Zorin next. I can't recommend Debian to most people that just want to use a desktop. It was difficult for me and I have decades of experience in running Linux servers. I will probably stick with Debian as its working great now, but too many things were too hard to make it an option for most desktop users I would imagine. I can recommend ditching Windows for some flavor of Linux.