Metricon (u/Metricon)

Metricon commented on Wikipedia loses challenge against Online Safety Act bbc.com/news/articles/cjr... · Posted by u/phlummox

extraisland · 23 days ago

People constantly cite this poll as it is proof that British people want this.

You cannot trust the YouGov polling. It is flawed.

> Despite the sophisticated methodology, the main drawback faced by YouGov, Ashcroft, and other UK pollsters is their recruitment strategy: pollsters generally recruit potential respondents via self-selected internet panels. The American Association of Public Opinion Research cautions that pollsters should avoid gathering panels like this because they can be unrepresentative of the electorate as a whole. The British Polling Council’s inquiry into the industry’s 2015 failings raised similar concerns. Trying to deal with these sample biases is one of the motivations behind YouGov and Ashcroft’s adoption of the modelling strategies discussed above.

https://theconversation.com/its-sophisticated-but-can-you-be...

Even if the aforementioned problems didn't exist with the polling. It has been known for quite a while that how you ask a question changes the results. The question you linked was the following.

> From everything you have seen and heard, do you support or oppose the recent rules requiring age verification to access websites that may contain pornographic material?

Most people would think "age verification to view pornography". They won't think about all the other things that maybe caught in that net.

Metricon · 23 days ago

It seems like some things always remain the same: https://www.youtube.com/watch?v=G0ZZJXw4MTA

Metricon commented on ChatGPT agent: bridging research and action openai.com/index/introduc... · Posted by u/Topfi

polytely · 2 months ago

Is there anyone trying to solve OCR, I often think of that annas-archive blog about how we basically just have to keep shadow libraries alive long enough until the conversion from pdf to plaintext is solved.

https://annas-archive.org/blog/critical-window.html

I hope one of these days one of these incredibly rich LLM companies accidentally solves this or something, would be infinitely more beneficial to mankind than the awful LLM products they are trying to make

Metricon · 2 months ago

You may want to have a look at Mistral OCR: https://mistral.ai/news/mistral-ocr

Metricon commented on Orpheus-3B – Emotive TTS by Canopy Labs canopylabs.ai/model-relea... · Posted by u/Zetaphor

thot_experiment · 5 months ago

got it, so inference in llama.cpp server won't actually get me any audio directly

Metricon · 5 months ago

If you run the `gguf_orpheus.py` file in that repository, it will capture the audio tokens and convert them to a .wav file. With a little more work, you can feed the streaming audio directly using `sounddevice` and `OutputStream`

On a Nvidia 4090, it's producing:

  prompt eval time =      17.93 ms /    24 tokens (    0.75 ms per token,  1338.39 tokens per second)

         eval time =    2382.95 ms /   421 tokens (    5.66 ms per token,   176.67 tokens per second)

        total time =    2400.89 ms /   445 tokens

*A Correction to the llama.cpp server command above, there are 29 layers so it should read "-ngl 29" to load all the layers to the GPU.

Metricon commented on Orpheus-3B – Emotive TTS by Canopy Labs canopylabs.ai/model-relea... · Posted by u/Zetaphor

Metricon · 5 months ago

GGUF version created by "isaiahbjork" which is compatible with LM Studio and llama.cpp server at: https://github.com/isaiahbjork/orpheus-tts-local/

To run llama.cpp server: llama-server -m C:\orpheus-3b-0.1-ft-q4_k_m.gguf -c 8192 -ngl 28 --host 0.0.0.0 --port 1234 --cache-type-k q8_0 --cache-type-v q8_0 -fa --mlock

Metricon commented on Automated reasoning to remove LLM hallucinations aws.amazon.com/blogs/aws/... · Posted by u/rustastra

nl · 9 months ago

I don't see why this is hilarious at all.

The problem with expert systems (and most KG-type applications) has always been that translating unconstrained natural language into the system requires human-level intelligence.

It's been completely obvious that LLMs are a technology that let us bridge that gap for years, and many of the best applications of LLMs are doing exactly that (eg code generation)

Metricon · 9 months ago

To be clear, my amusement isn't that I find this technique to not be useful for the purpose it was created, but that 40 years later, we find ourselves in pursuit for the advancement of AI to be somewhat back where we already were; albeit, in a more semi-automated fashion as someone still has to create the underlying rule-set.

I do feel that the introduction of generative neural network models in both natural language and multi-media creation has been a tremendous boon for the advancement of AI, it just amuses me to see that which was old is new again.

Metricon commented on Automated reasoning to remove LLM hallucinations aws.amazon.com/blogs/aws/... · Posted by u/rustastra

Metricon · 9 months ago

This amuses me tremendously. I began programming in the early 1980s and quickly developed an interest in Artificial Intelligence. At the time there was a great interest in the advancement of AI by the introduction of "Expert Systems" (which would later play a part in the ‘Second AI Winter’).

What Amazon appears to have done here is use a transformers based neural network (aka LLM) to translate natural language into symbolic logic rules which are collectively used together in what could be identified as an Expert System.

Full Circle. Hilarious.

For reference to those on the younger side: The Computer Chronicles (1984) https://www.youtube.com/watch?v=_S3m0V_ZF_Q

Metricon commented on Nearly all of the Google images results for "baby peacock" are AI generated twitter.com/notengoprisa/... · Posted by u/jsheard

Metricon · a year ago

There are a number of ways this might get solved, but I would speculate that it will generally be solved by adding image metadata that is signed by a certificate authority similar to the way SSL certificates are assigned to domains.

I think eventually all digital cameras and image scanners will securely hash and sign images just as forensic cameras do to certify that an image was "captured" instead of generated.

Of course this leaves a grey area for image editing applications such as Photoshop, so there may also need to be some other level of certificate base signing introduced there as well.

Metricon commented on The lifecycle of a code AI completion sourcegraph.com/blog/the-... · Posted by u/tosh

ForHackernews · a year ago

> Congratulations, you just wrote a code completion AI!

> In fact, this is pretty much how we started out with Cody autocomplete back in March!

Am I wrong in thinking that there's only like 3(?) actual AI companies and everything else is just some frontend to ChatGPT/LLama/Claude?

Is this sustainable? I guess the car industry is full of rebadged models with the same engines and chassis. It's just wild that we keep hearing about the AI boom as though there's a vibrant competitive ecosystem and not just Nvidia, a couple of software partners and then a sea of whiteboxers.

Metricon · a year ago

For those who might not be aware of this, there is also an open source project on GitHub called "Twinny" which is an offline Visual Studio Code plugin equivalent to Copilot: https://github.com/rjmacarthy/twinny

It can be used with a number of local model services. Currently for my setup on a NVIDIA 4090, I'm running both the base and instruct model for deepseek-coder 6.7b using 5_K_M Quantization GGUF files (for performance) through llama.cpp "server" where the base model is for completions and the instruct model for chat interactions.

llama.cpp: https://github.com/ggerganov/llama.cpp/

deepseek-coder 6.7b base GGUF files: https://huggingface.co/TheBloke/deepseek-coder-6.7B-base-GGU...

deepseek-coder 6.7b instruct GGUF files: https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct...

Metricon commented on 40 Years of Programming liw.fi/40/... · Posted by u/janvdberg

Metricon · a year ago

Started 1982 on Tandy Color Computer. Still at it.

#1 Advice - Focus on getting things done (as many will not) and Lego build interconnected/isolated simplicity as much as possible.

Metricon commented on ChatGPT for Teams openai.com/chatgpt/team... · Posted by u/szermer

yawnxyz · 2 years ago

Looks like the $20/month PLUS plan DOES use your data to train the model now... (they seem to have removed that "feature" from the list in the side-by-side comparison)

Metricon · 2 years ago

Currently, if you disable chat history, you'll see this message:

Chat History is off for this browser. When history is turned off, new chats on this browser won't appear in your history on any of your devices, be used to train our models, or stored for longer than 30 days. This setting does not sync across browsers or devices.