Readit News logoReadit News
dgut commented on Asking 60 LLMs a set of 20 questions   benchmarks.llmonitor.com... · Posted by u/vincelt
typpo · 2 years ago
In case anyone's interested in running their own benchmark across many LLMs, I've built a generic harness for this at https://github.com/promptfoo/promptfoo.

I encourage people considering LLM applications to test the models on their _own data and examples_ rather than extrapolating general benchmarks.

This library supports OpenAI, Anthropic, Google, Llama and Codellama, any model on Replicate, and any model on Ollama, etc. out of the box. As an example, I wrote up an example benchmark comparing GPT model censorship with Llama models here: https://promptfoo.dev/docs/guides/llama2-uncensored-benchmar.... Hope this helps someone.

dgut · 2 years ago
This is impressive. Good work.

Deleted Comment

dgut commented on Positive association between altitude and suicide in U.S. counties   ncbi.nlm.nih.gov/pmc/arti... · Posted by u/KoftaBob
boesboes · 2 years ago
Yeah, ppl on the ring road of Amsterdam live 10 years shorter afaik, for instance
dgut · 2 years ago
Source? As a counter anecdote: Madrid, where most people live near traffic, has the highest life expectancy in the EU at 85.4 years.
dgut commented on Sweden launches research rocket, accidentally hits Norway   reuters.com/world/europe/... · Posted by u/perihelions
perihelions · 2 years ago
If the US military invades IKEA they will *never* figure out how to exit.
dgut · 2 years ago
I'm pretty sure IKEA has been legally Dutch for a while now.
dgut commented on Now ChatGPT is being (mis)used to do PeerReview   mstdn.science/@ukrio/1101... · Posted by u/doener
dgut · 2 years ago
Peer-reviews have been broken way before GPT. People are expected to do it for free in a publish or perish academia...
dgut commented on The grotesque side of Leonardo da Vinci   theguardian.com/artanddes... · Posted by u/prismatic
IgorPartola · 2 years ago
By comparison, when I saw a few of Donatello’s sculptures I was surprised by how poorly he was able to convey faces compares to other artists. I am no art historian, I am just a big fan of TMNT :)
dgut · 2 years ago
Compared to his contemporaries? He is possibly the most influential sculptor of the Renaissance. Michelangelo wasn't even born yet when Donatello died.
dgut commented on We Found an Neuron in GPT-2   clementneo.com/posts/2023... · Posted by u/todsacerdoti
IIAOPSW · 3 years ago
That is not what universal grammar was about. Literally read even the wikipedia page or watch like any lecture on the topic.
dgut · 3 years ago
What exactly did I write that is wrong? UG argues a universal grammar exists, and that humans innately posses knowledge about this grammar. It's this grammar that enables humans to learn a language (according to UG). They (UG linguists) have created a system of syntax rules that attempts to describe any language, but failed at doing so once they step outside of Indo-European languages. This is partly because Chomsky was hired by MIT to solve machine translation and putting language into a set of neat boxes was his best idea and partly because Chomsky himself had little knowledge of other languages. It's pseudoscience.
dgut commented on We Found an Neuron in GPT-2   clementneo.com/posts/2023... · Posted by u/todsacerdoti
LudwigNagasena · 3 years ago
It’s notable how successful LLMs despite the lack of any linguistic tools in their architectures. It would be interesting to know how different a model would be if it operated on eg dependency trees instead of the linear list of tokens. Surely, the question of “a/an” would be solved with ease as the model would be required to come up with a noun token before choosing its determiner. I wonder if the developers of LLMs explored those approaches but found them infeasible due to large preprocessing times, immaturity of such tools and/or little benefit.
dgut · 3 years ago
grammar as we know it was devised for the Latin language and linguists spend most of the time attempting to fit other languages into neat boxes that the Latin grammar wasn't designed for. This of course leads to absurdity. Chomsky attempted to solve this problem with his universal grammar, but that too stops working quickly once you get outside of European languages. That is, ignoring linguistic tools is one of the reasons GPT is successful.

u/dgut

KarmaCake day389January 21, 2017
About
hi (at) davidgut (dot) com
View Original