Readit News logoReadit News
aksophist commented on Nobody cares   grantslatton.com/nobody-c... · Posted by u/fzliu
aksophist · a year ago
This post is angry detritus. I’m sorry someone upset you recently Grant, but seriously?

Billions of people care. And if you bother looking for them, you’ll find them. Most of the problems he describes result from complex systems being challenging and individuals having limited ability both to comprehend and influence them.

And no I don’t mean “this software module is complex” complex. I mean, “this social problem has hundreds of interacting incentives, changing any of them in isolation makes things worse, and it will take years and millions of dollars to change things, all while political winds of change are trying to blow down the consensus to tackle the problem.”

aksophist commented on Nobody cares   grantslatton.com/nobody-c... · Posted by u/fzliu
FigurativeVoid · a year ago
This is a statement of privilege: find a doctor who cares and stick with them.

I'm T1 diabetic, and it took me a long time to find an endo and a PCP that care. I have long since moved away from their offices, but I still make the drive because they are worth it.

My tip on finding good providers is basically to get lucky and find a good one. Then you should ask who they recommend. They know who the bad ones are.

aksophist · a year ago
It’s a statement of privilege to believe (and say) that there are hundreds of good doctors per handful of bad ones? It sounds to me like a statement of fact. And that you dispute the fact. What does privilege have to do with it?
aksophist commented on Launch HN: GPT Driver (YC S21) – End-to-end app testing in natural language    · Posted by u/cschiller
chrtng · a year ago
Thank you for your question! While we haven't published a formal evaluation yet, it's something we are working toward. Currently, we rely mostly on human reviews to monitor and assess LLM outputs. We also maintain a golden test suite that is run against every release to ensure consistency and quality over time, using regex-based evaluations.

Our key metrics include the time and cost per agentic loop, as well as the false positive rate for a full end-to-end test. If you have any specific benchmarks or evaluation metrics you'd suggest, we'd be happy to hear them!

aksophist · a year ago
What is a false positive rate? Is it when the agent falsely passes or falsely “finds a bug”? And regardless of which: why don’t you include the other as a key metric?

I’m not aware of any evals or shared metrics. But measuring a testing agents performance seems pretty important.

What is your tool’s FPR on your golden suite?

aksophist commented on Launch HN: GPT Driver (YC S21) – End-to-end app testing in natural language    · Posted by u/cschiller
aksophist · a year ago
how do you evaluate your tool, and have you published your evaluation along with the metrics?
aksophist commented on Launch HN: GPT Driver (YC S21) – End-to-end app testing in natural language    · Posted by u/cschiller
101008 · a year ago
Still interesting how a lot of companies offer a LLM (non-deterministic) solution for deterministic problems.
aksophist · a year ago
It’s only deterministic for each version of the app. Versions change: UI elements move, change their title slightly. Irrelevant promo popups appear, etc. For a deterministic solution, someone has to go and update the tests to handle all of that. Good ‘accessibility hygiene’ can help, but many apps lack that.

And then there are truly dynamic apps like games or simulators. There may be no accessibility info to deterministically code to.

aksophist commented on Former Google CEO Eric Schmidt's Leaked Stanford Talk   github.com/ociubotaru/tra... · Posted by u/gregzeng95
aksophist · 2 years ago
I read up to where he started taking questions (less than half the transcript or so?) and these were the interesting quotes that stood out to me:

So imagine a non-arrogant programmer that actually does what you want and you don't have to pay all that money to and there's infinite supply of these programs. That's all within the next year or two.

Google decided that work life balance and going home early and working from home was more important than winning.

But certainly in your lifetimes, the battle between the US and China for knowledge supremacy is going to be the big fight.

And one of the things to know about war is that the offense always has the advantage because you can always overwhelm the defensive systems. And so you're better off as a strategy of national defense to have a very strong offense that you can use if you need to.

And the systems that I and others are building will do that. Because of the way the system works, I am now a licensed arms dealer, a computer scientist, businessman, and an arms dealer. Is that a progression? I don't know. I do not recommend this in your group.

And if anyone knows Marjorie Taylor Greene, I would encourage you to delete her from your contact list because she's the one, a single individual is blocking the provision of some number of billions of dollars to save an important democracy.

aksophist commented on Tell HN: Bypass Paywalls repository is gone    · Posted by u/sogen
aksophist · 2 years ago
Now I wish I had bookmarked the distributed (peer to peer?) github replacement I’ve seen trend on HN a couple of times. It seems like a good place to host something like this. Anyone remember which tool I’m talking about?
aksophist commented on His Job Was to Make Instagram Safe for Teens. His 14-Year-Old Showed Otherwise   wsj.com/tech/instagram-fa... · Posted by u/antiviral
RicoElectrico · 2 years ago
> One in eight users under the age of 16 said they experienced unwanted sexual advances on the platform over the previous seven days.

I wonder what % of these is "show bobs and vegana" which is more of a nuisance than a threat.

aksophist · 2 years ago
What world are you living in where asking children under 16 to share pictures of their private sexual body parts is a “nuisance”? Epstein, is that you?
aksophist commented on HTTPS Watch   httpswatch.com/... · Posted by u/kingkilr
aksophist · 11 years ago
Where is the line item for "prevents downgrade of HTTPS connections to vulnerable protocols"?
aksophist commented on H.R.4681 - Intelligence Authorization Act for Fiscal Year 2015   congress.gov/bill/113th-c... · Posted by u/jborden13
frostmatthew · 11 years ago
> Voting has been useless in the many elections I have participated in. When will people wake up and demand change?

If voting is "useless" it makes it impossible to demand change since voting is how we enact change in a democracy.

aksophist · 11 years ago
> how we enact change in a democracy

You're either thinking of another country or have it wrong. The way change (legislation) gets enacted in the US is through lobbying.

u/aksophist

KarmaCake day66March 15, 2014View Original