Readit News logoReadit News
winddude commented on Self-taught engineers often outperform (2024)   michaelbastos.com/blog/wh... · Posted by u/mbastos
winddude · a month ago
As a self taught engineer who hasn't read the article or done any research I can confirm.
winddude commented on Show HN: Defuddle, an HTML-to-Markdown alternative to Readability   github.com/kepano/defuddl... · Posted by u/kepano
tmpfs · 3 months ago
Interesting as I was researching this recently and certainly not impressed with the quality of the Readability implementations in various languages. Although Readability.js was clearly the best, it being Javascript didn't suit my project.

In the end I found the python trifatura library to extract the best quality content with accurate meta data.

You might want to compare your implementation to trifatura to see if there is room for improvement.

winddude · 3 months ago
It's a bit old, but I bench marked a number of the web extraction tools years ago, https://github.com/Nootka-io/wee-benchmarking-tool, resiliparse-plain was my clear winner at the time.
winddude commented on “The Mind in the Wheel” lays out a new foundation for the science of mind   experimental-history.com/... · Posted by u/CharlesW
winddude · 3 months ago
I'm being pydantic, "So unlike the thermostat in your house, which doesn’t have to contend with any other control systems, all of the governors of the mind have to fight with each other constantly.", but what about automated blinds, self tinting windows, automatic skylights, humidistats, and humans open and closing windows.

It's also important to note that other control systems in the body that affect control systems in the mind, eg. endocrine.

winddude commented on Car companies are in a billion-dollar software war   insideevs.com/features/75... · Posted by u/rntn
winddude · 4 months ago
as a car guy and software engineer I just want to say car's need way less software, way more separation of concerns, more standardisation and more open platforms, but most of the money is made on service, so the manufactures are incentivized to make closed systems.
winddude commented on Show HN: BemiDB – Postgres read replica optimized for analytics   github.com/BemiHQ/BemiDB... · Posted by u/exAspArk
winddude · 10 months ago
difference to something like duckdb?
winddude commented on Diabetes risk soars for adults who had a sweet tooth as kids   nature.com/articles/d4158... · Posted by u/rntn
winddude · 10 months ago
this is for type 2 diabetes, a VERY IMPORT distinction. So it probably just effects eating habits. <https://www.theguardian.com/society/2024/oct/31/less-sugar-i...
winddude commented on ElasticSearch and many other repos are gone   github.com/elastic/elasti... · Posted by u/tison
parsimo2010 · 10 months ago
"As part of an internal change task" is the justification listed. Maybe this is a genuine accident.

Someone paranoid might think that the for-profit management at Elastic is trying to pull some of their previously free software behind a paid-for product. Perhaps they accidentally marked all repos private when they only intended to make a few of them private. They have had beef with AWS in the past where they changed their licensing due to things AWS was doing. So I'll fully believe that it was a genuine accident if all the formerly public repos become public again.

winddude · 10 months ago
unlikely, over the summer they announced that they were going to be more opensource, <https://www.elastic.co/blog/elasticsearch-is-open-source-aga...>
winddude commented on Before you buy a domain name, first check to see if it's haunted   bryanbraun.com/2024/10/25... · Posted by u/bryanbraun
romanhn · 10 months ago
Another "haunted domain" check is by trying to post about it on social media. I ran into this with my current project's domain name. After building an MVP and trying to test the social sharing functionality, I found that Facebook was blocking the domain outright. Turns out there was some spamming from it years ago. Getting it unblocked was extra fun, as the page to request manual review was itself broken! Thankfully I knew someone on the inside who alerted the relevant team, but the whole experience was quite the novel speedbump.
winddude · 10 months ago
I had that one happen as well, after launching a project. I could even post in a messages to friends.
winddude commented on Do AI companies work?   benn.substack.com/p/do-ai... · Posted by u/herbertl
winddude · a year ago
probably why OpenAI wants to build there own 5gw nuclear plant.
winddude commented on DoNotPay has to pay $193K for falsely touting untested AI lawyer, FTC says   arstechnica.com/tech-poli... · Posted by u/Brajeshwar
winddude · a year ago
'''"None of the Service’s technologies has been trained on a comprehensive and current corpus of federal and state laws, regulations, and judicial decisions or on the application of those laws to fact patterns," the FTC found'''

Wow!! That seems so simple, and literally a few weeks to do in today's ecosystem, now thoroughly testing make take a little more time, but wow, I wonder if it was evening attempting to do RAG.

u/winddude

KarmaCake day270July 22, 2019
About
Python Developer(primarily) & Founder of startups. Engineer of distributed systems, playing with machine learning, processing big data, information extraction, and crawling all things.

Currently working on Nootka.io & money.sexy

Previously CEO of AutoMudo.io

Passionate about cars, windsurfing, and technology. Canadian.

View Original