Readit News logoReadit News
gregw134 commented on US will ban Wall Street investors from buying single-family homes   reuters.com/world/us/us-w... · Posted by u/kpw94
rgmerk · 2 months ago
It will make very little difference in the end.

Australia's land tax system makes it effectively impossible for large corporations to own large chunks of residential property, but our real estate is amongst the world's most expensive and landlords are still awful - it's just that the landlords are hundreds of thousands of dentists and, yes, software engineers rather than corporate entities.

If you want housing to be cheaper and renters to be better treated, increase supply. Everything else is window-dressing.

gregw134 · 2 months ago
It's a good start. Would be nice to add a gradually increasing tax on multiple home purchases, e.g. buying a third home has a 5% tax, 4th 15% tax, fifth 25%, and if you want a sixth home you have to build it yourself. Would prevent regular people from buying up dozens of homes to rent.
gregw134 commented on Deaths are projected to exceed births in 2031   cbo.gov/publication/61390... · Posted by u/johntfella
gregw134 · 6 months ago
https://en.wikipedia.org/wiki/Doomsday_argument

Statistically, we're most likely to be born when the world population is at its peak.

gregw134 commented on GLP-1s are breaking life insurance   glp1digest.com/p/how-glp-... · Posted by u/alexslobodnik
loeg · 8 months ago
They aren't even that awful in maintenance -- just expensive. The unpleasant part is when you're increasing the dose. After a while at the same dose, it's more or less unnoticeable IMO.
gregw134 · 8 months ago
Unnoticable meaning doesn't have any effect at all, or just no bad side effects?
gregw134 commented on I decided to pay off a school’s lunch debt   huffpost.com/entry/utah-s... · Posted by u/dredmorbius
staplers · 10 months ago

  I felt like a big part of my childhood had been sold to the lowest bidder.
It was. Most local procurement laws enforce this.

gregw134 · 10 months ago
In our case, sold to the bidder who gave kickbacks to the district supervisor.
gregw134 commented on Google is illegally monopolizing online advertising tech, judge rules   nytimes.com/2025/04/17/te... · Posted by u/IdealeZahlen
barkerja · a year ago
Source? I don't see any mention of that here: https://help.kagi.com/kagi/search-details/search-sources.htm...
gregw134 · a year ago
Search "chili recipes" on bing. Now search kagi or any other alternative search engine. They all show the exact same recipes because they're using the bing API.
gregw134 commented on Google is illegally monopolizing online advertising tech, judge rules   nytimes.com/2025/04/17/te... · Posted by u/IdealeZahlen
barkerja · a year ago
I use Kagi ¯\_(ツ)_/¯
gregw134 · a year ago
Which uses Bing as its primary index
gregw134 commented on Google is illegally monopolizing online advertising tech, judge rules   nytimes.com/2025/04/17/te... · Posted by u/IdealeZahlen
guywithahat · a year ago
I've always been somewhat opposed to this, because there's already like 10 different search alternatives, and now AI is taking over, which will further weaken their grip.

Google is on top because they do the best job; I use Yandex primarily, but I switch back to google all the time for coding related questions. In terms of advertising, there's billions of views on Facebook/Instagram/X to get, in addition to all the other sites. I get they're a big player, but I worry we're just beating Google because they're down, not because it's good for the consumer.

gregw134 · a year ago
There's really only one alternative, Bing. Virtually all the other western search engines are using the Bing api, and just slightly modify the results.
gregw134 commented on Long Read: Lessons from Building Semantic Search for GitHub and Why I Failed   tzx.notion.site/What-I-Le... · Posted by u/zxt_tzx
whakim · a year ago
I was the first employee at a company which uses RAG (Halcyon), and I’ve been working through issues with various vector store providers for almost two years now. We’ve gone from tens of thousands to billions of embeddings in that timeframe - so I feel qualified to at least offer my opinion on the problem.

I agree that starting with pgvector is wise. It’s the thing you already have (postgres), and it works pretty well out of the box. But there are definitely gotchas that don’t usually get mentioned. Although the pgvector filtering story is better than it was a year ago, high-cardinality filters still feel like a bit of an afterthought (low-cardinality filters can be solved with partial indices even at scale). You should also be aware that the workload for ANN is pretty different from normal web-app stuff, so you probably want your embeddings in a separate, differently-optimized database. And if you do lots of updates or deletes, you’ll need to make sure autovacuum is properly tuned or else index performance will suffer. Finally, building HNSW indices in Postgres is still extremely slow (even with parallel index builds), so it is difficult to experiment with index hyperparameters at scale.

Dedicated vector stores often solve some of these problems but create others. Index builds are often much faster, and you’re working at a higher level (for better or worse) so there’s less time spent on tuning indices or database configurations. But (as mentioned in other comments) keeping your data in sync is a huge issue. Even if updates and deletes aren’t a big part of your workload, figuring out what metadata to index alongside your vectors can be challenging. Adding new pieces of metadata may involve rebuilding the entire index, so you need a robust way to move terabytes of data reasonably quickly. The other challenge I’ve found is that filtering is often the “special sauce” that vector store providers bring to the table, so it’s pretty difficult to reason about the performance and recall of various types of filters.

gregw134 · a year ago
What would you recommend for billions of embeddings?
gregw134 commented on Jeep owners fed up with in-car pop-up ads   kbb.com/car-news/jeep-own... · Posted by u/spudlyo
gregw134 · a year ago
> Last year, Ford filed a patent for an in-car advertising system that would use the car’s speakers and display screen to serve ads to drivers and passengers

I've owned Ford stock for 10 years, I think I'll finally sell it today

gregw134 commented on When imperfect systems are good: Bluesky's lossy timelines   jazco.dev/2025/02/19/impe... · Posted by u/cyndunlop
ChuckMcM · a year ago
As a systems enthusiast I enjoy articles like this. It is really easy to get into the mindset of "this must be perfect".

In the Blekko search engine back end we built an index that was 'eventually consistent' which allowed updates to the index to be propagated to the user facing index more quickly, at the expense that two users doing the exact same query would get slightly different results. If they kept doing those same queries they would eventually get the exact same results.

Systems like this bring in a lot of control systems theory because they have the potential to oscillate if there is positive feedback (and in search engines that positive feedback comes from the ranker which is looking at which link you clicked and giving it a higher weight) and it is important that they not go crazy. Some of the most interesting, and most subtle, algorithm work was done keeping that system "critically damped" so that it would converge quickly.

Reading this description of how user's timelines are sharded and the same sorts of feedback loops (in this case 'likes' or 'reposts') sounds like a pretty interesting problem space to explore.

gregw134 · a year ago
Would you be willing to share more about how you guys did click ranking at Blekko? It's an interesting problem.

u/gregw134

KarmaCake day1245December 5, 2012View Original