0x202020 (u/0x202020)

0x202020 commented on Kagi's Approach to AI in Search blog.kagi.com/kagi-ai-sea... · Posted by u/elashri

0x202020 · 2 years ago

I’ve been really impressed with OpenAI embeddings + vector searches for some full document searches compared to something like standard elasticsearch over a large body of text. Something I think I’m personally missing from the ChatGPT/GPT4/LLM conversation with regards to information retrieval is nested/graph hierarchies.

An example from a previous job where we used a hand tooled NLP system was querying for doctors/dentists/optometrists and being able to take something like “dentist near me who is available in the afternoons and speaks spanish”. We would parse this user query into a few different queries that would run against a search cluster and database to return the filtered result set, or the closest output.

What would be the ideal way to prepare or tokenize this data for querying with an LLM? It’s partially text (match dentist, speak Spanish), partially geographic (near me, doing a geo radius of N miles from providers location, and part filtering (who meets all those criteria and has availability in a time frame). Is this a use case for large token sizes to be able to take in all possible providers? Or parsing a query more easily from human language -> SQL/other data store query language? Or perhaps figuring out another way to encode this data?

0x202020 commented on Ask HN: Those with money-making side projects,how did you come up with the idea? · Posted by u/fearofcoding

0x202020 · 3 years ago

Two different streams that have built up over time. I spend less than 10 hours combined weekly on them averaged over a month if i had to guess but both were independent full time jobs at one point

1) ($$m total, $$k/mo now) Server hosting! This started off as hosting game servers for friends in early high school. It expanded to friends of friends for awhile before I pivoted more towards crypto mining (yes I know, young kid with dubious ambitions). Lots of my first software experience was here. I wrote some software for switching the processes between mining/game hosting across the different boxes (raw processes on servers, the horror). Nowadays it’s winding down as just game hosting and some scientific computing rental to a few universities/their robotics clubs. It’s still slightly profitable but I have no interest in updating servers (CPUs from 2013-2015, GPUs mostly re-sold except for what a few people requested) and it runs everything via containers now.

2) ($$k/mo) Sports film review. I wrote the first version of this my first year of college which was a way to keep the stats book for basketball and football games and stitch the actions with the video footage. We had customers throughout ~20 states primarily high school but some colleges as well. In fact it still runs at a lot of them, but I’m not really connected with managing of it anymore. My co-founder still runs it and we rotate a few students from our alma-matter in as interns and occasionally juniors on it.

This became the basis for an esports version of the software that I created a few years ago. This time with CV to do all the gathering of stats and allowing for jumping around in videos and analyzing overall stats from the output. This started in Call of Duty for their then new professional league but expanded out to Halo, Rocket League and Valorant since. I still do some occasional retraining of the models but the product itself got acqui-hired by a larger company for which I still “consult”

EDIT: I’ve also had many more that cost me more money than they ever made, but I’m a big proponent of failing fast and iterating

0x202020 commented on Activision’s faulty anti-cheat software blog.mikeswanson.com/post... · Posted by u/mdswanson

SkyPuncher · 3 years ago

This author sounds like me minus a ban.

I really enjoy Call of Duty. Got extremely good at it in high school, along with my group of friends. Since then, I've had to move away and CoD has kept us connected multiple times per week. It's been a very enjoyable game for well over a decade.

In the past few years, it's become extremely clear to me that something is seriously wrong with CoD - and perhaps Activision as a whole. I'm willing to explain much of it away as "it's hard to run a big game at scale", but some the bugs are just inexcusable. Activision offers little to no insight or recourse when things break.

I have 700 hours in MW2/WZ1. I can't even stand to play the new game. It's just too broken.

0x202020 · 3 years ago

IMO, the release of Warzone 1, Call of Duty’s Battle Royale mode that released a few months after Modern Warfare 2019, has completely changed the priorities and incentives for the company. The goal is to sell in game cosmetics more so than the game itself, as many other games are turning to as well.

The yearly release cycle, which may be ending soon, leads to bugs that re-emerge each year and features that appear and disappear. Sure, the different studios which produce the games need some room to innovate, but the inconsistent base set of features is incredibly frustrating. CoD games are one of the games I play the most, with the other being a game which is the complete opposite, Old School RuneScape, that has been built on for ~20 years.

I play the current game MWII with friends a few hours a day most days of the week. Multiple times per session my game crashes at random, something I can’t remember with any other major games with a top of the line PC. Like many other pieces of software, chasing other revenue sources seems to have made the quality of the product take a nose dive, with consequences yet to be seen. This is disappointing to me as someone who enjoys playing the game with friends, who has competed in open-bracket events at major tournaments over a span of a few years, and worked directly with the professional league and teams (CDL and CWL) for analytics and software.

0x202020 commented on Arm fills in some gaps and details in server chip roadmaps nextplatform.com/2022/09/... · Posted by u/rbanffy

NelsonMinar · 3 years ago

I cannot wait until I can buy a good simple ARM server for my home. Something like an RPi 4 but with more disk I/O and RAM. Or a slightly more baked version of a ROCKPro64 in a nice case.

0x202020 · 3 years ago

I would love to see something akin to an M-based Mac as a dev board of sorts, but I’m fairly sure it would never happen. My home server has transitioned over time from various 4u rack mount options, to a 2u and an M1 Mac Mini, to a Mac Studio with a disk enclosure.

I even tried using a Snapdragon phone dev board for awhile for something more than an RPi

0x202020 commented on Ask HN: Have You Left Kubernetes? · Posted by u/strzibny

0x202020 · 3 years ago

I switched jobs recently and became the defacto DevOps person so have been able to deploy mostly how I want. I’ve used kubernetes at multiple jobs, side projects and at home but for a cost and time constrained startup we are leveraging ECS/Lambda/Batch/Cloudfront. B2C application, mostly low traffic with nearly no traffic off hours. Occasionally we’ll get a big rush, 2 to 3 orders of magnitude more traffic than usual, from a marketing push and haven’t ran into any issues yet.

I still run KEDA at home for managing plex, home assistant, some game servers and other of my own projects. But being the only one who is using the cluster is a different use case than getting RBAC, ingress and management set up correctly for a production cluster IMO. I’ve never had the sole responsibility or permission over a cluster before, so it was a daunting step I decided not to take for my own sake

0x202020 commented on What has changed with video games over the last 25 years, and what has not dynamic-mess.com/un-peu-d... · Posted by u/canuck69

haunter · 3 years ago

Graphics are not the most important thing for sure but they still _matter_ at least to some extent.

For example yesterday Nintendo revealed the new trailer of the next mainline Pokemon game... and it has some insane texture shimmering and apparently zero anti-aliasing in the whole game. And that's how they promote the game in the trailer. It's not even a low budget title, it's the flagship AAA game of the company sold for $60. And the Switch is not a bad hardware so at this point they most likely just don't care which is the strange part. It's like watching a movie in 360p. Sure the story, the characters, the music matter but a low resolution can damage the whole experience, same as the intentionally low quality graphics in an otherwise good game

0x202020 · 3 years ago

Yes Pokémon Legends: Arceus was a tipping point for me. The game does not look good at all to me, and across 3 different switches I tested, it felt like it was hitching and not loading correctly even with the low detail and resolution in docked mode. I’m handheld it’s a bit better, but that’s not really how I want to play the game.

On the other hand, my most played game is by far Old School RuneScape, which is a restore of a backup of RuneScape from 2007. The game has certainly evolved in style and content since the re-release, and I use a 3rd party client with an “HD” plug-in which adds more render distance, shadows, lighting and anti-aliasing.

0x202020 commented on Elastic and Amazon reach agreement on trademark infringement lawsuit elastic.co/blog/elastic-a... · Posted by u/dhd415

rmccue · 4 years ago

Hopefully this will let each of them compete on their own merits.

I’ve been tossing up moving our workloads to Elastic Cloud anyway, because AWS ES Service is a source of constant headaches for us. Feels like at least once a week a server ends up in a state where we can’t fix it, and AWS engineers have to manually fix their internal state.

Their standard response is “add more nodes”; well, we did that, and it is costing us an arm and a leg, and it didn’t fix the problems. (Plus, now we have new problems where networking blips appear to be causing quorum problems and sending the cluster into a death spiral.)

Purely off the customer experience, it feels like Elastic Cloud has to be better; the whole licensing debacle has definitely turned me off Elastic though.

0x202020 · 4 years ago

Like the others have said, definitely seems to be hit or miss. I inherited an old cluster running on elastic cloud, migrated it to AWS managed and it reduced monthly costs by nearly an order of magnitude on top of going from a 5-15 minute outage per day for master node election to having 3 over 2 years.

We’ve rewritten most of the interactions with the older version of ES into a new service and use a self-managed OpenSearch cluster on Graviton instances and it’s the most stable Elasticsearch/Solr solution I’ve ever interacted with

0x202020 commented on Show HN: EdgeDB 1.0 edgedb.com/blog/edgedb-1-... · Posted by u/colinmcd

1st1 · 4 years ago

Yep, we have plans for that. We are exploring if it would be feasible to make it possible to plug-in external engines like elastic and make that integration totally automatic, enabled with a simple annotation in the schema.

0x202020 · 4 years ago

Even better then I would have thought, thanks! I’ve primarily dealt with Elasticsearch/Solr in my career, so jumping into the way searches work in Postgres with various native support and plugins is, interesting to say the least. Maybe I just need to break my Lucene roots

0x202020 commented on Show HN: EdgeDB 1.0 edgedb.com/blog/edgedb-1-... · Posted by u/colinmcd

0x202020 · 4 years ago

Poking a bit at the documentation, I see like/ilike but are there plans for text search/trigram capabilities? Recently I’ve been working with lots of different entities that have searchable properties and exploring searching across different elasticsearch indexes to do “JOIN” like operations but have been exploring Postgres (and related) solutions for better “JOIN” support out of the box

0x202020 commented on The Snowflakes as Code Antipattern infrastructure-as-code.co... · Posted by u/zdw

3np · 4 years ago

Yeah, it was a common pattern especially in older TF repositories from before workspaces had better support.

It takes quite a bit of discipline (and balance; sometimes the effort to make it “right” is not well-spent) to not end up in a mess when your infrastructure starts to scale across regions and providers.

0x202020 · 4 years ago

To get around the separate directories/copies someone established a pattern of doing a for each environment at the top of each tf file and then all variables are stored as a map of each env, which I find interesting.

The pattern exists in one repo with the other being all the duplicates, they’ve been in “it just works” for long enough that no one has changed them. Workspaces look pretty slick, I’ll have to check them out