Readit News logoReadit News
cweill commented on DuckDB: Querying JSON files as if they were tables   duckdb.org/2023/03/03/jso... · Posted by u/cmdlineluser
qolop · 3 years ago
This isn't unique to duckdb. Almost all databases allow for sorting and joins of large tables that don't fit into memory.
cweill · 3 years ago
Yes but if you're in a Jupyter notebook, you may not be directly connected to a DB. If you're using pandas, this unlocks some scalability before needing dask and a cluster.
cweill commented on DuckDB: Querying JSON files as if they were tables   duckdb.org/2023/03/03/jso... · Posted by u/cmdlineluser
cweill · 3 years ago
If you ever need to join two large dataframes, but are OOMing on the join, write them to disk as parquet files then use DuckDB to do the join. It's amazing what you can do on one machine thanks to DuckDB.
cweill commented on Andrej Karpathy is joining OpenAI again   twitter.com/karpathy/stat... · Posted by u/birriel
TigeriusKirk · 3 years ago
He barely lasted 6 months as a gentleman of leisure.
cweill · 3 years ago
Do you think he was on garden leave? Some kind of non-compete?
cweill commented on To build truly intelligent machines, teach them cause and effect (2018)   quantamagazine.org/to-bui... · Posted by u/rzk
jakeinspace · 3 years ago
Isn’t this just basic reinforcement learning? We’re not too many steps away from having an AI equipped with a good language model and a reinforcement learning mechanism to be let loose on the internet.
cweill · 3 years ago
I also have this question. Is the RL MDP actually encoding cause and effect? Or just learning (bidirectional) correlations between states and actions?

I wonder if Pearl thinks that RL replicates his do-calculus under the hood, or if that's an innovation we're missing.

cweill commented on My Youtube earnings   brickexperimentchannel.wo... · Posted by u/tpmx
cweill · 3 years ago
For reference, most gaming channels in the US have an RPM of $4-$5. I'm guessing their RPM is $1 because the channel owner is based in Europe. I imagine toy channels have higher RPMs in the US.
cweill commented on Ask HN: What's a good program for calisthenics for beginners that you have tried    · Posted by u/sergiotapia
cweill · 3 years ago
Look into Coach Sommer and https://www.gymnasticbodies.com/. Probably the best guide from beginner to advanced out there. I've been following their program for over 15 years, and there's something for every level of strength and flexibility. It's a program you can follow your entire life.

The big thing with calisthenics/gymnastics is developing the tendon and ligament strength in your joints which take several months/years, but is the foundation to doing the advanced stuff like planches and levers. Good luck!

cweill commented on Show HN: Twitter Graph - Like Google Trends but for Twitter   twit.deta.dev... · Posted by u/noelblanc
cweill · 3 years ago
How far back can this go? Would you be able to do 1-3 years of daily data?
cweill commented on Ask HN: Anyone tired of everything being a subscription now?    · Posted by u/CM30
cweill · 3 years ago
Having been on both sides as a customer and foundee, I love subscriptions.

Every business needs a business model. Subscribers is a very clear one. I'm untrusting of any service that's free and doesn't rely on ad-revenue, because they have incentive to make money in other shady ways like selling your data to third-parties who will use it against your best interests.

Subscriptions are a nice business model because they are predictable costs/revenue to both customers and the business.

What I don't like are subscriptions that don't grandfather you into higher pricing, and pay-as-you-go plans, because it's so easy to forget about then and their rules and get charged a nasty bill later.

As a customer IMO subscriptions are the lesser of all the evils, and the best alternative to ads, eg YouTube Premium.

cweill commented on Show HN: I fine-tuned Flan-T5. Can it cook?   abuqader.substack.com/p/i... · Posted by u/aqader
spion · 3 years ago
For some reason, I find it really grating how most deep learning stuff are pretty much impossible to do without paying quite a bit to a 3rd party service by the hour. Its one of the rare things in computing that you can't get started with unless you have some significant resources - most other things you can theoretically do with any old laptop and internet connection (ok - you may need to pay once you get users, but not just to get started)
cweill · 3 years ago
Google colab is $49/mo to get an A100. Trust me when I say you can build multimillion-dollar ML companies with just that (and maybe $100 extra dollars per month of spot credits).

u/cweill

KarmaCake day182March 6, 2013
About
I used to do Machine Learning at Google Research. Now I make YouTube videos. Cornell '12.

CreatorML (Startup): https://creatorml.com

YouTube: https://www.youtube.com/CharlesWeill

GitHub: https://github.com/cweill

Twitter: https://twitter.com/CharlesWeill

LinkedIn: https://www.linkedin.com/in/cweill

Website: https://charlesweill.com

View Original