zi_ (u/zi_) - Readit News

zi_ commented on Machine Learning in Production (CMU Course) mlip-cmu.github.io/s2025/... · Posted by u/azhenley

golly_ned · a year ago

It depends on your starting point. A baseline level of ML is needed. Otherwise ML platforms account for three basic functions: features/data, model training, and model hosting.

So do an end-to-end project where you:

- start from a CSV dataset, with the goal of predicting some output column. A classic example is predicting whether a household's income is >$50K or not from census information.

- transform/clean the data in a jupyter notebook and engineer features for input into a model. Export the features to disk into a format suitable for training.

- train a simple linear model using a chosen framework: a regressor if you're predicting a numerical field, a classifier if its categorical.

- iterate on model evaluation metrics through more feature engineering, scoring the model on unseen data to see its actual performance.

- export the model in such a way it can be loaded or hosted. The format largely depends on the framework.

- construct a docker container that exposes the model over HTTP and a handler for receiving prediction requests and transforming them for input into the model, and a client that sends requests to that model.

That'll basically get an entire end-to-end run the entire MLE lifecycle. Every other part of development is a series of concentric loop between these steps, scaled out to ridiculous scale in several dimensions: number of features, size of dataset, steps in a data/feature processing pipeline to generate training datasets, model architecture and hyperparameters, latency/availability requirements for model servers...

For bonus points:

- track metrics and artifacts using a local mlflow deployment.

- compare performance for different models.

- examine feature importance to remove unnecessary (or net-negative) features.

- use a NN model and train on GPU. Use profiling tools (depends on the framework) and Nvidia NSight to examine performance. Optimize.

- host a big model on GPU. Profile and optimize.

IMO: the biggest missing piece for ML systems/platform engineers is how to feed GPUs. If you can right-size workloads and feed a GPU with MLE workloads you'll get hired. MLE workloads vary wildly (ratio of data volume in vs. compute; size of model; balancing CPU compute for feature processing with GPU compute for model training). We're all working under massive GPU scarcity.

zi_ · a year ago

this is unbelievably helpful, wow -- thank you!

curious: which part of the pipeline does the majority of 'business' value come from?

zi_ commented on YC Graveyard: 821 inactive Y Combinator startups ycgraveyard.iamwillwang.c... · Posted by u/memalign

iceman_w · a year ago

I've also been tracking the 'path to graveyard' for the startups from the last 3 years here https://pivots.fyi/

zi_ · a year ago

this is very interesting!

though i noticed some outdated info for most companies (last updated >2 months ago).

zi_ commented on OK, I can partly explain the LLM chess weirdness now dynomight.net/more-chess/... · Posted by u/dmazin

furyofantares · a year ago

LLMs are fundamentally text-completion. The Chat-based tuning that goes on top of it is impressive but they are fundamentally text-completion, that's where most of the training energy goes. I keep this in mind with a lot of my prompting and get good results.

Regurgitating and Examples are both ways to lean into that and try to recover whatever has been lost by Chat-based tuning.

zi_ · a year ago

what else do you think about when prompting, which you've found to be useful?

zi_ commented on Chinese app design: weird, but it works [video] youtube.com/watch?v=WSMFn... · Posted by u/zi_

neduma · 2 years ago

Great Info. Culture is everything.

zi_ · 2 years ago

totally agree. culture is prevalent yet it is also difficult to learn & understand it well.

have you found reliable ways to learn about other cultures?

zi_ commented on Show HN: I made a web game that makes practicing basic arithmetic fun worchle.com/mathic/... · Posted by u/dylanhouli

zi_ · 2 years ago

Love this!

I also made a math game 3 years ago.

https://github.com/zineanteoh/clean-the-river

I called it “Clean the river”. It is a web-based game that lets children practice forming and solving basic mathematical expressions by "cleaning the river".