Readit News logoReadit News
jchandra commented on Hyperparameter Tuning Is a Resource Scheduling Problem   jchandra.com/posts/hyperp... · Posted by u/jchandra
vivahir215 · 4 months ago
I dont think its just a resource scheduling problem, there are more to it as well. The goal is Model Performance, not just efficient resource use.

Nice article but I can see your point.

jchandra · 4 months ago
Totally fair point — at the end of the day, it's all about getting the best model performance. I was mostly trying to highlight how, under the hood, a lot of modern HPO algos really boil down to smart scheduling decisions.
jchandra commented on AI Supply Chain Attack: How Malicious Pickle Files Backdoor Models   jchandra.com/posts/python... · Posted by u/jchandra
zahlman · 5 months ago
I have to agree with Chris Angelico there:

> Then the obvious question is: Why? Why use pickle? The most likely answer is “because <X> can’t represent what I need to transmit”, but for that to be at all useful to your proposal, you need to show examples that won’t work in well-known safe serializers.

jchandra · 5 months ago
Pickle still is good for custom objects (JSON loses methods and also order), Graphs & circular refs (JSON breaks), Functions & lambdas (Essential for ML & distributed systems) and is provided out of box.
jchandra commented on How Pickle Files Backdoor AI Models   jchandra.com/posts/python... · Posted by u/jchandra
vivahir215 · 6 months ago
ah okay. Didnt know this. I generally use pytorch save models for my workflow.
jchandra · 6 months ago
pytorch save/load still are pickle based models. Its fine for trusted sources but when you start using from untrusted sources then there is always a risk of ACE. If you want to execute it, would suggest to try it in a sandbox env like docker, VM or online notebooks envs or other option is to inspect the model file.

As Open source AI booms, the risk of supply chain attacks also increases.

jchandra commented on How Pickle Files Backdoor AI Models   jchandra.com/posts/python... · Posted by u/jchandra
vivahir215 · 6 months ago
Nice read !

You could also use joblib format as well.

jchandra · 6 months ago
joblib is not fully secure because it still relies on Pickle internally. The reason it is slightly better in pickle is due to fact that pickle file gets immediately executed when it gets imported whereas joblib doesn’t execute code just by being imported.
jchandra commented on We built a modern data stack from scratch and reduced our bill by 70%   jchandra.com/posts/data-i... · Posted by u/jchandra
throwaway7783 · 6 months ago
.. how many engineers?
jchandra · 6 months ago
2
jchandra commented on We built a modern data stack from scratch and reduced our bill by 70%   jchandra.com/posts/data-i... · Posted by u/jchandra
snake_doc · 6 months ago
These just seems like over engineered solutions trying to guarantee their job security. When the dataflows are so straight forward, just replicate into pick your OLAP, and transform there.
jchandra · 6 months ago
our approach wasn’t about over-engineering, we were trying to leverage our existing investments (like Confluent BYOC) while optimizing for flexibility, cost, and performance. We wanted to stay loosely coupled to adapt to cloud restrictions across multiple geographic deployments.
jchandra commented on We built a modern data stack from scratch and reduced our bill by 70%   jchandra.com/posts/data-i... · Posted by u/jchandra
jchandra · 6 months ago
We did have a discussion on Self vs Managed and TCOs associated with it. 1> We have multi regional setup so it came up with Data Sovereignty requirements. 2> Vendor Lock ins - Few of the services were not available in that geographic region 3> With managed services, you often pay for capacity you might not always use. our workloads were often consistent and predictable, so self managed solutions helped in fine tuning our resources. 4> One og the goal was to keep our storage and compute loosely coupled while staying Iceberg-compatible for flexibility. Whether it’s Trino today or Snowflake/Databricks tomorrow, we aren’t locked in.

u/jchandra

KarmaCake day80March 17, 2020
About
[ my public key: https://keybase.io/jchandra; my proof: https://keybase.io/jchandra/sigs/CZWAMEfii33T_DFtePEne5Zksf74ElLdHSgYnh7FOxw ]
View Original