Readit News logoReadit News
stygiansonic commented on Google Engineer Found Guilty of Sending AI Secrets to China   justice.gov/opa/pr/former... · Posted by u/jedixit
stygiansonic · 9 days ago

  The jury found that Ding stole trade secrets relating to the hardware infrastructure and software platforms that allow Google’s supercomputing data center to train and serve large AI models. The trade secrets contained detailed information about the architecture and functionality of Google’s custom Tensor Processing Unit chips and systems and Google’s Graphics Processing Unit systems, the software that allows the chips to communicate and execute tasks, and the software that orchestrates thousands of chips into a supercomputer capable of training and executing cutting-edge AI workloads. The trade secrets also pertained to Google’s custom-designed SmartNIC, a type of network interface card used to facilitate high speed communication within Google’s AI supercomputers and cloud networking products.

stygiansonic commented on Sampling at negative temperature   cavendishlabs.org/blog/ne... · Posted by u/ag8
stygiansonic · a month ago
Neat experiment that gives a mechanistic interpretation of temperature. I liked the reference to the "anomalous" tokens being near the centroid, and thus having very little "meaning" to the LLM.
stygiansonic commented on We bought the whole GPU, so we're damn well going to use the whole GPU   hazyresearch.stanford.edu... · Posted by u/sydriax
woadwarrior01 · 4 months ago
Yeah, I only posted two links from my notes, from when I was looking at this a few months ago. Here's one on MIG.

https://arxiv.org/abs/2207.11428

stygiansonic · 4 months ago
That paper doesn’t seem to be about security vulnerabilities in MiG but rather using it to improve workload efficiency
stygiansonic commented on Pentagon Pizza Index   pizzint.watch/... · Posted by u/exiguus
stygiansonic · 6 months ago
Wonder why they haven’t gotten an in house pizzeria yet to reduce the signal on this side channel leak
stygiansonic commented on Gemma 3n preview: Mobile-first AI   developers.googleblog.com... · Posted by u/meetpateltech
krackers · 9 months ago
What is "Per Layer Embeddings"? The only hit I can find for that term is the announcement blogpost.

And for that matter, what is

>mix’n’match capability in Gemma 3n to dynamically create submodels

It seems like mixture-of-experts taken to the extreme, where you actually create an entire submodel instead of routing per token?

stygiansonic · 9 months ago
From the article it appears to be something they invented:

> Gemma 3n leverages a Google DeepMind innovation called Per-Layer Embeddings (PLE) that delivers a significant reduction in RAM usage.

Like you I’m also interested in the architectural details. We can speculate but we’ll probably need to wait for some sort of paper to get the details.

stygiansonic commented on Reservoir Sampling   samwho.dev/reservoir-samp... · Posted by u/chrisdemarco
fanf2 · 9 months ago
That paper says “Algorithm R (which is a reservoir algorithm due to Alan Waterman)” but it doesn’t have a citation. Vitter’s previous paper https://dl.acm.org/doi/10.1145/358105.893 cites Knuth TAOCP vol 2. Knuth doesn’t have a citation.
stygiansonic · 9 months ago
Interesting! If Knuth is not the original author then they’ve been lost to the sands of time
stygiansonic commented on Reservoir Sampling   samwho.dev/reservoir-samp... · Posted by u/chrisdemarco
stygiansonic · 9 months ago
Great article and nice explanation. I believe this describes “Algorithm R” in this paper from Vitter, who was probably the first to describe it: https://www.cs.umd.edu/~samir/498/vitter.pdf

u/stygiansonic

KarmaCake day3683August 1, 2012
About
https://peterchng.com

Email: 01endive-thunder@icloud.com

View Original