Readit News logoReadit News
jdeaton commented on Attention Wasn't All We Needed   stephendiehl.com/posts/po... · Posted by u/mooreds
jdeaton · 9 months ago
First four things on the list are attention
jdeaton commented on Programming languages should have a tree traversal primitive   blog.tylerglaiel.com/p/pr... · Posted by u/azhenley
jdeaton · 9 months ago
Its called recursion

> Doesn't for_tree(...) look a lot nicer and simpler and less error prone than needing to implement a recursive function for each operation you would want to do on a tree?

No it does not

Deleted Comment

jdeaton commented on Sapphire: Rust based package manager for macOS   github.com/alexykn/sapphi... · Posted by u/adamnemecek
whywhywhywhy · 10 months ago
Consider rebranding to a 4 letter name or even better a 3 letter one.

I know it sounds dumb but uv was smart to go shorter than pip and sapphire feels heavier than brew no matter what it does after typing that.

jdeaton · 10 months ago
Yeah i vote it should be rebranded “why”
jdeaton commented on The effect of deactivating Facebook and Instagram on users' emotional state   nber.org/papers/w33697... · Posted by u/imakwana
jdeaton · 10 months ago
0.061 standard deviations? Thats like almost nothing?
jdeaton commented on Nvidia adds native Python support to CUDA   thenewstack.io/nvidia-fin... · Posted by u/apples2apples
jdeaton · 10 months ago
Pytorch???
jdeaton commented on Are noise-cancelling headphones to blame for young people's hearing problems?   bbc.com/news/articles/cgk... · Posted by u/vinni2
jdeaton · a year ago
Maybe shes wearing those noise canceling headphones because of her auditory processing condition and not the other way around??

This seems like basic speculative attribution error- no research here.

jdeaton commented on How to scale your model: A systems view of LLMs on TPUs   jax-ml.github.io/scaling-... · Posted by u/mattjjatgoogle
hustwindmaple1 · a year ago
there is limited TPU support in pytorch via torch_xla
jdeaton · a year ago
Sounds limited
jdeaton commented on How to scale your model: A systems view of LLMs on TPUs   jax-ml.github.io/scaling-... · Posted by u/mattjjatgoogle
Scene_Cast2 · a year ago
I'm curious, what does paradigmatic JAX look like? Is there an equivalent of picoGPT [1] for JAX?

[1] https://github.com/jaymody/picoGPT/blob/main/gpt2.py

jdeaton · a year ago
yeah it looks exactly like that file but replace "import numpy as np" with "import jax.numpy as np" :)
jdeaton commented on How to scale your model: A systems view of LLMs on TPUs   jax-ml.github.io/scaling-... · Posted by u/mattjjatgoogle
jdeaton · a year ago
Something nice about this guide is that it generally transfers to GPU directly thanks to JAX/XLA.

u/jdeaton

KarmaCake day355June 9, 2018View Original