You need to enable JavaScript to run this app.
Readit News
Overwview
Stories
Comments
Posted by
u/limoce
13 days ago
Operation Costs in CPU Clock Cycles (2016)
ithare.com/infographics-o...
Posted by
u/limoce
14 days ago
Principles and Methodologies for Serial Performance Optimization
usenix.org/conference/osd...
Posted by
u/limoce
14 days ago
The Koala Benchmarks for the Shell
kben.sh/...
Posted by
u/limoce
a month ago
SmallThinker: A Family of Efficient LLMs Natively Trained for Local Deployment
arxiv.org/abs/2507.20984...
Posted by
u/limoce
a month ago
Step3 Technical Report [pdf]
github.com/stepfun-ai/Ste...
Posted by
u/limoce
a month ago
FP8 is ~100 tflops faster when the kernel name has "cutlass" in it
twitter.com/cis_female/st...
Posted by
u/limoce
2 months ago
Polaris: A Post-training recipe for scaling RL on Advanced Reasoning models
hkunlp.github.io/blog/202...
Posted by
u/limoce
2 months ago
Overclocking LLM Reasoning: Monitoring and Controlling LLM Thinking Path Lengths
royeisen.github.io/Overcl...
Posted by
u/limoce
2 months ago
Neutrino: Probing-Based eBPF-Like GPU Kernel Profiling
github.com/open-neutrino/...
Load more content (10 of 215)
l
u/limoce
Karma
Cake day
870
April 3, 2021
View Original