Readit News logoReadit News
pumanoir commented on Flunking my Anthropic interview again   taylor.town/flunking-anth... · Posted by u/surprisetalk
pumanoir · 15 days ago
Why is Anthropic hiring developers? Amodei said that AI will be generating all the code by the end of the year.
pumanoir commented on Trump wins presidency for second time   thehill.com/homenews/camp... · Posted by u/koolba
Muromec · 10 months ago
You don't need a degree to get working visa in the Netherlands (source: I don't have a degree). You can even use Dutch-US treaty to basically hire yourself.
pumanoir · 10 months ago
(Honest question) Can you share how to use Dutch-US treaty to basically hire oneself?
pumanoir commented on Fisker EVs Hired an IT Spy Who Funneled Millions to N. Korea's Missile Program   torquenews.com/1084/fiske... · Posted by u/rmason
Maarten88 · 10 months ago
With all the anger over illegal immigrants taking US jobs, as a European it surprises me that nobody in the US seems to even mention the idea of punishing the employer for employing illegal workers.

If I want to hire someone (local or remote) as an employer here, I better make sure the worker has a valid working permit. Fines for non-compliance towards the employer are huge, even for a single day of work. All paperwork has to be complete before any work is done. Even when hiring through intermediary companies who guarantee it's all legal, liability and fines remain in place for the ultimate employer if it turns out to be not so.

pumanoir · 10 months ago
The scammers in question use stolen US citizens' identities. Same thing happens in Europe to a lesser degree.
pumanoir commented on Improving Accessibility Using Vision Models   myswamp.substack.com/p/im... · Posted by u/bearjaws
pumanoir · a year ago
I've had great success to convert math pics to latex using qwen2-vl
pumanoir commented on The Emmy Computer Algebra System   github.com/mentat-collect... · Posted by u/tosh
pumanoir · 2 years ago
which books will teach how to build your own computer algebra system? (I've only heard PAIP does)
pumanoir commented on Llemma: An Open Language Model for Mathematics   arxiv.org/abs/2310.10631... · Posted by u/AlphaWeaver
pumanoir · 2 years ago
I skimmed (ctrl-f) the paper and didn't see a comparison against gpt-4. Anybody knows how they'd compare?
pumanoir commented on Render a neural network into CUDA/HIP code   github.com/facebookincuba... · Posted by u/fzliu
stevenwliao · 2 years ago
How much faster is it?
pumanoir · 2 years ago
Depends on the model and GPU. Here is an example of almost 2x on a 3060 for StableDiffusion: https://www.youtube.com/watch?v=_6BsUijOWoM
pumanoir commented on MITx Differential Equations starts May 31   mitxonline.mit.edu/course... · Posted by u/drhodes
qumpis · 2 years ago
Are there any interesting modern applications of diff. equations in computer science outside physics simulators, and 3d vision? Or some adjacent areas that would benefit from skillset of working with diff. equations?
pumanoir · 2 years ago
Optimization by gradient descent is used to do the learning in deep learning. For example, diff eqs are used to create optimizers that improve upon the classic 'adam' say, such as the new 'sophia' [1]. 1. https://arxiv.org/abs/2305.14342
pumanoir commented on 4-Bit Quantization and QLoRA   huggingface.co/blog/4bit-... · Posted by u/tosh
ChrisRackauckas · 2 years ago
There is an example of using this with gradient-based optimization here: https://docs.sciml.ai/SciMLSensitivity/dev/tutorials/data_pa....

As an ODE solver, you wouldn't do nanoGPT with it though, you'd need to go back to KernelAbstractions and write a nanoGPT based on that same abstraction layer. Again, this is a demonstration of the cross-GPU tools for ODEs, but for LLMs you'd need to take these tools and implement an LLM.

pumanoir · 2 years ago
Thanks!
pumanoir commented on 4-Bit Quantization and QLoRA   huggingface.co/blog/4bit-... · Posted by u/tosh
ChrisRackauckas · 2 years ago
We recently showed DiffEqGPU.jl generating customized ODE solver kernels for NVIDIA CUDA, AMD GPUs, Intel OneAPI, and Apple Metal, where for CUDA it matches state of the art (MPGOS) which is about 10x-100x something like Jax/PyTorch (where the performance difference comes from inefficiencies of using vmap vs actually writing and calling a kernel). It's all in https://arxiv.org/abs/2304.06835. So this stuff exists and people are using it. Of course the caveat here is this is the context of engineering applications so someone would need to do similar for LLMs to fully relate back to the article, but it shows the tools are ready to a large extent for someone to step up in the ML space.
pumanoir · 2 years ago
Are there examples to use it for SGD using this? Like "Here is a tutorial on how to do a nanoGPT using DiffEqGPU.jl"?

u/pumanoir

KarmaCake day146November 19, 2017View Original