I made a 4B Qwen3 distill of this model (and a synthetic dataset created with it) a while back. Both can be found here: https://huggingface.co/flashresearch
Remote: Yes
Willing to relocate: Yes (US)
Technologies: Python, PyTorch, HuggingFace, C++
Résumé/CV: https://drive.google.com/file/d/1qY-m1tKz4_QpHgxaGryC2vk-DGs...
Email: sumo43@proton.me
ML Engineer & Research Scientist. Ex AI grant startup, hedge fund. I've previously worked on inference for LLMs and vision models & have experience with data curation and multinode training. Looking for summer internships or part time positions
we are a small team associated with EleutherAI. looking to push the frontier of open source language models through self-play. so far we have implemented SPIN. compute included.
email tyoma9k@gmail.com
edit: formatting
Constraints are the fun part here. I know this isn't the 8x Blackwell Lamborghini, that's the point. :)