Readit News logoReadit News
crowwork commented on Open ABI and FFI for Machine Learning Systems   github.com/apache/tvm-ffi... · Posted by u/crowwork
crowwork · 5 months ago
The goal of the project is to bring open ABI and FFI for machine learning systems.

- Stable, minimal C ABI designed for kernels, DSLs, and runtime extensibility. - Zero-copy interop across PyTorch, JAX, and CuPy using DLPack protocol. - Compact value and call convention covering common data types for ultra low-overhead ML applications. - Multi-language support out of the box: Python, C++, and Rust (with a path towards more languages).

crowwork commented on LLM Microserving: a new RISC-style approach to design LLM serving API   blog.mlc.ai/2025/01/07/mi... · Posted by u/jinhongyii
crowwork · a year ago
Scale LLM serving with programmable cross-engine serving patterns, all in a few lines of Python
crowwork commented on     · Posted by u/crowwork
crowwork · a year ago
XGrammar is an open-source library for efficient, flexible, and portable structured generation. Bring 2x-10x speedup in grammar grammar-guided(JSON and CFG) LLM serving.

Deleted Comment

crowwork commented on In-browser LLM inference engine with WebGPU and OpenAI API   blog.mlc.ai/2024/06/13/we... · Posted by u/CharlieRuan
crowwork · 2 years ago
Comes with ability to do full structured generation with json schema

also a in-browser demo https://chat.webllm.ai/

u/crowwork

KarmaCake day167September 30, 2016View Original