February 11, 2025
2025
Our paper, FlexInfer: Flexible LLM Inference with CPU Computations, has been accepted to MLSys 2025.