Publications

(2024). Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization. In ICML 2024 (Spotlight Paper).
(2023). Training Transformers with 4-bit Integers. In NeurIPS 2024.