Boxun Xu
Ph.D. Candidate, UC Santa Barbara
I am a final-year Ph.D. candidate in Electrical and Computer Engineering at UC Santa Barbara, advised by Prof. Peng Li (IEEE Fellow). My research focuses on efficient generative models, multimodal content generation, and ML systems & hardware co-design, building toward scalable, real-time multimodal and world models. I received consecutive William J. McCalla Best Paper Award nominations at ICCAD 2024 and ICCAD 2025.
I interned at Meta (2024) and Meta Superintelligence Labs (2025), where I integrated Video Sparse Attention into MovieGen-30B, delivering 1.55× tuning-free end-to-end speedup, and extending it from inference to sparse finetuning across 256 H100s.
Prior to UCSB, I received my M.S. in Electrical and Computer Engineering from the University of Michigan, Ann Arbor, advised by Prof. David Blaauw (IEEE Fellow) and Prof. Dennis Sylvester (IEEE Fellow), and my B.S. in Electronic Engineering from the University of Electronic Science and Technology of China.
Research Focus
- Efficient Generative Modeling and Multimodal & Interactive World Modeling
- Hardware / Algorithm Co-design & ML Systems & Electronic Design Automation
News
| Feb 15, 2026 | Paper on VLM hallucination mitigation (VEGAS) accepted at CVPR 2026 Findings. |
|---|---|
| Nov 15, 2025 | Papers on adaptive KV caching for visual autoregressive models and KAN-based graph contrastive learning accepted at AAAI 2026. |
| Oct 26, 2025 | 🏆 Paper on 3D MoE spiking transformers nominated for the William J. McCalla Best Paper Award at ICCAD 2025 — second consecutive year. |
| Jun 30, 2025 | Paper on 3D MoE spiking transformer acceleration accepted at ICCAD 2025. |
| May 23, 2025 | Paper on transfer learning for Vmin prediction in advanced nodes accepted at ITC 2025. |
| Apr 29, 2025 | Paper on heterogeneous quantization for spiking vision transformers accepted at ASAP 2025. |
| Mar 21, 2025 | Paper on heterogeneous-core acceleration of spiking transformers with error-constrained pruning accepted at ISCA 2025. |
| Jan 18, 2025 | Paper on network-hardware co-optimization for sparse SNN accelerators accepted at TCAD as a long paper. |
| Jan 03, 2025 | Joining Meta Superintelligence Labs this summer in Seattle, working on efficient movie generation. |
| Oct 26, 2024 | 🏆 Paper on 3D spiking transformer accelerators nominated for the William J. McCalla Best Paper Award at ICCAD 2024. |
| Jul 01, 2024 | Papers on 3D spiking transformer accelerators and LLM-guided analog design accepted at ICCAD 2024. |
| Jun 24, 2024 | Started summer internship at Meta, working on knowledge distillation of multi-modal foundation models. |
| May 25, 2024 | Paper on a multi-modal IoT SoC with on-chip MRAM accepted at JSSC. |
Selected Publications
I have published papers in top conferences in machine learning / computer architecture / design automation, including ISCA, AAAI, CVPR, ICCV, ICCAD, TCAD and JSSC.
Efficient Generative Modeling
- AAAI’26
★ AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive TransformersIn AAAI Conference on Artificial Intelligence (main track)(Acceptance Rate: 17.6%) , 2026First efficient KV-caching design tailored for multi-scale visual AR transformers. - Preprint
★ Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Video Generation2025First native trainable sparse-attention framework enabling real-time autoregressive video generation.Work done during internship at Meta Superintelligence Labs. - ICCV’25
VAR-Q: Tuning-free Quantized KV Caching for Visual Autoregressive ModelsIn IEEE/CVF International Conference on Computer Vision (ICCV) Workshop on Binary and Extreme Quantization for Computer Vision, 2025 - CVPR’26
VEGAS: Mitigating Hallucinations in Large Vision-Language Models via Vision-Encoder Attention Guided Adaptive SteeringIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026
Hardware/Algorithm Co-design and EDA
- ISCA’25
★ Bishop: Sparsified Bundling Spiking Transformers on Heterogeneous Cores with Error-Constrained PruningIn International Symposium on Computer Architecture (ISCA)(Acceptance Rate: 22.2%) , 2025First SW/HW co-design framework for neuromorphic transformers. - ICCAD’25
🏆 Nominated as William J. McCalla Best Paper Award in 2025★ 3D Acceleration for Mixture-of-Experts and Multi-Head Attention Spiking Transformers with Dynamic Head PruningIn ACM/IEEE International Conference on Computer-Aided Design (ICCAD)(Acceptance Rate: 24.7%) , 2025First 3D-integrated accelerator for Mixture-of-Experts spiking transformers with dynamic head pruning. - ICCAD’24
🏆 Nominated as William J. McCalla Best Paper Award in 2024★ Spiking Transformer Hardware Accelerators in 3D IntegrationIn ACM/IEEE International Conference on Computer-Aided Design (ICCAD)(Acceptance Rate: 24%) , 2024First 3D-integrated hardware accelerator for spiking transformers. - TCAD’25
SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural NetworksIn IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems(TCAD), 2025 - ASAP’25
Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization SearchIn IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), 2025 - TMLR
DS2TA: Denoising Spiking Transformer with Attenuated Spatiotemporal AttentionIn Transactions on Machine Learning Research (TMLR, under review), 2024 - ICCAD’24
ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language ModelsIn ACM/IEEE International Conference on Computer-Aided Design (ICCAD), 2024First work to bring LLMs into analog circuit design, pairing in-context priors with Bayesian optimization for sample-efficient sizing. - COLM’26
LASER: Language Model Regression for Semi-Structured Workflow Resource and Runtime EstimationIn Conference on Language Modeling (COLM, under review), 2026 - ITC’25
Transfer Learning for Minimum Operating Voltage Prediction in Advanced Technology Nodes: Leveraging Legacy Data and Silicon Odometer SensingIn ACM/IEEE International Test Conference (ITC), 2025 - JSSC’24
AIMMI: Audio and Image Multi-Modal Intelligence via a Low-Power SoC With 2-MByte On-Chip MRAM for IoT DevicesIn IEEE Journal of Solid-State Circuits(JSSC), 2024 - VLSI’22
Audio and Image Cross-Modal Intelligence via a 10TOPS/W 22nm SoC with Back-Propagation and Dynamic Power GatingIn 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI-Symposium), 2022
Other Publications
- AAAI’26
Khan-GCL: Kolmogorov-Arnold Network Based Graph Contrastive Learning with Hard NegativesIn AAAI Conference on Artificial Intelligence (main track)(Acceptance Rate: 17.6%) , 2026