Publications
You can also find my articles on my Google Scholar profile.
Conference Papers
FastTTS: Accelerating Test-Time Scaling for Edge LLM Reasoning
ASPLOS, 2026
(Invited Paper) Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges
ASP-DAC, 2026
Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling
Preprints
Efficient and Flexible FP-INTx Accelerator for Weight-only Quantized LLM Inference
Preprint, 2025
