Fall 2024: I am looking for one or two Ph.D. students to join my research group in Fall 2024. If you are interested, please drop me a brief email with your CV.
10/2023. One paper accepted HPCA 2024.
09/2023. One paper accepted ICCD 2023.
08/2023. NSF funding received on efficient online learning. Thanks to NSF!
07/2023. Two papers accepted MICRO 2023.
02/2023. Three papers accepted DAC 2023.
01/2023. One paper accepted as Spotlight at ICLR 2023.
10/2022. Three papers accepted HPCA 2023.
Bingyao Li, (Ph.D. candidate)
Sheng Li, (Ph.D. candidate)
Yue Dai, (Ph.D. candidate, co-advise with Dr.Youtao Zhang)
Mehrnoosh Raoufi, (Ph.D. candidate, co-advise with Dr.Youtao Zhang)
Yueqi Wang, (Ph.D. candidate)
Tianyu Wang, (Ph.D. candidate)
Yingheng Li, (Ph.D. candidate)
Zewei Mo, (Ph.D. candidate, co-advise with Dr.Youtao Zhang)
NSF CNS CSR Award. PI. Thanks to NSF!
NSF CCF SHF Award. PI. Thanks to NSF!
NSF FoMR Award. Co-PI. Thanks to NSF and Intel!
(HPCA 2024) GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement (To appear).
(MICRO 2023) IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations (To appear).
(MICRO 2023) SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices (To appear).
(ICLR 2023 Spotlight) SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing.
(NeurIPS 2022) Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training.
(ECCV 2022) You Already Have It: A Generator-Free Low-Precision DNN Training Framework using Stochastic Rounding.
(SIGMETRICS 2021) Mix and Match: Reorganizing Tasks for Enhancing Data Locality.
(PLDI 2021) Distance-in-Time versus Distance-in-Space.
(AAAI 2021) YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design.
(PPoPP 2021) Compiler Support for Near Data Computing .
(ISCA 2019) Opportunistic Computing in GPU Architectures.
(SIGMETRICS 2019) Architecture-Aware Approximate Computing.
(SIGMETRICS 2019) Quantifying Data Locality in Dynamic Parallelism in GPUs.
(SIGMETRICS 2019) Computing with Near Data.
(MICRO 2017) Data Movement Aware Computation Partitioning.
(PLDI 2015) Optimizing Off-Chip Accesses in Manycores.
(SIGMETRICS 2015) Memory Row Reuse Distance and its Role in Optimizing Application Performance.
CS 2410 Computer Architecture, Spring 2023
CS 3410 Advanced Topics Computer Architecture, Fall 2022
CS 2410 Computer Architecture, Spring 2022
CS 2410 Computer Architecture, Spring 2021
CS 1541 Introduction to Computer Architecture, Spring 2021
CS 2210 Compiler design, Spring 2020
Zhongxuan Song (Master)
Weizheng Xu (Master)
Zachary Michael Smith (Master)
Thomas Matthew Dicarlo (Master)
Antony Paul (Master)
Yuan Yao (Master, Next move: Bloomberg LP)
Ziyu Zhang (Master, Next move: Apple)
Yilun Zhao (Ph.D. candidate, Next move: Ph.D.@ICT-CAS)
Qi Xue (Undergrad, Next move: MS at UPenn)
Updated 10/2023