README
I am an assistant professor at School of Cyber Science and Technology, Shandong University. I work broadly on high performance computing related research.
Before that, I obtained my Ph.D. from The University of Hong Kong under the supervision of Prof. Cho-Li Wang in 2022 and received my B.S. degree in Computer Science from University of Hong Kong in 2018.
招收大模型训练推理系统(特别是RLHF和多模态推理);隐私计算(同态加密、零知识证明)GPU加速、隐私计算框架设计、零知识证明示例应用等方向的硕士研究生。Research Interests
- GPU Algorithm Design
- GPU Compiler
- GPU Multitasking System
- Deep Learning & Big Data
- High Performance Computing
Publications (仅列出一作,或学生一作、本人通信的文章)
- Zhiyuan Zhang, Yanxin Cai, Wenhao Yin, Xueyu Wu, Yi Wang, Lei Ju, Zhuoran Ji*. Pipelonk: Accelerating End-to-End Zero-Knowledge Proof Generation on GPUs for PLONK-based Protocols. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP, CCF-A), 2025
- Xiangkai Yin, Shuoyu Wang, Zhaorui Zhang, Zimeng Zhou, Lei Ju, Zhuoran Ji*. TensorNTT: Architecture-Aware Optimizations for Number-Theoretic Transform on Tensor Core Unit. IEEE International Conference on Big Data (BigData, CCF-C), 2025
- Yifeng Tang, Huaman Zhou, Zhuoran Ji*, Cho-Li Wang. Cube-fx: Mapping Taylor Expansion Onto Matrix Multiplier-Accumulators of Huawei Ascend AI Processors. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025
- Zhuoran Ji, Jianyu Zhao, Peimin Gao, Xiangkai Yin, Lei Ju. Accelerating Number Theoretic Transform with Multi-GPU Systems for Efficient Zero Knowledge Proof. International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025
- Zhuoran Ji, Jianyu Zhao, Zhaorui Zhang, Jiming Xu, Shoumeng Yan, Lei Ju. A Compiler-Like Framework for Optimizing Cryptographic Big Integer Multiplication on GPUs. IEEE/ACM International Symposium on Microarchitecture (MICRO), 2024
- Zhuoran Ji, Zhiyuan Zhang, Jiming Xu, Lei Ju. Accelerating Multi-Scalar Multiplication for Efficient Zero Knowledge Proofs with Multi-GPU Systems. International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024
- Zhuoran Ji, Zhaorui Zhang, Jiming Xu, Lei Ju. POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2024
- Zhuoran Ji, Cho-Li Wang. Optimizing Aggregate Computation of Graph Neural Networks with a New Style of GPU Programming. International Conference on Parallel Architectures and Compilation Techniques (PACT), 2022
- Zhuoran Ji, Cho-Li Wang. Efficient Exact K-Nearest Neighbor Graph Construction for Billion-Scale Datasets using GPUs with Tensor Cores. ACM International Conference on Supercomputing (ICS), 2022
- Zhuoran Ji, Cho-Li Wang. Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022
- Zhuoran Ji, Cho-Li Wang. Accelerating DBSCAN Algorithm with AI Chips for Large Datasets. International Conference on Parallel Processing (ICPP), 2021
- Zhuoran Ji, Cho-Li Wang. Collaborative GPU Preemption via Spatial Multitasking for Efficient GPU Sharing. European Conference on Parallel Processing (EuroPar), 2021
- Zhuoran Ji, Cho-Li Wang. CTXBack: Enabling low latency GPU context switching via context flashback. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021
- Zhuoran Ji. Introduction to Mali GPU Driver gitbook, open access 2019
Honors and Awards
- Hong Kong PhD Fellowship (Top 3% among more than 4000 applications) 2018 - 2022
- Research Grants Council Student Research Supporting Funding 2022
- Y S and Christabel Lung Postgraduate Scholarship 2018
- Institute of Electrical and Electronics Engineers (IEEE) Prize, Hong Kong Section 2017
- Rosita King Ho Scholarship 2017
- Undergraduate Research Fellowship 2017
- The HKU Worldwide Scholarship 2016
- Dean’s Honours List 2015 - 2018
