publications | Jia Wei

2025

Under Review

Dual-pronged deep learning preprocessing on heterogeneous platforms with CPU, GPU and CSD

Jia Wei, Xingjun Zhang, Witold Pedrycz, and 2 more authors

arXiv preprint arXiv:2407.00005, 2024
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion

Jia Wei, Xingjun Zhang, and Witold Pedrycz

arXiv preprint arXiv:2403.15766, 2024

TACO

Fastensor: Optimise the tensor I/O path from SSD to GPU for deep learning training

Jia Wei, Xingjun Zhang, Longxiang Wang, and 1 more author

ACM Transactions on Architecture and Code Optimization, 2023
IS

Leader population learning rate schedule

Jia Wei, Xingjun Zhang, Zhimin Zhuo, and 4 more authors

Information Sciences, 2023
Revisit and Benchmarking of Automated Quantization Toward Fair Comparison

Zheng Wei, Xingjun Zhang, Zeyu Ji, and 2 more authors

IEEE Transactions on Computers, 2023
数据密集型超算现状, 挑战以及未来发展趋势

魏嘉, 陈默, 王龙翔, and 8 more authors

数据与计算发展前沿, 2023

ICDE

How much storage do we need for high performance server

Jia Wei, and Xingjun Zhang

In 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2022
GARLSched: Generative adversarial deep reinforcement learning task scheduling optimization for large-scale high performance computing systems

Jingbo Li, Xingjun Zhang, Jia Wei, and 2 more authors

Future Generation Computer Systems, 2022
EP4DDL: addressing straggler problem in heterogeneous distributed deep learning

Zeyu Ji, Xingjun Zhang, Jingbo Li, and 2 more authors

The Journal of Supercomputing, 2022
Status, challenges and trends of data-intensive supercomputing

Jia Wei, Mo Chen, Longxiang Wang, and 8 more authors

CCF Transactions on High Performance Computing, 2022
BenQ: Benchmarking automated quantization on deep neural network accelerators

Zheng Wei, Xingjun Zhang, Jingbo Li, and 2 more authors

In 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2022

Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems

Jingbo Li, Xingjun Zhang, Zheng Wei, and 2 more authors

CCF transactions on high performance computing, 2021
A tile-fusion method for accelerating Winograd convolutions

Zeyu Ji, Xingjun Zhang, Zheng Wei, and 2 more authors

Neurocomputing, 2021
Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system

Jia Wei, Xingjun Zhang, Zeyu Ji, and 2 more authors

Scientific Reports, 2021
天河三号原型机分布式并行深度神经网络性能评测及调优

魏嘉, 张兴军, 纪泽宇, and 2 more authors

计算机工程与科学, 2021