publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- TFSDynamic Fuzzy Sampler for Graph Neural NetworksIEEE Transactions on Fuzzy Systems, 2025
2024
- Under ReviewDual-pronged deep learning preprocessing on heterogeneous platforms with CPU, GPU and CSDarXiv preprint arXiv:2407.00005, 2024
- BEND: Bagging Deep Learning Training Based on Efficient Neural Network DiffusionarXiv preprint arXiv:2403.15766, 2024
2023
- TACOFastensor: Optimise the tensor I/O path from SSD to GPU for deep learning trainingACM Transactions on Architecture and Code Optimization, 2023
- ISLeader population learning rate scheduleInformation Sciences, 2023
- Revisit and Benchmarking of Automated Quantization Toward Fair ComparisonIEEE Transactions on Computers, 2023
- 数据密集型超算现状, 挑战以及未来发展趋势数据与计算发展前沿, 2023
2022
- ICDEHow much storage do we need for high performance serverIn 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2022
- GARLSched: Generative adversarial deep reinforcement learning task scheduling optimization for large-scale high performance computing systemsFuture Generation Computer Systems, 2022
- EP4DDL: addressing straggler problem in heterogeneous distributed deep learningThe Journal of Supercomputing, 2022
- Status, challenges and trends of data-intensive supercomputingCCF Transactions on High Performance Computing, 2022
- BenQ: Benchmarking automated quantization on deep neural network acceleratorsIn 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2022
2021
- Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systemsCCF transactions on high performance computing, 2021
- A tile-fusion method for accelerating Winograd convolutionsNeurocomputing, 2021
- Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype systemScientific Reports, 2021
- 天河三号原型机分布式并行深度神经网络性能评测及调优计算机工程与科学, 2021