Follow
Zhen Dong
Zhen Dong
PhD & Postdoc at Berkeley AI Research
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
A survey of quantization methods for efficient neural network inference
A Gholami*, S Kim*, Z Dong*, Z Yao*, MW Mahoney, K Keutzer
Book of Low-Power Computer Vision, 2022, 2022
12622022
Q-bert: Hessian based ultra low precision quantization of bert
S Shen*, Z Dong*, J Ye*, L Ma, Z Yao, A Gholami, MW Mahoney, ...
Conference on Artificial Intelligence (AAAI), 2020, 2020
5952020
Hawq: Hessian aware quantization of neural networks with mixed-precision
Z Dong, Z Yao, A Gholami, MW Mahoney, K Keutzer
International Conference on Computer Vision (ICCV), 2019, 2019
5642019
Zeroq: A novel zero shot quantization framework
Y Cai*, Z Yao*, Z Dong*, A Gholami, MW Mahoney, K Keutzer
Computer Vision and Pattern Recognition (CVPR), 2020, 2020
4512020
Hawq-v2: Hessian aware trace-weighted quantization of neural networks
Z Dong, Z Yao, D Arfeen, A Gholami, MW Mahoney, K Keutzer
Advances in Neural Information Processing Systems (NeurIPS), 2020, 2020
2922020
Hawq-v3: Dyadic neural network quantization
Z Yao, Z Dong, Z Zheng, A Gholami, J Yu, E Tan, L Wang, Q Huang, ...
International Conference on Machine Learning (ICML), 2021, 2021
2692021
SqueezeLLM: Dense-and-sparse quantization
S Kim, C Hooper, A Gholami, Z Dong, X Li, S Shen, MW Mahoney, ...
International Conference on Machine Learning (ICML), 2024, 2023
1442023
Q-diffusion: Quantizing diffusion models
X Li, Y Liu, L Lian, H Yang, Z Dong, D Kang, S Zhang, K Keutzer
International Conference on Computer Vision (ICCV), 2023, 2023
1302023
Hessian-aware pruning and optimal neural implant
S Yu, Z Yao, A Gholami, Z Dong, S Kim, MW Mahoney, K Keutzer
Winter Conference on Applications of Computer Vision (WACV), 2022, 2022
652022
Applications and techniques for fast machine learning in science
AMC Deiana, N Tran, J Agar, M Blott, G Di Guglielmo, J Duarte, P Harris, ...
Frontiers in Big Data, 2022, 2022
642022
Codenet: Efficient deployment of input-adaptive object detection on embedded fpgas
Z Dong*, D Wang*, Q Huang*, Y Gao, Y Cai, T Li, B Wu, K Keutzer, ...
International Symposium on Field-Programmable Gate Arrays (FPGA), 2021, 2021
612021
A novel convolution computing paradigm based on NOR flash array with high computing speed and energy efficiency
R Han, P Huang, Y Xiang, C Liu, Z Dong, Z Su, Y Liu, L Liu, X Liu, J Kang
IEEE Transactions on Circuits and Systems (TCAS), 2019, 2019
572019
Hao: Hardware-aware neural architecture optimization for efficient inference
Z Dong, Y Gao, Q Huang, J Wawrzynek, HKH So, K Keutzer
Field-Programmable Custom Computing Machines (FCCM), 2021, 2021
512021
Convolutional neural networks based on RRAM devices for image recognition and online learning tasks
Z Dong, Z Zhou, Z Li, C Liu, P Huang, L Liu, X Liu, J Kang
IEEE Transactions on Electron Devices (TED), 2018, 2018
492018
A survey of quantization methods for efficient neural network inference. arXiv preprint
A Gholami, S Kim, Z Dong, Z Yao, MW Mahoney, K Keutzer
47*2021
LLM inference unveiled: survey and roofline model insights
Z Yuan, Y Shang, Y Zhou, Z Dong, C Xue, B Wu, Z Li, Q Gu, YJ Lee, ...
arXiv preprint arXiv:2402.16363, 2024
432024
PB-LLM: Partially binarized large language models
Y Shang, Z Yuan, Q Wu, Z Dong
International Conference on Learning Representations (ICLR), 2024, 2023
432023
Cross-domain sentiment classification with contrastive learning and mutual information maximization
T Li, X Chen, S Zhang, Z Dong, K Keutzer
International Conference on Acoustics, Speech and Signal Processing (ICASSP …, 2021
422021
HAWQ-V2: Hessian aware trace-weighted quantization of neural networks (2019)
Z Dong, Z Yao
arXiv preprint arXiv:1911.03852, 2019
39*2019
NoisyQuant: Noisy bias-enhanced post-training activation quantization for vision transformers
Y Liu, H Yang, Z Dong, K Keutzer, L Du, S Zhang
Computer Vision and Pattern Recognition (CVPR), 2023, 2023
382023
The system can't perform the operation now. Try again later.
Articles 1–20