Abstract
Quantization-Aware Training (QAT) uses batch normalization (BN) folding during fine-tuning, so it may not use the normalization effects of the BN layer. HAWQ, which achieved SOTA with QAT, has significant accuracy degradation as the training becomes longer. In this paper, we apply Adaptive Gradient Clipping (AGC) to stable quantization-aware training and improve accuracy by adding Dropout. Moreover, we have ablation studies about AGC.
Original language | English |
---|---|
Title of host publication | 2023 International Conference on Electronics, Information, and Communication, ICEIC 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350320213 |
DOIs | |
State | Published - 2023 |
Event | 2023 International Conference on Electronics, Information, and Communication, ICEIC 2023 - Singapore, Singapore Duration: 5 Feb 2023 → 8 Feb 2023 |
Publication series
Name | 2023 International Conference on Electronics, Information, and Communication, ICEIC 2023 |
---|
Conference
Conference | 2023 International Conference on Electronics, Information, and Communication, ICEIC 2023 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 5/02/23 → 8/02/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.