Lut-gemm: Quantized matrix multiplication based on luts for efficient inference in large-scale generative language models G Park, B Park, M Kim, S Lee, J Kim, B Kwon, SJ Kwon, B Kim, Y Lee, ... arXiv preprint arXiv:2206.09557, 2022 | 116 | 2022 |
Refining generative process with discriminator guidance in score-based diffusion models D Kim, Y Kim, SJ Kwon, W Kang, IC Moon arXiv preprint arXiv:2211.17091, 2022 | 86 | 2022 |
Memory-efficient fine-tuning of compressed large language models via sub-4-bit integer quantization J Kim, JH Lee, S Kim, J Park, KM Yoo, SJ Kwon, D Lee Advances in Neural Information Processing Systems 36, 2024 | 84 | 2024 |
Measurement of effectiveness for an anti-torpedo combat system using a discrete event systems specification-based underwater warfare simulator KM Seo, HS Song, SJ Kwon, TG Kim The Journal of Defense Modeling and Simulation 8 (3), 157-171, 2011 | 69 | 2011 |
Structured compression by weight encryption for unstructured pruning and quantization SJ Kwon, D Lee, B Kim, P Kapoor, B Park, GY Wei Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 52 | 2020 |
Maximum likelihood training of implicit nonlinear diffusion model D Kim, B Na, SJ Kwon, D Lee, W Kang, IC Moon Advances in neural information processing systems 35, 32270-32284, 2022 | 48 | 2022 |
Alphatuning: Quantization-aware parameter-efficient adaptation of large-scale pre-trained language models SJ Kwon, J Kim, J Bae, KM Yoo, JH Kim, B Park, B Kim, JW Ha, N Sung, ... arXiv preprint arXiv:2210.03858, 2022 | 35 | 2022 |
Biqgemm: matrix multiplication with lookup table for binary-coding-based quantized dnns Y Jeon, B Park, SJ Kwon, B Kim, J Yun, D Lee SC20: International Conference for High Performance Computing, Networking …, 2020 | 35 | 2020 |
Extremely low bit transformer quantization for on-device neural machine translation I Chung, B Kim, Y Choi, SJ Kwon, Y Jeon, B Park, S Kim, D Lee arXiv preprint arXiv:2009.07453, 2020 | 30 | 2020 |
Flexround: Learnable rounding based on element-wise division for post-training quantization JH Lee, J Kim, SJ Kwon, D Lee International Conference on Machine Learning, 18913-18939, 2023 | 29 | 2023 |
No token left behind: Reliable kv cache compression via importance-aware mixed precision quantization JY Yang, B Kim, J Bae, B Kwon, G Park, E Yang, SJ Kwon, D Lee arXiv preprint arXiv:2402.18096, 2024 | 25 | 2024 |
Learning low-rank approximation for cnns D Lee, SJ Kwon, B Kim, GY Wei arXiv preprint arXiv:1905.10145, 2019 | 22 | 2019 |
Simulation‐Based Optimization on the System‐of‐Systems Model via Model Transformation and Genetic Algorithm: A Case Study of Network‐Centric Warfare BG Kang, SH Choi, SJ Kwon, JH Lee, TG Kim Complexity 2018 (1), 4521672, 2018 | 22 | 2018 |
Effectiveness analysis of anti-torpedo warfare simulation for evaluating mix strategies of decoys and jammers SJ Kwon, KM Seo, B Kim, TG Kim Advanced Methods, Techniques, and Applications in Modeling and Simulation …, 2012 | 17 | 2012 |
Flexor: Trainable fractional quantization D Lee, SJ Kwon, B Kim, Y Jeon, B Park, J Yun Advances in neural information processing systems 33, 1311-1321, 2020 | 15 | 2020 |
Rethinking channel dimensions to isolate outliers for low-bit weight quantization of large language models JH Heo, J Kim, B Kwon, B Kim, SJ Kwon, D Lee arXiv preprint arXiv:2309.15531, 2023 | 12 | 2023 |
Modeling and simulation methodology for defense systems based on concept of system of systems TG Kim, SJ Kwon, B Kang Journal of Korean Institute of Industrial Engineers 39 (6), 450-460, 2013 | 11 | 2013 |
Adaptive discrete event simulation systems to embrace changes of requirements using event control models SJ Kwon, B Kang, C Choi, TG Kim IEEE Transactions on Systems, Man, and Cybernetics: Systems 50 (3), 1147-1160, 2017 | 7 | 2017 |
Design and implementation of event-based DEVS execution environment for faster execution of iterative simulation. SJ Kwon, TG Kim SpringSim (TMS-DEVS), 14, 2012 | 7 | 2012 |
Network pruning for low-rank binary indexing D Lee, SJ Kwon, B Kim, P Kapoor, GY Wei arXiv preprint arXiv:1905.05686, 2019 | 6 | 2019 |