What is wrong with scene text recognition model comparisons? dataset and model analysis J Baek, G Kim, J Lee, S Park, D Han, S Yun, SJ Oh, H Lee
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
587 2019 OCR-Free Document Understanding Transformer G Kim, T Hong, M Yim, JY Nam, J Park, J Yim, W Hwang, S Yun, D Han, ...
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
190 * 2022 Cost-effective End-to-end Information Extraction for Semi-structured Document Images W Hwang, H Lee, J Yim, G Kim, M Seo
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
25 2021 Graph embedding with shifted inner product similarity and its improved approximation capability A Okuno, G Kim, H Shimodaira
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
10 2019 Representation learning with weighted inner product for universal approximation of general similarities G Kim, A Okuno, K Fukui, H Shimodaira
Proceedings of the Twenty-Eighth International Joint Conference on …, 2019
9 2019 Word-like character n-gram embedding G Kim, K Fukui, H Shimodaira
Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User …, 2018
6 2018 On text localization in end-to-end OCR-Free document understanding transformer without text localization supervision G Kim, S Yokoo, S Seo, A Osanai, Y Okamoto, Y Baek
International Conference on Document Analysis and Recognition, 215-232, 2023
4 2023 Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models G Kim, H Lee, D Kim, H Jung, S Park, Y Kim, S Yun, T Kil, B Lee, S Park
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
4 * 2023 Segmentation-free Compositional -gram Embedding G Kim, K Fukui, H Shimodaira
Proceedings of the 2019 Conference of the North American Chapter of the …, 2019
4 2019 On Web-based Visual Corpus Construction for Visual Document Understanding D Kim, T Hong, M Yim, Y Kim, G Kim
Proceedings of the International Conference on Document Analysis and …, 2023
2 * 2023 SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap D Kim, Y Kim, DH Kim, Y Lim, G Kim, T Kil
2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023
1 2023 Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings M Naito, S Yokoi, G Kim, H Shimodaira
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021
1 2021 Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model S Park, G Kim, J Lee, J Cha, JH Kim, H Lee
Proceedings of the 28th International Conference on Computational …, 2020
1 2020 CREPE: Coordinate-Aware End-to-End Document Parser Y Okamoto, Y Baek, G Kim, R Nakao, DH Kim, MB Yim, S Park, B Lee
arXiv preprint arXiv:2405.00260, 2024
2024 HyperCLOVA X Technical Report HyperCLOVA
arXiv preprint arXiv:2404.01954, 2024
2024 Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation S Lee, S Kim, SH Park, G Kim, M Seo
arXiv preprint arXiv:2401.06591, 2024
2024 Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching G Kim, W Hwang, M Seo, S Park
Proceedings of the AAAI-22 Workshop on Knowledge Discovery from Unstructured …, 2022
2022 Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization M Mizutani, A Okuno, G Kim, H Shimodaira
arXiv preprint arXiv:2005.00670, 2020
2020 Free Donut: E2E 文書理解モデルにおける Attention を用いた S Yokoo, G Kim, S Seo, AOYOY Baek