Deokhyung Kang

Ph.D Student. POSTECH

prof_pic.jpg

I’m a Ph.D. student in the Natural Language Processing (NLP) group, working with Prof. Gary Geunbae Lee at POSTECH, South Korea. My primary research interest lies in multilingual language processing, with a special focus on semantic parsing. I aim to expand semantic parsing systems across multiple languages while maintaining their reasoning capabilities.

Beyond semantic parsing, my research interests include multilingual NLP, question answering, and information retrieval. I am particularly fascinated by the challenges of enabling robust reasoning across diverse linguistic landscapes.

Previously, I completed my B.S.E. in Computer Science and Engineering at POSTECH.

news

Aug 21, 2025 Two papers accepted at EMNLP 2025! “MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries” (Main), “Self-Correcting Code Generation Using Small Language Models” (Findings)
May 16, 2025 Two papers accepted at ACL 2025! “Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation” (Main), “EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance” (Findings)
Feb 25, 2025 Release new preprint! “Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation” explores generating Ladder Programs using LLMs. Check it out here!
Sep 21, 2024 Our paper, “Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing,” has been accepted at EMNLP 2024—my first EMNLP paper of my PhD!
Feb 20, 2024 “Denoising Table-Text Retrieval for Open-Domain Question Answering,” has been accepted to LREC-COLING 2024.

selected publications

  1. EMNLP 2025
    MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
    Jonghwi Kim, Deokhyung Kang, Seonjeong Hwang, and 3 more authors
    2025
  2. ACL 2025
    Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation
    Deokhyung Kang*, Jeonghun Cho*, Yejin Jeon, and 4 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
  3. EMNLP 2024
    Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing
    Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  4. LREC-COLING 2024
    Denoising Table-Text Retrieval for Open-Domain Question Answering
    Deokhyung Kang, Baikjin Jung, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024