Deokhyung Kang

Ph.D Student. POSTECH

prof_pic.jpg

I am a Ph.D. student in the Natural Language Processing (NLP) group at POSTECH, South Korea, advised by Prof. Gary Geunbae Lee. My primary research interest lies in multilingual language processing, with a special focus on reasoning — particularly on building language models that can reason robustly across languages. More broadly, I am also interested in multilingual NLP, information retrieval, and code generation.

Previously, I completed my B.S.E. in Computer Science and Engineering at POSTECH.

news

Oct 31, 2025 Release new preprint! “Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?” explores why RLM exhibit multilingual gap in reasoning. Check it out here!
Oct 01, 2025 “GuRE:Generative Query REwriter for Legal Passage Retrieval” has been accepted to NLLP 2025 Workshop co-located with the EMNLP 2025.
Aug 21, 2025 Two papers accepted at EMNLP 2025! “MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries” (Main), “Self-Correcting Code Generation Using Small Language Models” (Findings)
May 16, 2025 Two papers accepted at ACL 2025! “Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation” (Main), “EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance” (Findings)
Sep 21, 2024 Our paper, “Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing,” has been accepted at EMNLP 2024—my first EMNLP paper of my PhD!

selected publications

  1. EMNLP 2025
    MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
    Jonghwi Kim, Deokhyung Kang, Seonjeong Hwang, and 3 more authors
    2025
  2. ACL 2025
    Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation
    Deokhyung Kang*, Jeonghun Cho*, Yejin Jeon, and 4 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
  3. EMNLP 2024
    Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing
    Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  4. LREC-COLING 2024
    Denoising Table-Text Retrieval for Open-Domain Question Answering
    Deokhyung Kang, Baikjin Jung, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024