Deokhyung Kang

Ph.D Student. POSTECH

prof_pic.jpg

I am a Ph.D. student in the Natural Language Processing (NLP) group at POSTECH, South Korea, advised by Prof. Gary Geunbae Lee. My primary research interest lies in multilingual language processing, with a special focus on reasoning — particularly on building language models that can reason robustly across languages. More broadly, I am also interested in multilingual NLP, information retrieval, and code generation.

Previously, I completed my B.S.E. in Computer Science and Engineering at POSTECH.

news

May 12, 2026 “Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model” has been accepted to SurGeLLM@ACL 2026! 🎉 Check it out here!
Apr 07, 2026 “Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?” has been accepted to ACL 2026 Findings! 🎉 Check it out here!
Oct 01, 2025 “GuRE:Generative Query REwriter for Legal Passage Retrieval” has been accepted to NLLP 2025 Workshop co-located with the EMNLP 2025.
Aug 21, 2025 Two papers accepted at EMNLP 2025! “MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries” (Main), “Self-Correcting Code Generation Using Small Language Models” (Findings)
May 16, 2025 Two papers accepted at ACL 2025! “Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation” (Main), “EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance” (Findings)

selected publications

  1. EMNLP 2025
    MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
    Jonghwi Kim, Deokhyung Kang, Seonjeong Hwang, and 3 more authors
    2025
  2. ACL 2025
    Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation
    Deokhyung Kang*, Jeonghun Cho*, Yejin Jeon, and 4 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
  3. EMNLP 2024
    Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing
    Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  4. LREC-COLING 2024
    Denoising Table-Text Retrieval for Open-Domain Question Answering
    Deokhyung Kang, Baikjin Jung, Yunsu Kim, and 1 more author
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024