About
Hi, I’m Haneul Yoo, a Ph.D. candidate advised by Alice Oh at Users & Information Lab, School of Computing, KAIST.
As a researcher in machine learning (ML) and natural language processing (NLP), my ultimate goal is to bridge the gap of communication among different cultures and languages with the help of large language models (LLMs).
In particular, I am dedicated to (1) data-centric and multilingual NLP and (2) LLM-driven education innovation.
I believe there are still myriads of underrepresented languages in NLP, and our efforts on low-resource languages can build more culturally-diverse language models.
I also believe recent advances in AI can assist language learners, and researchers can leverage them to enhance NLP data and techniques as well.
Education
- Ph.D. in School of Computing, KAIST (Sep 2022 - current)
- M.S. in School of Computing, KAIST (Sep 2020 - Aug 2022)
- B.S. in Computer Science and Engineering, Ajou University (Mar 2017 - Aug 2020)
- B.S. in Digital Media, Ajou University (Mar 2017 - Aug 2020)
- Semester Abroad Winter 2018, WI, USA, University of Wisconsin - Stout
Work Experiences
- Research Intern at Naver AI Lab, Sungnam, Korea (Mar 2024 - May 2024)
- Development of multilingual LLM using code-switching corpora
- Advisor: Hwaran Lee
- Data Management Intern at Upstage, Remote (Aug 2023 - Feb 2024)
- Construction and management of Korean evaluation benchmark for LLMs
- Advisor: Taehwan Oh, Jiyoon Han
- Visiting Student at Naver AI Lab, Sungnam, Korea (Mar 2023 - Jul 2023)
- Construction and management of KoBBQ, Korean bias benchmark for question answering
- Advisor: Hwaran Lee
- Research Intern at LAMDA Lab, Suwon, Korea (Mar 2020 - Aug 2020)
- Study of Deep Learning Model for Bio-signal (EEG) Analysis
- Advisor: Jungryul Seo, Kyung-Ah Sohn
- Research Intern at KEPRI, Daejeon, Korea (Sep 2019 - Feb 2020)
- Development of application services utilizing electric used power data
- Advisor: Moonsuk Choi
- Research Intern at CSIRO, QLD, Australia (Jul 2019 - Aug 2019)
- Development of convolutional deep neural network for detecting cattle in images/videos
- Advisor: Brano Kusy
- Software Engineer and Co-founder at Indielist, Suwon, Korea (Jul 2018 - Jun 2019)
- Establishment of an independent music platform and development of iOS application services
Publications
* denotes equal contributions
Under Review / Preprints
- Haneul Yoo, Jieun Han, So-Yeon Ahn, Alice Oh, “DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing” (under review)
- Jieun Han*, Haneul Yoo*, Junho Myung, Minsun Kim, Hyunseung Lim, Yoonsu Kim, Tak Yeon Lee, Hwajung Hong, Juho Kim, So-Yeon Ahn, Alice Oh, “FABRIC: Automated Scoring and Feedback Generation for Essays” (preprint)
International Journals / Conferences
- Jieun Han*, Haneul Yoo*, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh, “RECIPE4U: Exploring Student-ChatGPT Dialogue in EFL Writing Education” (LREC-COLING 2024)
- Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Throne, Alice Oh, “CLIcK: Evaluation of Cultural and Linguistic Intelligence in Korean” (LREC-COLING 2024)
- Jiho Jin*, Jiseon Kim*, Nayeon Lee*, Haneul Yoo*, Alice Oh, Hwaran Lee, “KoBBQ: Korean Bias Benchmark for Question Answering” (TACL 2024)
- Haneul Yoo, Rifki Afina Putri, Changyoon Lee, Youngin Lee, So-Yeon Ahn, Dongyeop Kang, Alice Oh, “Rethinking Annotation: Can Language Learners Contribute?” (ACL 2023)
- Juhee Son*, Jiho Jin*, Haneul Yoo, JinYeong Bak, Kyunghyun Cho, Alice Oh, “Translating Hanja historical documents to understandable Korean and English” (EMNLP 2022, Findings)
- Haneul Yoo, Jiho Jin, Juhee Son, JinYeong Bak, Kyunghyun Cho, Alice Oh, “HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea” (NAACL 2022, Findings)
- Yohan Jo, Haneul Yoo, JinYeong Bak, Alice Oh, Chris Reed, Eduard Hovy, “Knowledge-Enhanced Evidence Retrieval for Counterargument Generation” (EMNLP 2021, Findings)
Workshops / Posters
- Jieun Han*, Haneul Yoo*, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh, “Exploring Student-ChatGPT Dialogue in EFL Writing Education” (NeurIPS 2023 Workshop)
- Jieun Han*, Haneul Yoo*, Yoonsu Kim, Junho Myung, Minsun Kim, Hyunseung Lim, Juho Kim, Tak Yeon Lee, Hwajung Hong, So-Yeon Ahn, Alice Oh, “RECIPE: How to Integrate ChatGPT into EFL Writing Education” (L@S 2023, Work in Progress)
Domestic Journals / Conferences
- Juyoung Suk*, Eunsu Kim*, Philhoon Oh, Haneul Yoo, James Throne, Alice Oh, “CLIcK: Evaluation of Cultural and Linguistic Intelligence in Korean” (JKAIA 2023)
- Haneul Yoo, Jungryul Seo, Kyung-Ah Sohn “Image-based Deep Learning Approach for EEG Signal Classification” (KICS 2020)
- Moonsuk Choi, Inji Choi, Minhae Jang, Haneul Yoo, “Proposal and Simulation of Optimal Electric Vehicle Routing Algorithm”, KEPCO Journal on Electric Power and Energy Volume 6, Number 1, March 2020, pp.59-64 (KEPCO 2020)
Talks
- Guest Lecturer, AI Tech Boostcamp
- “Some Points that We Should Consider about Data-Centric NLP” (May 2023)
- “The Data You Created, Are You Sure It’s Okay?” (Dec 2022)
Teaching Experiences
- Teaching Assistant, NAVER Connect Foundation
- Teaching Assistant, KAIST
- The Many Voices of ChatGPT: Exploring Linguistic Diversity (Spring - Fall 2023)
- ML for NLP (Fall 2023)
- AI Ethics (Spring 2023)
- AI and Its Social Impact (Spring 2023)
- KAIST SoC Colloquium (Fall 2021)
- Machine Learning (Fall 2022, Spring 2021)
- Lecturer, SWeat
- Python at Suwon Information Science High School (Spring 2020)
- Java at Maetan High School (Spring 2019)
Academic Services
Honors & Awards
- KAIST Support Scholarship at KAIST (Sep 2020 - current)
- Graduation with Honors, Summa Cum Laude (1st place) at Ajou University (Aug 2020)
- Outstanding Volunteer Award at Gyeonggi Volunteer Center (Jun 2020)
- Special Prize at BIXPO 2019 (Nov 2019)
- Excellence Award from Korea Invention Promotion Association at Creative Start-up Competition (Nov 2019)
- Academic Excellence Scholarship at Asan Foundation (Mar 2017 - Aug 2020)
Reference
- Alice Oh, Professor in School of Computing, KAIST