HKU-IDS Scholar
Professor Tao YU
Assistant Professor
HKU Musketeers Foundation Institute of Data Science and
Department of Computer Science, School of Computing and Data Science, HKU
Department of Computer Science, School of Computing and Data Science, HKU
Key expertise
Natural Language Processing, Artificial Intelligence
About me
Dr. Tao Yu is an Assistant Professor in the Computer Science Department of the University of Hong Kong. He is also a Postdoctoral Research Fellow in the Department of Computer Science and Engineering at University of Washington and a co-director of the NLP group at the University of Hong Kong. His research interest is in Natural Language Processing and Deep Learning, with a focus on designing and building conversational natural language interfaces that can help humans explore and reason over data in any application (e.g., relational databases and mobile apps) in a robust and trusted manner. He has published and served in the program committee at ACL, EMNLP, ICLR, NAACL, etc. He co-organized the Interactive and Executable Semantic Parsing workshop at EMNLP 2020.
Current Research Project
The current project Dr Yu is involved in is titled “Democratizing data science via conversational executable natural language understanding: building AI collaborators for everyone including laypeople via a natural language interface to coding, databases, and apps.”
Selected Publications
- Hongjin Su, Jungo Kasai, Chen Henry Wu, Weijia Shi, Tianlu Wang, Jiayi Xin, Rui Zhang, Mari Ostendorf, Luke Zettlemoyer, Noah A Smith, Tao Yu. Selective Annotation Makes Language Models Better Few-Shot Learners. (2022)
- Tianbao Xie*, Chen Henry Wu*, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu. UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models. (2022)
- Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong. GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. ICLR 2021. (2021)
- Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li, Qingning Yao, Shanelle Roman, Zilin Zhang and Dragomir Radev. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task. (2019)
Research Interests
Natural Language Processing, Deep Learning, Dialog Systems, Natural Language Interfaces
Awards
Seminar