HKU-IDS Scholar
Department of Computer Science, School of Computing and Data Science, HKU
Natural Language Processing, Artificial Intelligence
Professor Tao Yu is an Assistant Professor in the Computer Science Department of the University of Hong Kong. He is also a Postdoctoral Research Fellow in the Department of Computer Science and Engineering at University of Washington and a co-director of the NLP group at the University of Hong Kong. His research interest is in Natural Language Processing and Deep Learning, with a focus on designing and building conversational natural language interfaces that can help humans explore and reason over data in any application (e.g., relational databases and mobile apps) in a robust and trusted manner. He has published and served in the program committee at ACL, EMNLP, ICLR, NAACL, etc. He co-organized the Interactive and Executable Semantic Parsing workshop at EMNLP 2020.
The current project Professor Yu is involved in is titled “Democratizing data science via conversational executable natural language understanding: building AI collaborators for everyone including laypeople via a natural language interface to coding, databases, and apps.”
- Hongjin Su, Jungo Kasai, Chen Henry Wu, Weijia Shi, Tianlu Wang, Jiayi Xin, Rui Zhang, Mari Ostendorf, Luke Zettlemoyer, Noah A Smith, Tao Yu. Selective Annotation Makes Language Models Better Few-Shot Learners. (2022)
- Tianbao Xie*, Chen Henry Wu*, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu. UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models. (2022)
- Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong. GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. ICLR 2021. (2021)
- Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li, Qingning Yao, Shanelle Roman, Zilin Zhang and Dragomir Radev. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task. (2019)