Tenure-Track Assistant Professor
Department:Department of Translation
Educational Background & Work Experience
2021 - now; Assistant Professor; School of Foreign Languages; Shanghai Jiao Tong University
2015 - 2021; PhD in Computational Linguistics; Department of Linguistics, Indiana University Bloomington, USA
2012 - 2015; MA in English Linguistics; Renmin University of China
2008 - 2012; BA in English Language and Literature; Renmin University of China
2013 - 2014; Exchange student; Tuebingen University, Germany
Teaching and Research
Research Interests:
- Computational Linguistics; Natural Language Understanding; Natural Language Inference; Treebanks; Corpus Translation Studies; Digital Humanities
Research Projects:
- Building a treebank for translated Chinese literature (ongoing): Renmin University of China–Indiana University Joint Funding Program
- Original Chinese Natural Language Inference (OCNLI) corpus. link.
- Chinese Language Understanding Evaluation (CLUE) benchmark. link.
Recent Publications:
【自然语言理解/自然语言推理; natural language understanding/natural language inference】
-
Hu, Hai, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Ma, Yanting Li, Yixin Nie, Kyle Richardson (2021). Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference. In: Findings of ACL. paper. code.
-
Xu, Liang, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, and Zhenzhong Lan (2020). CLUE: A Chinese Language Understanding Evaluation Benchmark. In Proceedings ofthe 28th International Conference on Computational Linguistics (COLING). pp. 4762–4772. paper. website. github page
-
Hu, Hai, Kyle Richardson, Liang Xu, Lu Li, Sandra Kuebler, and Larry Moss. (2020). OCNLI: Original Chinese Natural Language Inference. In: Findings of the Association for Computational Linguistics: EMNLP 2020. pp. 3512–3526. paper. code and data. leaderboard.
-
Richardson, Kyle, Hai Hu, Larry Moss, and Ashish Sabharwal. (2020). Probing Natural Language Inference Models through Semantic Fragments. In: Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence. pp. 8713-8721. paper. code and data.
-
Hu, Hai, Qi Chen, Kyle Richardson, Atreyee Mukherjee, Lawrence S Moss, and Sandra Kuebler. (2020). MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity. In: Proceedings of the Society for Computation in Linguistics 2020. pp. 319-329. paper. poster. code.
-
Hu, Hai, Qi Chen and Larry Moss. (2019). Natural Language Inference with Monotonicity. In Proceedings of the 13th International Conference on Computational Semantics (IWCS 2019), pp. 8–15. Gothenburg, Sweden. paper.
-
Hu, Hai, and Lawrence S. Moss. (2018). Polarity Computations in Flexible Categorial Grammar. In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics: *SEM, pp. 124–129. New Orleans, Louisiana, USA. paper. poster. code.
【语义变迁; semantic change】
-
Amaral, Patrícia, Hai Hu and Sandra Kübler (accepted). “Tracing semantic change with distributional methods: The contexts of algo”. Diachronica. (SSCI).
-
Hu, Hai, Patrícia Amaral and Sandra Kübler (2021). “Word Embeddings and Semantic Shifts in Historical Spanish: Methodological Considerations”. Digital Scholarship in the Humanities. (SSCI) paper. code
【语料库翻译研究/翻译汉语树库建设; corpus translation studies/treebank construction】
-
Hu, Hai and Sandra Kübler. (2021). Investigating Translated Chinese and Its Variants Using Machine Learning. In Natural Language Engineering. Volume 27, Issue 3 , May 2021 , pp. 339 - 372. (SCI/SSCI/AHCI) paper. code.
-
Hu, Hai, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Sandra Kübler, and Chien-Jer Charles Lin (2020). “Building a Literary Treebank for Translation Studies in Chinese”. In: Proceedings of 19th International Workshop on Treebanks and Linguistic Theories (TLT). pp. 18-31. paper.
-
Hu, Hai, Wen Li, and Sandra Kübler. (2018). Detecting Syntactic Features of Translated Chinese. In Proceedings of the 2nd Workshop on Stylistic Variation, pp. 20-28. New Orleans, Louisiana, USA. paper. slides. video presentation.
【其它; others】
-
Lin, Chien-Jer Charles, & Hu, Hai. (2018). Linking comprehension and production: Frequency distribution of Chinese relative clauses in the Sinica Treebank. In Chu-Ren Huang, Shukai Hsieh, & Peng Jin (eds.) Text, Speech, and Language Technology Series. Springer. pp. 1-21.
-
Hu, Hai and Yiwen Zhang. (2017). Path of Vowel Raising in Chengdu Dialect of Mandarin. In Proceedings of the 29th North America Conference on Chinese Linguistics. Rutgers, NJ. pp. 481-498. paper. abstract.
-
Hu, Hai, Daniel Dakota, and Sandra Kübler. (2017). Non-Deterministic Segmentation for Chinese Lattice Parsing. In Proceedings of Recent Advances of Natural Language Processing 2017, pp. 316–324. Varna, Bulgaria. paper. bib.
For all publications, see https://huhailinguist.github.io/publications/
Translation:
- Surfaces and Essences: Analogy as the Fuel and Fire of Thinking《表象与本质——类比,思考之源和思维之火》Translated by 刘健、胡海、陈祺;Written by Douglas Hofstadter and Emmanuel Sander; 2018
Professional Service
Workshop Organization
-
NAtural LOgic meets MAchine learning (NALOMA) workshop; Workshop at WESSLLI 2020. webpage
Reviewing for
- ACL, EMNLP, NAACL, CCL and other CL/NLP conferences
- Natural Language Engineering and other journals