Le Sun 2019-08-19T12:55:53+00:00

孙乐(Le Sun)

Distinguished Professor

Phone:   +86-10-62561512

Email:    lesunle{at}163{dot}com

Address:4#  South  Fourth StreetZhong Guan Cun, Haidian District,Beijing

BIOGRAPHY

In 1998, I got Doctor’s degree from Nanjing University of Science & Technology.
From 1998 to 2000, Post-Doc Researcher, at Chinese Information Processing Center, Institute of Software, Chinese Academy of Sciences
Since 2001, Associate Professor, at Chinese Information Processing Center, Institute of Software , Chinese Academy of Sciences.
From March 2003 to September 2003, Visiting Senior Research Fellow, at Centre for Corpus Linguistics, Department of English, University of Birmingham, UK
From Dec. 2004 to Dec. 2005, Visiting Scholar, at RALI, University of Montreal , Canada

RESEACH INTERESTS

My major research interest includes Knowledge-based Natural Language Understanding (K-NLU), Chinese Information Processing

AWARDS & ACHIEVEMENTS

  • Excellent tutor at the University of Chinese Academy of Sciences

SELECTED PUBLICATIONS

译著

  1. 冯志伟, 孙乐. 自然语言处理综论(第二版)[M].北京:电子工业出版社. 2018年3月.  

学术论文

  1. Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun, Bin Dong, Shanshan Jiang. Gazetteer-Enhanced Attentive Neural Networks for Named Entity Recognition. In: 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019, CCF-B).
  2. Lingyong Yan, Xianpei Han, Le Sun and Ben He. Learning to Bootstrap for Entity Set Expansion. In: 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019, CCF-B).
  3. Hao Nie, Xianpei Han, Ben He, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, Hao Kong. Deep Sequence-to-Sequence Entity Matching for Heterogeneous Entity Resolution. In: Proceedings of The 28th ACM Conference on Information and Knowledge Management (CIKM 2019,CCF-B), Beijing, China, November 3-7, 2019.
  4. Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun. Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks. In: the 57th Annual Meeting of the Association for Computational Linguistics(ACL 2019,CCF-A).
  5. Yaojie Lu, Hongyu Lin, Xianpei Han and Le Sun. Distilling Discrimination and Generalization Knowledge for Event Detection via ∆-Representation Learning. In: the 57th Annual Meeting of the Association for Computational Linguistics(ACL 2019,CCF-A).
  6. Jialong Tang, Ziyao Lu, Jinsong Su, Yubin Ge, Linfeng Song, Le Sun, Jiebo Luo. Progressively Self-Supervised Attention Learning for Aspect-Level Sentiment Analysis. In: the 57th Annual Meeting of the Association for Computational Linguistics(ACL 2019,CCF-A).
  7. Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun. Cost-sensitive Regularization for Label Confusion-aware Event Detection. In: the 57th Annual Meeting of the Association for Computational Linguistics(ACL 2019,CCF-A).
  8. Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu and Hao Kong. End-to-End Multi-Perspective Matching for Entity Resolution. In: the 28th International Joint Conference on Artificial Intelligence(IJCAI 2019,CCF-A).
  9. Bo Chen, Le Sun and Xianpei Han. Sequence-to-Action: End-to-End Semantic Graph Generation for Semantic Parsing. In: the 56th Annual Meeting of the Association for Computational Linguistics(ACL 2018,CCF-A).
  10. Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun. Nugget Proposal Networks for Chinese Event Detection. In: the 56th Annual Meeting of the Association for Computational Linguistics(ACL 2018,CCF-A).
  11. Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun. Adaptive Scaling for Sparse Detection in Information Extraction. In: the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018,CCF-A).
  12. Bo An, Xianpei Han and Le Sun. Accurate Text-Enhanced Knowledge Graph Representation Learning. In: The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(NAACL 2018,CCF-B).
  13. Bo Chen, Le Sun and Xianpei Han. Semi-Supervised Lexicon Learning for Wide-Coverage Semantic Parsing. In: The 27th International Conference on Computational Linguistics (COLING 2018,CCF-B)
  14. Bo An, Xianpei Han and Le Sun. Model-Free Context-Aware Word Composition. In: The 27th International Conference on Computational Linguistics (COLING 2018,CCF-B).
  15. Hongyu Lin, Le Sun, Xianpei Ha Reasoning with Heterogeneous Knowledge for Commonsense Machine Comprehension. In: Proc. of the 2017 Conference on Empirical Methods on Natural Language Processing (EMNLP 2017,CCF-B)
  16. Xianpei Han and Le Sun. Distant Supervision via Prototype-based Global Representation Learning. In: the Thirty-First AAAI Conference (AAAI-17,CCF-A).
  17. Xianpei Han, Le Sun. Context-Sensitive Inference Rule Discovery: A Graph-based Method. In: the 26th International Conference on Computational Linguistics (COLING 2016,CCF-B).
  18. Bo Chen, Le Sun, Xianpei Han, Bo An. Sentence Rewriting for Semantic Parsing. In: The 54th annual meeting of the Association for Computational Linguistics (ACL 2016,CCF-A)
  19. Xianpei Han, Le Sun. Global Distant Supervision for Relation Extraction. The Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16,CCF-A).
  20. Zhenzhong Zhang, Le Sun,   Xianpei Han. A Joint Model for Entity Set Expansion and Attribute Extraction from Web Search Queries. The Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16,CCF-A). 2016.
  21. Bei Shi, Zhengzhong Zhang, Le Sun, Xianpei Han. A Probabilistic Co-Bootstrapping Method for Entity Set Expansion. The 25th International Conference on Computational Linguistics(COLING 2014).2014.
  22. Zhenzhong Zhang, Le Sun and Xianpei Han. Learning to Mine Query Subtopics from Query Log. In: The 53rd annual meeting of the Association for Computational Linguistics (ACL 2015,CCF-A ).
  23. Xianpei Han and Le Sun. Semantic Consistency: A Local Subspace Based Method for Distant Supervised Relation Extraction. In: The 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014,CCF-A), Baltimore, Maryland, 2014. pp. 718-724.
  24. Le Sun and Xianpei Han. A Feature-Enriched Tree Kernel for Relation Extraction. In: The 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014,CCF-A), Baltimore, Maryland, 2014. pp. 61-67.
  25. Zhenzhong Zhang, Le Sun and Xianpei Han. Learning to detect task boundaries of query session. In The 22nd ACM international conference on Conference on information & knowledge management(CIKM 2013,CCF-B)。
  26. Xianpei Han and Le Sun. An Entity-Topic Model for Entity Linking. In: Conference on Empirical Methods in Natural Language Processing and Natural Language Learning (EMNLP-CoNLL 2012,CCF-B)
  27. Xianpei Han, Le Sun and Jun Zhao. Collective Entity Linking in Web Text: A Graph-Based Method. In: The 34th Annual ACM SIGIR Conference (SIGIR 2011,CCF-A), Beijing, China, July 24-28, 2011.
  28. Xianpei Han and Le Sun. A Generative Entity-Mention Model for Linking Entities with Knowledge Base. In: The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT 2011,CCF-A), Portland, Oregon, USA, June 19-24, 2011.
  29. Zhenzhong Zhang and Le Sun. Improving Word Sense Induction by Exploiting Semantic Relevance. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP 2011).Chiang Mai, Thailand. Nov. 8-13, 2011.
  30. Xue Jiang, Xianpei Han and Le Sun. ISCAS at Subtopic Mining Task in NTCIR9. In Proceedings of NII Test Collection for IR Systems (NTCIR 2011). Tokyo, Japan. December 6-9, 2011.
  31. Yunping Huang, Le Sun. Query Model Refinement Using Word Graphs. In Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM 2010).
  32. Yunping Huang, Le Sun. A Unified Iterative Optimization Algorithm for Query Model and Ranking Refinement. The Sixth Asia Information Retrieval Societies Conference  (AIRS 2010).
  33. Wenbo Li, Le Sun, Zhenzhong Zhang, Xue Jiang, Weiru Zhang. TC-DCA: A System for Text Classification Based on Document’s Content Allocation. In Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM 2010).
  34. Dakun Zhang, Le Sun, Wenbo Li. Improving Phrase-based SMT Model with Flattened Bilingual Parse Tree. In Proceedings of the 6th IEEE International Conference on Natural Language Processing and Knowledge Engineering(NLPKE 2010)
  35. Zhenzhong Zhang, Le Sun, Wenbo Li. ISCAS: A System of Chinese Word Sense Induction Based on K-means Algorithm. In Proceedings of the 1st CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP 2010)
  36. Zhenzhong Zhang, Le Sun, Qiang Dong. Overview of Chinese Word Sense Induction at Task-4 at CLP2010.. In Proceedings of the 1st CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP 2010)
  37. Yunping Huang, Le Sun, Jian-Yun Nie. Smoothing Document Language Model with Local Word Graph.In Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM 2009),(accepted,short paper).
  38. Yunping Huang, Le Sun, Zhe Wang. A Unified Graph-Based Iterative Reinforcement Approach to Personalized Search. The Fifth Asia Information Retrieval Symposium (AIRS 2009).
  39. Wenbo Li, Le Sun, etc. Smoothing LDA Model for Text Categorization. 4th Asia Information Retrieval Symposium (AIRS 2008), LNCS 4993, pp. 83–94, Harbin, 2008
  40. Jing Li, Sun leA Lexical Chain Approach for Update-style Query-focused Multi-document Summarization. In the Proceedings of AIRS. Harbin, Jan 2008
  41. Ruihong Huang, Le Sun, Yuanyong Feng. Study of Kernel-based Methods for Chinese Relation Extraction(poster). In the LNCS, Springer, AIRS 08
  42. Dakun Zhang, Le Sun and Wenbo Li. A Structured Prediction Approach for Statistical Machine Translation, International Joint Conference on Natural Language Processing (IJCNLP 2008) (poster) Hyderabad, India, 2008
  43. Yuanyong Feng, Ruihong Huang, Le Sun. Two-Step Chinese NER Based on CRF. The Fourth SIGHAN Bakeoff: the First CIPSC. Hyderabad, India. January 7-12, 2008
  44. Yunping Huang, Yulin Wang, Le Sun. ISCAS at Multilingual Opinion Analysis Task. NTCIR 7,Tokyo,Japan, 2008
  45. Le Sun. Introduction of the HTRDP Chinese IR Evaluation. The First International Workshop on Evaluating Information Access ( EVIA 2007), May, 2007, Tokyo
  46. Le Sun. A User Adaptive Framework for Computer-aided Translation System, Chapter 9 in book Computer-aided Translation: Theory and Practice, 2007
  47. LIU Qun, WANG Xiangdong, LIU Hong, SUN Le, TANG Sheng, XIONG Deyi, HOU Hongxu, LV Yuanhua, LI Wenbo, LIN Shouxun, QIAN Yueliang. Introduction to HTRDP evaluations on Chinese information processing and intelligent human-machine interface, Frontiers of Computer Sciences in China, Vol.1, No.1, Feb.2007
  48. Jing Li, Le Sun,Chunyu Kit, Jonathan Webster. A Query-Focused Multi-Document Summarizer Based on Lexical Chains,Proceeding of the Document Understanding Conferences (DUC) 2007
  49. Huang Rui-hong, Sun Le, Li Jing , Pan Long-xi and Zhang Junlin. ISCAS in CLIR at NTCIR-6: Experiments with MT and PRF, NTCIR-6, Tokyo, Japan, 2007
  50. Huan Rui-hong, Sun Le, Pan Long-xi. ISCAS in Opinion Analysis Pilot Task: Experiments with sentimental dictionary based classifier and CRF model, NTCIR-6, Tokyo, Japan, 2007
  51. Yuanhua Lv, Le Sun, etc. An Iterative Implicit Feedback Approach to Personalized SearchProceeding of COLING/ACL2006,Sydney (Regular paper)
  52. Yuanyong Feng. Le Sun. Yuanhua Lv. Chinese Word Segmentation and Named Entity Recognition Based on Conditional Random Fields Models,Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, 2006,Sydney
  53. Quan Zhou, Le Sun, Yuanhua Lv. ISCAS at DUC06, Proceeding of the Document Understanding Conferences (DUC) 2006
  54. Jinming Min, Le Sun and Junlin Zhang. ISCAS in English-Chinese CLIR at NTCIR-5. Proceedings of the Fifth NTCIR Workshop on Research in Information Access Technologies Information Retrieval, Question Answering and Summarization, Tokyo Japan, 2005
  55. Quan Zhou, Le Sun, Jian-Yun Nie. IS_SUM: A Multi-Document Summarizer based on Document Index Graphic and Lexical Chains, Proceeding of the Document Understanding Conferences (DUC) 2005,10
  56. Junlin Zhang, Le Sun. Using the Web Corpus to Translate the Queries in Cross-Lingual Information Retrieval, 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering. Oct., 2005
  57. Yuanyong Feng, Le Sun and Julin Zhang. Early Results for Chinese Named Entity Recognition Using Conditional Random Fields Model, HMM and Maximum Entropy, 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering. Oct., 2005
  58. Junlin Zhang, Le Sun, Quan zhou. A Cue-based Hub-Authority Approach for Multi-Document Text Summarization, 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering. Oct., 2005
  59. Zhang Junlin, Sun le, Lv Yuanhua,Zhang Wei. Relevance Feedback.by Exploring the Different Feedback Source and Collection Structure,Proceeding of the Text REtrieval Conference (TREC).TREC 2005
  60. Zhang Junlin, Sun Le, Qu Weimin, Du Lin, Sun Yufang. A Three Level Cache-based Adaptive Chinese Language Model, Lecture Notes in Computer Science (Springer) 2004.3
  61. Qu Wei-min, Zhang Jun-lin, Sun Le, Sun Yu-fang. An Efficient Indexing and Querying Algorithm for Large-scale XML Data, 《软件学报》, 2003,Vol.14 (Suppl.), p97~104
  62. Sun le, Zhang Junlin, Sun Yufang. ISCAS at TREC2004:HARD Track. Proceeding of the Text REtrieval Conference (TREC)..TREC 2004
  63. Zhang Junlin, Sun Le , Qu Weimin, Sun Yufang. A Trigger Language Model-based IR system, The 20th International Conference on Computational Linguistics(COLING2004). Geneva, Switzerland, Vol.1, pp. 680-686
  64. Zhang Junlin, Sun Le , Yongchen Zhang. Applying Language Model into IR Task,NTCIR Workshop Fourth Meeting,2004
  65. Qu Wei-min, Zhang Jun-lin, Sun Le, Sun Yu-fang. An Efficient Indexing and Querying Algorithm for Large-scale XML Data,《软件学报》, 2003,Vol.14 (Suppl.), p97~104
  66. Zeng Wu, Lin Du, Le Sun, Shiwei Ye. TREC12 HARD Track at ISCAS. In Proceeding of the Text REtrieval Conference (TREC) TREC 2003
  67. Le Sun, Wei-min Qu, Song Xue. Constructing of a Large-Scale Chinese-English Parallel Corpus, In Coling2002, The 3rd Workshop on Asian Language Resources and International Standardization, TaiWan, 2002
  68. Jun-lin Zhang,Le Sun, Wei-min Qu, Lin Du, Song Xue. ISCAS IN NTCIR-3, NTCIR-3, Tokyo, Japan, 2002
  69. Du Lin, Zhang Yibo, Sun Le, Sun Yufang. The Application of the Comparable Corpora in the Chinese-English Cross-Lingual Information Retrieval, Journal of Computer Science and Technology, Vol. 16 , No. 4, p351~358.2001
  70. Sun Le, Zhang YiBo, Zhang JunLin, Sun YuFang. PECAT: A Computer-Aided Translation Tool Based On Bilingual Corpora. Proceeding of the IEEE SMC 2001, Tucson, Arizona, USA, Oct. 7-10, 2001, p927~932
  71. Sun Le, Zhang Junlin, Qu Weiming, Sun Yufang. Evaluation of an English-Chinese CLIR Experimental System Based on Bilingual Dictionary. International Conference on Chinese Computing, Singapore, Nov. 2001
  72. Zhang Yibo, Sun Le, Du Lin, Jin Youbing, Sun Yufang. ISCAS Text Retrieval in NTCIR Workshop II. In Proceedings of the Second NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization, Tokyo, Japan, pp.146-153,Mar. 7-9, 2001
  73. Du Lin, Zhang Yibo, Sun Le, Sun Yufang, Han Jie. PM-based indexing for Chinese text retrieval. In Proceedings of the fifth international workshop on on Information retrieval with Asian languages IRAL , Nov, 2000
  74. Sun Le, Jin Youbin, Du Lin, Sun Yufang. Automatic Extraction of English-Chinese Translation Lexicons from Noisy Bilingual Corpora. Second International Conference On Language Resources and Evaluation,. ATHENS, GREECE, 31 MAY- 2 JUNE 2000,vol. II, p751~756
  75. Du Lin, Zhang Yibo, Sun Le, Sun Yufang, Han Jie. PM-based indexing for Chinese text retrieval. In Proceedings of the fifth international workshop on on Information retrieval with Asian languages IRAL , Nov, 2000
  76. Sun Le, Jin Youbin, Du Lin, Sun Yufang. Word Alignment of English-Chinese Bilingual Corpus Based on Chunks. In Proceeding of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Oct.3 – 6 2000, Hong kong, p110~116
  77. Zhang Yibo, Sun Le, Sun Yufang. Query Translation in Chinese-English Cross-language Information Retrival. In Proceeding of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Oct.3 – 6 2000, Hong kong, p104~109
  78. Sun Le, Du Lin, Sun Yufang, Jin Youbin. Sentence Alignment of English-Chinese Complex Bilingual Corpora., Proceeding of the workshop Multi-lingual Information Processing and Asia Language Processing (MAL 1999) , Beijing, Nov. 5,1999 , P135~139

    My Google Citations

Current Research Projects

  • 国家重点研发计划(the National Key Research and Development Program of China)”基于大数据的面向开放域的智能问答技术”(Open domain Question Answering)(No. 2017YFB1002104: 2017.10-2021.09)”
  • 国家高技术研究发展计划(863计划)”面向基础教育的知识关联与推理类问题求解关键技术与系统” (Educational Question Answering)(No. 2015AA015405:K 2015.01-2017.12)”
  • 国家自然科学重点项目  “汉语认知加工机制与计算模型” (The Research of Cognitive Processing and Computational Model of Chinese)( 2015.01-2019.12)”
  • 国家语委重点科研项目 “中华经典诗词知识图谱构建技术研究” (2017-2020;)”