重要论文列表
Preprint
- Ning Bian, Peilin Liu, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun. A Drop of Ink Makes a Million Think: The Spread of False Information in Large Language Models.
- Ruoxi Xu, Hongyu Lin, Xinyan Guan, Xianpei Han, Yingfei Sun, Le Sun. DLUE: Benchmarking Document Language Understanding.
- Qiaoyu Tang, Ziliang Deng, Hongyu Lin, Xianpei Han, Qiao Liang, Le Sun. ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases.
- Tianshu Wang, Hongyu Lin†, Xianpei Han†, Le Sun, Xiaoyang Chen, Hao Wang, Zhenyu Zeng, DBCopilot: Scaling Natural Language Querying to Massive Databases
- Ruoxi Xu, Hongyu Lin†, Xianpei Han, Le Sun, Yingfei Sun. Academically intelligent LLMs are not necessarily socially intelligent.
- Qiaoyu Tang*, Jiawei Chen*, Bowen Yu, Yaojie Lu, Cheng Fu, Haiyang Yu, Hongyu Lin†, Fei Huang, Ben He, Xianpei Han, Le Sun, Yongbin Li†. Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
- Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun. Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation.
2024
- Jiawei Chen, Hongyu Lin†, Xianpei Han†, Le Sun. Benchmarking Large Language Models in Retrieval-Augmented Generation. The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024, CCF-A)
- Xinyan Guan*, Yanjiang Liu*, Hongyu Lin, Yaojie Lu†, Ben He, Xianpei Han, Le Sun†. Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-based Retrofitting. The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024, CCF-A)
- Ruoxi Xu, Yingfei Sun, Mengjie Ren, Shiguang Guo, Ruotong Pan, Hongyu Lin†, Le Sun, Xianpei Han. AI for social science and social science of AI: A survey. Information Processing & Management. (IP&M 2024)
- Ying Zhou, Ben He†, and Le Sun. Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Ning Bian, Xianpei Han†, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang, Bin Dong. ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Boxi Cao, Qiaoyu Tang, Hongyu Lin†, Xianpei Han†, Jiawei Chen, Tianshu Wang, Le Sun. Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Chunlei Xin, Yaojie Lu†, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun. Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han, Le Sun†. Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, Short Paper)
- Xin Zheng, Qiming Zhu, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun†. Executing Natural Language-Described Algorithms with Large Language Models: An Investigation.In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Jiawei Chen, Hongyu Lin†, Xianpei Han, Yaojie Lu, Shanshan Jiang, Bin Dong, Le Sun. Few-shot Named Entity Recognition via Superposition Concept Discrimination. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (COLING 2024, CCF-B)
- Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han†, Feng Zhang, Junfeng Zhan, Le Sun†. StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation. In Findings of the Association for Computational Linguistics: ACL 2024.
- Ying Zhou, Ben He†, Le Sun. Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Mengjie Ren, Boxi Cao, Hongyu Lin†, Cao Liu, Xianpei Han, Ke Zeng, Guanglu Wan, Xunliang Cai, Le Sun. Learning or Self-aligning? Rethinking Instruction Fine-tuning. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Shiguang Guo*, Ziliang Deng*, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun. Open Grounded Planning: Challenges and Benchmark Construction. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Ning Bian, Xianpei Han†, Hongyu Lin, Yaojie Lu, Ben He†, Le Sun. Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Xinran Chen, Xuanang Chen, Ben He†, Tengfei Wen, Le Sun. Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA. In Findings of the Association for Computational Linguistics: ACL 2024.
- Jian Luo, Xuanang Chen, Ben He†, Le Sun. PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Xiaoyang Chen, Ben He†, Hongyu Lin, Xianpei Han, Tianshu Wang, Boxi Cao, Le Sun†, Yingfei Sun. Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Question Answering. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. (ACL 2024, CCF A)
- Shu Chen, Xinyan Guan, Yaojie Lu†, Hongyu Lin, Xianpei Han†, Le Sun. Building Instruction Data from Unlabelled Corpus. In Findings of the Association for Computational Linguistics: ACL 2024.
- Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin†, Haiyang Yu, Le Sun, Xianpei Han, Yongbin Li†. SoFA: Shielded On-the-fly Alignment via Priority Rule Following. In Findings of the Association for Computational Linguistics: ACL 2024.
- Yanjiang Liu, Tianyun Zhong, Yaojie Lu†, Hongyu Lin, Ben He, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun. XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification. In Findings of the Association for Computational Linguistics: ACL 2024.
- Lvxue Li, Jiaqi Chen, Xinyu Lu, Yaojie Lu†, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun. Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations. In Findings of the Association for Computational Linguistics: ACL 2024.
2023
- Chunlei Xin, Hongyu Lin†, Shan Wu, Xianpei Han, Bo Chen, Wen Dai, Shuai Chen, Bin Wang, and Le Sun†. Dialogue rewriting via skeleton-guided generation. In Proceedings of the 37th AAAI Conference on Artificial Intelligence. (AAAI 2023, CCF-A)
- Jie Lou*, Yaojie Lu*, Dai Dai†, Wei Jia, Hongyu Lin, Xianpei Han†, Le Sun, and Hua Wu. Universal information extraction as unified semantic matching. In Proceedings of the 37th AAAI Conference on Artificial Intelligence. (AAAI 2023, CCF-A)
- Jiawei Chen, Yaojie Lu†, Hongyu Lin, Jie Lou, Wei Jia, Dai Dai, Hua Wu, Boxi Cao, Xianpei Han†, and Le Sun. Learning In-context Learning for Named Entity Recognition. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. (ACL 2023, CCF-A)
- Shan Wu*, Chunlei Xin*, Hongyu Lin†, Xianpei Han, Cao Liu, Jiansong Chen, Fan Yang, Guanglu Wan, and Le Sun†. Ambiguous Learning from Retrieval: Towards Zero-shot Semantic Parsing. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. (ACL 2023, CCF-A)
- Xiaoyang Chen, Yanjiang Liu, Ben He†, Le Sun†, and Yingfei Sun†. Understanding Differential Search Index for Text Retrieval. In Findings of the Association for Computational Linguistics: ACL 2023.
- Xuanang Chen, Ben He†, Zheng Ye†, Le Sun†, and Yingfei Sun†. Towards Imperceptible Document Manipulations against Neural Ranking Models. In Findings of the Association for Computational Linguistics: ACL 2023.
- Peilin Liu, Hongyu Lin†, Meng Liao, Hao Xiang, Xianpei Han†, and Le Sun. WebDP: Understanding Discourse Structures in Semi-Structured Web Documents. In Findings of the Association for Computational Linguistics: ACL 2023.
- Xueru Wen, Xiaoyang Chen, Xuanang Chen, Ben He†, and Le Sun†. Offline Pseudo Relevance Feedback for Efficient and Effective Single-pass Dense Retrieval. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2023, short paper)
- Boxi Cao, Hongyu Lin, Xianpei Han†, and Le Sun. The Life Cycle of Knowledge in Big Language Models: A Survey. Machine Intelligence Research. (MIR 2023)
- Xinlin Peng*, Ying Zhou*, Ben He†, Le Sun†, and Yingfei Sun†. Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2023, CCF-B)
- Ning Bian, Hongyu Lin†, Xianpei Han†, Ben He, and Le Sun. Contrastive Distant Supervision for Debiased and Denoised Machine Reading Comprehension. In Findings of the Association for Computational Linguistics: EMNLP 2023.
- Boxi Cao*, Qiaoyu Tang*, Hongyu Lin†, Xianpei Han†, and Le Sun. Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models?. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2023, CCF-B)
- Yiran Wang, Xuanang Chen, Ben He†, and Le Sun†. Contextual Interaction for Argument Post Quality Assessment. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2023, CCF-B)
- Xin Zheng, Hongyu Lin†, Xianpei Han†, and Le Sun. 2023. Toward Unified Controllable Text Generation via Regular Expression Instruction. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics. (AACL 2023)
2022
- Shan Wu, Chunlei Xin, Bo Chen, Xianpei Han†, and Le Sun. Semantic-aware Contrastive Learning for More Accurate Semantic Parsing. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2022, CCF-B)
- Xuanang Chen, Jian Luo, Ben He†, Le Sun, Yingfei Sun†. Towards Robust Dense Retrieval via Local Ranking Alignment. In Proceedings of the 31th International Joint Conference on Artificial Intelligence (IJCAI 2022, CCF-A).
- Tianshu Wang, Hongyu Lin†, Cheng Fu, Xianpei Han†, Le Sun, Feiyu Xiong, Hui Chen, Minlong Lu, Xiuwen Zhu. Bridging the Gap between Reality and Ideality of Entity Matching: A Revisting and Benchmark Re-Construction. In Proceedings of the 31th International Joint Conference on Artificial Intelligence (IJCAI 2022, CCF-A).
- Ying Zhou, Xuanang Chen, Ben He†, Zheng Ye†, Le Sun. Re-thinking Knowledge Graph Completion Evaluation from an Information Retrieval Perspective. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022, CCF-A).
- Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin†, Xianpei Han, Le Sun†, Hua Wu. Unified Structure Generation for Universal Information Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022, CCF-A).
- Fangchao Liu, Hongyu Lin†, Xianpei Han, Boxi Cao, Le Sun. Pre-training to Match for Unified Low-shot Relation Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022, CCF-A).
- Boxi Cao, Hongyu Lin, Xianpei Han†, Fangchao Liu, Le Sun†. Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022, CCF-A).
- Jiawei Chen*, Qing Liu*, Hongyu Lin†, Xianpei Han†, Le Sun. Few-shot Named Entity Recognition with Self-describing Networks. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022, CCF-A).
- Ruoxi Xu, Hongyu Lin†, Meng Liao†, Xianpei Han, Jin Xu, Wei Tan, Yingfei Sun, Le Sun. Towards Event-Centric Opinion Mining. In Findings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022).
- Jialong Tang, Hongyu Lin,Meng Liao†,Yaojie Lu, Xianpei Han, Le Sun†, Wenli Yu, Jin Xu. Procedural Text Understanding via Scene-wise Evolution. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022, CCF-A).
- Xiaoyang Chen, Kai Hui, Ben He†, Xianpei Han, Le Sun, Zheng Ye†. Incorporating Ranking Context for End-to-End BERT Re-ranking. In Proceedings of the 44th European Conference on Information Retrieval (ECIR 2022, CCF-C).
- Yaojie Lu, Hongyu Lin, Jialong Tang, Xianpei Han, Le Sun. End-to-End Neural Event Coreference Resolution. Artificial Intelligence. (CCF-A)
2021
- Lingyong Yan, Xianpei Han, and Le Sun†. Progressive Adversarial Learning for Bootstrapping: A Case Study on Entity Set Expansion. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2021, CCF-B)
- Qing Liu, Hongyu Lin†, Xinyan Xiao, Xianpei Han†, Le Sun, and Hua Wu. Fine-grained Entity Typing via Label Reasoning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2021, CCF-B)
- Jiawei Chen, Hongyu Lin†, Xianpei Han†, and Le Sun. Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2021, CCF-B)
- Zhi Zheng, Kai Hui, Ben He, Xianpei Han†, Le Sun, Andrew Yates. Contextualized Query Expansion via Unsupervised Chunk Selection for Text Retrieval. In Information Processing and Management. (IPM 2021,CCF-B)
- Yaojie Lu, Hongyu Lin, Jin Xu†, Xianpei Han†, Jialong Tang, Annan Li, Le Sun, Meng Liao, and Shaoyi Chen. Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Fangchao Liu, Lingyong Yan, Hongyu Lin†, Xianpei Han†, and Le Sun. Element Intervention for Open Relation Extraction. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun†, Weijian Xie, and Jin Xu†. From Discourse to Narrative: Knowledge Projection for Event Relation Extraction. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Shan Wu, Bo Chen, Chunlei Xin, Xianpei Han†, Le Sun†, Weipeng Zhang, Jiansong Chen, Fan Yang, and Xunliang Cai. From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Boxi Cao, Hongyu Lin†, Xianpei Han†, Le Sun, Lingyong Yan, Meng Liao, Tong Xue, and Jin Xu. Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Wenkai Zhang, Hongyu Lin†, Xianpei Han†, and Le Sun. De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing.(ACL 2021, CCF-A)
- Wenkai Zhang*, Hongyu Lin*, Xianpei Han†, Le Sun†, Huidan Liu, Jing Yuan, Zhicheng Wei. Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021, CCF-A).
- Ning Bian, Xianpei Han†, Bo Chen, Le Sun†. Benchmarking Knowledge-enhanced Commonsense Question Answering via Knowledge-to-Text Transformation. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021, CCF-A).
- Xuanang Chen†, Ben He†, Kai Hui, Yiran Wang, Le Sun, Yingfei Sun†. Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2021, short paper, Best Short Paper Award).
2020
- Hongyu Lin, Yaojie Lu, Jialong Tang, Xianpei Han†, Le Sun†, Zhicheng Wei, and Nicholas Jing Yuan. A Rigorous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land? In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2020, CCF-B).
- Jialong Tang, Yaojie Lu, Hongyu Lin, Xianpei Han†, Le Sun†, Xinyan Xiao, and Hua Wu. Syntactic and Semantic-driven Learning for Open Information Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2020.
- Lingyong Yan, Xianpei Han†, Ben He, and Le Sun. Global Bootstrapping Neural Network for Entity Set Expansion. In Findings of the Association for Computational Linguistics: EMNLP 2020.
- Zhi Zheng, Kai Hui, Ben He†, Xianpei Han, Le Sun†, and Andrew Yates. BERT-QE: Contextualized Query Expansion for Document Re-ranking. In Findings of the Association for Computational Linguistics: EMNLP 2020.
- Hao Nie, Xianpei Han†, Le Sun, Chi Man Wong, Qiang Chen, Wei Zhang, and Suhui Wu. Global Structure and Local Semantics-Preserved Embeddings for Entity Alignment. In Proceedings of the 29th International Joint Conference on Artificial Intelligence. (IJCAI 2020, CCF-A).
- Cheng Fu†, Xianpei Han†, Jiaming He, and Le Sun. Hierarchical Matching Network for Heterogeneous Entity Resolution. In Proceedings of the 29th International Joint Conference on Artificial Intelligence. (IJCAI 2020, CCF-A).
- Lingyong Yan, Xianpei Han†, Ben He†, and Le Sun. End-to-End Bootstrapping Neural Network for Entity Set Expansion. In Proceedings of the 34th AAAI Conference on Artificial Intelligence. (AAAI 2020, CCF-A).
- Bo Chen, Xianpei Han†, Ben He†, and Le Sun. Learning to Map Frequent Phrases to Sub-Structures of Meaning Representation for Neural Semantic Parsing. In Proceedings of the 34th AAAI Conference on Artificial Intelligence. (AAAI 2020, CCF-A).
- Le Wang, Ze Luo, Canjia Li, Ben He†, Le Sun, Hao Yu, and Yingfei Sun†. An End-to-end Pseudo Relevance Feedback Framework for Neural Document Retrieval. In Information Processing and Management. (IPM 2020, CCF-B).
2019
- Hongyu Lin, Yaojie Lu, Xianpei Han†, Le Sun, Bin Dong, and Shanshan Jiang. 2019. Gazetteer-Enhanced Attentive Neural Networks for Named Entity Recognition. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Association for Computational Linguistics. (EMNLP 2019, CCF-B).
- Lingyong Yan, Xianpei Han†, Le Sun, and Ben He. 2019. Learning to Bootstrap for Entity Set Expansion. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics. (EMNLP 2019, CCF-B).
- Bo An, Chen Bo, Xianpei Han, and Le Sun. 2019. EUSP: An Easy-to-Use Semantic Parsing PlatForm. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations. Association for Computational Linguistics. (EMNLP 2019).
- Hao Nie, Xianpei Han, Ben He†, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, Hao Kong. Deep Sequence-to-Sequence Entity Matching for Heterogeneous Entity Resolution. In Proceedings of the 28th ACM Conference on Information and Knowledge Management (CIKM 2019, CCF-B).
- Hongyu Lin, Yaojie Lu, Xianpei Han†, and Le Sun. 2019. Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics. (ACL 2019, CCF-A).
- Yaojie Lu, Hongyu Lin, Xianpei Han†, and Le Sun. 2019. Distilling Discrimination and Generalization Knowledge for Event Detection via Delta-Representation Learning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics. (ACL 2019, CCF-A).
- Jialong Tang,* Ziyao Lu*, Jinsong Su†, Yubin Ge, Linfeng Song, Le Sun, and Jiebo Luo. 2019. Progressive Self-Supervised Attention Learning for Aspect-Level Sentiment Analysis. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics. (ACL 2019, CCF-A).
- Hongyu Lin, Yaojie Lu, Xianpei Han†, and Le Sun. 2019. Cost-sensitive Regularization for Label Confusion-aware Event Detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics. (ACL 2019, CCF-A).
- Cheng Fu†, Xianpei Han†, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, Hao Kong. End-to-End Multi-Perspective Matching for Entity Resolution. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI 2019, CCF-A).
- Jinsong Su, Jialong Tang, Ziyao Lu, Xianpei Han, Haiying Zhang. A Neural Image Captioning Model with Caption-to-images Semantic Constructor. In Neurocomputing.
2018
- Bo Chen, Le Sun, and Xianpei Han. 2018. Sequence-to-Action: End-to-End Semantic Graph Generation for Semantic Parsing. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. (ACL 2018, CCF-A)
- Hongyu Lin, Yaojie Lu, Xianpei Han, and Le Sun. 2018. Nugget Proposal Networks for Chinese Event Detection. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. (ACL 2018, CCF-A)
- Hongyu Lin, Yaojie Lu, Xianpei Han, and Le Sun. 2018. Adaptive Scaling for Sparse Detection in Information Extraction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. (ACL 2018, CCF-A)
- Cancan Jin, Ben He, Kai Hui, and Le Sun. 2018. TDNN: A Two-stage Deep Neural Network for Prompt-independent Automated Essay Scoring. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. (ACL 2018, CCF-A)
- Bo An, Bo Chen, Xianpei Han, and Le Sun. 2018. Accurate Text-Enhanced Knowledge Graph Representation Learning. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics. (NAACL 2018, CCF-B)
- Bo Chen, Bo An, Le Sun, and Xianpei Han. 2018. Semi-Supervised Lexicon Learning for Wide-Coverage Semantic Parsing. In Proceedings of the 27th International Conference on Computational Linguistics. (COLING 2018, CCF-B)
- Bo An, Xianpei Han, and Le Sun. 2018. Model-Free Context-Aware Word Composition. In Proceedings of the 27th International Conference on Computational Linguistics. (COLING 2018, CCF-B)
- Canjia Li, Yingfei Sun, Ben He, Le Wang, Kai Hui, Andrew Yates, Le Sun, and Jungang Xu. 2018. NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2018, CCF-B)
- Canjia Li, Ben He, Le Sun, Yingfei Sun. Neural Precision Medicine by Mining Implicit Treatment Concepts. In Proceedings of the 2018 IEEE International Conference on Bioinformatics and Biomedicine. (BIBM 2018, CCF-B)
- Yanhua Ran, Ben He, Kai Hui, Jungang Xu, Le Sun. Neural Relevance Model using Similarities with Elite Documents for Effective Clinical Decision Support. In International Journal of Data Mining and Bioinformatics.
2017
- Hongyu Lin, Le Sun, and Xianpei Han. 2017. Reasoning with Heterogeneous Knowledge for Commonsense Machine Comprehension. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2017, CCF-B)
- Xianpei Han, Le Sun. Distant Supervision via Prototype-based Global Representation Learning. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI 2017, CCF-A).
2016
- Xianpei Han and Le Sun. Context-Sensitive Inference Rule Discovery: A Graph-Based Method. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (COLING 2016, CCF-B)
- Bo Chen, Le Sun, Xianpei Han, and Bo An. Sentence Rewriting for Semantic Parsing. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016, CCF-A)
- Xianpei Han, Le Sun. Global Distant Supervision for Relation Extraction. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI 2016, CCF-A).
- Zhenzhong Zhang, Le Sun, Xianpei Han. A Joint Model for Entity Set Expansion and Attribute Extraction from Web Search Queries. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI 2016, CCF-A).
2015
- Zhenzhong Zhang, Le Sun, and Xianpei Han. Learning to Mine Query Subtopics from Query Log. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. (ACL 2015, short paper)
- Jinsong Su, Deyi Xiong†, Yang Liu, Xianpei Han, Hongyu Lin, Junfeng Yao, and Min Zhang. A Context-Aware Topic Model for Statistical Machine Translation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. (ACL 2015, CCF-A)
- Jinsong Su, Deyi Xiong†, Shujian Huang, Xianpei Han, and Junfeng Yao. Graph-Based Collective Lexical Selection for Statistical Machine Translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2015, CCF-B)
2014
- Bei Shi, Zhengzhong Zhang, Le Sun, Xianpei Han. A Probabilistic Co-Bootstrapping Method for Entity Set Expansion. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014, CCF-B).
- Xianpei Han, Le Sun. Semantic Consistency: A Local Subspace Based Method for Distant Supervised Relation Extraction. In Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (ACL 2014, CCF-A).
- Le Sun, Xianpei Han. A Feature-Enriched Tree Kernel for Relation Extraction. In Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (ACL 2014, CCF-A).
2013
- Zhenzhong Zhang, Le Sun, Xianpei Han. Learning to detect task boundaries of query session. In Proceedings of the 22th ACM Conference on Information and Knowledge Management (CIKM 2013,CCF-B).
2012
- Xianpei Han, Le Sun. An Entity-Topic Model for Entity Linking. In Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing (EMNLP 2012, CCF-B).
- Longlong Ma, Jian Wu. A Component-based On-line Handwritten Tibetan Character Recognition Method using Conditional Random Field. In Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR 2012).
- Longlong Ma, Jian Wu. On-line Handwritten Chinese Character Recognition based on Inter-radical Stochastic Context-free Grammar. In Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN 2012, CCF-C).
2011
- Xianpei Han, Le Sun, Jun Zhao. Collective Entity Linking in Web Text: A Graph-Based Method. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011, CCF-A).
- Xianpei Han, Le Sun. A Generative Entity-Mention Model for Linking Entities with Knowledge Base. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011, CCF-A).
- Zhenzhong Zhang, Le Sun. Improving Word Sense Induction by Exploiting Semantic Relevance. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP 2011, CCF-C).
- Xue Jiang, Xianpei Han, Le Sun. ISCAS at Subtopic Mining Task in NTCIR9. In Proceedings of the 9th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2011).
2010
- Yunping Huang, Le Sun. Query Model Refinement Using Word Graphs. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2010,CCF-B).
- Wenbo Li, Le Sun, Zhenzhong Zhang, Xue Jiang, Weiru Zhang. TC-DCA: A System for Text Classification Based on Document’s Content Allocation. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2010,CCF-B).
- Yunping Huang, Le Sun. A Unified Iterative Optimization Algorithm for Query Model and Ranking Refinement. In Proceedings of the 6th Asia Information Retrieval Societies Conference (AIRS 2010).
- Dakun Zhang, Le Sun, Wenbo Li. Improving Phrase-based SMT Model with Flattened Bilingual Parse Tree. In Proceedings of the 6th IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLPKE 2010).
- Zhenzhong Zhang, Le Sun, Wenbo Li. ISCAS: A System of Chinese Word Sense Induction Based on K-means Algorithm. In Proceedings of the 1st CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP 2010).
- Zhenzhong Zhang, Le Sun, Qiang Dong. Overview of Chinese Word Sense Induction at Task-4 at CLP2010. In Proceedings of the 1st CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP 2010).
2009
- Yunping Huang, Le Sun, Jian-Yun Nie. Smoothing Document Language Model with Local Word Graph. In Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM 2009,CCF-B).
- Yunping Huang, Le Sun, Zhe Wang. A Unified Graph-Based Iterative Reinforcement Approach to Personalized Search. In Proceedings of the 5th Asia Information Retrieval Symposium (AIRS 2009).
2008
- Wenbo Li, Le Sun. Smoothing LDA Model for Text Categorization. In Proceedings of the 4th Asia Information Retrieval Symposium (AIRS 2008).
- Jing Li, Sun le. A Lexical Chain Approach for Update-style Query-focused Multi-document Summarization. In Proceedings of the 4th Asia Information Retrieval Symposium (AIRS 2008).
- Ruihong Huang, Le Sun, Yuanyong Feng. Study of Kernel-based Methods for Chinese Relation Extraction. In Proceedings of the 4th Asia Information Retrieval Symposium (AIRS 2008).
- Dakun Zhang, Le Sun, Wenbo Li. A Structured Prediction Approach for Statistical Machine Translation. In Proceedings of the 2th International Joint Conference on Natural Language Processing (IJCNLP 2008, CCF-C).
- Yuanyong Feng, Ruihong Huang, Le Sun. Two-Step Chinese NER Based on CRF. In Proceedings of the 6th SIGHAN Workshop on Chinese Language Processing (SIGHAN 2008).
- Yunping Huang, Yulin Wang, Le Sun. ISCAS at Multilingual Opinion Analysis Task. In Proceedings of the 7th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2007).
2007
- Le Sun. Introduction of the HTRDP Chinese IR Evaluation. In Proceedings of the 1st International Workshop on Evaluating Information Access (EVIA 2007).
- Le Sun. A User Adaptive Framework for Computer-aided Translation System, Chapter 9 in book Computer-aided Translation: Theory and Practice, 2007.
- Qun Liu, Xiangdong Wang, Hong Liu, Le Sun, Sheng Tang, Deyi Xiong, Hongxu Hou, Yuanhua Lv, Wenbo Li, Shouxun Lin, Yueliang Qian. Introduction to HTRDP evaluations on Chinese information processing and intelligent human-machine interface, Frontiers of Computer Sciences in China.
- Jing Li, Le Sun,Chunyu Kit, Jonathan Webster. A Query-Focused Multi-Document Summarizer Based on Lexical Chains. In Proceedings of the 2007 Document Understanding Conferences (DUC 2007).
- Ruihong Huang, Le Sun, Jing Li, Longxi Pan, Junlin Zhang. ISCAS in CLIR at NTCIR-6: Experiments with MT and PRF. In Proceedings of the 6th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2006).
- Ruihong Huan, Sun Le, Longxi Pan. ISCAS in Opinion Analysis Pilot Task: Experiments with Sentimental Dictionary based Classifier and CRF Model. In Proceedings of the 6th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2006).
2006
- Yuanhua Lv, Le Sun. An Iterative Implicit Feedback Approach to Personalized Search. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics (ACL 2006, CCF-A).
- Yuanyong Feng, Le Sun, Yuanhua Lv. Chinese Word Segmentation and Named Entity Recognition Based on Conditional Random Fields Models. In Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing (SIGHAN 2006).
- Quan Zhou, Le Sun, Yuanhua Lv. ISCAS at DUC06. In Proceedings of the 2006 Document Understanding Conferences (DUC 2006).
2005
- Jinming Min, Le Sun, Junlin Zhang. ISCAS in English-Chinese CLIR at NTCIR-5. In Proceedings of the 5th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2005).
- Quan Zhou, Le Sun, Jian-Yun Nie. IS_SUM: A Multi-Document Summarizer based on Document Index Graphic and Lexical Chains. In Proceedings of the 2005 Document Understanding Conferences (DUC 2005).
- Junlin Zhang, Le Sun. Using the Web Corpus to Translate the Queries in Cross-Lingual Information Retrieval. In Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLPKE 2005).
- Yuanyong Feng, Le Sun, Julin Zhang. Early Results for Chinese Named Entity Recognition Using Conditional Random Fields Model, HMM and Maximum Entropy. In Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLPKE 2005).
- Junlin Zhang, Le Sun, Quan zhou. A Cue-based Hub-Authority Approach for Multi-Document Text Summarization. In Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLPKE 2005).
- Zhang Junlin, Sun le, Lv Yuanhua, Zhang Wei. Relevance Feedback.by Exploring the Different Feedback Source and Collection Structure. In Proceedings of the 2005 Text REtrieval Conference (TREC 2005).
2004
- Junlin Zhang, Le Sun, Weimin Qu, Lin Du, Yufang Sun. A Three Level Cache-based Adaptive Chinese Language Model. In Lecture Notes in Computer Science.
- Le Sun, Junlin Zhang, Yufang Sun. ISCAS at TREC2004:HARD Track. In Proceedings of the 2004 Text REtrieval Conference (TREC 2004).
- Junlin Zhang, Le Sun, Weimin Qu, Yufang Sun. A Trigger Language Model-based IR system. In Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004, CCF-B).
- Junlin Zhang, Le Sun, Yongchen Zhang. Applying Language Model into IR Task. In Proceedings of the 4th NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2004).
2003
- Weimin Qu, Junlin Zhang, Le Sun, Yufang Sun. An Efficient Indexing and Querying Algorithm for Large-scale XML Data. In 软件学报 2003.
- Zeng Wu,Lin Du,Le Sun,Shiwei Ye. TREC12 HARD Track at ISCAS. In Proceedings of the 2003 Text REtrieval Conference (TREC 2003).
2002
- Le Sun, Weimin Qu, Song Xue. Constructing of a Large-Scale Chinese-English Parallel Corpus. In Proceedings of the 3rd Workshop on Asian Language Resources and International Standardization.
- Jun-lin Zhang,Le Sun, Weimin Qu, Lin Du, Song Xue. ISCAS IN NTCIR-3. In Proceedings of the 3rd NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2002).
2001
- Lin Du, Yibo Zhang, Le Sun, Yufang Sun. The Application of the Comparable Corpora in the Chinese-English Cross-Lingual Information Retrieval. In Journal of Computer Science and Technology.
- Le Sun, Yibo Zhang, Junlin Zhang, Yufang Sun. PECAT: A Computer-Aided Translation Tool Based On Bilingual Corpora. In Proceeding of the IEEE International Conference on Systems, Man, and Cybernetics (SMC 2001, CCF-C).
- Le Sun, Junlin Zhang, Weiming Qu, Yufang Sun. Evaluation of an English-Chinese CLIR Experimental System Based on Bilingual Dictionary. In International Conference on Chinese Computing.
- Yibo Zhang, Le Sun, Lin Du, Youbing Jin, Yufang Sun. ISCAS Text Retrieval in NTCIR Workshop II. In Proceedings of the 2nd NTCIR Workshop Research in Chinese & Japanese Text Retrieval and Text Summarization (NTCIR 2001).
2000
- Lin Du, Yibo Zhang, Le Sun, Yufang Sun, Jie Han. PM-based indexing for Chinese text retrieval. In Proceedings of the 5th international workshop on on Information retrieval with Asian languages (IRAL 2000)
- Le Sun, Youbin Jin, Lin Du, Yufang Sun. Automatic Extraction of English-Chinese Translation Lexicons from Noisy Bilingual Corpora. In Proceedings of the 2nd International Conference On Language Resources and Evaluation.
- Le Sun, Youbin Jin, Lin Du, Yufang Sun. Word Alignment of English-Chinese Bilingual Corpus Based on Chunks. In Proceeding of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP 2000).
- Yibo Zhang, Le Sun, Yufang Sun. Query Translation in Chinese-English Cross-language Information Retrival. In Proceeding of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP 2000).
1999
- Le Sun, Lin Du, Yufang Sun, Youbin Jin. Sentence Alignment of English-Chinese Complex Bilingual Corpora. In Proceeding of the workshop Multi-lingual Information Processing and Asia Language Processing (MAL 1999).