Publications
Machine Learning
Solving Large Scale Linear SVM with Distributed Block Minimization [pdf]
Dmitry Pechyony, Libin Shen and Rosie Jones
NIPS 2011 Workshop on Big Learning: Algorithms, Systems, and Tools for Learning at Scale. Granada, Spain. Dec. 16 - 17, 2011
Understanding Exhaustive Pattern Learning [pdf]Libin Shen
arXiv1104.3929, April, 2011.
Ranking and Reranking with Perceptron [pdf]
Libin Shen and Aravind K. Joshi
Machine Learning Jounal, Volume 60, Numbers 1-3, pp 73-96, September 2005.
Machine Translation
String-to-Dependency Statistical Machine Translation [pdf]
Libin Shen, Jinxi Xu and Ralph Weischedel
Computational Linguistics Journal, Volume 36, Number 4, pp 649-671, December, 2010.
Statistical Machine Translation with a Factorized Grammar [pdf]
Libin Shen, Bing Zhang, Spyros Matsoukas, Jinxi Xu and Ralph Weischedel
in Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP). Cambridge, MA USA, Oct. 9 - 11, 2010.
Effect Use of Linguistic and Contextual Information for Statistical Machine Translation [pdf, slides]
Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas and Ralph Weischedel
in Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, Aug. 6 - 7, 2009.
A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model [pdf, slides]
Libin Shen, Jinxi Xu and Ralph Weischedel
in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL). Columbus, OH, USA, June 15 - 20, 2008.
Discriminative Reranking for Machine Translation [pdf, slides]
Libin Shen, Anoop Sarkar and Franz Josef Och
in Proceedings of Human Language Technology conference / North American chapter of the Association for Computational Linguistics annual meeting (HLT/NAACL). Boston, USA. May 2 - 7, 2004.
A Smorgasbord of Features for Statistical Machine Translation [pdf]
Franz Josef Och, Daniel Gildea, Sanjeev Khudanpur, Anoop Sarkar, Kenji Yamada, Alex Fraser, Shankar Kumar, Libin Shen, David Smith, Katherine Eng, Viren Jain, Zhen Jin and Dragomir Radev
in Proceedings of Human Language Technology conference / North American chapter of the Association for Computational Linguistics annual meeting (HLT/NAACL). Boston, USA. May 2 - 7, 2004.
Parsing
LTAG Dependency Parsing with Bidirectional Incremental Construction [pdf]
Libin Shen and Aravind K. Joshi
in Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP). Hawaii, USA. Oct. 25 - 27, 2008
Incremental LTAG parsing [pdf, slides]
Libin Shen and Aravind K. Joshi
in Proceedings of Human Language Technology Conference / Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP). Vancouver, Canada. Oct. 6 - 8, 2005.
Flexible Margin Selection for Reranking with Full Pairwise Samples [pdf, slides]
Libin Shen and Aravind K. Joshi
in Proceedings of the 1st International Joint Conference of Natural Language Processing (IJCNLP), LNAI Vol. 3248, pp 446-455. Hainan Island, China. March 22 - 24, 2004.
Using LTAG Based Features in Parse Reranking [pdf, slides]
Libin Shen, Anoop Sarkar and Aravind K. Joshi
in Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP). Sapporo, Japan. July 11 - 12, 2003.
An SVM Based Voting Algorithm with Application to Parse Reranking [pdf, slides]
Libin Shen and Aravind K. Joshi
in Proceedings of the 7th Conference on Computational Natural Language Learning (CoNLL).Edmonton, Canada, May 31- June 1, 2003.
Chunking and Labelling
Discriminative Learning of Supertagging
Libin Shen
Book chapter in Supertagging: Using Complex Lexical Descriptions in Natural Language Processing. MIT Press. 2010.
Guided Learning for Bidirectional Sequence Classification [pdf, slides]
Libin Shen, Giorgio Satta and Aravind K. Joshi
in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL). Prague, Czech, June 24 - 29, 2007.
A SNoW Based Supertagger with Application to NP Chunking [pdf, slides]
Libin Shen and Aravind K. Joshi
in Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL). Sapporo, Japan. July 7 - 12, 2003.
Chinese Word Segmentation as LMR Tagging [pdf, slides]
Nianwen Xue and Libin Shen
in Proceedings of the 2nd ACL SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan. July 11 - 12, 2003.
Language Resources
LTAG-spinal and the Treebank: A new resource for incremental, dependency and semantic parsing [springer online, draft]
Libin Shen, Lucas Champollion and Aravind K. Joshi
Language Resources and Evaluation, Volume 42, Number 1, pp 1-19, March, 2008.
Issues in Synchronizing the English Treebank and Propbank [pdf]
Olga Babko-Malaya, Ann Bies, Ann Taylor, Szuting Yi, Martha Palmer, Mitch Marcus, Seth Kulick and Libin Shen
in Preceedings of Coling/ACL Workshop on Frontiers in Linguistically Annotated Corpora, Sydney, Australia. July 22, 2006.
Extracting Deeper Information from Richer Resource: EM Models for LTAG Treebank Induction [pdf, slides]
Libin Shen and Aravind K. Joshi
in IJCNLP Workshop: Beyond Shallow Analyses - Formalisms and Statistical Modeling for Deep Analyses. Hainan Island, China. March 21, 2004.
LTAG Derivation Tree Extraction [pdf]
Libin Shen
in Proceedings of the 7th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG 7). Vancouver, Canada. May 20 - 22, 2004.
Dissertation
Statistical LTAG Parsing [pdf]
Software
The LTAG-spinal Treebank, the two LTAG parsers and the POS tagger are available at Penn's XTAG site.