|
Zhifei Li |
Ph.D. Student |
|
|
|
Office: CSEB 322 |
|
Email: zhifei.work@gmail.com |
I am interested in applying machine learning methods (mainly sequence/structured model) on large-scale NLP applications such as Machine Translation and Language Modeling. Due to the large scale of the training data and the model search space, compact data structures (for model representation) and efficient algorithms (for model inference and learning) are essential. Also, due to the complex dependency among the model parameters, a discriminative feature-based framework is more flexible than a generative model. Currently, I am working on large-scale discriminative training for machine translation systems.
I am also interested in system-oriented research on computer networks. Before coming to JHU, I have done some work on wireless networks. In particular, during 1999-2002, I have worked at MobileSoft for several commercial projects related to WAP and Bluetooth. During 2002-2004 at NTU, I have worked on Medium Access Control issues in IEEE 802.11-based wireless networks. Here are some of my publications.
I was very interested in applying algorithmic/theoretical methods in some fascinating applications. I found the following theoretical areas, Distributed Algorithms, Game Theory, Graph Algorithms, and Online Algorithms, are of great interests. The application problems that I was interested in are Wireless Networks, P2P, Web Search Ranking, and Reputation Systems.
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets. In Proceedings of COLING 2010.
First-
and Second-order Expectation Semirings with Applications to Minimum-Risk
Training on Translation Forests. In Proceedings of EMNLP
2009.
Slides: animated,
non-animated
Variational
Decoding for Statistical Machine Translation.
In Proceedings of ACL 2009. Nominated for Best Paper Award.
Slides: animated,
non-animated
Zhifei Li and Sanjeev Khudanpur. Efficient Extraction of Oracle-best Translations from Hypergraphs. In Proceedings of NAACL 2009 (short paper).
Slides: animated
Zhifei Li and Sanjeev Khudanpur. Forest Reranking for Machine Translation with the Perceptron Algorithm. To appear in the GALE book chapter on "MT from text", 2009.
Jason Smith, Damianos Karakos, Zhifei Li, Jason Eisner and Sanjeev Khudanpur. Novel System Combination Approaches for MT. To appear in the GALE book chapter on "MT from text", 2009.
Zhifei Li, Chris Callison-Burch,
Chris Dyer, Juri Ganitkevitch,
Sanjeev Khudanpur, Lane
Schwartz, Wren Thornton, Jonathan Weese and Omar Zaidan, 2009.
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine
Translation. To appear in ACL 2009 (demo).
Zhifei Li, Chris Callison-Burch,
Chris Dyer, Juri Ganitkevitch,
Sanjeev Khudanpur, Lane
Schwartz, Wren Thornton, Jonathan Weese and Omar Zaidan, 2009.
Joshua: An Open
Source Toolkit for Parsing-based Machine Translation. In Proceedings of the
Workshop on Statistical Machine Translation (WMT09).