Ph.D. Candidate, Johns Hopkins University
As of August 2013, I have moved to Philadelphia, following CCB as he takes his new job at Penn.
I'm a Ph.D. student in the Computer Science Department
at JHU. My main research interest is in statistical machine translation,
especially parsing-based methods that include syntactic information.
I am also interested in formal semantics and pragmatics.
The lab I work in is the Center for Language and Speech Processing, which is an inter-departmental center shared among the Computer Science, Electrical and Computer Engineering, and Cognitive Science Departments of JHU.
My advisor is Chris Callison-Burch.
I earned a B.A. in mathematics from The University of Chicago in 2008 and an M.S.E. in computer science from JHU in 2011.
My curriculum vitae (updated Nov. 26, 2013).
- PARADIGM: PARAphrase DIagnostics through Grammar Matching. Jonathan Weese, Juri Ganitkevitch, and Chris Callison-Burch. To appear in Proceedings of EACL.
- Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT. Adam Lopez, Matt Post, Chris Callison-Burch, Jonathan Weese, Juri Ganitkevitch, Narges Ahmidi, Olivia Buzek, Leah Hanson, Beenish Jamil, Matthias Lee, Ya-Ting Lin, Henry Pao, Fatima Rivera, Leili Shahriyari, Debu Sinha, Adam Teichert, Stephen Wampler, Michael Weinberger, Daguang Xu, Lin Yang, and Shang Zhao. In Transactions of ACL (TACL), Number 1.
- PARMA: A Predicate Argument Aligner. Travis Wolfe, Benjamin Van Durme, Mark Dredze, Nicholas Andrews, Charley Beller, Chris Callison-Burch, Jay DeYoung, Justin Snyder, Jonathan Weese, Tan Xu and Xuchen Yao. In Proceedings of ACL (short papers).
- UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems. Lushan Han, Abhay L. Kashyap, Tim Finin, Jonathan Weese, and James Mayfield. In Proceedings of *SEM.
- Joshua 4.0: Packing, PRO, and Paraphrases. Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, and Chris Callison-Burch. In Proceedings of WMT12.
- Using Categorial Grammar to Label Translation Rules. Jonathan Weese, Chris Callison-Burch, and Adam Lopez. In Proceedings of WMT12.
- Processing Informal, Romanized Pakistani Text Messages. Ann Irvine, Jonathan Weese, and Chris Callison-Burch. In the NAACL Workshop on Language in Social Media.
- Joshua 3.0: Syntax-based Machine Translation with the Thrax Grammar Extractor. Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, and Adam Lopez. In Proceedings of WMT11.
- Joshua 2.0: A Toolkit for Parsing-based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies. Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Lane Schwartz, Wren N.G. Thornton, Ziyuan Wang, Jonathan Weese and Omar F. Zaidan. In Proceedings of WMT10.
- cdec: A Decoder, Alignment, and Learning Framework for Finite-State and Context-Free Translation Models. Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman and Philip Resnik. In Proceedings of ACL 2010 (system demonstrations).
- Visualizing Data Structures in Parsing-based Machine Translation. Jonathan Weese and Chris Callison-Burch. In The Prague Bulletin of Mathematical Linguistics (PBML), Number 93, January 2010.
- The Machine Translation Toolpack for LoonyBin: Automated Management of Experimental Machine Translation HyperWorkflows. Jonathan H. Clark, Jonathan Weese, Byung Gyu Ahn, Andreas Zollmann, Qin Gao, Kenneth Heafield and Alon Lavie. In The Prague Bulletin of Mathematical Linguistics (PBML), Number 93, January 2010.
- Demonstration of Joshua: an open source toolkit for parsing-based machine translation. Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G. Thronton, Jonathan Weese, and Omar F. Zaidan. In Proceedings of ACL (software demonstrations).
- Joshua: an open source toolkit for parsing-based machine translation. Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G. Thronton, Jonathan Weese, and Omar F. Zaidan. In Proceedings of WMT09.
- Joshua: an open source toolkit for parsing-based machine translation. Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren Thronton, Jonathan Weese, and Omar Zaidan. In The Prague Bulletin for Mathematical Linguistics (PBML), Number 92, January 2009.
- Visualization tools for the Joshua MT decoder.
- Also in Joshua, an implementation of Chris Dyer's two-pass algorithm for synchronous parsing using SCFGs.
- Thrax, an extractor for synchronous context-free grammars for machine translation.
- I am a contributor to the cdec MT decoder.
Teaching (and assistant-teaching)
My office is in Hackerman 225. As noted at the top of the page, I'm in Philadelphia.
- In Spring semester 2014, I am the TA for Penn's CIS 526, Machine Translation.
- In Spring semester 2012, I was a TA for EN.600.468, Machine Translation.
- In Fall semester 2009, I was a TA for EN.600.465, Natural Language Processing.
jonny at cs dot jhu dot edu