Priors for the Estimation of Probablistic Grammars
from Incomplete Natural Language Data (VIDI Project Sima'an)


Publications -- Since the start of the project in January 2007:

  1. Reut Tsarfaty and Khalil Sima'an. Evaluating an Alternative to Head-Driven Approaches to Parsing a (Relatively) Free Word-Order Language. To appear in Proceedings of the Conference on Empirical Methods in NLP (EMNLP'09), Singapore.
  2. Hany Hassan, Khalil Sima'an and Andy Way. A Syntactified Direct Translation Model with Linear-Time Decoding.
    To appear in Proceedings of the Conference on Empirircal Methos in NLP (EMNLP'09), Singapore.
  3. Hany Hassan, Khalil Sima'an and Andy Way.  Lexicalized Semi-Incremental Dependency Parsing. 
    To appear in proceedings Recent Advances in NLP (RANLP'09), Borovets, Bulgria.
  4. Khalil Sima'an and Markos Mylonakis. Better Statistical Estimation Can Benefit All Phrases in Phrase-Based Statistical Machine Translation. In Proceedings IEEE Workshop on Spoken Language Technology (SLT) 2008, Goa, India.
  5. Hany Hassan, Khalil Sima'an and Andy Way. A Syntactic Language Model based on Incremental CCG Parsing. In Proceedings IEEE Workshop on Spoken Language Technology (SLT) 2008, Goa, India.
  6. Markos Mylonakis and Khalil Sima'an.  Phrase Translation Probabilities with ITG Priors and Smoothing as Learning Objective.In Proceedings Conf. on Empirical Methods in NLP (EMNLP'08), 2008.
  7. Barbara Plank and Khalil Sima'an. Parsing with Subdomain Instance Weighting from Raw Corpora. In proceedings Interspeech 2008, Australia, Sep. 2008.
  8. Reut Tsarfaty and Khalil Sima'an. Relational Realizational Parsing. In proceedings COLING 2008, Manchester, UK, August 2008.
  9. Hany Hassan, Khalil Sima'an and Andy Way. Syntactically Lexicalized Phrase-Based Statistical Translation. IEEE Transactions on Audio, Speech and Language Processing, 2008.
  10. Barbara Plank and Khalil Sima'an. Subdomain Sensitive Statistical Parsing using Raw Corpora. In Proceedings sixth International conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco.
  11. Roy Bar-Haim, Khalil Sima'an and Yoad Winter. Part-of-Speech Tagging of Modern Hebrew Text.  Journal of Natural Language Engineering (J-NLE), 14(2):223-251, 2008.
  12. M. Mylonakis,  K. Sima'an and R. Hwa.  Unsupervised Estimation for Noisy-Channel ModelsIn 24th Annual International Conference on Machine Learning (ICML 2007).
  13. Hany Hassan, Khalil Sima'an and Andy Way. Supertagged Phrase-Based Statistical Machine Translation. In Proceedings of 45th Annual Meeting of the Association for Comp. Linguistics (ACL'07). 
  14. Reut Tsarfaty and Khalil Sima'an. Accurate Unlexicalized Parsing for Modern Hebrew. In Proceedings of Text, Speech and Dialog (TSD'07). Lecture Notes in Computer Science (LNCS). Pilsen, Czech Republic, September 2007.
  15. Reut Tsarfaty and Khalil Sima'an. Three-Dimensional Parametrization for Parsing Morphologically Rich Languages.  In Proceedings of the International Conference on Parsing Technologies (IWPT'07). Prague, Czech Republic, June 2007. 
  16. Saib Mansour, Khalil Sima'an and Yoad Winter. Smoothing a Lexicon-based POS tagger for Arabic and Hebrew.  In proceedings of  ACL 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources. Prague, Czech Republic, 2007. Presented also as extended abstract at Bar Ilan Symposium on Artificial Intelligence (BISFAI 2007),
  17. Markos Mylonakis and Khalil Sima'an. Translation Lexicon Estimates from Non-Parallel Corpora Pairs.  In Proceedings Belgian-Netherlands AI Conference (BNAIC), Utrecht, 2007. BNAIC'07 Best Paper Award!!.

Invited Talks (Khalil Sima'an)