BEYOND CFG

Publications

To Appear

Andreas van Cranenburgh and Rens Bod (2017).A Data-Oriented Model of Literary Language.
To be presented at EACL 2017.

2016

Laura Kallmeyer (2016). On the Mild Context-Sensitivity of k-Tree Wrapping Grammar, in Annie Foret, Glyn Morrill, Reinhard Muskens, Rainer Osswald and Sylvain Pogodalla (eds) Formal Grammar. 20th and 21st International Conferences, FG 2015, Barcelona, Spain, August 2015, Revised Selected Papers. FG 2016, Bozen, Italy, August 2016, Proceedings, Volume 9804 of the series Lecture Notes in Computer Science, pp 77-93.

Laura Kallmeyer and Wolfgang Maier (2016). LR Parsing for LCFRS, Algorithms 9(3), 58..

Andreas van Cranenburgh, Remko Scha, Rens Bod (2016). Data-Oriented Parsing with Discontinuous Constituents and Function Tags. Journal of Language Modelling, vol. 4, no. 1, pp. 57-111. dx.doi.org/10.15398/jlm.v4i1.100

Kim Jautze, Andreas van Cranenburgh, Corina Koolen (2016). Topic Modeling Literary Quality. Digital Humanities 2016, Krakow, Poland, 11-16 July. http://andreasvc.github.io/dh2016.pdf 

Wolfgang Maier and Timm Lichte, “Discontinuous parsing with continuous trees,” in Proceedings of the Workshop on Discontinuous Structures in Natural Language Processing, San Diego, California, 2016, pp. 47-57.

Younes Samih and Wolfgang Maier, “An Arabic-Moroccan Darija Code-Switched Corpus,” in Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, 2016.

Younes Samih and Wolfgang Maier, “Detecting code-switching in Moroccan Arabic,” in Proceedings of SocialNLP @ IJCAI-2016, New York, 2016.

Younes Samih, Suraj Maharjan, Mohammed Attia, Laura Kallmeyer and Thamar Solorio (2016): Multilingual Code-switching Identification via LSTM Recurrent Neural Networks. In the Proceedings of the Second Workshop onComputational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016.

Younes Samih, Wolfgang Maier and Laura Kallmeyer (2016): SAWT: Sequence Annotation Web Tool. In the Proceedings of the Second Workshop on Computational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016.

Attia,M.,  Maharjan, S.,  Samih,Y.,  Kallmeyer, L., and Solorio, T.  Detecting Semantic Relations via Word Embeddings, in Proceedings of the  5th CogALex workshop at COLING 2016

2015

Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, and Josef van Genabith (2015): Arabic spelling error detection and correction. Natural Language Engineering. [doi]

Laura Kallmeyer and Wolfgang Maier (2015): LR Parsing for LCFRS. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, pages 1250-1255. [pdf]

Laura Kallmeyer (2015): On the mild context-sensitivity of k-Tree Wrapping Grammar. In Annie Foret, Glyn Morrill, Reinhard Muskens and Rainer Osswald (eds) Preproceedings of the 20th Conference on Formal Grammar, pages 72-88. [pdf]

Miriam Kaeshammer (2015): Hierarchical Machine Translation With Discontinuous Phrases. In Proceedings of the Tenth Workshop on Statistical Machine Translation (EMNLP), Lisbon, Portugal.

Wolfgang Maier (2015): Discontinuous Incremental Shift-reduce Parsing. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, pages 1202-1212. [pdf]

Simon Petitjean, Younes Samih, and Timm Lichte (2015): Une métagrammaire de l'interface morpho-sémantique dans les verbes en arabe. In Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles, Caen, France, pages 473-479.

2014

Miriam Kaeshammer and Anika Westburg (2014): On Complex Word Alignment Configurations. In Proceedings of the 9th Edition of the Language Resources and Evaluation Conference (LREC 2014), Reykjavik, May 2014. [pdf]

Levi King, Eric Baucom, Tilmur Gilmanov, Sandra Kübler, Daniel Whyatt, Wolfgang Maier, and Paul Rodrigues (2014): The IUCL+ System: Word-Level Language Identification via Extended Markov Models. In Proceedings of the First Workshop on Computational Approaches to Code Switching, Doha, Qatar. [pdf]

Wolfgang Maier and Carlos Gómez-Rodríguez (2014): Language variety identification in Spanish tweets. In Proceedings of the EMNLP'2014 Workshop on Language Technology for Closely Related Languages and Language Variants, Doha, Qatar. [pdf]

Wolfgang Maier, Sandra Kübler, Daniel Dakota, and Daniel Whyatt (2014). Parsing German: How Much Morphology Do We Need? In Proceedings of the First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages, Dublin, Ireland. [pdf]

Wolfgang Maier, Miriam Kaeshammer, Peter Baumann, Sandra Kübler (2014): Discosuite - A Parser Test Suite for German Discontinuous Structures. In Proceedings of the 9th Edition of the Language Resources and Evaluation Conference (LREC 2014), Reykjavik, May 2014. [pdf]

2013

Pierre Bourreau, Sylvain Salvati and Laura Kallmeyer (2013): On IO-copying and mildly-context sensitive formalisms. In Glyn Morrill and Mark-Jan Nederhof (eds) Formal Grammar. 17th and 18th International Conferences, FG 2012 Opole, Poland, August 2012, Revised Selected Papers. FG 2013 Düsseldorf, Germany, August 2013, Proceedings, LNCS 8036, pages 1-16, Springer.

Miriam Kaeshammer (2013): Synchronous Linear Context-Free Rewriting Systems for Machine Translation. In Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-7), NAACL-HLT 2013 Workshop. Atlanta, Georgia, USA. [pdf]

Laura Kallmeyer: Linear Context-Free Rewriting Systems. Language and Linguistics Compass 7(1), 22-38, January 2013. [doi]

Laura Kallmeyer, Rainer Osswald and Robert D. Van Valin, Jr. (2013): Tree Wrapping for Role and Reference Grammar. In Glyn Morrill and Mark-Jan Nederhof (eds) Formal Grammar. 17th and 18th International Conferences, FG 2012 Opole, Poland, August 2012, Revised Selected Papers. FG 2013 Düsseldorf, Germany, August 2013, Proceedings, LNCS 8036, pages 175-190, Springer.

Laura Kallmeyer and Wolfgang Maier (2013): Data-driven Parsing using Probabilistic Linear Context-Free Rewriting Systems. Computational Linguistics 39 (1). [pdf]

Sandra Kübler and Wolfgang Maier (2013): Über den Einfluss von Part-of-Speech-Tags auf Parsing-Ergebnisse. Journal for Language Technology and Computational Linguistics 28 (1). [pdf]

Wolfgang Maier (2013): Parsing Discontinuous Structures. PhD thesis, University of Tübingen. [pdf] [bib]

Wolfgang Maier (2013): LCFRS binarization and debinarization for directional parsing. In Proceedings of The 13th International Conference on Parsing Technologies, Nara, Japan, pages 113-119. [pdf]

Wolfgang Maier and Sandra Kübler (2013): Are All Commas Equal? Detecting Coordination in the Penn Treebank. The Twelfth Workshop on Treebanks and Linguistic Theories (TLT12), Sofia, Bulgaria, pages 121-133. [pdf]

Thomas Schoenemann (2013). Training Nondeficient Variants of IBM-3 and IBM-4 for Word Alignment. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, pages 22–31. [pdf]

Djamé Seddah, Reut Tsarfaty, Sandra Kübler, Marie Candito, Jinho D. Choi, Richard Farkas, Jennifer Foster, Iakes Goenaga, Koldo Gojenola Galletebeitia, Yoav Goldberg, Spence Green, Nizar Habash, Marco Kuhlmann, Wolfgang Maier, Yuval Marton, Joakim Nivre, Adam Przepiórkowski, Ryan Roth, Wolfgang Seeker, Yannick Versley, Veronica Vincze, Marcin Wolinski, and Alina Wróblewska (2013). Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages. In Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, Seattle, Washington, USA. [pdf]

2012

Miriam Kaeshammer and Vera Demberg (2012): German and English Treebanks and Lexica for Tree-Adjoining Grammars. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey[pdf]

Laura Kallmeyer and Marco Kuhlmann (2012): A Formal Model for Plausible Dependencies in Lexicalized Tree Adjoining Grammar. In Proceedings of the 11th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG+11), Paris, France, pages 108-116. [pdf]

Laura Kallmeyer and Rainer Osswald (2012): A frame-based semantics of the dative alternation in Lexicalized Tree Adjoining Grammars. In Christopher Pinón (ed.) Empirical Issues in Syntax and Semantics 9, December 2012, pages 167-184. ISSN 1769-7158.

Laura Kallmeyer and Rainer Osswald (2012): An Analysis of Directed Motion Expressions with Lexicalized Tree Adjoining Grammars and Frame Semantics. In L. Ong and R. de Queiroz (eds) WoLLIC 2012, LNCS 7456, 34-55, Springer.

Wolfgang Maier, Miriam Kaeshammer, and Laura Kallmeyer (2012): Data-Driven PLCFRS Parsing Revisited: Restricting the Fan-Out to Two. In Proceedings of the Eleventh International Conference on Tree Adjoining Grammars and Related Formalisms (TAG+11), Paris, France. [pdf] [code]

Wolfgang Maier, Sandra Kübler, Erhard Hinrichs, and Julia Krivanek (2012): Annotating Coordination in the Penn Treebank. In Proceedings of The 6th Linguistic Annotation Workshop (The LAW VI), Jeju, Korea. [pdf]

Thomas Schoenemann, Fredrik Kahl, Simon Masnou, Daniel Cremers, (2012): A linear framework for region-based image segmentation and inpainting involving curvature penalization. International Journal of Computer Vision (IJCV), 99(1), 53-68, August 2012. 

Thomas Schoenemann and Daniel Cremers (2012): A Coding Cost Framework for Super-resolution Motion Layer Decomposition. IEEE Transactions on Image Processing (TIP), 21(3), pp. 1097-1110, March 2012.

Thomas Schoenemann (2012): Comparing Linear and Convex Relaxations for Stereo and Motion. In International Conference on Pattern Recognition Applications and Methods (ICPRAM), Vilamoura, Portugal.

Yulia Zinova and Laura Kallmeyer (2012): A Frame-Based Semantics of Locative Alternation in LTAG. In Proceedings of the 11th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG+11), Paris, September 2012, pages 28-36.

2011

Kilian Evang and Laura Kallmeyer (2011): PLCFRS Parsing of English Discontinuous Constituents. Proceedings of the 12th International Conference on Parsing Technologies (IWPT), pages 104-116. October 5-7, Dublin. [pdf] [bib]

Miriam Kaeshammer and Dominikus Wetzel (2011): Enriching Phrase-Based Statistical Machine Translation with POS Information. Proceedings of the Student Research Workshop associated with RANLP 2011, pages 33-40, Hissar, Bulgaria, 13 September 2011. [pdf] [bib]

Wolfgang Maier and Timm Lichte (2011): Characterizing Discontinuity in Constituent Treebanks. In de Groote, P., Egg, M. and Kallmeyer, L., editors: Formal Grammar. 14th International Conference, FG 2009, Bordeaux, France, July 25-26, 2009, Revised Selected Papers, volume 5591 of Lecture Notes in Artificial Intelligence, pages 167-182. [url] (at Springer) 

Thomas Schoenemann (2011): Regularizing Mono- and Bi-word Models for Word Alignment. In International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand November 8-13 2011. [url]

Thomas Schoenemann (2011): Probabilistic Word Alignment under the L0-norm. In Computational Natural Language Learning (CoNLL), Portland, Oregon, June 23-24 2011. [url]

2010

Laura Kallmeyer (2010): On Mildly Context-Sensitive Non-Linear Rewriting. Research on Language and Computation. Volume 8, Number 4, 341-363, DOI: 10.1007/s11168-011-9081-6. [url] (at Springer)

Thomas Schoenemann (2010): Computing Optimal Alignments for the IBM-3 Translation Model. In Computational Natural Language Learning (CoNLL). Uppsala, Sweden, July 15-16 2010. [url]