rparse - a data-driven parser for Probabilistic Linear Context-Free Rewriting Systems
rparse is a data-driven parser for Probabilistic Linear Context-Free Rewriting Systems (PLCFRS). It has originally been developed at the Emmy Noether group of Prof. Dr. Laura Kallmeyer at the University of Tübingen, Germany. At this time, it is still under active development at the University of Düsseldorf, Germany. This work has been sponsored by the German Research Foundation (DFG), in particular by its Emmy Noether program.
The parser is described in the following publications:
- Laura Kallmeyer and Wolfgang Maier (2010): 'Data-Driven Parsing with Probabilistic Linear Context-Free Rewriting Systems'. In: Proceedings of The 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China.
- Wolfgang Maier (2010): 'Direct Parsing of Discontinuous Constituents in German'. In: Proceedings of the First NAACL/HLT Workshop on Morphologically-Rich Languages (SPRML2010), Los Angeles, CA.
- Wolfgang Maier and Laura Kallmeyer (2010): 'Discontinuity and Non-Projectivity: Using Mildly Context-Sensitive Formalisms for Data-Driven Parsing.' In: Proceedings of the 10th International Conference on Tree Adjoining Grammars and Related Formalisms (TAG+10), Yale University.
- Wolfgang Maier, Miriam Kaeshammer and Laura Kallmeyer (2012): 'Data-Driven PLCFRS Parsing Revisited: Restricting the Fan-Out to Two'. In: Proceedings of the Eleventh International Conference on Tree Adjoining Grammars and Related Formalisms (TAG+11), Paris, France.
rparse is available for download. It is released under the GNU General Public Licence 2.0 or higher. The parser is written in Java (Java 7 is required). See the README file in the package for information on how to run the parser.
In case of questions or comments, please contact Wolfgang Maier.



