This page provides information about the IRME Project carried out at the Utrecht institute of Linguistics OTS
(Uil-OTS) and
Alfa-informatica Groningen. This project commenced in June 2005 and ended at 31 August 2007.
The project was funded by STEVIN.
Internal reports
- Grégoire, Nicole, 2007. Report on integration of acquired lexical knowledge into ECM-based database. (PDF)
- Grégoire, Nicole, 2007. Report on the lexical representation of subclasses of MWEs. (PDF)
- Grégoire, Nicole, 2007. Report on formalizing and elaborating the parameterized Equivalence Class Method for Dutch. (PDF)
- Grégoire, Nicole, 2007. Report on integrating ECM Lexical database in Alpino system. (PDF)
- Grégoire, Nicole, 2007. Report on results of incorporating idiomatic expressions into Rosetta. (PDF)
- Villada Moirón, Begoña, 2007. Maximum Entropy modeling for MWE identification. Comparison with decision trees. (PDF)
- Villada Moirón, Begoña, 2007. Specifications of tools to acquire valence patterns and morphosyntactic restrictions. (PDF)
- Villada Moirón, Begoña, 2007. A task-based evaluation of the ECM database. Effect on parsing performance. (PDF)
- Villada Moirón, Begoña, 2006. Evaluation of a machine learning algorithm for MWE identification. Decision Trees. (PDF)
- Villada Moirón, Begoña, 2005. Log-linear models and latent semantic indexing applied to MWE identification. (PDF)
Publications
- Grégoire, Nicole, 2006, `Elaborating the Parameterized Equivalence Class method for Dutch’, in Calzolari (et al.), LREC2006: 5th International Conference on Language Resources and Evaluation: Proceedings, pages 1894-1899, Genoa, Italy.
- Grégoire, Nicole, 2007, `Design and Implementation of a Lexicon of Dutch Multiword Expressions’, in (Grégoire et al., 2007), pp. 17-24. (PDF)
- Grégoire, Nicole, Stefan Evert and Su Nam Kim (eds.), 2007. ‘Proceedings of the Workshop on A Broader Perspective on Multiword Expressions’, ACL 2007, Prague, Czech Republic. June 28, 2007. (PDF)
- Grégoire, Nicole, Stefan Evert and Brigitte Krenn (eds.), 2008. ‘Proceedings of the Workshop Towards a Shared Task for Multiword Expressions’, LREC 2008, Marrakech, Morocco. June 1, 2008.
- Odijk, J., 2006, ‘IRME’, DIXIT 4.2, p. 23, December 2006. (PDF)
- Van de Cruys, T. and B. Villada Moirón, 2007, ‘Semantics-based Multiword Expression Extraction’. In (Grégoire et al., 2007), pp. 25-32. (PDF)
- Van de Cruys, T. and B. Villada Moirón, 2007, ‘Lexico-Semantic Multiword Expression Extraction’. In P. Dirix et al. (eds.), Computational Linguistics in the Netherlands 2006, pp. 175-190.
- Villada Moirón, Begoña, 2005, ‘Linguistically enriched corpora for establishing variation in support verb constructions’. In Proceedings of the 6th International Workshop on Linguistically Interpreted Corpora (Linc'05) held at The 2nd International Joint Conference on Natural Language Processing (IJCNLP-05). R. of Korea. (PDF)
- Villada Moirón, Begoña and Jörg Tiedemann, 2006. Identifying idiomatic expressions using automatic word-alignment. Proceedings of the EACL 2006 Workshop on Multi-word-expressions in a multilingual context, p.33-40. Trento, Italy. (PDF)
- Villada Moirón, B., A. Villavicencio, D. McCarthy, S. Evert and S. Stevenson (eds.) (2006). Proceedings of the COLING/ACL Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties. Sydney, Australia. (PDF)