Access the full text.
Sign up today, get DeepDyve free for 14 days.
Eric Laporte, Takuya Nakamura, Stavroula Voyatzi (2008)
A French Corpus Annotated for Multiword Nouns
M. Moiron (2005)
University of Groningen Data-driven identification of fixed expressions and their modifiability
R. Moon (1998)
Fixed Expressions and Idioms in English: A Corpus-Based Approach
Carlos Ramisch, P. Schreiner, M. Idiart, Aline Villavicencio (2008)
An Evaluation of Methods for the Extraction of Multiword Expressions
Valentina Efrati, F. Masini (2011)
CoP-It. Towards the creation of an online database of Italian word combinations
(2009)
Special issue on Multiword Expressions
A. Michiels, Nicolas Dufour (1998)
DEFI, a tool for automatic multi-word unit recognition, meaning assignment and translation selection
(2000)
GRADIT, Grande dizionario Italiano dell'uso. UTET
Andrea Zaninello, M. Nissim (2010)
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian
M. Nissim, Andrea Zaninello (2011)
A quantitative study on the morphology of Italian multiword expressions
Aline Villavicencio, Francis Bond, A. Korhonen, Diana McCarthy (2005)
Introduction to the special issue on multiword expressions: Having a crack at a hard nutComput. Speech Lang., 19
Pavel Pecina (2008)
AMachine Learning Approach to Multiword Expression Extraction
Ann Copestake, Fabre Lambeau, Aline Villavicencio, Francis Bond, Timothy Baldwin, I. Sag, D. Flickinger (2002)
Multiword expressions: linguistic precision and reusability
E. Zanchetta, Marco Baroni (2005)
Morph-it! A free corpus-based morphological resource for the Italian language, 1
M. Constant, Anthony Sigogne (2011)
MWU-Aware Part-of-Speech Tagging with a CRF Model and Lexical Resources
Mark Finlayson, Nidhi Kulkarni (2011)
Detecting Multi-Word Expressions Improves Word Sense Disambiguation
G. Nunberg, I. Sag, T. Wasow (2015)
IdiomsLanguage, 70
Paul Rayson, D. Archer, S. Piao, A. McEnery (2004)
The UCREL Semantic Analysis System
Agata Savary (2009)
Computational Inflection of Multi-Word Units, a contrastive study of lexical approachesLinguistic Issues in Language Technology
Yi Zhang, Valia Kordoni, Aline Villavicencio, M. Idiart (2006)
Automated Multiword Expression Prediction for Grammar Engineering
Heiki-Jaan Kaalep, K. Muischnek (2008)
Multi-Word Verbs of Estonian : a Database and a Corpus
Marion Weller, U. Heid (2010)
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
Aline Villavicencio, Valia Kordoni, Yi Zhang, M. Idiart, Carlos Ramisch (2007)
Validation and Evaluation of Automatically Acquired Multiword Expressions for Grammar Engineering
C. Fellbaum, Alexander Geyken, Axel Herold, Fabian Körner, G. Neumann (2006)
Corpus-based Studies of German Idioms and Light VerbsInternational Journal of Lexicography, 19
K. Choukri, M. Nilsson (1998)
The european language resources association
Petter Haugereid, Francis Bond (2011)
Extracting Transfer Rules for Multiword Expressions from Parallel Corpora
N. Calzolari, C. Fillmore, R. Grishman, Nancy Ide, Alessandro Lenci, C. Macleod, A. Zampolli (2002)
Towards Best Practice for Multiword Expressions in Computational Lexicons
(2004)
Polirematiche
(1995)
The syntactic behaviour of idioms
Julia Miller (2010)
Review of Fellbaum, C., ed. (2007) Idioms and Collocations: Corpus-based Linguistic and Lexicographic StudiesAustralian Review of Applied Linguistics, 33
I. Sag, Timothy Baldwin, Francis Bond, Ann Copestake, D. Flickinger (2002)
Multiword Expressions: A Pain in the Neck for NLP
S. Evert, Brigitte Krenn (2005)
Using small random samples for the manual evaluation of statistical association measuresComput. Speech Lang., 19
O. Christ (1994)
A Modular and Flexible Architecture for an Integrated Corpus Query SystemArXiv, abs/cmp-lg/9408005
Tim Cruys, Begoña Moirón (2007)
Semantics-based Multiword Expression Extraction
(1994)
Idioms. Lang
(2013)
ACM Transactions on Speech and Language Processing
Carlos Ramisch, Aline Villavicencio, C. Boitet (2010)
mwetoolkit: a Framework for Multiword Expression Identification
E. Wehrli (1998)
Translating Idioms
A. Fazly, S. Stevenson (2006)
Automatically Constructing a Lexicon of Verb Phrase Idiomatic Combinations, 11
(2007)
Syntactic subcategorization of noun + verb multiwords : Description , classification and extraction from text corpora
(2004)
Polirematiche. Linguistica Pragensia
(2000)
GRADIT, Grande dizionario Italiano dell’uso
Aline Villavicencio, Ann Copestake, Benjamin Waldron, Fabre Lambeau (2004)
Lexical Encoding of MWEs
J. Odijk (2004)
A proposed standard for the lexical representation of idioms
(2007)
Parole sintagmatiche in italiano
(2007)
Collocations and Idioms: Corpus-Based Linguistic and Lexicographic Studies
Marco Baroni, Silvia Bernardini, Adriano Ferraresi, E. Zanchetta (2009)
The WaCky wide web: a collection of very large linguistically processed web-crawled corporaLanguage Resources and Evaluation, 43
Adele Goldberg (2003)
Constructions: a new theoretical approach to languageTrends in Cognitive Sciences, 7
Marco Baroni, Silvia Bernardini, Federica Comastri, Lorenzo Piccioni, A. Volpi, G. Aston, Marco Mazzoleni (2004)
Introducing the La Repubblica Corpus: A Large, Annotated, TEI(XML)-compliant Corpus of Newspaper Italian
U. Heid, Marion Weller (2010)
Corpus-derived data on German multiword expressions for lexicography
(2014)
The Boundaries of the Lexicon
S. Piao, Guangfan Sun, Paul Rayson, Q. Yuan (2006)
Automatic Extraction of Chinese Multiword Expressions with a Statistical Tool
Paul Cook, A. Fazly, S. Stevenson (2007)
Pulling their Weight: Exploiting Syntactic Forms for the Automatic Identification of Idiomatic Expressions in Context
C. Bannard (2007)
A Measure of Syntactic Flexibility for Automatically Identifying Multiword Expressions in Corpora
(2004)
The UCREL semantic analysis system Beyond Named Entity Recognition Semantic Labelling for NLP Tasks
Nicole Grégoire (2010)
DuELME: a Dutch electronic lexicon of multiword expressionsLanguage Resources and Evaluation, 44
S. Banerjee, Ted Pedersen (2003)
The Design, Implementation, and Use of the Ngram Statistics Package
(2011)
CoP-It
Hassan Al-Haj, S. Wintner (2010)
Identifying Multi-word Expressions by Leveraging Morphological and Syntactic Idiosyncrasy
Jacob Cohen (1960)
A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 20
(2008)
Statistical methods for corpus exploitation
Modeling the Internal Variability of Multiword Expressions through a Pattern-Based Method MALVINA NISSIM, University of Bologna ANDREA ZANINELLO, Zanichelli editore, Bologna The issue of internal variability of multiword expressions (MWEs) is crucial towards their identification and extraction in running text. We present a corpus-supported and computational study on Italian MWEs, aimed at defining an automatic method for modeling internal variation, exploiting frequency and part-of-speech (POS) information. We do so by deriving an XML-encoded lexicon of MWEs based on a manually compiled dictionary, which is then projected onto a a large corpus. Since a search for fixed forms suffers from low recall, while an unconstrained flexible search for lemmas yields a loss in precision, we suggest a procedure aimed at maximizing precision in the identification of MWEs within a flexible search. Our method builds on the idea that internal variability can be modelled via the novel introduction of variation patterns, which work over POS patterns, and can be used as working tools for controlling precision. We also compare the performance of variation patterns to that of association measures, and explore the possibility of using variation patterns in MWE extraction in addition to identification. Finally, we suggest that corpus-derived, pattern-related
ACM Transactions on Speech and Language Processing (TSLP) – Association for Computing Machinery
Published: Jun 1, 2013
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.