Access the full text.
Sign up today, get DeepDyve free for 14 days.
References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.
An Information-Theoretic Measure to Evaluate Parsing Difficulty Across Treebanks ` ANNA CORAZZA, Universita di Napoli "Federico II" ALBERTO LAVELLI, FBK-irst ` GIORGIO SATTA, Universita di Padova With the growing interest in statistical parsing, special attention has recently been devoted to the problem of comparing different treebanks to assess which languages or domains are more difficult to parse relative to a given model. A common methodology for comparing parsing difficulty across treebanks is based on the use of the standard labeled precision and recall measures. As an alternative, in this article we propose an information-theoretic measure, called the expected conditional cross-entropy (ECC). One important advantage with respect to standard performance measures is that ECC can be directly expressed as a function of the parameters of the model. We evaluate ECC across several treebanks for English, French, German, and Italian, and show that ECC is an effective measure of parsing difficulty, with an increase in ECC always accompanied by a degradation in parsing accuracy. Categories and Subject Descriptors: I.2.7 [Artificial Intelligence]: Natural Language Processing General Terms: Experimentation, Performance Additional Key Words and Phrases: Natural language parsing, probabilistic context-free grammars ACM Reference Format: Corazza, A., Lavelli, A., and Satta, G. 2013.
ACM Transactions on Speech and Language Processing (TSLP) – Association for Computing Machinery
Published: Jan 1, 2013
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.