The role of data in NLP: the case for dataset profiling

De Roeck, Anne (2007). The role of data in NLP: the case for dataset profiling. In: Nicolov, Nicolas; Mitkov, Ruslan and Angelova, Galia eds. Recent Advances in Natural Language Processing IV. Current issues in linguistic theory (292). Amsterdam: John Benjamin Publishing Company, pp. 259–266.




[About the book]

This volume brings together selected and revised papers from the international conference on “Recent Advances in Natural Language Processing”, held in Borovets, Bulgaria, in September 2005. The best papers have been selected for this volume with the aim to reflect the most promising and significant trends in natural language processing. The volume covers a wide variety of topics in Natural Language Processing, including information extraction, indexing, latent semantic analysis, dependency parsing, anaphora and referring expressions, spam analysis, document classification, rhetorical relations, textual entailment, question answering, ontologies, word sense disambiguation, machine translation, treebanks and corpora.

Viewing alternatives


Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions
No digital document available to download for this item

Item Actions