Copy the page URI to the clipboard
Williams, Sandra (2013). An Analysis of POS Tag Patterns in Ontology Identifiers and Labels. Technical Report 2013/02; Department of Computing, The Open University.
DOI: https://doi.org/10.21954/ou.ro.000160c3
Abstract
I describe an analysis of the syntax of identifier names found in a corpus of over 500 ontologies. 1 The analysis was performed in five steps: (i) extraction of identifier names from the corpus; (ii) construction of dummy sentences containing the identifiers; (iii) part-of-speech (POS) tagging; (iv) extraction of POS tag strings; (v) POS string frequency analysis; and (vi) general syntactic pattern analysis. The findings of the analysis were that identifier names follow simple syntactic patterns; each type of identifier can be expressed through relatively few patterns; and the syntax of identifiers differs from natural English in consistent ways.