An Analysis of POS Tag Patterns in Ontology Identifiers and Labels

Williams, Sandra (2013). An Analysis of POS Tag Patterns in Ontology Identifiers and Labels. Technical Report 2013/02; Department of Computing, The Open University.

DOI: https://doi.org/10.21954/ou.ro.000160c3

Abstract

I describe an analysis of the syntax of identifier names found in a corpus of over 500 ontologies. 1 The analysis was performed in five steps: (i) extraction of identifier names from the corpus; (ii) construction of dummy sentences containing the identifiers; (iii) part-of-speech (POS) tagging; (iv) extraction of POS tag strings; (v) POS string frequency analysis; and (vi) general syntactic pattern analysis. The findings of the analysis were that identifier names follow simple syntactic patterns; each type of identifier can be expressed through relatively few patterns; and the syntax of identifiers differs from natural English in consistent ways.

Viewing alternatives

Download history

Metrics

Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions

Item Actions

Export

About