INVocD: Identifier Name Vocabulary Dataset

Butler, Simon; Wermelinger, Michel; Yu, Yijun and Sharp, Helen (2013). INVocD: Identifier Name Vocabulary Dataset. In: 10th Working Conference on Mining Software Repositories, 18-19 May 2013, San Francisco.



NVocD is a dataset of the identifier name declarations and vocabulary found in 60 FLOSS Java projects where the source code structure is recorded and the identifier name vocabulary is made directly available, offering advantages for identifier name research over conventional source code models. The dataset has been used to support a range of research projects from identifier name analysis to concept location, and provides many opportunities to researchers.

Viewing alternatives

Download history


Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions

Item Actions