The Open UniversitySkip to content

INVocD: Identifier Name Vocabulary Dataset

Butler, Simon; Wermelinger, Michel; Yu, Yijun and Sharp, Helen (2013). INVocD: Identifier Name Vocabulary Dataset. In: 10th Working Conference on Mining Software Repositories, 18-19 May 2013, San Francisco.

Full text available as:
[img] ZIP archive (Version of Record)
Download (604MB)
PDF (Accepted Manuscript) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (218kB) | Preview
Google Scholar: Look up in Google Scholar


NVocD is a dataset of the identifier name declarations and vocabulary found in 60 FLOSS Java projects where the source code structure is recorded and the identifier name vocabulary is made directly available, offering advantages for identifier name research over conventional source code models. The dataset has been used to support a range of research projects from identifier name analysis to concept location, and provides many opportunities to researchers.

Item Type: Conference or Workshop Item
Copyright Holders: 2013 IEEE
Extra Information: The zip file includes the dataset and instructions how to use it.
Keywords: identifier names; source code model; source code mining
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Related URLs:
Item ID: 36992
Depositing User: Michel Wermelinger
Date Deposited: 09 Apr 2013 08:19
Last Modified: 07 Dec 2018 23:02
Share this page:

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU