An open framework for semantic code queries on heterogeneous repositories

Zhang, Tian; Pan, Minxue; Zhao, Jizhou; Yu, Yijun and Li, Xuandong (2015). An open framework for semantic code queries on heterogeneous repositories. In: Proceedings of the 2015 International Symposium on Theoretical Aspects of Software Engineering (Sun, Jun ed.), IEEE, pp. 39–46.



To help developers understand and reuse programs, semantic queries on the source code itself is attractive. Although programs in heterogeneous languages are being controlled for collaborative software development, most queries supported by various source code repositories are based either on the metadata of the repositories, or on indexed identifiers and method signatures. Few provide full support to search for structures that are common across different programming languages and different viewpoints (hence heterogeneous). To facilitate understanding and reuses, in this paper, we propose a novel source code query framework that (1) transforms source code to a unified abstract syntax format, and handles heterogeneity (non-isomorphism) at the abstract syntax level; (2) stores source code on a cloud-based NoSQL storage in MongoDB; (3) rewrites semantic query patterns into the NoSQL form. The efficiency of the framework has been evaluated to support several open-source hosting platforms.

Viewing alternatives

Download history


Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions

Item Actions