Copy the page URI to the clipboard
Zhou, Deyu; He, Yulan and Kwoh, Chee Keong
(2007).
DOI: https://doi.org/10.1007/978-3-540-72586-2_54
URL: http://www.springerlink.com/content/c458254741383w...
Abstract
Since most knowledge about protein-protein interactions still hides in biological publications, there is an increasing focus on automatically extracting information from the vast amount of biological literature. Existing approaches can be broadly categorized as rule-based or statistically-based. Rule-based approaches require heavy manual effort. On the other hand, statistically-based approaches require large-scale, richly annotated corpora in order to reliably estimate model parameters. This is normally difficult to obtain in practical applications. We have proposed a hidden vector state (HVS) model for protein-protein interactions extraction. The HVS model is an extension of the basic discrete Markov model in which context is encoded as a stack-oriented state vector. State transitions are factored into a stack shift operation similar to those of a push-down automaton followed by the push of a new preterminal category label. In this paper, we propose a novel approach based on the k-nearest-neighbors classifier to automatically train the HVS model from un-annotated data. Experimental results show the improved performance over the baseline system with the HVS model trained from a small amount of the annotated data.
Viewing alternatives
Download history
Metrics
Public Attention
Altmetrics from AltmetricNumber of Citations
Citations from DimensionsItem Actions
Export
About
- Item ORO ID
- 23795
- Item Type
- Conference or Workshop Item
- Extra Information
-
Computational Science – ICCS 2007
7th International Conference, Beijing, China, May 27 - 30, 2007, Proceedings, Part II
Yong Shi, Geert Dick van Albada, Jack Dongarra and Peter M.A. Sloot
ISBN: 978-3-540-72585-5
Lecture Notes in Computer Science 4488 - Keywords
- information extraction; Hidden Vector State Model; protein-protein interactions
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
- Centre for Research in Computing (CRC)
- Copyright Holders
- © 2007 Springer-Verlag
- Depositing User
- Kay Dave