The Open UniversitySkip to content

Automatic extraction of knowledge from web documents

Alani, Harith; Kim, Sanghee; Millard, David E.; Weal, Mark J.; Lewis, Paul H.; Hall, Wendy and Shadbolt, Nigel R. (2003). Automatic extraction of knowledge from web documents. In: 2nd International Semantic Web Conference - Workshop on Human Language Technology for the Semantic Web and Web Services, 20-23 Oct 2003, Sanibel Island, Florida, USA.

Full text available as:
PDF (Accepted Manuscript) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (285kB)
Google Scholar: Look up in Google Scholar


A large amount of digital information available is written as text documents in the form of web pages, reports, papers, emails, etc. Extracting the knowledge of interest from such documents from multiple sources in a timely fashion is therefore crucial. This paper provides an update on the Artequakt system which uses natural language tools to automatically extract knowledge about artists from multiple documents based on a predefined ontology. The ontology represents the type and form of knowledge to extract. This knowledge is then used to generate tailored biographies. The information extraction process of Artequakt is detailed and evaluated in this paper.

Item Type: Conference or Workshop Item
Copyright Holders: 2003 The Authors
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Item ID: 20050
Depositing User: Harith Alani
Date Deposited: 15 Apr 2010 12:41
Last Modified: 11 Dec 2018 03:27
Share this page:

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU