Penev, Lyubomir; Lyal, Christopher H. C.; Weitzman, Anna; Morse, David R.; King, David; Sautter, Guido; Georgiev, Teodor; Morris, Robert A.; Catapano, Terry and Agosti, Donat
(2011).
XML schemas and mark-up practices of taxonomic literature.
ZooKeys, 150
pp. 89–116.
Abstract
We review the three most widely used XML schemas used to mark-up taxonomic texts, TaxonX, TaxPub and taXMLit. These are described from the viewpoint of their development history, current status, implementation, and use cases. The concept of “taxon treatment” from the viewpoint of taxonomy mark-up into XML is discussed. TaxonX and taXMLit are primarily designed for legacy literature, the former being more lightweight and with a focus on recovery of taxon treatments, the latter providing a much more detailed set of tags to facilitate data extraction and analysis. TaxPub is an extension of the National Library of Medicine Document Type Definition (NLM DTD) for taxonomy focussed on layout and recovery and, as such, is best suited for mark-up of new publications and their archiving in PubMedCentral. All three schemas have their advantages and shortcomings and can be used for different purposes.
| Item Type: |
Journal Article
|
| Copyright Holders: |
2011 Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License 3.0. |
| ISSN: |
1313-2989 |
| Funders: |
ViBRANT (Virtual Biodiversity Research and Access Network for Taxonomy, http://vbrant.eu) FP7 project |
| Keywords: |
mark-up; XML schema; taxonomy; TaxonX; TaxPub; taXMLit |
| Academic Unit/Department: |
Mathematics, Computing and Technology > Computing |
| Interdisciplinary Research Centre: |
Centre for Research in Computing (CRC) |
| Item ID: |
30401 |
| Depositing User: |
David Morse
|
| Date Deposited: |
05 Jan 2012 12:41 |
| Last Modified: |
23 Oct 2012 14:21 |
| URI: |
http://oro.open.ac.uk/id/eprint/30401 |
Actions (login may be required)