New thesaurus standard ISO 25964 Part 1 published
Monday, October 24th, 2011 by traugottkochAfter three and a half years of intensive work and a long formal process, ISO TC46/SC9 WG8 has finalized work on Part 1 of the new thesaurus standard. It has been published by ISO on August 18, 2011.
ISO 25964-1 “Information and documentation — Thesauri and interoperability with other vocabularies — Part 1: Thesauri for information retrieval”, is a standard for development of thesauri and exchange of thesaurus data, covering both mono- and multilingual thesauri.
It has been developed (cf. [1]) as an extensive revision and extension of ISO 2788 and ISO 5964 (1985/6), based on BS 8723 (2005-2008), to include the following main sections:
1 Scope
2 Terms and definitions
3 Symbols, abbreviated terms and other conventions
4 Thesaurus overview and objectives
5 Concepts and their scope in a thesaurus
6 Thesaurus terms
7 Complex concepts
8 The equivalence relationship, in a monolingual context
9 Equivalence across languages
10 Relationships between concepts
11 Facet analysis
12 Presentation and layout
13 Managing thesaurus construction and maintenance
14 Guidelines for thesaurus management software
15 Data model
16 Integration of thesauri with applications
17 Exchange formats
18 Protocols
Annex A (informative) Examples of displays found in published thesauri
Annex B (informative) XML Schema for data exchange
Bibliography
Index
Part 1 can be bought and downloaded from ISO [2] or ordered from national standards bodies such as BSI, DIN, ANSI, AFNOR. It is available in PDF or print, in English language only. Standards distributors may sell it as well and certain reference libraries include it in their holdings.
Freely available is, however, the XML schema, that has been developed for exchange of thesaurus data (an Annex to the standard), cf. [3]. It is underpinned by a UML data model that provides for all the features a thesaurus may include, e.g. terms, relations among terms, concepts, relations among concepts, arrays, concept groups etc. Some constraints have been incorporated in the schema to ensure the quality of the represented thesauri.
The schema can be accessed without charge or password control at the NISO site [4] containing the schema and an introduction to it, an XML test example, HTML documentation files and an errata page for the standard.
A presentation at the recent NKOS 2011 workshop at TPDL in Berlin [5] did focus on the XML schema and made a comparison with the SKOS schema, pointing to necessary SKOS extensions.
Part 2 of the new standard, “Interoperability with other vocabularies”, covers mapping between thesauri and other types of vocabulary, such as classification schemes, file plans for records management, taxonomies, subject heading schemes, ontologies, terminologies, name authority lists, and synonym rings.
As ISO DIS 25964-2 (Draft International Standard) it has been submitted mid October for circulation among the member bodies of TC46/SC9 (Information and documentation/ Identification and description) and for comments.
Publication of Part 2 can be expected in 2012.
National standards bodies make the text available, applying different policies, however. BSI will, most probably, make it available to everybody during a commenting period, free of charge and passwords (cf. [6]).
[1] http://metadaten-twr.org/2009/05/27/new-iso-thesaurus-standard-under-development/
[2] http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=53657
[3] http://metadaten-twr.org/2010/02/12/thesaurus-data-model-and-schema-in-isodis-25964-1-available-for-public-comment/
[4] http://www.niso.org/schemas/iso25964/
[5] http://www.comp.glam.ac.uk/pages/research/hypermedia/nkos/nkos2011/presentations/DextreClarke_DeSmedt_ISO25964.pdf
[6] http://metadaten-twr.org/2009/11/06/commenting-and-balloting-on-the-new-draft-international-thesaurus-standard-opened/