The KIM Technology Watch Report: http://metadaten-twr.org

Archive for February, 2010

Thesaurus data model and schema in ISO/DIS 25964-1 available for public comment

Friday, February 12th, 2010 by traugottkoch

ISO 25964 “Thesauri and interoperability with other vocabularies” is the first thesaurus standard explicitly featuring a data model for monolingual and multilingual thesauri, recommendations for exchange formats and protocols and an XML schema. Part 1: “Thesauri for information retrieval”, officially numbered ISO/DIS 25964-1, has been released as a draft available for public comment until the end of February 2010 (the ballot will end March 26).

Clause 15 describes the data model underpinning a XML schema, presented both using UML (Fig. 15) and in tabular format. The full range of options described in the standard is accommodated. It models logically the underlying structure of thesaurus data, not necessarily representing the way data is held within a given computer system. The five main classes appearing are Thesaurus, ThesaurusConcept, ThesaurusTerm, ThesaurusArray and Note. How to see the text and to comment on it, has been described in an earlier blog message [1].

The XML schema for data exchange, derived from the data model, is included in the standard as an informative appendix (Annex B), and is available free of charge at [2]. The schema may be used for electronically transmitting a whole thesaurus or portions of a thesaurus. Everybody is invited to give feedback on the draft schema by advancing from that page to the comments page for the schema and clicking on the “Add a Comment” link there. In addition, a test XML document using the schema is available as well. All comments will be open for public viewing. Links to the schema are also provided at the ISO 25964 public project page [3].

One of the predecessors to the new thesaurus standard is BS 8723. A data model and XML schema developed in the context of this standard and documented in Part 5 of the standard (BS 8723-5), dealing with exchange formats and protocols for interoperability, is freely available for comparison at [4].

[1] http://metadaten-twr.org/2009/11/06/commenting-and-balloting-on-the-new-draft-international-thesaurus-standard-opened/
[2] http://www.niso.org/schemas/iso25964/
[3] http://www.niso.org/workrooms/iso25964/
[4] http://schemas.bs8723.org/Model.aspx

Read 1 comment or post a comment  |  Show this entry only


International Standard Name Identifier: An introduction

Wednesday, February 3rd, 2010 by Juha Hakala

Author: Juha Hakala

Director, IT development, The National Library of Finland

juha.hakala@helsinki.fi

Member of the ISNI working group

Abstract:
The International Organization for Standardization (ISO) [1] is developing the International Standard Name Identifier (ISO 27729) as a standard identifier for public identities of parties. At the time of writing (December 2009) the document has reached Draft International Standard status. Much information is available at http://www.isni.org/ [2], including a useful FAQ [3]. This article describes the standard and potential implementation challenges.
(more…)

Read 3 comments or post your comment  |  Show this entry only