ISO 25964

ISO 25964 is the international standard for thesauri, published in two parts as follows:

    ISO 25964  Information and documentation - Thesauri and interoperability with other vocabularies
         Part 1: Thesauri for information retrieval     [published August 2011]
         Part 2: Interoperability with other vocabularies     [published March 2013]

It was issued by ISO, the International Organization for Standardization, and its official website [1] is maintained by its secretariat in NISO, the USA National Information Standards Organization. Each part of the standard can be purchased separately from ISO or from any of its national member bodies (such as ANSI, BSI, AFNOR, DIN, etc.). Some parts of it are available free of charge from the official website.

History

The first international standard for thesauri was ISO 2788, Guidelines for the establishment and development of monolingual thesauri, originally published in 1974 and updated in 1986. In 1985 it was joined by the complementary standard ISO 5964, Guidelines for the establishment and development of multilingual thesauri. Over the years ISO 2788 and ISO 5964 were adopted as national standards in several countries, for example Canada, France and UK. In the UK they were given alias numbers BS 5723 and BS 6723 respectively. And it was in the UK around the turn of the century that work began to revise them for the networking needs of the new millennium. This resulted during 2005 - 2008 in publication of the 5-part British Standard BS 8723, as follows:

    BS 8723 Structured vocabularies for information retrieval - Guide
         Part 1: Definitions, symbols and abbreviations
         Part 2: Thesauri
         Part 3: Vocabularies other than thesauri
         Part 4: Interoperability between vocabularies
         Part 5: Exchange formats and protocols for interoperability

Even before the last part of BS 8723 was published, work began to adopt and adapt it as an international standard to replace ISO 2788 and ISO 5964. The project was led by a Working Group of ISO's Technical Committee 46 (Information and documentation) Subcommittee 9 (Identification and description) known as “ISO TC46/SC9/WG8 Structured Vocabularies”.

ISO 2788 and ISO 5964 were withdrawn in 2011, when they were replaced by the first part of ISO 25964. The second part of ISO 25964 was issued in March 2013, completing the project.

Aims and Scope

ISO 25964 is for thesauri intended to support information retrieval, and specifically to guide the choice of terms used in indexing, tagging and search queries.

The primary objective is thus summarised in the introduction to the standard as:

Whereas most of the applications envisaged for ISO 2788 and ISO 5964 were databases in a single domain, often in-house or for paper-based systems, ISO 25964 provides additional guidance for the new context of networked applications, including the Semantic Web. A thesaurus is one among several types of controlled vocabulary used in this context.

ISO 25964 Part 1

A thesaurus compliant with ISO 25964-1 (as Part 1 is known) lists all the concepts available for indexing in a given context, and labels each of them with a preferred term, as well as any synonyms that apply. Relationships between the concepts and between the terms are shown, making it easy to navigate around the field while building up a search query. The main types of relationship include:

  • equivalence (between synonyms and near-synonyms e.g. motor-bikes, motor-cycles and motorcycles)
  • hierarchical (between broader and narrower concepts e.g. flowers and roses)
  • associative (between concepts that are closely related in some non-hierarchical way, e.g. between a disease and the virus that causes that disease)

In multilingual thesauri equivalence also applies between corresponding terms in different natural languages. Establishing correspondence is not always easy, and the standard provides recommendations for handling the difficulties that commonly arise.

ISO 25964-1 explains how to build a monolingual or a multilingual thesaurus, how to display it, and how to manage its development. There is a data model to use for handling thesaurus data (especially when exchanging data between systems) and an XML schema for encoding the data. Both the model and the schema can be accessed 24/7, free of charge, on the official website hosted by NISO. The standard also sets out the features you should look for when choosing software to manage the thesaurus.

ISO 25964 Part 2

ISO 25964-2 deals with the challenges of using one thesaurus in combination with another, and/or with some other type of controlled vocabulary or knowledge organization system (KOS). The types covered include classification schemes, taxonomies, subject heading schemes, ontologies, name authority lists, terminologies and synonym rings. Within a single organization it is common to find several different such KOSs used in contexts such as the records management system, the library catalogue, the corporate intranet, the research lab, etc. To help users with the challenge of running a single search across all the available collections, ISO 25964-2 provides guidance on mapping between the terms and concepts of one thesaurus and those of the other KOSs. Where mapping is not a sensible option, the standard recommends other forms of complementary vocabulary use.

Similarly on the Internet there is an opportunity to make a simultaneous search of repositories and databases that have been indexed with different KOSs, on an even wider scale. Interoperability between the different networks, platforms, software applications, and languages (both natural and artificial) is reliant on the adoption of numerous protocols and standards. ISO 25964-2 is the one to address interoperability between structured vocabularies, especially where a thesaurus is involved.

Related standards

Since Part 1 of ISO 25964 was published it has been adopted by the national standards bodies in a number of countries. For example, The British Standards Institution (BSI) in the UK has adopted it and labelled it unchanged as BS ISO 25964-1. At the time of writing similar consideration is under way for Part 2. The American standard ANSI/NISO Z39.19 - Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies covers some of the same ground as ISO 25964-1. It deals with monolingual lists, synonym rings and taxonomies as well as thesauri, but does not provide a data model, nor address multilingual vocabularies or other aspects of interoperability, such as mapping between KOSs. Where the two standards overlap, they are broadly compatible with each other. NISO is actively involved in both standards, having participated in the work of developing ISO 25964 as well as running its secretariat. The W3C Recommendation SKOS (Simple Knowledge Organization System) has a close relationship with ISO 25964 in the context of the Semantic Web. SKOS applies to all sorts of “simple KOSs” that can be found on the Web, including thesauri and others. Whereas ISO 25964-1 advises on the selection and fitting together of concepts, terms and relationships to make a good thesaurus, SKOS addresses the next step - porting the thesaurus to the Web. And whereas ISO 25964-2 recommends the sort of mappings that can be established between one KOS and another, SKOS presents a way of expressing the mappings when published to the Web.

See also

References

  1. ^ ISO 25964 – the international standard for thesauri and interoperability with other vocabularies

External links

Art

Art is a diverse range of human activities in creating visual, auditory or performing artifacts (artworks), expressing the author's imaginative, conceptual ideas, or technical skill, intended to be appreciated for their beauty or emotional power. In their most general form these activities include the production of works of art, the criticism of art, the study of the history of art, and the aesthetic dissemination of art.

The three classical branches of art are painting, sculpture and architecture. Music, theatre, film, dance, and other performing arts, as well as literature and other media such as interactive media, are included in a broader definition of the arts. Until the 17th century, art referred to any skill or mastery and was not differentiated from crafts or sciences. In modern usage after the 17th century, where aesthetic considerations are paramount, the fine arts are separated and distinguished from acquired skills in general, such as the decorative or applied arts.

Though the definition of what constitutes art is disputed and has changed over time, general descriptions mention an idea of imaginative or technical skill stemming from human agency and creation. The nature of art and related concepts, such as creativity and interpretation, are explored in a branch of philosophy known as aesthetics.

ISO 2788

ISO 2788 was the ISO international standard for monolingual thesauri for information retrieval, first published in 1974 and revised in 1986. The official title of the standard was "Guidelines for the establishment and development of monolingual thesauri".

It was withdrawn in 2011 and replaced by ISO 25964-1.

ISO 5964

ISO 5964 was the ISO standard for the establishment and development of multilingual thesauri. Its full title was Guidelines for the establishment and development of multilingual thesauri. It was withdrawn in 2011, when replaced by ISO 25964-1. See more explanation on the official website for ISO 25964

Simple Knowledge Organization System

Simple Knowledge Organization System (SKOS) is a W3C recommendation designed for representation of thesauri, classification schemes, taxonomies, subject-heading systems, or any other type of structured controlled vocabulary. SKOS is part of the Semantic Web family of standards built upon RDF and RDFS, and its main objective is to enable easy publication and use of such vocabularies as linked data.

Thesaurus (information retrieval)

In the context of information retrieval, a thesaurus (plural: "thesauri") is a form of controlled vocabulary that seeks to dictate semantic manifestations of metadata in the indexing of content objects. A thesaurus serves to minimise semantic ambiguity by ensuring uniformity and consistency in the storage and retrieval of the manifestations of content objects. ANSI/NISO Z39.19-2005 defines a content object as "any item that is to be described for inclusion in an information retrieval system, website, or other source of information". The thesaurus aids the assignment of preferred terms to convey semantic metadata associated with the content object.A thesaurus serves to guide both an indexer and a searcher in selecting the same preferred term or combination of preferred terms to represent a given subject. ISO 25964, the international standard for information retrieval thesauri, defines a thesaurus as a “controlled and structured vocabulary in which concepts are represented by terms, organized so that relationships between concepts are made explicit, and preferred terms are accompanied by lead-in entries for synonyms or quasi-synonyms.”

A thesaurus is composed by at least three elements: 1-a list of words (or terms), 2-the relationship amongst the words (or terms), indicated by their hierarchical relative position (e.g. parent/broader term; child/narrower term, synonym, etc.), 3-a set of rules on how to use the thesaurus.

ISO standards by standard number
1–9999
10000–19999
20000+

This page is based on a Wikipedia article written by authors (here).
Text is available under the CC BY-SA 3.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.