Skip to end of metadata
Go to start of metadata

DDEX standards allow to include an XML attribute called LanguageAndScriptCode on most free-form string fields as well as composites that contain free-form string fields. The definition of this field is  "The Language and script […] as defined in IETF RfC 5646(*). The default is the same as indicated for the containing composite. Language and Script are provided as lang[-scipt][-region][-variant]. This is represented in an XML schema as an XML Attribute." The overall default is "en" (for English).


It is important to note that the language is independent from the territory for which information is provided. One example is the Swiss Army's main band. It can be communicated as a Resource Contributor as follows:

...
<SoundRecordingDetailsByTerritory>
  <TerritoryCode>Worldwide<TerritoryCode>
  ...
  <ResourceContributor>
    <PartyName LanguageAndScriptCode="en">Swiss Army Band Representation Orchestra</PartyName>
    ...
  <ResourceContributor>
  ...
</SoundRecordingDetailsByTerritory>
<SoundRecordingDetailsByTerritory>
  <TerritoryCode>CH<TerritoryCode>
  ...
    <ResourceContributor>
    <PartyName LanguageAndScriptCode="de"> 
       <FullName>Repräsentationsorchester Schweizer Armeespiel</FullName
    </PartyName>
    <PartyName LanguageAndScriptCode="fr"
       <FullName>L'orchestre de représentation de l'Armée suisse</FullName>
    </PartyName>
    <PartyName LanguageAndScriptCode="it">
        <FullName>Orchestra di rappresentanza</FullName
    </PartyName>
  <ResourceContributor>
  ...
</SoundRecordingDetailsByTerritory>
...

Note: the example is for ERN-3, but applies equally for ERN-4 and other standards.


Other examples for the the LanguageAndScriptCode attribute are listed on http://tools.ietf.org/html/rfc5646#page-80. These examples include:

  • ja (Japanese)
  • zh-Hant (Chinese written using the Traditional Chinese script)
  • zh-Hans (Chinese written using the Simplified Chinese script)
  • zh-Hans-CN (Chinese written using the Simplified script as used in mainland China)
  • sr-Cyrl (Serbian written using the Cyrillic script)
  • sr-Latn (Serbian written using the Latin script)
  • sl-rozaj-biske (San Giorgio dialect of Resian dialect of Slovenian)
  • de-CH-1901 (German as used in Switzerland using the 1901 variant)
  • es-419 (Spanish appropriate for the Latin America and Caribbean region using the UN region code)

 

Note that allowed value sets – such as role codes – are not affected by the LanguageAndScriptCode:

<DisplayArtist LanguageAndScriptCode="jp">
   <PartyName>
     <FullName>ホリエ ユイ</FullName>
   </PartyName>
   <ArtistRole>MainArtist</ArtistRole>
</DisplayArtist>


(*) RfC 5646 has replaced RfC 4646 which was used in older versions of DDEX standards. RfC 5646 specifically adds support for dialects.

  • No labels

1 Comment

  1. Anonymous

    Finding these codes with the ISO numbers is kind of hard to do. Here are some direct links to some lists to help you determine what ISO combinations you may need:

    ISO 639-1: https://en.wikipedia.org/wiki/ISO_639-1

    ISO 15924 : https://en.wikipedia.org/wiki/ISO_15924

    ISO 3166-1: https://en.wikipedia.org/wiki/ISO_3166-1