ARIB STD B24 character set
Volume 1 of the Association of Radio Industries and Businesses STD-B24 standard for Broadcast Markup Language specifies, amongst other details, a character encoding for use in Japanese-language broadcasting. It was introduced on. The latest revision is version 6.3 as of.
It includes a number of ARIB extended characters not found in the base standards. It was the source standard for many symbol characters which were added to Unicode, including portions of the Miscellaneous Symbols, Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks. Its contributions partially overlap the Unicode emoji, but were added a year earlier, in Unicode 5.2.
The ARIB STD-B62 standard, published in 2014, defines Unicode mappings for a selection of the B24 extended characters, as well as a few extended Kanji. It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP's private use area.
Sets and codes
The ARIB STD B24 standard defines multiple character sets and a method of switching between them. These include a Kanji set, an Alphanumeric set, a Hiragana set, Katakana sets of two distinct layouts and four mosaic sets. The sets are selected using ISO 2022 mechanisms for 94-sets, using the following codes :Set | Type | Code | Code | Code | Comments |
Kanji | 2-byte | 4/2 | 42 | B | The escape code B used for the ARIB Kanji set is used for the 1983 version of JIS C 6226 in ISO-2022-JP. |
Alphanumeric | 1-byte | 4/10 | 4A | J | JIS_C6220-ro. Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP. |
Proportional alphanumeric | 1-byte | 3/6 | 36 | 6 | JIS_C6220-ro. Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP. |
Hiragana | 1-byte | 3/0 | 30 | 0 | Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation. |
Proportional Hiragana | 1-byte | 3/7 | 37 | 7 | Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation. |
Katakana | 1-byte | 3/1 | 31 | 1 | Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation. |
Proportional Katakana | 1-byte | 3/8 | 38 | 8 | Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation. |
JIS X 0201 Katakana | 1-byte | 4/9 | 49 | I | JIS_C6220-jp. Escape code matches usage in ISO-2022-JP-3. |
Mosaic A | 1-byte | 3/2 | 32 | 2 | Pseudographics |
Mosaic B | 1-byte | 3/3 | 33 | 3 | Pseudographics |
Mosaic C | 1-byte | 3/4 | 34 | 4 | Non-spacing pseudographics |
Mosaic D | 1-byte | 3/5 | 35 | 5 | Non-spacing pseudographics |