CJKV VBA IME for MS Word v.3.2 English page

Google+

CJKV VBA IME for MS Word
version 3.2

Instructions for use

With these macros you may enter any double-byte Asian characters and make simple conversion of texts for East Asian languages. These macros are includes: Chinese, Japanese, Korean, and Vietnamese units in the dot-format files (MS Word templates).

Technical requirements

For properly work of these macros you need the MS Word software installed on your PC with MS Office version not 2000 earlier. These macros have no digital signature, so after installation, you need change to low security of your MS Word for work with VBA-macros, for example: (Tools => Macro => Security => Low). After run of MS Word or after open any of the dot-files, you will see a new toolbars in MS Word application. The binary files and conversion tables program path is fixed and determined as C:\ CJKV_VBA.

1. Chinese Input Methods for MS Word.

1.1 List of IME

Input methods (character set) for Simplified Chinese (GB2312):

Phonetic (Mandarin)

GuoBiao (JianTi) PinYin - standard Pinyin with using digitals for tones and input letter «v» instead of «ü». Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is Latin letters. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

For the Pinyin IMEs (GB and Big5) few variants of Latinized Chinese transcription systems were added as a button control for a switching. Five transcription systems variants were included for Standard Chinese phonemes text input. By the cyclic switching of control button at the right side of text edit field you can choice appropriate transcription system what you need. After each clicking the color of button will be change accordingly with follows systems list:

•    Standard Pinyin (gray)
•    Wade-Giles Romanization (green)
•    MPS II (red)
•    Yale Romanization (yellow)
•    Tongyong PY (purple)
•    Gwoyeu Romatzyh (blue), after phoneme input, you need to press space bar.

The without tones words input is available only in Standard Pinyin mode (gray button color).

GuoBiao (JianTi) ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first key press is corresponds to PinYin initial. The second key press is corresponds to PinYin medial+final. Character set is GB2312. Simplified Chinese characters are available. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

GuoBiao (JianTi) ZhuYin - input based on Zhuyin Zimu alphabet. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is single ideographic characters only.

GuoBiao (FanTi) PinYin - standard Pinyin, with using digitals for tones and input letter «v» instead of «ü». Character set is GB2312. Traditional Chinese characters are available. Keyboard layout is Latin letters. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

GuoBiao (FanTi) ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first key press is corresponds to PinYin initial. The second key press is corresponds to PinYin medial+final. Character set is GB2312. Traditional Chinese characters are available. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

GuoBiao (FanTi) ZhuYin - input based on Zhuyin Zimu alphabet. Character set is GB2312. Traditional Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is single ideographic characters only.

Structural methods

GuoBiao (JianTi) CangJie - input with CangJie system. The maximum length of the input string is 5 characters. Character set - GB2312. Simplified Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is single input of ideographic character only.

GuoBiao (JianTi) CangJie (exact) - input with CangJie system. The maximum length of the input string is 5 characters. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only.

GuoBiao (JianTi) WuBi - input with Wubi system. The maximum length of the input string is 4 characters. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard WuBi. Input modes is type of single ideographic character only.

Phonetic (Dialects)

GuoBiao (JT/FT) Shanghai IME – this is an input with standard Latin transcription system for Shanghainese with the numerical tones designation (0-5). Character set is GB2312. In the ideographs choice list you can see both simplified and traditional hanzi. For this input method the keyboard layout is standard English. The input mode is type of single ideographic character only.

GuoBiao (JT/FT) Wu IME – this is an input with generalized Latin transcription system for Wu dialects without any numerical tones designation. Character set is GB2312. In the ideographs choice list you can see both simplified and traditional hanzi. For this input method the keyboard layout is standard English. The input mode is type of single ideographic character only. For activation the list of choice you need just press space bar after input of Wu phoneme.

Input methods (character set) for Traditional Chinese:

Phonetic (Mandarin)

Big5 PinYin - standard Pinyin, with using digitals for tones and input letter «v» instead of «ü». Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Latin letters. Text input mode is typing of single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

Big5 ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first keypress is corresponds to PinYin initial. The second keypress is corresponds to PinYin medial+final. Character set is BIG5. Traditional Chinese characters are available. Text input mode is typing of single ideographic characters and you can also input some Chinese words (tone digits is not use for it).

Big5 ZhuYin - input based on Zhuyin Zimu alphabet. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is type of single ideographic character only.

Structural methods

Big5 CangJie - input with CangJie system. The maximum length of the input string is 5 characters. Character set - BIG5. Traditional Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is type of single ideographic character only.

Big5 CangJie (exact) - input with CangJie system. The maximum length of the input string is 5 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only.

Big5 Array30 - input with Array30 system. The maximum length of the input string is 4 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Array30. The selection of characters for the input list is performs by matching of the entered part of the coding Array30 stroke. The input mode is type of single ideographic character only.

Big5 Boshiamy - input with Boshiamy system. The maximum length of the input string is 4 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Boshiamy. The selection of characters for the input list is performs by matching of the entered part of the coding Boshiamy stroke. The input mode is type of single ideographic character only.

Big5 DaYi (4) - input with DaYi system. The maximum length of the input string is 4 characters. Character set - BIG5. Traditional Chinese characters are available. Keyboard layout is DaYi. The selection of characters for the input list is performs by matching of the entered part of the coding DaYi stroke. The input mode is type of single ideographic character only.

Phonetic methods (Chinese dialects)

Big5 Cantonese PY - input with Cantonese transcription without tone digits. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Latin. The input mode is type of single ideographic character only.

Big5 Hakka PY - input with transcription for Hakka dialects, with digital marking of tones (1-6). Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Latin. The input mode is type of single ideographic character only.

Input methods for UNICODE CJK extensions

PinYin IME (CJK Unified + ExtA) - input with Standard Pinyin with digital Chinese tones designation (1-5) and with letter «v» instead of «ü». A set of characters for enter is Unicode: CJK Unified + CJK ExtA. Keyboard layout is standard Latin. The input mode is type of single ideographic character only.

JyutPing IME (CJK Unified + ExtA) - input with Standard Jyutping with digital Chinese tones designation (1-6). A set of characters for enter is Unicode: CJK Unified + CJK ExtA. Keyboard layout is standard Latin. The input mode is type of single ideographic character only.

CangJie IME (CJK Unified + ExtA) - input with CangJie system. The maximum length of the input string is 5 characters. A set of characters for enter is Unicode: CJK Unified + CJK ExtA. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is type of single ideographic character only.

CangJie IME (CJK Unified + ExtA) exact - input with CangJie system. The maximum length of the input string is 5 characters. A set of characters for enter is Unicode: CJK Unified + CJK ExtA. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only.

CangJie IME (ExtB) - input with CangJie system. The maximum length of the input string is 5 characters. A set of characters to enter is Unicode CJK ExtB. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is type of single ideographic character only.

CangJie IME (ExtB) exact - input with CangJie system. The maximum length of the input string is 5 characters. A set of characters to enter - Unicode CJK ExtB. Keyboard layout - standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only.

1.2 Conversion of text

For conversion you need just select the text and press the button with macro.

Mandarin

HanZi to PinYin (CJK Unified + ExtA) - selected characters will be converted to Pinyin transcription with digital marking of Chinese tones and with text separator such as space, for example:

[HZ1] [HZ2] [HZ3] -> pinyin1 pinyin2 pinyin3

If the Chinese character has more than one reading, the alternate versions of the transcription will be enclosed in parentheses with comma separator inside, for example:

[HZ1] [HZ2] [HZ3] -> pinyin1 (pinyin2_1, pinyin2_2, pinyin2_3) pinyin3
For conversion of Chinese ideographs are available following Unicode ranges: CJK Unified + CJK ExtA.
For example:
我喜欢电脑游戏
wo3 xi3 huan1 dian4 nao3 you2 (xi4 , hu1 , hui1)

HZ Colored Tones – this text function is converts uncolored selected hanzi into colored same string with specified color for each ideograph. A color corresponded with one of the four main Putonghua tones. Black color is the first tone, red is the second tone, blue is third tone and green color is fourth tone. Blinking pink color is means that the ideograph has a more one tone in readings. Purple color is means that was occurred sandhi conversion of two ideographs reading repeatedly with third tones both (without words analysis).

For example:
我喜欢电脑游戏
我喜欢电脑游戏

1.2.1 Conversion with other Pinyin standards

PinYin to…, …to PinYin – this is functions for direct and inverse conversion of standard Pinyin phoneme and other Pinyin standard transcriptions. There are few standards in macros:

•    Wade-Giles (WD)
•    Yale Romanization (YL)
•    MPS II (MPS)
•    TongYong Pinyin (TY)
•    Gwoyeu Romatzyh (GW)

To conversion you need select phonemes in text and with a space divider, after you need to press one of the buttons with macros on the submenu.

For example (PY->Yale Romanization):
wo3 xue2 xi2 zhong1 wen2
wo3 sywe2 syi2 jung1 wen2

PinYin to IPA - this function is converts Pinyin phonemes with a digital Chinese tones designation to the symbols of IPA system.

For example:
wo3 xi3 huan1 dian4 nao3 you2 xi4
wuɔ²¹⁴ ɕʅ²¹⁴ huɑn⁵⁵ tiɛn⁵¹ nɑʊ²¹⁴ jɤʊ¹⁵ ɕʅ⁵¹

PY to ZY - this function is converts Pinyin phoneme with digital Chinese tones designation in the stroke of Zhuyin Zimu alphabet.

For example:
wo3 xi3 huan1 dian4 nao3 you2 xi4
ㄨㄛˇ ㄒㄧˇ ㄏㄨㄢㄉㄧㄢˋ ㄋㄠˇ ㄧㄡˊ ㄒㄧˋ

PY to ZY (fixed) - this function is converts Pinyin phoneme with digital Chinese tones designation in the stroke of Zhuyin Zimu alphabet with fixed positions for four character cells in resulting string. I.e. if the medial phoneme zero, it will be inserted instead by the Asian monospace gap for aligning.

For example:
wo3 xi3 huan1 dian4 nao3
|　ㄨㄛˇ|ㄒㄧ　ˇ| ㄏㄨㄢ　|ㄉㄧㄢˋ| ㄋ　ㄠˇ|

PY to RU - this function is converts Pinyin phonemes with a digital Chinese tones designation in the Palladius Cyrillic transcription system stroke with digital symbolic for tones designation.

For example:
wo3 xi3 huan1 dian4 nao3 you2 xi4
во3 си3 хуань1 дянь4 нао3 ю2 си4

PY to PY (tone marks) - this function is converts Pinyin phonemes with the digital designation of Chinese tones in the stroke of standard Putonghua Pinyin with diacritical tone symbols placed under the vowels.

For example:
wo3 xi3 huan1 dian4 nao3 you2 xi4
wǒ xǐ huān diàn nǎo yóu xì

PY (tone marks) to PY - this function is converts string of standard Putonghua Pinyin with diacritical tone symbols placed under the vowels in the equivalent Pinyin stroke with a digital designation of Chinese tone.

For example:
wǒ xǐ huān diàn nǎo yóu xì
wo3 xi3 huan1 dian4 nao3 you2 xi4

ZY to PY (only one phoneme) - this function is converts a single phoneme text typed in Zhuyin Zimu alphabet into a phoneme typed with a standard Pinyin and with digital designation of Chinese tone.

For example:
ㄉㄧㄢˋ => dian4

Kill Tones - this function is removes symbols indicated the tone designations used in transcription system PinYin or ZhuYin.

For example:
wo3 xi3 huan1 dian4 nao3 you2 xi4
(or)
wǒ xǐ huān diàn nǎo yóu xì
---------------------------------
wo xi huan dian nao you xi

Cantonese

HanZi to JyutPing (CJK Unified + ExtA) - selected characters will be converted to Jyutping transcription with digital designation of Cantonese tone and with text separator such as space, for example:

[HZ1] [HZ2] [HZ3] -> jyutping1 jyutping2 jyutping3

If the character has more than one reading, the alternate versions of the transcription will be enclosed in parentheses with comma separator inside, for example:

[HZ1] [HZ2] [HZ3] -> jyutping1 (jyutping2_1, jyutping2_2) jyutping3
Characters that are available for conversion are in Unicode: CJK Unified + CJK ExtA.
For example:
我喜欢电脑游戏
ngo5 hei2 fun1 din6 nou5 jau4 (fu1, hei3)

JyutPing to IPA (GD) - this function converts a digital transcription Jyutping with digital designation of Cantonese tone into IPA string (Guandong)

For example:
baak1 ging1 si6 zung1 gwok3 dik1 sau2 dou1
pa:k⁵ kiŋ⁵⁵ si¹¹ tɕʊŋ⁵⁵ kʷɔk³ tik⁵ sɐu²⁵ tou⁵⁵

JyutPing to IPA (HK) - this function converts a digital transcription Jyutping with digital designation of Cantonese tone into IPA string (HK)

For example:
baak1 ging1 si6 zung1 gwok3 dik1 sau2 dou1
pa:k⁵ kiŋ⁵⁵ si¹¹ tsʊŋ⁵⁵ kʷɔk³ tik⁵ sɐu²⁵ tou⁵⁵

JyutPing to RU - this function converts Jyutping with digital indication of Cantonese tone into Cyrillic transcription for Cantonese.

For example:
ngo5 hei2 fun1 din6 nou5 jau4 fu1
нго5 хэй2 фунь1 тинь6 ноу5 яу4 фу1

Ideographs conversion

FanTi to JianTi (GuoBiao) - this function is converts the traditional Chinese characters into simplified form (GB2312 charset only).

For example:
我喜歡電腦游戲
我喜欢电脑游戏

JianTi to FanTi (GuoBiao) - this function is converts the simplified Chinese characters into traditional form (GB2312 charset only).

For example:
我喜欢电脑游戏
我喜歡電腦游戲

HZ to Radical (CJK Unified + ExtA) - this function is converts Chinese characters from the ranges Unicode: CJK Unified + CJK ExtA into Bushou character from the list of 214 Kang Xi radicals. The radical characters of Chinese ideographs are determined in accordance with sorting within the Unicode standard.

For example:
北京是中国的首都
匕亠日丨囗白首邑

HZ to Semantic Variant - this function is converts a Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into semantic variant.

For example:
简 => 耕

HZ to JianTi Variant - this function is converts Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into a simplified form.

For example:
簡體漢字 => 简体汉字

HZ to SpecSimpl Variant - this function is converts Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into a special simplified form (for example: Japanese shintaiji, ryakuji).

For example:
龍 => 竜

HZ to FanTi Variant - this function is converts Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into a traditional form.

For example:
简体汉字 =>簡體漢字

HZ to Z-Variant - this function is converts Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into a Z-variant form.

For example:
繁 => 緐

CJK HZ to Compatibility Char - this function is converts Chinese character from the ranges of Unicode: CJK Unified + CJK ExtA into a compatibility form.

Compatibility Char to CJK HZ - this function is converts compatibility form of Chinese character into a usual character form from Unicode ranges: CJK Unified + CJK ExtA.

Conversions for CangJie

ABC to CJ Code - this text function is converts the Latin alphabet characters into the hieroglyphic key symbols of CangJie input system.

For example:
LMP YRF
中一心卜口火

CJ code to ABC - this text function is converts the hieroglyphic key symbols of CangJie input system in the Latin alphabet characters.

For example:
中一心卜口火
LMP YRF

HZ to CJ Code (ABC) (CJK Unified + ExtA) - this text function is converts the Chinese characters from the ranges Unicode: CJK Unified + CJK ExtA into CangJie system encoding string using Latin alphabet symbols.

For example:
北京 => LMP YRF

Other

HanZi to Unicode (Dec) - this function is converts selected Chinese characters in decimal code. After conversion between obtained numerical values the space separators will be placed.

For example:
北京是中国的首都
21271 20140 26159 20013 22269 30340 39318 37117

HanZi to Unicode (Hex) - this function is converts selected Chinese characters in hexadecimal code. After conversion between obtained numerical values the space separators will be placed.

For example:
北京是中国的首都
5317 4EAC 662F 4E2D 56FD 7684 9996 90FD

1.3 CJK-decomposition module

As a main base for determining of Chinese ideographs decomposition the Ideographic Description Sequence (U+2FF0 – U+2FFB) was adopted. However for some decomposition types the interpretation was a little bit changed. It was cased due a strongly binary conception for Chinese Ideographs composition schemes which historically were formed as a simple formula: radical + phonetic. And now decomposition must be described by following components:

•    first symbol – radical (or more complex symbol with radical);
•    ideographic decomposition symbol;
•    second symbol – usually phonetic (or just second component);
•    radical order in ideograph (first or last);
•    extended decomposition for first symbol (if absent in Unicode);
•    extended decomposition for second symbol (if absent in Unicode);

Extended decomposition is if the first or the second symbols there are not included in Unicode standard.

CJK Full Decomposition – text conversion function which performs full decomposition of Chinese ideograph. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
愂=((((十⿱冖)⿱子)⿰力)⿱心)
惕=忄⿰(日⿱(勹⿹(丿⿰丿)))

CJK Full Decomposition (rad simpl) – text conversion function which performs full decomposition of Chinese ideograph. Graphical variants of any radicals will be simplified to the main radical form. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
惕=心⿰(日⿱(勹⿹(丿⿰丿)))

CJK Full Decomposition (prefix) – text conversion function which performs full decomposition of Chinese ideograph with classical (IDS) prefix format of resulting decomposition sequence string. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
惕=⿰忄⿱日⿹勹⿰丿丿

CJK Full Decomposition (prefix) (rad simpl) – text conversion function which performs full decomposition of Chinese ideograph with classical (IDS) prefix format of resulting decomposition sequence string. Graphical variants of any radicals will be simplified to the main radical form. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
惕=⿰心⿱日⿹勹⿰丿丿

CJK Ideographic Decomposition – text conversion function which performs one act decomposition of Chinese ideograph in format [HZ]=[C1]{decomp}[C2]. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
湕=氵⿰建

CJK Ideographic Decomposition (rad simpl) – text conversion function which performs one act decomposition of Chinese ideograph in format [HZ]=[C1]{decomp}[C2]. Graphical variants of any radicals will be simplified to the main radical form. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro.

For example:
湕=水⿰建

CJK Ideographic Decomposition (...) – text conversion function which performs one act decomposition of Chinese ideograph in format ([C1]{decomp}[C2]).Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>(氵⿰建)

CJK Ideographic Decomposition (...) (rad simpl) – text conversion function which performs one act decomposition of Chinese ideograph in format ([C1]{decomp}[C2]). Graphical variants of any radicals will be simplified to the main radical form. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>(水⿰建)

CJK Ideographic Decomposition (prefix) – text conversion function which performs one act decomposition of Chinese ideograph in format [HZ]={decomp}[C1][C2]. In the resulting string the ”=” sign will be added after each decomposing ideograph. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=⿰氵建

CJK Ideographic Decomposition (prefix) (rad simpl) – text conversion function which performs one act decomposition of Chinese ideograph in format [HZ]={decomp}[C1][C2]. In the resulting string the ”=” sign will be added after each decomposing ideograph. Graphical variants of any radicals will be simplified to the main radical form. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=⿰水建

CJK Ideographic Decomposition Determinative – text conversion function which returns determinative (radical) of Chinese ideograph. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>氵

CJK Ideographic Decomposition Determinative (rad simpl) – text conversion function which returns determinative (radical) of Chinese ideograph. Graphical variants of any radicals will be simplified to the main radical form. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>水

CJK Ideographic Decomposition Phonetic – text conversion function which returns phonetic part of Chinese ideograph. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>建

CJK Radical and Strokes Count – text conversion function which returns radical sign (according to Unicode standard), total strokes and strokes for additional component of Chinese ideograph. Just select text for decomposition and after press button with macro. Resulting string will type instead the selected ideograph.

For example:
湕=>水(11/9)

CJK Radicals Composition (form) – two components input method.

1 – combo box to selecting of 214 Kang Xi radicals
2 – text box for input of radical English names (if the name is correct, then radical will be selected automatically in combo box 1). The English names of radicals see in Appendix.
3 – radio button enables the choice for the composition components getting from Unicode CJK Ext A diapason only.
4 – radio button enables the choice for the composition components getting from Unicode CJK Unifed Ideographs diapason only.
5 - select the character for the composition from list generated by selected radical after applying filter for the number of strokes
6 - filter works only with total number of strokes in character
7 - filter works only with number of strokes for additional non-radical part
8 - number of strokes (total or additional) for filter to generate characters list in 5
9 - set the threshold for number of strokes (total or additional) for filter to generate characters list in 5
10 - button for input the first character from element 5 to the composition
11 - button for input the second character from element 5 to the composition
12 – text label displaying the first character for composition (double click - type into the text)
13 - text label displaying second character for composition (double click - type into the text)
14 - text label displaying Unicode decomposition symbol (double click - type into the text )
15 - button for input whole composition into the text
16 - button for input "=" sign into the text
17 - button to enter in text the resulting one ideograph from the source two characters composition (activated if ideograph is present in Unicode standard)
18 – combo box to select the symbols of Unicode Ideographs Description Sequence (about mnemonics see in Appendix)

For input ⺼“meet” left-position or ⻏ “city” right position radicals you will choose these components from CJK supplementary radicals (U+2E80 – U+2EF3).

CJK Radicals Compose (3 symbols) – text conversion function which converts three symbols in string as [C1]{compos}[C2] into resulting ideograph (like the CJK Radicals Composition (form)). Just select text (always – three characters) before composition and after press button with macro. The ideograph will be just typed instead of selected text if result of such composition included in Unicode standard.

For example:
氵⿰龍
瀧

CJK Radicals Compose Prefix (3 symbols) - Text conversion function which converts three symbols in string as {compos}[C1][C2] into resulting ideograph (like the CJK Radicals Composition (form)). Just select text (always – three characters) before composition and after press button with macro. The ideograph will be just typed instead of selected text if result of such composition included in Unicode standard.

For example:
⿰氵龍
瀧

Decomposition Symbol to Mnemonics – CJK Decomposition Description Unicode symbols (U+2FF0 – U+2FFB) will be converted into text mnemonics (see Appendix).

For example:
湕=水⿰建
湕=水[LR]建

Delete Decomposition Symbols – CJK Decomposition Description Unicode symbols (U+2FF0 – U+2FFB) will be converted into commas and all parentheses will be deleted from selected text.

For example:
腍=(⺼⿰((亼⿱乛)⿱心))
腍=⺼,亼,乛,心
It will be useful for displaying of radicals set in the decomposed ideograph.

Decomposition Symbols Count – in the selected text all CJK Decomposition Description Unicode symbols (U+2FF0 – U+2FFB) will be counted. The resulting number will be typed instead of selected text.

For example:
腍=(⺼⿰((亼⿱乛)⿱心))
腍=3
It will be useful to counting and displaying of logical complexity for one ideograph. Other words, this is a total number of binary composition operations among radical graphemes.

1.4 Select fonts

Font - this function is allows to select fonts to display the input text and text in the input form.

Selecting a font character set for GB2312

Selecting a font character set for BIG5

Selecting a font to display text in the input form

1.5 Input form

1 - short name of active IME
2 - one-char mode indicator
3 - words-mode indicator (only for Pinyin) - b)
4 - indicator of the input form position on the screen
5 - input text box
6 - reset the selection list after closing the related words form
7 - close the form with related words after the insertion char in the text
8 - just insert into the text, if the list of candidate has only one char
9 - disable the input of related words
10 - insert into the text first character from selection candidates list
11 - button to activate the LianXiang-mode for entering related words
12 - selection candidates list (words or characters)
13 - character typed in the text
14 - enable LianXiang-mode indicator
15 – PinYin Latin transcription systems switcher (for PinYin IMEs only)
a) - main input form, c) - related words input form.

2. Japanese Input Methods for MS Word.

2.1 List of IME

Japanese Romaji - this is kana input by using Latin transcription (Romaji). Character input modes (Hira, Kata, HW-Kata, Kanji) can be switched by appropriate buttons on the input form. Default kana input is hiragana mode. Note, that sometimes to enter the closing syllable symbol ん (-n) you need to double press N. Reset input buffer by pressing Esc when you want. You can also activate the kanji input mode with the selection candidates list displayed on the input form.

Japanese Hiragana - this is kana input by using Latin transcription (Romaji). Character input modes (Hira, Kata, HW-Kata, Kanji) can be switched by appropriate buttons on the input form. Default kana input is hiragana mode. Reset input buffer by pressing Esc when you want. You can also activate the kanji input mode with the selection candidates list displayed on the input form.

Japanese FourCorner - this is the input of kanji with 4Corner system (Four Corners). The maximum length of the input string is 5 characters. Characters set for input is (0-9). Additional input also is available for the Japanese kana characters layout. You can input kanji from the selection list by pressing a number key. But it will be available when the length of the 4Corner encoding string will reach 5 characters. If the length of 4Corner encoding string is less than 5, you can input kanji by mouse click.

2.2 Conversion of text

You need just select the text and press the button with macro.

Hiragana to Romaji (simple) - this text function is converts the hiragana text into romaji text string (without analysis of kana characters neighboring in the converted text)
For example:
ぼくのせんせいがです
bokunosenseigadesu

Katakana to Hiragana - this text function is converts hiragana text string into katakana text string.
For example:
ボクノセンセイガ
ぼくのせんせいが

Hiragana to Katakana - this text function is converts katakana text string into hiragana text string.
For example:
ぼくのせんせいが
ボクノセンセイガ

Hiragana to HalfW Katakana - this text function is converts hiragana text string into half-width katakana text string.
For example:
ぼくのせんせいが
ﾎﾞｸﾉｾﾝｾｲｶﾞ

Kanji to Radical - this text function is converts kanji into the radical char from the list of 214 Kang Xi radicals. The radical characters are determined in accordance with sorting into the Unicode standard. Available char set for conversion is CJK Unifed Ideographs.
For example:
著書南嶋探験
艸曰十山手馬

Kyutaiji to Sintaiji - this text function is converts kanji from traditional style to simplified Japanese kanji style.
For example:
驗=>験

Sintaiji to Kyutaiji - this text function is converts kanji from simplified Japanese Kanji style to traditional style.
For example:
験=>驗

Kanji to Unicode (Dec) - this function is converts selected kanji into decimal code. After conversion the space separators will be placed among obtained numerical values.
For example:
著書南嶋探験
33879 26360 21335 23947 25506 39443

Kanji to Unicode (Hex) - this function is converts selected Chinese characters in hexadecimal code. After conversion the space separators will be placed among obtained numerical values.
For example:
著書南嶋探験
8457 66F8 5357 5D8B 63A2 9A13

2.3 Select fonts

Font - this function is allows to select fonts to display the input text and text in the input form.

2.4 Input form

1 - indicator of input form position on the screen
2 - indicator for enabling kanji mode
3 - indicator for enabling half-width katakana mode
4 - indicator for enabling katakana mode
5 - indicator for enabling hiragana mode
6 - text input
7 - list of candidates for single kanji or words
8 - short name of the input method.

3. Korean Input Methods for MS Word.

3.1 List of IME

Hangul (BeolSik) - input Hangul phonemes with a standard Korean keyboard layout (Beol Sik). You can reset the input buffer by pressing Esc. It is possible to activate the hanja input mode from the selection list (press appropriate button). You can activate the input for simple words by the entered first Hangeul syllable. Also you can enter all Hangul phonemes included in the Unicode standard. Button on the input form may enable KSC-only mode to filtering Hangeul phonemes input. Thus it will be input with Hangul phonemes only included into Korean national standard (KSC).

Hangul (LatinABC) - input Hangul phonemes with a simplified Korean romanization scheme (like for isolated phonemes) by Latin keyboard layout. You can reset the input buffer by pressing Esc. It is possible to activate the hanja input mode from the selection list (press appropriate button). You can activate the input for simple words by the entered first Hangeul syllable. Also you can enter all Hangul phonemes included in the Unicode standard. Button on the input form may enable KSC-only mode to filtering Hangeul phonemes input. Thus it will be input with Hangul phonemes only included into Korean national standard (KSC).

3.2 Conversion of text

For conversion you need just select the text and press a button with macro.

Hangul to String - this text conversion function is splits Hangeul phonemes into string of Hangul Jamo letters that included in this phoneme.

For example:
곺 => ㄱ ㅗ ㅍ

Hangul (Isolated) to New Romanization 2000 – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into string of New Korean Romanization standard without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글 자모
han geul ja mo

Hangul (Isolated) to Yale Romanization – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into string of Yale Romanization standard without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글 자모
han kul ca mo

Hangul (Isolated) to McCune-Reischauer – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into string of McCune-Reischauer standard without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글 자모
han kŭl cha mo

Hangul (Isolated) to Joseon Gwahagwon – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into string of Joseon Gwahagwon (North Korea) standard without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글 자모
han kŭr tsa mo

Hangul (Isolated) to Cyrillization – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into string of Cyrillic transcription (based on Kontzevich system) standard without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글 자모
хан кыль ча мо

Latin IME String to Hangul – text function which performs conversion from Latin IME-type text to the Hangeul text.

For example:
HAN GEUL JA MO=>한글자모
HAN KEUR CA MO=>한글자모
HAN KEUL CA MO=>한글자모

All three source variants of input strings after conversion will be the same as Korean text. It was very useful for realization of Romanized IME input.

Hangul to Latin IME String – simplified text conversion function for isolated Hangeul phonemes. One Hangeul phoneme will be converted into Latin IME string without any analysis of neighbor phonemes and position in words. After conversion the space will be divider between phonemes Romanized words.

For example:
한글자모
HAN GEUR JA MO

Type All 11172 Hangul chars - test for the output of all Hangeul phonemes included in the Unicode standard (this function will take few seconds)

For example:
가, 각, 갂, 갃, 간, 갅, 갆, 갇, 갈 ... (output in one column)

Hangul to Unicode (Dec) - this function is converts selected Hangeul phonemes into decimal code. After conversion the space separators will be placed among obtained numerical values.

For example:
한글 자모
54620 44544 51088 47784

Hangul to Unicode (Hex) - this function is converts selected Hangeul phonemes into hexadecimal code. After conversion the space separators will be placed among obtained numerical values.

For example:
한글 자모
D55C AE00 C790 BAA8

Hanja to Unicode (Dec) - this function is converts selected hanja into decimal code. After conversion the space separators will be placed among obtained numerical values.

For example:
韓契字母
38867 63753 23376 27597

Hanja to Unicode (Hex) - this function is converts selected hanja into hexadecimal code. After conversion the space separators will be placed among obtained numerical values.

For example:
韓契字母
97D3 F909 5B50 6BCD

Hanja to Hangul - selected hanja will be converted to Hangeul phoneme transcription with text separator such as space. If the hanja has more than one Hangeul reading, the alternate versions of the transcription will be enclosed in parentheses with comma separator inside. Available charsets for conversion are: CJK Unifed Ideographs + CJK Ext A.

For example:
韓契字母
한글 자모

Hanja to Radical (Unicode) - this text function is converts hanja to the radical character from the list of 214 Kang Xi radicals. The radical characters are determined in accordance with sorting into the Unicode standard. Available charset for conversion is CJK Unifed Ideographs.

For example:
韓契字母
韋大子毋

3.3 Select fonts

Font - this function is allows to select fonts to display the input text and text in the input form.

3.4 Input form

1 - button to activate the KSC-only mode filtration
2 - button to activate the input of simple words
3 - button to activate the Hanja input mode
4 - indicator of input form position on the screen
5 - typed Hangeul phoneme
6 - the list of Hanja candidates (or Hangeul words) for input

4. Vietnamese Input Methods for MS Word.

4.1 List of IME

Telex (VISCII) - the basis of the layout is standard Latin. You can input Vietnamese vowels with the Telex system. Tone modifiers of vowels are keys: F, S, X, J, R. Vowel modifiers for Â, Ê, Ô - respectively second pressing A, E, O. The modifier for Ư, Ơ - W letter for both, after O and U press respectively. The modifier for Đ - double press of D key. To correctly display characters you required a font with VISCII single-byte characters encoding (this font is attached to the macro script C:\CJKV_VBA\VIETNAMESE\Viet.ttf). Chinese ideographs character input mode is not available.

VIQR fast (VISCII) - the basis of the layout is standard Latin. You can input Vietnamese vowels with the VIQR system (fast variant, max. two clicks). To correctly display characters you required a font with VISCII single-byte characters encoding (this font is attached to the macro script C:\CJKV_VBA\VIETNAMESE\Viet.ttf). Chinese ideographs character input mode is not available.

Telex (Unicode) - the basis of the layout is standard Latin. You can input Vietnamese vowels with the Telex system. Tone modifiers of vowels are keys: F, S, X, J, R. Vowel modifiers for Â, Ê, Ô - respectively second pressing A, E, O. The modifier for Ư, Ơ - W letter for both, after O and U press respectively. The modifier for Đ - double press of D key. To correctly display characters you required a Unicode font with Extended Latin symbols (Times New Roman). Chinese ideographs character input mode is available.

VIQR fast (Unicode) - the basis of the layout is standard Latin. You can input Vietnamese vowels with the VIQR system (fast variant, max. two clicks). Also you need to have any Unicode fonts with extended Latin characters (Times New Roman). Chinese ideographs character input mode is available.

4.2 Conversion of text

For the conversion you need just select the text (except the test output functions) and press a button with macro.

Type VISCII vowels LowCase - test output of all Vietnamese vowels list in all tone cases with VISCII single-byte encoding (lower case characters).

For example:
a    à    á    ạ    ã    ả
â    ầ    ấ    ậ    ẫ    ẩ
ă    ằ    ắ    ặ    ẵ    ẳ
e    è    é    ẹ    ẽ    ẻ
ê    ề    ế    ệ    ễ    ể
o    ò    ó    ọ    õ    ỏ
ô    ồ    ố    ộ    ỗ    ổ
ơ    ờ    ớ    ợ    ỡ    ở
u    ù    ú    ụ    ũ    ủ
ư    ừ    ứ    ự    ữ    ử
i    ì    í    ị    ĩ    ỉ
y    ỳ    ý    ỵ    ỹ    ỷ

Type VISCII vowels UpCase - test output of all Vietnamese vowels list in all tone cases with VISCII single-byte encoding (upper case characters).

For example:
A    À    Á    Ạ    Ã    Ả
Â    Ầ    Ấ    Ậ    Ẫ    Ẩ
Ă    Ằ    Ắ    Ặ    Ẵ    Ẳ
E    È    É    Ẹ    Ẽ    Ẻ
Ê    Ề    Ế    Ệ    Ễ    Ể
O    Ò    Ó    Ọ    -    Ỏ
Ô    Ồ    Ố    Ộ    Ỗ    Ổ
Ơ    Ờ    Ớ    Ợ    Ỡ    Ở
U    Ù    Ú    Ụ    Ũ    Ủ
Ư    Ừ    Ứ    Ự    Ữ    Ử
I    Ì    Í    Ị    Ĩ    Ỉ
Y    Ỳ    Ý    -    -    -

Type Unicode vowels LowCase - test output of all Vietnamese vowels list in all tone cases with Unicode double-byte encoding (Extended Latin symbols, lower case characters).

Type Unicode vowels UpCase - test output of all Vietnamese vowels list in all tone cases with Unicode double-byte encoding (Extended Latin symbols, upper case characters).

For example:
A    À    Á    Ạ    Ã    Ả
Â    Ầ    Ấ    Ậ    Ẫ    Ẩ
Ă    Ằ    Ắ    Ặ    Ẵ    Ẳ
E    È    É    Ẹ    Ẽ    Ẻ
Ê    Ề    Ế    Ệ    Ễ    Ể
O    Ò    Ó    Ọ    Õ    Ỏ
Ô    Ồ    Ố    Ộ    Ỗ    Ổ
Ơ    Ờ    Ớ    Ợ    Ỡ    Ở
U    Ù    Ú    Ụ    Ũ    Ủ
Ư    Ừ    Ứ    Ự    Ữ    Ử
I    Ì    Í    Ị    Ĩ    Ỉ
Y    Ỳ    Ý    Ỵ    Ỹ    Ỷ

VIQR to VISCII - this function performs conversion of text strings encoded by VIQR sequence into Vietnamese characters single-byte encoding VISCII. To work correctly you need to have a font with VISCII single-byte character encoding (viet.ttf).

For example:
Tie^'ng Vie^.t
Tiếng Việt

VIQR to Unicode - this function performs conversion of text strings encoded by VIQR sequence into Unicode Vietnamese letters. To work correctly you need to have any Unicode font with extended Latin characters (Times New Roman).

For example:
Tie^'ng Vie^.t
Tiếng Việt

VISCII to Unicode - this text function is converts VISCII encoded string to a string with Unicode Vietnamese letters.

Unicode to VISCII - this text function is converts Unicode strings into a string encoded by VISCII single-byte character set. For correctly working of this function you need to have a font with single-byte character encoding VISCII (viet.ttf).

VISCII to VIQR - this text function is performs the conversion of single byte Vietnamese text (VISCII) into the VIQR encoding text.

For example:
Tiếng Việt
Tie^'ng Vie^.t

Unicode to VIQR - this text function is performs the conversion of Vietnamese text (Unicode) into the VIQR encoding text.
For example:
Tiếng Việt
Tie^'ng Vie^.t

HanNom to VIET string - this text function is converts the selected Chinese characters from the ranges of Unicode: CJK Unified + CJK ExtA into corresponding Vietnamese reading. If the Chinese character has more than one reading, the alternate versions of the Vietnamese transcription will be enclosed in parentheses with separator by comma inside.
For example:
越南 => việt nam

4.3 Input form

1 - indication of characters number displayed on the selection list
2 - name of input method (or input text field)
3 - button for activate of Chinese ideographs input mode
4 - Chinese ideographs selection list
5 - indicator of characters encoding used for input.

APPENDIX

English names of Chinese Kang Xi radicals
Unicode decomposition symbols and mnemonics.
Extended variants of alternative CJK decomposition interpretation accordingly with binary conception
Standard Chinese phonemes with various transcription systems

WERTA(C), 2000-2014, werta666@pisem.net