is

Interscript

Interoperable
script conversion systems

System Name: moct-kor-Hang-Latn-2000

Authority IDmoct
System ID2000
LanguageKorean
Source ScriptHang
Destination ScriptLatn
NameMinistry of Culture and Tourism Korean Romanization System (2000)
URL
DescriptionGeneration of Jamo from Hangul This is how the Hangul-to-Jamo maps are generated. Please refer to this page for details about Korean text handling in Unicode. http://gernot-katzers-spice-pages.com/var/korean_hangul_unicode.html This formula copied from the page above is used: [stem] ==== tail = mod (Hangul codepoint − 44032, 28) vowel = 1 + mod (Hangul codepoint − 44032 − tail, 588) / 28 lead = 1 + int [ (Hangul codepoint − 44032)/588 ] ==== [source,python] ---- import pandas as pd import re import math leadjamo = [chr(0x1100+i) for i in range(0,19)] # ᄀᄁᄂᄃᄄᄅᄆᄇᄈᄉᄊᄋᄌᄍᄎᄏᄐᄑᄒ voweljamo = [chr(0x1161+i) for i in range(0,21)] # ᅡᅢᅣᅤᅥᅦᅧᅨᅩᅪᅫᅬᅭᅮᅯᅰᅱᅲᅳᅴᅵ tailjamo = ['']+[chr(0x11A8+i) for i in range(0,27)] # ᆨᆩᆪᆫᆬᆭᆮᆯᆰᆱᆲᆳᆴᆵᆶᆷᆸᆹᆺᆻᆼᆽᆾᆿᇀᇁᇂ hanguls = [chr(i) for i in range(44032,55204)] tails = [tailjamo[(i-44032) % 28] for i in range(44032,55204)] vowels = [voweljamo[((i-44032-((i-44032) % 28)) % 588) // 28] for i in range(44032,55204)] leads = [leadjamo[math.floor((i-44032)// 588)] for i in range(44032,55204)] kr_df = pd.DataFrame({'Hangul':hanguls, 'Lead':leads,'Vowel':vowels, 'Tail':tails}) ---- Hangul Lead Vowel Tail 0 가 ᄀ ᅡ 1 각 ᄀ ᅡ ᆨ 2 갂 ᄀ ᅡ ᆩ 3 갃 ᄀ ᅡ ᆪ 4 간 ᄀ ᅡ ᆫ 5 갅 ᄀ ᅡ ᆬ 6 갆 ᄀ ᅡ ᆭ 7 갇 ᄀ ᅡ ᆮ 8 갈 ᄀ ᅡ ᆯ 9 갉 ᄀ ᅡ ᆰ
Hang
Latn
Condition
  • none
    space
    after: digit, not before: any (digit, space)
  • 10031
    c77c
  • 20032
    c774
  • 30033
    c0bc
  • 40034
    c0ac
  • 50035
    c624
  • 60036
    c721
  • 70037
    ce60
  • 80038
    d314
  • 90039
    ad6c
  • none
    -002d
    after: any (b3c4, c2dc, ad70, ad6c, c74d, ba74, b9ac, b3d9, ac00) + line end, not before: line start
  • Run var-kor-Hang-Hang-jamo
  • line start
    space
  • line end
    space
  • 11a9110b
    11a81101
  • 11a9
    11a8
  • 11aa110b
    11a81109
  • 11aa
    11a8
  • 11ac110b
    11ab110c
  • 11ac
    11ab
  • 11ad1100
    11ab110f
  • 11ad1103
    11ab1110
  • 11ad1107
    11ab1111
  • 11ad110c
    11ab110e
  • 11ad
    11ab
  • 11ae
    11ba
    after: any (1100, 1101, 1102, 1103, 1104, 1105, 1106, 1107, 1108, 1109, 110a, 110c, 110d, 110e, 110f, 1110, 1111, 1112)
  • 11b3110b
    11af1109
  • 11b3
    11af
  • 11b4110b
    11af1110
  • 11b4
    11af
  • 11b5110b
    11af1111
  • 11b5
    11af
    after: any (1103, 1104, 1110)
  • 11b5
    1107
  • Parallel
  • Parallel
  • 1100
    g0067
    before: any (alpha, digit, jamo vowel, -002d)
  • 1102
    n006e
    before: any (alpha, digit, jamo vowel, -002d)
  • 1103
    d0064
    before: any (alpha, digit, jamo vowel, -002d)
  • 1105
    r0072
    before: any (alpha, digit, jamo vowel, -002d)
  • 1106
    m006d
    before: any (alpha, digit, jamo vowel, -002d)
  • 1107
    b0062
    before: any (alpha, digit, jamo vowel, -002d)
  • 1109
    s0073
    before: any (alpha, digit, jamo vowel, -002d)
  • 110b
    before: any (alpha, digit, jamo vowel, -002d)
  • 110c
    j006a
    before: any (alpha, digit, jamo vowel, -002d)
  • 110e
    c0063h0068
    before: any (alpha, digit, jamo vowel, -002d)
  • 110f
    k006b
    before: any (alpha, digit, jamo vowel, -002d)
  • 1110
    t0074
    before: any (alpha, digit, jamo vowel, -002d)
  • 1111
    p0070
    before: any (alpha, digit, jamo vowel, -002d)
  • 1112
    h0068
    before: any (alpha, digit, jamo vowel, -002d)
  • 1101
    k006bk006b
    before: any (alpha, digit, jamo vowel)
  • 1104
    t0074t0074
    before: any (alpha, digit, jamo vowel)
  • 1108
    p0070p0070
    before: any (alpha, digit, jamo vowel)
  • 110a
    s0073s0073
    before: any (alpha, digit, jamo vowel)
  • 110d
    j006aj006a
    before: any (alpha, digit, jamo vowel)
  • Parallel
  • 1100
    g0067
    before: space
  • 1102
    n006e
    before: space
  • 1103
    d0064
    before: space
  • 1105
    before: space, after: any (1163, 1164, 1167, 1168, 116d, 1172)
  • 1105
    n006e
    before: space
  • 1106
    m006d
    before: space
  • 1107
    b0062
    before: space
  • 1109
    s0073
    before: space
  • 110b
    before: space
  • 110c
    j006a
    before: space
  • 110e
    c0063h0068
    before: space
  • 110f
    k006b
    before: space
  • 1110
    t0074
    before: space
  • 1111
    p0070
    before: space
  • 1112
    h0068
    before: space
  • 1101
    k006bk006b
    before: space
  • 112d
    k006bk006b
    before: space
  • 1104
    t0074t0074
    before: space
  • 112f
    t0074t0074
    before: space
  • 1108
    p0070p0070
    before: space
  • 1132
    p0070p0070
    before: space
  • 110a
    s0073s0073
    before: space
  • 110d
    j006aj006a
    before: space
  • 1136
    j006aj006a
    before: space
  • Parallel
  • 11a8
    k006b
    after: any (space, -002d)
  • 11ab
    n006e
    after: any (space, -002d)
  • 11ae
    t0074
    after: any (space, -002d)
  • 11af
    l006c
    after: any (space, -002d)
  • 11b7
    m006d
    after: any (space, -002d)
  • 11b8
    p0070
    after: any (space, -002d)
  • 11ba
    t0074
    after: any (space, -002d)
  • 11bc
    n006eg0067
    after: any (space, -002d)
  • 11bd
    t0074
    after: any (space, -002d)
  • 11be
    t0074
    after: any (space, -002d)
  • 11bf
    k006b
    after: any (space, -002d)
  • 11c0
    t0074
    after: any (space, -002d)
  • 11c1
    p0070
    after: any (space, -002d)
  • 11b0
    k006b
    after: any (space, -002d)
  • 11b2
    p0070
    after: any (space, -002d)
  • line start + space
    none
  • space + line end
    none
  • Title case