1
0
mirror of synced 2026-03-15 14:47:09 +00:00

Compare commits

...

5 Commits

Author SHA1 Message Date
rmkaplan
f5205e23c6 Update unicode mapping tables (#440)
* Updated all ISO8859 and Vendor Unicode mappings from current Unicode.org information

* John Cowan says that East Asia is obsolete

* Delete obsolete file
2021-08-28 09:01:27 -07:00
Larry Masinter
b57438983b modify release notes / scripts for separate medley maiko releases (#436)
* modify release notes / scripts for separate medley maiko releases

* Tweak release notes

* more edits release notes
2021-08-28 08:41:33 -07:00
rmkaplan
f4951abf4d Merge pull request #442 from Interlisp/Adjust-TTYIN-and-LLKEY-for-character-specifications
Adjust ttyin and llkey for character specifications
2021-08-27 21:29:08 -07:00
rmkaplan
d1fb141fa1 TTYIN: Remove indirects to generic char io functions, abstract to function for TTYIN.RESTORE.BUF.CODES
Local BOUTCCODE macro replaced by call to \OUTCHAR etc.  Previously it called PRINTCCODE probably because that was the encapsulation of NSIN.  Now avoid the user-level function call, so it can be broken, go directly to the generic.
2021-08-27 17:05:55 -07:00
rmkaplan
66624477f9 LLKEY: Do the Meta/Function abstraction for \ORIGKEYACTIONS and \MAIKOKEYACTIONST4
With all the back and forth in the last branch I forgot to commit these particular changes
2021-08-27 16:20:48 -07:00
146 changed files with 317298 additions and 61672 deletions

View File

@@ -1,25 +1,24 @@
Each release should have a subset of the medley repo in a file
We've moved to separate releases of medley and maiko.
Just get the latest version of each.
Or, you could pick up the medley release and build your own maiko.
Medley release is here:
`medley-`releasename`.tgz`
and at least one
`maiko-`releasename`.`osname`.`arch`.tgz`
e.g.,
`maiko-$tag.linux.x86_64.tgz`
for each os/arch pair for which we have GitHub "action" runners.
Maiko relese is [here](https://github.com/Interlisp/maiko/releases)
To use (from a shell/terminal window):
1. Unpack the medley tar file
```
tar -xvfz medley-$tag.tgz
tar -xvfz $tag.tgz
```
and the maiko file for your os.arch
```
tar -xvfz maiko-$tag.linux.x86_64.tgz
```
this should leave you with two new directories, `medley` and `maiko`.
x and the maiko file for your os.arch, e.g.,
```
tar -xvfz maiko-210823.linux.x86_64.tgz
```
This should leave you with two directories, `medley` and `maiko`.
Then you can
```
cd medley

13
scripts/loadup-and-release.sh Executable file
View File

@@ -0,0 +1,13 @@
#!/bin/sh
export MEDLEYDIR=`pwd`
if [ ! -x run-medley ] ; then
echo run from MEDLEYDIR
exit 1
fi
./scripts/loadup-all.sh && \
./scripts/copy-all.sh && \
./scripts/release-medley.sh

View File

@@ -8,26 +8,30 @@ fi
tag=$1
if [ -z "$tag" ] ; then
tag=nightly-`date +%y%m%d`
tag=medley-`date +%y%m%d`
fi
cd ..
echo making medley zip
echo making medley zip $tag
tar cfz medley/tmp/medley-$tag.tgz \
tar cfz medley/tmp/$tag.tgz \
--exclude-backups --exclude="*.PDF" \
medley/docs/dinfo \
medley/greetfiles/SIMPLE-INIT \
medley/run-medley \
medley/scripts \
medley/loadups \
medley/loadups/lisp.sysout \
medley/loadups/full.sysout \
medley/loadups/whereis.hash \
medley/fonts/displayfonts medley/fonts/altofonts \
medley/fonts/postscriptfonts medley/fonts/ipfonts \
medley/fonts/postscriptfonts \
medley/library/ \
medley/internal/library \
medley/lispusers/ \
medley/sources/
# medley/fonts/big medley/fonts/other \
# medley/sources/
# medley/internal/library \
cd medley
@@ -35,7 +39,6 @@ echo making release
sed s/'$tag'/$tag/g < release-notes.md > tmp/release-notes.md
gh release create $tag -F tmp/release-notes.md -p -t $tag
echo uploaded tmp/medley-$tag.tgz
gh release upload $tag tmp/medley-$tag.tgz --clobber
echo uploaded $tag.tgz
gh release upload $tag tmp/$tag.tgz --clobber
./scripts/release-one.sh $tag

View File

@@ -1,10 +1,8 @@
(DEFINE-FILE-INFO PACKAGE "INTERLISP" READTABLE "INTERLISP" BASE 10)
(FILECREATED "24-Aug-2021 16:54:52" {DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>LLKEY.;6 199507
(FILECREATED "27-Aug-2021 16:18:18" {DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>LLKEY.;9 199890
changes to%: (FNS KEYACTION)
previous date%: "24-Aug-2021 16:43:30"
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>LLKEY.;5)
previous date%: "24-Aug-2021 16:54:52"
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>LLKEY.;7)
(* ; "
@@ -1582,29 +1580,29 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(RPAQQ \ORIGKEYACTIONS
((0 (53 "%%" NOLOCKSHIFT))
(1 (52 "$" NOLOCKSHIFT))
(2 (54 "~" NOLOCKSHIFT))
((0 ("5" "%%" NOLOCKSHIFT))
(1 ("4" "$" NOLOCKSHIFT))
(2 ("6" "~" NOLOCKSHIFT))
(3 ("e" "E" LOCKSHIFT))
(4 (55 "&" NOLOCKSHIFT))
(4 ("7" "&" NOLOCKSHIFT))
(5 ("d" "D" LOCKSHIFT))
(6 ("u" "U" LOCKSHIFT))
(7 ("v" "V" LOCKSHIFT))
(8 (48 ")" NOLOCKSHIFT))
(8 ("0" ")" NOLOCKSHIFT))
(9 ("k" "K" LOCKSHIFT))
(10 ("-" "-" NOLOCKSHIFT))
(11 ("p" "P" LOCKSHIFT))
(12 ("/" "?" NOLOCKSHIFT))
(13 ("\" "|" NOLOCKSHIFT))
(14 (10 96 NOLOCKSHIFT))
(15 (8 8 NOLOCKSHIFT))
(16 (51 "#" NOLOCKSHIFT))
(17 (50 "@" NOLOCKSHIFT))
(14 ("LF" "`" NOLOCKSHIFT))
(15 ("Bs" "Bs" NOLOCKSHIFT))
(16 ("3" "#" NOLOCKSHIFT))
(17 ("2" "@" NOLOCKSHIFT))
(18 ("w" "W" LOCKSHIFT))
(19 ("q" "Q" LOCKSHIFT))
(20 ("s" "S" LOCKSHIFT))
(21 ("a" "A" LOCKSHIFT))
(22 (57 "(" NOLOCKSHIFT))
(22 ("9" "(" NOLOCKSHIFT))
(23 ("i" "I" LOCKSHIFT))
(24 ("x" "X" LOCKSHIFT))
(25 ("o" "O" LOCKSHIFT))
@@ -1612,11 +1610,11 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(27 ("," "<" NOLOCKSHIFT))
(28 ("'" "%"" NOLOCKSHIFT))
(29 ("]" "}" NOLOCKSHIFT))
(30 (194 194 NOLOCKSHIFT))
(31 (193 193 NOLOCKSHIFT))
(32 (49 "!" NOLOCKSHIFT))
(33 (27 27 NOLOCKSHIFT))
(34 (9 9 NOLOCKSHIFT))
(30 ("#B" "#B" NOLOCKSHIFT))
(31 ("#A" "#A" NOLOCKSHIFT))
(32 ("1" "!" NOLOCKSHIFT))
(33 ("Esc" "Esc" NOLOCKSHIFT))
(34 ("Tab" "Tab" NOLOCKSHIFT))
(35 ("f" "F" LOCKSHIFT))
(36 CTRLDOWN . CTRLUP)
(37 ("c" "C" LOCKSHIFT))
@@ -1626,24 +1624,24 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(41 1SHIFTDOWN . 1SHIFTUP)
(42 ("." ">" NOLOCKSHIFT))
(43 (";" ":" NOLOCKSHIFT))
(44 (13 13 NOLOCKSHIFT))
(44 ("CR" "CR" NOLOCKSHIFT))
(45 ("_" "^" NOLOCKSHIFT))
(46 (127 535 NOLOCKSHIFT))
(46 ("Del" "Function,^W" NOLOCKSHIFT))
(47 ("(" "[" NOLOCKSHIFT))
(48 ("r" "R" LOCKSHIFT))
(49 ("t" "T" LOCKSHIFT))
(50 ("g" "G" LOCKSHIFT))
(51 ("y" "Y" LOCKSHIFT))
(52 ("h" "H" LOCKSHIFT))
(53 (56 "*" NOLOCKSHIFT))
(53 ("8" "*" NOLOCKSHIFT))
(54 ("n" "N" LOCKSHIFT))
(55 ("m" "M" LOCKSHIFT))
(56 LOCKDOWN . LOCKUP)
(57 (32 32 NOLOCKSHIFT))
(57 ("Sp" "Sp" NOLOCKSHIFT))
(58 ("[" "{" NOLOCKSHIFT))
(59 ("=" "+" NOLOCKSHIFT))
(60 2SHIFTDOWN . 2SHIFTUP)
(61 (195 195 NOLOCKSHIFT))
(61 ("#C" "#C" NOLOCKSHIFT))
(63 (")" "]" NOLOCKSHIFT))
(77 EVENT . EVENT)
(78 EVENT . EVENT)
@@ -1792,50 +1790,50 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(105 ("\" "|" NOLOCKSHIFT))))
(RPAQQ \MAIKOKEYACTIONST4
((61 ("^E" "^G" NOLOCKSHIFT))
(91 ("2,10" "2,50" NOLOCKSHIFT))
(92 ("2,1" "2,41" NOLOCKSHIFT))
(30 ("2,1" "2,41" NOLOCKSHIFT))
(109 ("2,25" "2,65" NOLOCKSHIFT))
(63 ("2,4" "2,44" NOLOCKSHIFT))
((61 ("^E" "Bell" NOLOCKSHIFT))
(91 ("Function,Bs" "Function,(" NOLOCKSHIFT))
(92 ("Function,^A" "Function,!" NOLOCKSHIFT))
(30 ("Function,^A" "Function,!" NOLOCKSHIFT))
(109 ("Function,^U" "Function,65" NOLOCKSHIFT))
(63 ("Function,^D" "Function,$" NOLOCKSHIFT))
(14 METADOWN . METAUP)
(93 ("2,24" "2,64" NOLOCKSHIFT))
(93 ("Function,^T" "Function,64" NOLOCKSHIFT))
(62)
(111 ("1,o" "1,O" NOLOCKSHIFT))
(111 ("Meta,o" "Meta,O" NOLOCKSHIFT))
(89)
(90 ("2,3" "2,43" NOLOCKSHIFT))
(73 ("2,11" "2,11" NOLOCKSHIFT))
(74 ("2,12" "2,12" NOLOCKSHIFT))
(90 ("Function,^C" "Function,#" NOLOCKSHIFT))
(73 ("Function,Tab" "Function,Tab" NOLOCKSHIFT))
(74 ("Function,LF" "Function,LF" NOLOCKSHIFT))
(75 ("^B" "^B" NOLOCKSHIFT))
(81 ("2,14" 55 NOLOCKSHIFT))
(82 (173 56 NOLOCKSHIFT))
(83 ("2,15" 57 NOLOCKSHIFT))
(84 (172 52 NOLOCKSHIFT))
(85 ("2,16" 53 NOLOCKSHIFT))
(87 (174 54 NOLOCKSHIFT))
(94 ("2,17" 49 NOLOCKSHIFT))
(69 (175 50 NOLOCKSHIFT))
(70 ("2,20" 51 NOLOCKSHIFT))
(98 ("2,21" 48 NOLOCKSHIFT))
(76 ("2,13" "2,13" NOLOCKSHIFT))
(110 ("2,53" "2,53" NOLOCKSHIFT))
(81 ("Function,FF" "7" NOLOCKSHIFT))
(82 ("#-" "8" NOLOCKSHIFT))
(83 ("Function,CR" "9" NOLOCKSHIFT))
(84 ("#," "4" NOLOCKSHIFT))
(85 ("Function,^N" "5" NOLOCKSHIFT))
(87 ("#." "6" NOLOCKSHIFT))
(94 ("Function,^O" "1" NOLOCKSHIFT))
(69 ("#/" "2" NOLOCKSHIFT))
(70 ("Function,^P" "3" NOLOCKSHIFT))
(98 ("Function,^Q" "0" NOLOCKSHIFT))
(76 ("Function,^K" "Function,^K" NOLOCKSHIFT))
(110 ("Function,+" "Function,+" NOLOCKSHIFT))
(72 LOCKTOGGLE)
(97 ("2,101" "2,141" NOLOCKSHIFT))
(99 ("2,102" "2,142" NOLOCKSHIFT))
(100 ("2,103" "2,143" NOLOCKSHIFT))
(67 ("2,104" "2,144" NOLOCKSHIFT))
(68 ("2,105" "2,145" NOLOCKSHIFT))
(101 ("2,106" "2,146" NOLOCKSHIFT))
(66 ("2,107" "2,147" NOLOCKSHIFT))
(104 ("2,110" "2,150" NOLOCKSHIFT))
(80 ("2,111" "2,151" NOLOCKSHIFT))
(106 ("2,113" "2,153" NOLOCKSHIFT))
(107 ("2,114" "2,154" NOLOCKSHIFT))
(108 ("2,115" "2,155" NOLOCKSHIFT))
(97 ("Function,A" "Function,a" NOLOCKSHIFT))
(99 ("Function,B" "Function,b" NOLOCKSHIFT))
(100 ("Function,C" "Function,c" NOLOCKSHIFT))
(67 ("Function,D" "Function,d" NOLOCKSHIFT))
(68 ("Function,E" "Function,e" NOLOCKSHIFT))
(101 ("Function,F" "Function,f" NOLOCKSHIFT))
(66 ("Function,G" "Function,g" NOLOCKSHIFT))
(104 ("Function,H" "Function,h" NOLOCKSHIFT))
(80 ("Function,I" "Function,i" NOLOCKSHIFT))
(106 ("Function,K" "Function,k" NOLOCKSHIFT))
(107 ("Function,L" "Function,l" NOLOCKSHIFT))
(108 ("Function,M" "Function,m" NOLOCKSHIFT))
(13 ("^W" "^U" NOLOCKSHIFT))
(33 ("ESC" "ESC" NOLOCKSHIFT))
(33 ("Esc" "Esc" NOLOCKSHIFT))
(64 IGNORE . IGNORE)
(65 (27 27 NOLOCKSHIFT))
(65 ("Esc" "Esc" NOLOCKSHIFT))
(95 IGNORE . IGNORE)
(96 IGNORE . IGNORE)
(102 IGNORE . IGNORE)
@@ -1845,8 +1843,8 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(56 LOCKTOGGLE . IGNORE)
(45 ("`" "~" NOLOCKSHIFT))
(31 METADOWN . METAUP)
(71 (10 10 NOLOCKSHIFT))
(47 ("2,22" "2,62" NOLOCKSHIFT))
(71 ("LF" "LF" NOLOCKSHIFT))
(47 ("Function,^R" "Function,62" NOLOCKSHIFT))
(86 IGNORE . IGNORE)
(88 IGNORE . IGNORE)
(105 ("\" "|" NOLOCKSHIFT))))
@@ -3924,33 +3922,33 @@ Copyright (c) 1982-1990, 1992, 1999, 1920, 2000, 2018, 2021 by Venue & Xerox Cor
(PUTPROPS LLKEY COPYRIGHT ("Venue & Xerox Corporation" 1982 1983 1984 1985 1986 1987 1988 1989 1990
1992 1999 1920 2000 2018 2021))
(DECLARE%: DONTCOPY
(FILEMAP (NIL (14733 22049 (BKSYSCHARCODE 14743 . 15092) (\CLEARSYSBUF 15094 . 15652) (\GETKEY 15654
. 16829) (\NSYSBUFCHARS 16831 . 17573) (\SAVESYSBUF 17575 . 19184) (\SYSBUFP 19186 . 19490) (
\GETSYSBUF 19492 . 19672) (\PUTSYSBUF 19674 . 20887) (\PEEKSYSBUF 20889 . 22047)) (23346 60180 (
\KEYBOARDINIT 23356 . 25076) (\KEYBOARDEVENTFN 25078 . 29778) (\ALLOCLOCKED 29780 . 30370) (
\SETIOPOINTERS 30372 . 34908) (\KEYBOARDOFF 34910 . 35324) (\KEYBOARDON 35326 . 35705) (\KEYHANDLER
35707 . 35838) (\KEYHANDLER1 35840 . 43286) (\RESETKEYBOARD 43288 . 44936) (\DOMOUSECHORDING 44938 .
48758) (\DOTRANSITIONS 48760 . 49437) (\DECODETRANSITION 49439 . 56128) (MOUSECHORDWAIT 56130 . 56794)
(\TRACKCURSOR 56796 . 60178)) (95464 117014 (KEYACTION 95474 . 96327) (KEYACTIONTABLE 96329 . 97511)
(KEYBOARDTYPE 97513 . 98615) (RESETKEYACTION 98617 . 100376) (\KEYBOARD.MACHINE-SPECIFIC-KEYACTIONS
100378 . 102280) (\KEYACTION1 102282 . 112398) (KEYDOWNP 112400 . 112735) (KEYNUMBERP 112737 . 112935)
(\KEYNAMETONUMBER 112937 . 113631) (\KEYNUMBERTONAME 113633 . 113823) (MODIFY.KEYACTIONS 113825 .
114686) (METASHIFT 114688 . 115632) (SHIFTDOWNP 115634 . 117012)) (117077 117373 (
SETUP.OFFICE.KEYBOARD 117087 . 117371)) (120076 121788 (\INIT.KEYBOARD.STREAM 120086 . 121786)) (
122053 138430 (\DOBUFFEREDTRANSITIONS 122063 . 137493) (\TIMER.INTERRUPTFRAME 137495 . 138220) (
\PERIODIC.INTERRUPTFRAME 138222 . 138428)) (138684 142761 (\HARDCURSORUP 138694 . 140576) (
\HARDCURSORPOSITION 140578 . 142614) (\HARDCURSORDOWN 142616 . 142759)) (142762 166822 (CURSOR.INIT
142772 . 146472) (\CURSORDESTINATION 146474 . 148792) (\SOFTCURSORUP 148794 . 154048) (
\SOFTCURSORUPCURRENT 154050 . 161086) (\SOFTCURSORPOSITION 161088 . 161853) (\SOFTCURSORDOWN 161855 .
162563) (CURSORPROP 162565 . 162907) (GETCURSORPROP 162909 . 163097) (PUTCURSORPROP 163099 . 164254) (
\CURSORBITSPERPIXEL 164256 . 166372) (\CURSORIMAGEPROPNAME 166374 . 166598) (\CURSORMASKPROPNAME
166600 . 166820)) (166823 184773 (CURSORCREATE 166833 . 169508) (CURSOR 169510 . 171322) (
\CURSOR-VALID-P 171324 . 172411) (\CURSORUP 172413 . 174128) (\CURSORPOSITION 174130 . 176658) (
\CURSORDOWN 176660 . 176893) (ADJUSTCURSORPOSITION 176895 . 177473) (CURSORPOSITION 177475 . 179017) (
CURSORSCREEN 179019 . 179675) (CURSOREXIT 179677 . 181068) (FLIPCURSOR 181070 . 182196) (FLIPCURSORBAR
182198 . 183178) (LASTMOUSEX 183180 . 183434) (LASTMOUSEY 183436 . 183690) (CREATEPOSITION 183692 .
183898) (POSITIONP 183900 . 184184) (CURSORHOTSPOT 184186 . 184771)) (186011 187559 (GETMOUSESTATE
186021 . 186680) (\EVENTKEYS 186682 . 187557)) (193986 194782 (MACHINETYPE 193996 . 194396) (
SETMAINTPANEL 194398 . 194780)) (194812 195951 (BEEPON 194822 . 195475) (BEEPOFF 195477 . 195949)) (
196402 196665 (WITHOUT-INTERRUPTS 196412 . 196663)))))
(FILEMAP (NIL (14696 22012 (BKSYSCHARCODE 14706 . 15055) (\CLEARSYSBUF 15057 . 15615) (\GETKEY 15617
. 16792) (\NSYSBUFCHARS 16794 . 17536) (\SAVESYSBUF 17538 . 19147) (\SYSBUFP 19149 . 19453) (
\GETSYSBUF 19455 . 19635) (\PUTSYSBUF 19637 . 20850) (\PEEKSYSBUF 20852 . 22010)) (23309 60143 (
\KEYBOARDINIT 23319 . 25039) (\KEYBOARDEVENTFN 25041 . 29741) (\ALLOCLOCKED 29743 . 30333) (
\SETIOPOINTERS 30335 . 34871) (\KEYBOARDOFF 34873 . 35287) (\KEYBOARDON 35289 . 35668) (\KEYHANDLER
35670 . 35801) (\KEYHANDLER1 35803 . 43249) (\RESETKEYBOARD 43251 . 44899) (\DOMOUSECHORDING 44901 .
48721) (\DOTRANSITIONS 48723 . 49400) (\DECODETRANSITION 49402 . 56091) (MOUSECHORDWAIT 56093 . 56757)
(\TRACKCURSOR 56759 . 60141)) (95847 117397 (KEYACTION 95857 . 96710) (KEYACTIONTABLE 96712 . 97894)
(KEYBOARDTYPE 97896 . 98998) (RESETKEYACTION 99000 . 100759) (\KEYBOARD.MACHINE-SPECIFIC-KEYACTIONS
100761 . 102663) (\KEYACTION1 102665 . 112781) (KEYDOWNP 112783 . 113118) (KEYNUMBERP 113120 . 113318)
(\KEYNAMETONUMBER 113320 . 114014) (\KEYNUMBERTONAME 114016 . 114206) (MODIFY.KEYACTIONS 114208 .
115069) (METASHIFT 115071 . 116015) (SHIFTDOWNP 116017 . 117395)) (117460 117756 (
SETUP.OFFICE.KEYBOARD 117470 . 117754)) (120459 122171 (\INIT.KEYBOARD.STREAM 120469 . 122169)) (
122436 138813 (\DOBUFFEREDTRANSITIONS 122446 . 137876) (\TIMER.INTERRUPTFRAME 137878 . 138603) (
\PERIODIC.INTERRUPTFRAME 138605 . 138811)) (139067 143144 (\HARDCURSORUP 139077 . 140959) (
\HARDCURSORPOSITION 140961 . 142997) (\HARDCURSORDOWN 142999 . 143142)) (143145 167205 (CURSOR.INIT
143155 . 146855) (\CURSORDESTINATION 146857 . 149175) (\SOFTCURSORUP 149177 . 154431) (
\SOFTCURSORUPCURRENT 154433 . 161469) (\SOFTCURSORPOSITION 161471 . 162236) (\SOFTCURSORDOWN 162238 .
162946) (CURSORPROP 162948 . 163290) (GETCURSORPROP 163292 . 163480) (PUTCURSORPROP 163482 . 164637) (
\CURSORBITSPERPIXEL 164639 . 166755) (\CURSORIMAGEPROPNAME 166757 . 166981) (\CURSORMASKPROPNAME
166983 . 167203)) (167206 185156 (CURSORCREATE 167216 . 169891) (CURSOR 169893 . 171705) (
\CURSOR-VALID-P 171707 . 172794) (\CURSORUP 172796 . 174511) (\CURSORPOSITION 174513 . 177041) (
\CURSORDOWN 177043 . 177276) (ADJUSTCURSORPOSITION 177278 . 177856) (CURSORPOSITION 177858 . 179400) (
CURSORSCREEN 179402 . 180058) (CURSOREXIT 180060 . 181451) (FLIPCURSOR 181453 . 182579) (FLIPCURSORBAR
182581 . 183561) (LASTMOUSEX 183563 . 183817) (LASTMOUSEY 183819 . 184073) (CREATEPOSITION 184075 .
184281) (POSITIONP 184283 . 184567) (CURSORHOTSPOT 184569 . 185154)) (186394 187942 (GETMOUSESTATE
186404 . 187063) (\EVENTKEYS 187065 . 187940)) (194369 195165 (MACHINETYPE 194379 . 194779) (
SETMAINTPANEL 194781 . 195163)) (195195 196334 (BEEPON 195205 . 195858) (BEEPOFF 195860 . 196332)) (
196785 197048 (WITHOUT-INTERRUPTS 196795 . 197046)))))
STOP

Binary file not shown.

View File

@@ -1,11 +1,11 @@
(DEFINE-FILE-INFO PACKAGE "INTERLISP" READTABLE "INTERLISP" BASE 10)
(FILECREATED "16-Aug-2021 23:40:36" 
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>TTYIN.;12 329478
(FILECREATED "27-Aug-2021 17:02:43" 
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>TTYIN.;19 329093
changes to%: (FNS \TTYIN.RPEOF)
changes to%: (VARS TTYINCOMS)
previous date%: "14-Aug-2021 10:55:39"
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>TTYIN.;10)
previous date%: "27-Aug-2021 16:49:59"
{DSK}<Users>kaplan>Local>medley3.5>git-medley>sources>TTYIN.;18)
(* ; "
@@ -80,8 +80,10 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
 "This is so that you can (MOVD 'TTYINPROMPTFORWORD 'PROMPTFORWORD) and not die")
(P (MOVD? 'PROMPTFORWORD 'NON-TTYIN-PROMPTFORWORD NIL T]
(DECLARE%: DOEVAL@COMPILE DONTCOPY (COMS * TTCOMPILETIME))
(INITVARS (DORADO.RESTORE.BUF.CODES '(194))
(TTYIN.RESTORE.BUF.CODES '(516 530))
(* ;; " The DORADO branch is deprecated (DORADO.RESTORE.BUF.CODES (CHARCODE (%"#B%")))")
(INITVARS (TTYIN.RESTORE.BUF.CODES (CHARCODE ("Function,^D" "Function,^R")))
(TTYINBUFFER)
(?ACTIVATEFLG T)
(EDITPREFIXCHAR)
@@ -375,7 +377,7 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(SETQ TTYINBUFFER \TTYINSTATE])
(TTYIN1
[LAMBDA NIL (* ; "Edited 13-Jun-2021 09:55 by rmk:")
[LAMBDA NIL (* ; "Edited 27-Aug-2021 16:27 by rmk:")
(* ;;; "The main moby subfn of TTYIN. Is errorset protected in caller")
@@ -782,14 +784,17 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
((CHARCODE ^W) (* ; "^W delete last word")
(TTDELETEWORD))
(COND
([MEMB CHAR (OR \RESTOREBUFCODES (SETQ \RESTOREBUFCODES
(APPEND (AND (EQ (MACHINETYPE)
'DORADO)
DORADO.RESTORE.BUF.CODES)
TTYIN.RESTORE.BUF.CODES]
((MEMB CHAR (OR \RESTOREBUFCODES (SETQ \RESTOREBUFCODES
TTYIN.RESTORE.BUF.CODES)))
(* ;; "One of the characters we interpret as %"restore last buffer%". Recomputed after exit in case we change machine. The dorado code is a perfectly good charset 0 code, so don't usually want to usurp it.")
(* We aren't supporting Dorado,
 whose original code maps on to the
 acute accent (SETQ \RESTOREBUFCODES
 (APPEND (AND (EQ (MACHINETYPE)
 (QUOTE DORADO))
 DORADO.RESTORE.BUF.CODES)
 TTYIN.RESTORE.BUF.CODES)))
(RESTOREBUF))
[(> CHAR 32) (* ; "not a control char")
(ADDCHAR (COND
@@ -3109,9 +3114,9 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(T (BACKWARD.DELETE.TO TAIL])
(TTECHO.TO.FILE
[LAMBDA (FILE DRIBBLING) (* ; "Edited 24-May-91 11:11 by jds")
[LAMBDA (FILE DRIBBLING) (* ; "Edited 27-Aug-2021 16:45 by rmk:")
(* ;;; "Echos input to FILE. If DRIBBLING is true, the prompts are also echoed")
(* ;;; "Echos input to FILE. If DRIBBLING is true, the prompts are also echoed")
(for (STREAM _ (GETSTREAM FILE 'OUTPUT))
(LINE _ \FIRSTLINE)
@@ -3119,7 +3124,7 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
X CH END do (COND
([AND DRIBBLING (SETQ X (COND
(FIRSTIME
(* ; "Print the first prompt")
(* ; "Print the first prompt")
(SETQ FIRSTIME NIL)
(AND \PROMPT1
(NOT (EQMEMB 'NOPROMPT OPTIONS))
@@ -3130,20 +3135,20 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(SETQ X (fetch (LINE START) of LINE))
(until (EQ X END) do [COND
([NOT (COMPLEXCHARP (SETQ CH (CAR X]
(BOUTCCODE STREAM CH))
(\OUTCHAR STREAM CH))
[(EQ (fetch (COMPLEXCHAR CPXREALCHAR)
of CH)
(CHARCODE SPACE))
(* ;; "pseudo-tab kludge: instead of printing the 'real' character, ignore it and print only its padding spaces")
(* ;; "pseudo-tab kludge: instead of printing the 'real' character, ignore it and print only its padding spaces")
(FRPTQ (fetch (COMPLEXCHAR CPXNCHARS)
of CH)
(BOUTCCODE STREAM (CHARCODE SPACE]
(T (BOUTCCODE STREAM (fetch
(COMPLEXCHAR
CPXREALCHAR)
of CH]
(\OUTCHAR STREAM (CHARCODE SPACE]
(T (\OUTCHAR STREAM (fetch (COMPLEXCHAR
CPXREALCHAR
)
of CH]
(SETQ X (TTNEXTCHAR X)))
(SETQ LINE (fetch (LINE NEXTLINE) of LINE))
(COND
@@ -3151,12 +3156,12 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
END))
(NOT \PROMPTFORWORD))
(* ;; "Don't terpri on overflow line, since user didn't; except always do it to dribblefile, since that's what's on the screen. Promptforword-style input doesn't have terminating cr.")
(* ;; "Don't terpri on overflow line, since user didn't; except always do it to dribblefile, since that's what's on the screen. Promptforword-style input doesn't have terminating cr.")
(TERPRI FILE)))
repeatwhile (AND LINE (OR (EQ END \ENDBUFFER)
(PROGN (* ;
 "Avoid echoing the terminating empty line, except when it is an empty overflow line")
(PROGN (* ;
 "Avoid echoing the terminating empty line, except when it is an empty overflow line")
(NEQ (fetch (LINE START) of LINE)
\ENDBUFFER])
@@ -3243,38 +3248,34 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(RETURN LINE])
(TTLOADBUF
[LAMBDA (BUF) (* ; "Edited 8-Feb-88 12:54 by bvm:")
(* ;; "BUF is a list, a la READBUF, which is loaded into our character buffer, using DCHCON to convert the s-expressions therein to char codes. If we are READING, then uses PRIN2 pnames, i.e. includes escape chars and such stuff. Alternatively, BUF may be a string, in which case its contents are also loaded into the buffer, a la BKSYSBUF, and the setting of \READING is irrelevant")
[LAMBDA (BUF) (* ; "Edited 27-Aug-2021 16:47 by rmk:")
(* ;; "BUF is a list, a la READBUF, which is loaded into our character buffer, using DCHCON to convert the s-expressions therein to char codes. If we are READING, then uses PRIN2 pnames, i.e. includes escape chars and such stuff. Alternatively, BUF may be a string, in which case its contents are also loaded into the buffer, a la BKSYSBUF, and the setting of \READING is irrelevant")
(COND
[(EQ (CAR (LISTP BUF))
HISTSTR1) (* ;
 "read from file. BUF is (<histstr1> (file start . end))")
(SETQ BUF (CADR BUF))
(SETFILEPTR (CAR BUF)
(CADR BUF))
(bind CHAR NEXTCH (STREAM _ (GETSTREAM (CAR BUF)
'INPUT))
(END _ (CDDR BUF)) while (< (GETFILEPTR STREAM)
END)
END)
do (* ; "Read another character. Unfortunately, we have to go by file pointer to determine end, since stream could have ns chars in it")
(COND
((NEQ (SETQ CHAR (BINCCODE STREAM))
((NEQ (SETQ CHAR (\INCCODE.EOLC STREAM))
(CHARCODE CR))
(ADDNAKEDCHAR CHAR T))
(T (* ; "eat up lf after the cr")
(COND
([OR (>= (GETFILEPTR STREAM)
END)
(PROGN (SETQ NEXTCH (BINCCODE STREAM))
(PROGN (SETQ NEXTCH (\INCCODE.EOLC STREAM))
(AND (EQ NEXTCH (CHARCODE LF))
(>= (GETFILEPTR STREAM)
END] (* ; "Ignore final CR")
(RETURN)))
(ADDNAKEDCHAR CHAR)
(COND
@@ -3285,25 +3286,21 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
((AND (LISTP BUF)
(SETQ START (FMEMB HISTSTR0 BUF))) (* ;
 "HISTSTR0 is a marker used by lispx to denote end of line")
(FRPLACD (NLEFT BUF 1 START]
(SETQ START (DCHCON BUF (CDR \ENDBUFFER)
(AND (LISTP BUF)
\READING)
RDTBL)) (* ;
 "Use our own buffer as a scratchlist for DCHCON as long as it's lying around anyway.")
[COND
((LISTP BUF) (* ;
 "Remove the surrounding parens from the outer list")
(SETQ END (NLEFT (SETQ START (CDR START))
1]
(* ;; "now detach the result from our buffer to avoid conflict of interest. If DCHCON found our scratchlist inadequate, START will not be a tail of \ENDBUFFER so the NLEFT below comes out NIL, which is also fine")
(* ;; "now detach the result from our buffer to avoid conflict of interest. If DCHCON found our scratchlist inadequate, START will not be a tail of \ENDBUFFER so the NLEFT below comes out NIL, which is also fine")
(FRPLACD (NLEFT \ENDBUFFER 1 START)) (* ; "Now unread the CHCON list.")
(READFROMBUF START END])
(TTNEXTLINE
@@ -3838,16 +3835,14 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(SETQ BUF (CDR BUF)) finally (RETURN BUF])
(TTYIN.READ
[LAMBDA (FINALCHAR DONTREAD STREAM) (* ; "Edited 16-Jan-88 17:58 by bvm:")
[LAMBDA (FINALCHAR DONTREAD STREAM) (* ; "Edited 27-Aug-2021 16:43 by rmk:")
(* ;;; "Process buffer for reading. FINALCHAR is what prompted us to terminate the call to TTYIN and is not in the buffer. If DONTREAD is true, then STREAM is the line buffer and we are acting as \FILLBUFFER -- otherwise, STREAM is our own scratch stream, with an eof fn that returns right paren; we read the buffer and return a list of expressions")
(LET (LASTC BUTLASTC)
(while (NEQ \BUFFER \ENDBUFFER) do (SETQ BUTLASTC LASTC)
(* ; "Fill the buffer")
(BOUTCCODE STREAM (SETQ LASTC (FIRSTCHAR \BUFFER))
)
(\OUTCHAR STREAM (SETQ LASTC (FIRSTCHAR \BUFFER)))
(SETQ \BUFFER (TTNEXTCHAR \BUFFER)))
(COND
((AND DONTREAD (SELCHARQ FINALCHAR
@@ -3859,45 +3854,37 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
ESCAPE.RC))
(* ;
 "If it ended in a quoted right paren, then it's just like any other character")
T)
((EQ (\SYNCODE \RDTBLSA (CHARCODE %]))
RIGHTBRACKET.RC)
(* ;; "Line ended in paren. Change to right bracket so READLINE doesn't get confused. Only do this if ] really is right bracket!")
(\SETFILEPTR STREAM (IDIFFERENCE (\GETFILEPTR
STREAM)
(STREAMBYTESPERCHAR
STREAM)))
(BOUTCCODE STREAM (CHARCODE %]))
(* ;; "Line ended in paren. Change to right bracket so READLINE doesn't get confused. Only do this if ] really is right bracket!")
(\BACKCCODE STREAM)
(\OUTCHAR STREAM (CHARCODE %]))
NIL)))
T))
((%) %])
NIL)
T)) (* ;
 "Print FINALCHAR unless terminator was EOL and line already ended in a closing paren or bracket")
(BOUTCCODE STREAM FINALCHAR)))
(\OUTCHAR STREAM FINALCHAR)))
(\SETEOFPTR STREAM (\GETFILEPTR STREAM))
(\SETFILEPTR STREAM 0)
(COND
(DONTREAD (* ;
 "STREAM = \LINEBUF.OFD and caller will take care of reading buf")
(AND (EQ STREAM \LINEBUF.OFD)
(replace (LINEBUFFER LINEBUFSTATE) of STREAM with READING.LBS))
T)
(T (* ;
 "Read from buffer until it's empty")
(PROG1 (bind TERM while [AND (SKIPSEPRS STREAM)
(SETQ TERM (NLSETQ (READ STREAM]
collect (CAR TERM))
(\SETFILEPTR STREAM 0) (* ;
(\SETFILEPTR STREAM 0) (* ;
 "Now clear the stream so nobody reads extra garbage after us")
(\SETEOFPTR STREAM 0])
(\SETEOFPTR STREAM 0))])
)
@@ -5336,7 +5323,7 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(T (GIVE.TTY.PROCESS WINDOW])
(TTYINREADP
[LAMBDA (FLG) (* ; "Edited 14-Apr-87 00:25 by bvm:")
[LAMBDA (FLG) (* ; "Edited 27-Aug-2021 16:49 by rmk:")
(* ;;; "Intended to replace LISPXREADP. Does the right thing when READBUF has just a <cr> in it")
@@ -5345,7 +5332,7 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
HISTSTR0)
FLG))
((NOT (LINEBUFFER-EOFP \LINEBUF.OFD))
(OR FLG (NEQ (PEEKBINCCODE \LINEBUF.OFD)
(OR FLG (NEQ (\PEEKCCODE \LINEBUF.OFD)
(CHARCODE EOL])
(TTYINREAD
@@ -5800,8 +5787,8 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(RPAQQ TTYINMACROS (TYPEAHEAD? AT.END.OF.BUF AT.END.OF.LINE AT.START.OF.BUF AT.START.OF.LINE
BEFOREBUF BREAK.OR.SEPRP DISPLAYTERMP EMPTY.BUFFER EMPTY.LINE EQPOS
NEQPOS INPART ON.FIRST.LINE ON.LAST.LINE METACHARP NONMETACHARBITS
METACHAR COMPLEXCHARP STREAMBYTESPERCHAR SPACEP TTBOUT TTNEXTCHAR
BOUTCCODE PEEKBINCCODE BINCCODE WORDSEPRP FCHARWIDTH FIRSTCHAR))
METACHAR COMPLEXCHARP SPACEP TTBOUT TTNEXTCHAR WORDSEPRP FCHARWIDTH
FIRSTCHAR))
(DECLARE%: EVAL@COMPILE
(PUTPROPS TYPEAHEAD? MACRO (NIL (\SYSBUFP)))
@@ -5865,12 +5852,6 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(PUTPROPS COMPLEXCHARP MACRO (= . LISTP))
(PUTPROPS STREAMBYTESPERCHAR MACRO ((STREAM)
(COND
((\RUNCODED STREAM)
1)
(T 2))))
(PUTPROPS SPACEP MACRO [(CHAR)
(FMEMB CHAR (CHARCODE (SPACE TAB CR])
@@ -5886,13 +5867,6 @@ Copyright (c) 1982-1988, 1990-1991, 2021 by Venue & Xerox Corporation.
(PUTPROPS TTNEXTCHAR MACRO (= . CDR))
(PUTPROPS BOUTCCODE MACRO (OPENLAMBDA (STREAM CHAR)
(PRINTCCODE CHAR STREAM)))
(PUTPROPS PEEKBINCCODE MACRO (= . PEEKCCODE))
(PUTPROPS BINCCODE MACRO (= . READCCODE))
(PUTPROPS WORDSEPRP DMACRO [OPENLAMBDA (X)
(OR (EQ (\SYNCODE \PRIMTERMSA X)
WORDSEPR.TC)
@@ -6008,9 +5982,12 @@ DONTEVAL@LOAD EVAL@COMPILE
)
)
(RPAQ? DORADO.RESTORE.BUF.CODES '(194))
(RPAQ? TTYIN.RESTORE.BUF.CODES '(516 530))
(* ;; " The DORADO branch is deprecated (DORADO.RESTORE.BUF.CODES (CHARCODE (%"#B%")))")
(RPAQ? TTYIN.RESTORE.BUF.CODES (CHARCODE ("Function,^D" "Function,^R")))
(RPAQ? TTYINBUFFER )
@@ -6099,62 +6076,62 @@ DONTEVAL@LOAD EVAL@COMPILE
(PUTPROPS TTYIN COPYRIGHT ("Venue & Xerox Corporation" 1982 1983 1984 1985 1986 1987 1988 1990 1991
2021))
(DECLARE%: DONTCOPY
(FILEMAP (NIL (7706 207384 (TTYIN 7716 . 20949) (TTYIN.SETUP 20951 . 24027) (TTYIN.CLEANUP 24029 .
24857) (TTYIN1 24859 . 51377) (TTYIN1RESTART 51379 . 52643) (TTYIN.FINISH 52645 . 62062) (
TTYIN.BALANCE 62064 . 63190) (ADDCHAR 63192 . 65378) (TTMAKECOMPLEXCHAR 65380 . 65854) (ADDNAKEDCHAR
65856 . 67366) (TTADDTAB 67368 . 68303) (ADJUSTLINE 68305 . 82216) (ADJUSTLINE.AND.RESTORE 82218 .
82656) (AT.END.OF.SCREEN 82658 . 82946) (AT.END.OF.TEXT 82948 . 83403) (AUTOCR? 83405 . 83879) (
BACKSKREAD 83881 . 88466) (BACKWARD.DELETE.TO 88468 . 88650) (BREAKLINE 88652 . 90919) (BUFTAILP 90921
. 91239) (CHECK.MARGIN 91241 . 91864) (CLEAR.LINE? 91866 . 92159) (CURRENT.WORD 92161 . 94561) (
DELETE.TO.END 94563 . 95282) (DELETELINE 95284 . 98241) (DELETETO 98243 . 100065) (DELETETO1 100067 .
101410) (DO.EDIT.COMMAND 101412 . 118731) (DO.EDIT.PP 118733 . 121395) (TTDOTABS 121397 . 122767) (
EDITCOLUMN 122769 . 123225) (EDITNUMBERP 123227 . 123458) (END.DELETE.MODE 123460 . 123977) (ENDREAD?
123979 . 126414) (FIND.LINE 126416 . 127952) (FIND.LINE.BREAK 127954 . 128624) (FIND.MATCHING.QUOTE
128626 . 129471) (FIND.NEXT.WORD 129473 . 130852) (FIND.NON.SPACE 130854 . 131127) (FIND.START.OF.WORD
131129 . 131492) (FORWARD.DELETE.TO 131494 . 133716) (GO.TO.ADDRESSING 133718 . 134674) (
GO.TO.FREELINE 134676 . 135257) (GO.TO.RELATIVE 135259 . 136039) (INIT.CURSOR 136041 . 136938) (
INSERT.NODE 136940 . 137462) (INSERTLINE 137464 . 138968) (KILL.LINES 138970 . 139508) (KILLSEGMENT
139510 . 140633) (L-CASECODE 140635 . 140796) (MOVE.BACK.TO 140798 . 141027) (MOVE.FORWARD.TO 141029
. 141450) (MOVE.TO.LINE 141452 . 142367) (MOVE.TO.NEXT.LINE 142369 . 142639) (MOVE.TO.START.OF.WORD
142641 . 143405) (MOVE.TO.WHEREVER 143407 . 143630) (NTH.COLUMN.OF 143632 . 143963) (
NTH.RELATIVE.COLUMN.OF 143965 . 145265) (OVERFLOW? 145267 . 146215) (OVERFLOWLINE? 146217 . 146543) (
PREVLINE 146545 . 147725) (PREVWORD 147727 . 149868) (PROPERTAILP 149870 . 150077) (READFROMBUF 150079
. 152668) (RENUMBER.LINES 152670 . 153063) (RESTORE.CURSOR 153065 . 153219) (RESTOREBUF 153221 .
155405) (RETYPE.BUFFER 155407 . 157670) (SAVE.CURSOR 157672 . 157844) (SCANBACK 157846 . 159204) (
SCANFORWARD 159206 . 160074) (SCRATCHCONS 160076 . 160678) (SEGMENT.LENGTH 160680 . 161216) (
SEGMENT.BIT.LENGTH 161218 . 161825) (SETLASTC 161827 . 162124) (SETTAIL? 162126 . 162942) (
SHOW.MATCHING.PAREN 162944 . 165444) (SKIP/ZAP 165446 . 167925) (START.NEW.LINE 167927 . 168259) (
START.OF.PARAGRAPH? 168261 . 168642) (TTADJUSTWORD 168644 . 169818) (TTBIN 169820 . 171026) (
TTBITWIDTH 171028 . 171177) (TTCRLF 171179 . 171386) (TTCRLF.ACCOUNT 171388 . 172028) (TTDELETECHAR
172030 . 173174) (TTDELETELINE 173176 . 175124) (TTDELETEWORD 175126 . 175794) (TTECHO.TO.FILE 175796
. 179344) (TTGIVEHELP 179346 . 180611) (TTGIVEHELP1 180613 . 181195) (TTGIVEHELP2 181197 . 181892) (
TTLASTLINE 181894 . 182262) (TTLOADBUF 182264 . 185778) (TTNEXTLINE 185780 . 186100) (TTNEXTNODE
186102 . 186341) (TTNLEFT 186343 . 187570) (TTNTH 187572 . 188031) (TTNTHLINE 188033 . 188565) (
TTPRIN1 188567 . 192380) (TTPRINSPACE 192382 . 192775) (TTPRIN1COMMENT 192777 . 193101) (TTPRIN2
193103 . 195422) (TTPROMPTCHAR 195424 . 196320) (TTRUBOUT 196322 . 197285) (TTUNREADBUF 197287 .
197696) (TTWAITFORINPUT 197698 . 201906) (TTYINSTRING 201908 . 202867) (TYPE.BUFFER 202869 . 204621) (
U-CASECODE 204623 . 204782) (U/L-CASE 204784 . 207382)) (207539 217252 (TTRATOM 207549 . 207993) (
TTREADLIST 207995 . 208362) (TTSKIPSEPR 208364 . 208738) (TTSKREAD 208740 . 213380) (TTYIN.READ 213382
. 217250)) (217299 237343 (FIND.MATCHING.WORD 217309 . 217837) (TTCOMPLETEWORD 217839 . 232267) (
WORD.MATCHES.BUFFER 232269 . 233829) (TTYIN.SHOW.?ALTERNATIVES 233831 . 237341)) (237377 255687 (
DO?CMD 237387 . 243289) (TTYIN.PRINTARGS 243291 . 254149) (TTYIN.READ?=ARGS 254151 . 254932) (
DO?CMD.ERRORHANDLER 254934 . 255685)) (255721 263794 (BEEP 255731 . 255906) (BITBLT.DELETE 255908 .
256555) (BITBLT.ERASE 256557 . 256742) (BITBLT.INSERT 256744 . 257055) (DO.CRLF 257057 . 257376) (
DO.DELETE.LINES 257378 . 258422) (DO.INSERT.LINE 258424 . 260358) (DO.LF 260360 . 260526) (
ERASE.TO.END.OF.LINE 260528 . 260853) (ERASE.TO.END.OF.PAGE 260855 . 261460) (INSERT.TEXT 261462 .
261966) (TTDELSECTION 261968 . 262266) (TTADJUSTWIDTH 262268 . 263132) (TTINSERTSECTION 263134 .
263473) (TTSETCURSOR 263475 . 263792)) (263829 268806 (TTYINBUFFERDEVICE 263839 . 265155) (
TTYINBUFFERSTREAM 265157 . 265919) (TTYINBUFFERBIN 265921 . 266457) (TTYINBUFFERPEEK 266459 . 266937)
(TTYINBUFFERREADP 266939 . 267194) (TTYINBUFFEREOFP 267196 . 267448) (TTYINBUFFERBACKPTR 267450 .
268002) (TTYINWORDRDTBL 268004 . 268804)) (268967 294524 (DO.MOUSE 268977 . 271734) (
DO.SHIFTED.SELECTION 271736 . 282175) (COPY.SEGMENT 282177 . 282381) (DELETE.LONG.SEGMENT 282383 .
282742) (DELETE.LONG.SEGMENT1 282744 . 285220) (INVERT.LONG.SEGMENT 285222 . 286251) (INVERT.SEGMENT
286253 . 287768) (BRACKET.CURRENT.WORD 287770 . 289304) (TTBEFOREPOS 289306 . 290036) (TTNEXTPOS
290038 . 290746) (TTRACKMOUSE 290748 . 294522)) (294668 299924 (SETREADFN 294678 . 295156) (
TTYINENTRYFN 295158 . 295583) (TTYINREADP 295585 . 296050) (TTYINREAD 296052 . 297446) (TTYINFIX
297448 . 298647) (CHARMACRO? 298649 . 299216) (TTYINMETA 299218 . 299346) (TTYIN.LASTINPUT 299348 .
299922)) (299925 308194 (TTYINEDIT 299935 . 302052) (SIMPLETEXTEDIT 302054 . 305098) (
SET.TTYINEDIT.WINDOW 305100 . 306251) (TTYIN.PPTOFILE 306253 . 308192)) (308252 308429 (
MAKE-TTSCRATCHFILE 308262 . 308427)) (308576 309709 (TTYIN.SCRATCHFILE 308586 . 309032) (\TTYIN.RPEOF
309034 . 309707)) (309921 313554 (TTYINPROMPTFORWORD 309931 . 313552)))))
(FILEMAP (NIL (7796 207831 (TTYIN 7806 . 21039) (TTYIN.SETUP 21041 . 24117) (TTYIN.CLEANUP 24119 .
24947) (TTYIN1 24949 . 51833) (TTYIN1RESTART 51835 . 53099) (TTYIN.FINISH 53101 . 62518) (
TTYIN.BALANCE 62520 . 63646) (ADDCHAR 63648 . 65834) (TTMAKECOMPLEXCHAR 65836 . 66310) (ADDNAKEDCHAR
66312 . 67822) (TTADDTAB 67824 . 68759) (ADJUSTLINE 68761 . 82672) (ADJUSTLINE.AND.RESTORE 82674 .
83112) (AT.END.OF.SCREEN 83114 . 83402) (AT.END.OF.TEXT 83404 . 83859) (AUTOCR? 83861 . 84335) (
BACKSKREAD 84337 . 88922) (BACKWARD.DELETE.TO 88924 . 89106) (BREAKLINE 89108 . 91375) (BUFTAILP 91377
. 91695) (CHECK.MARGIN 91697 . 92320) (CLEAR.LINE? 92322 . 92615) (CURRENT.WORD 92617 . 95017) (
DELETE.TO.END 95019 . 95738) (DELETELINE 95740 . 98697) (DELETETO 98699 . 100521) (DELETETO1 100523 .
101866) (DO.EDIT.COMMAND 101868 . 119187) (DO.EDIT.PP 119189 . 121851) (TTDOTABS 121853 . 123223) (
EDITCOLUMN 123225 . 123681) (EDITNUMBERP 123683 . 123914) (END.DELETE.MODE 123916 . 124433) (ENDREAD?
124435 . 126870) (FIND.LINE 126872 . 128408) (FIND.LINE.BREAK 128410 . 129080) (FIND.MATCHING.QUOTE
129082 . 129927) (FIND.NEXT.WORD 129929 . 131308) (FIND.NON.SPACE 131310 . 131583) (FIND.START.OF.WORD
131585 . 131948) (FORWARD.DELETE.TO 131950 . 134172) (GO.TO.ADDRESSING 134174 . 135130) (
GO.TO.FREELINE 135132 . 135713) (GO.TO.RELATIVE 135715 . 136495) (INIT.CURSOR 136497 . 137394) (
INSERT.NODE 137396 . 137918) (INSERTLINE 137920 . 139424) (KILL.LINES 139426 . 139964) (KILLSEGMENT
139966 . 141089) (L-CASECODE 141091 . 141252) (MOVE.BACK.TO 141254 . 141483) (MOVE.FORWARD.TO 141485
. 141906) (MOVE.TO.LINE 141908 . 142823) (MOVE.TO.NEXT.LINE 142825 . 143095) (MOVE.TO.START.OF.WORD
143097 . 143861) (MOVE.TO.WHEREVER 143863 . 144086) (NTH.COLUMN.OF 144088 . 144419) (
NTH.RELATIVE.COLUMN.OF 144421 . 145721) (OVERFLOW? 145723 . 146671) (OVERFLOWLINE? 146673 . 146999) (
PREVLINE 147001 . 148181) (PREVWORD 148183 . 150324) (PROPERTAILP 150326 . 150533) (READFROMBUF 150535
. 153124) (RENUMBER.LINES 153126 . 153519) (RESTORE.CURSOR 153521 . 153675) (RESTOREBUF 153677 .
155861) (RETYPE.BUFFER 155863 . 158126) (SAVE.CURSOR 158128 . 158300) (SCANBACK 158302 . 159660) (
SCANFORWARD 159662 . 160530) (SCRATCHCONS 160532 . 161134) (SEGMENT.LENGTH 161136 . 161672) (
SEGMENT.BIT.LENGTH 161674 . 162281) (SETLASTC 162283 . 162580) (SETTAIL? 162582 . 163398) (
SHOW.MATCHING.PAREN 163400 . 165900) (SKIP/ZAP 165902 . 168381) (START.NEW.LINE 168383 . 168715) (
START.OF.PARAGRAPH? 168717 . 169098) (TTADJUSTWORD 169100 . 170274) (TTBIN 170276 . 171482) (
TTBITWIDTH 171484 . 171633) (TTCRLF 171635 . 171842) (TTCRLF.ACCOUNT 171844 . 172484) (TTDELETECHAR
172486 . 173630) (TTDELETELINE 173632 . 175580) (TTDELETEWORD 175582 . 176250) (TTECHO.TO.FILE 176252
. 179811) (TTGIVEHELP 179813 . 181078) (TTGIVEHELP1 181080 . 181662) (TTGIVEHELP2 181664 . 182359) (
TTLASTLINE 182361 . 182729) (TTLOADBUF 182731 . 186225) (TTNEXTLINE 186227 . 186547) (TTNEXTNODE
186549 . 186788) (TTNLEFT 186790 . 188017) (TTNTH 188019 . 188478) (TTNTHLINE 188480 . 189012) (
TTPRIN1 189014 . 192827) (TTPRINSPACE 192829 . 193222) (TTPRIN1COMMENT 193224 . 193548) (TTPRIN2
193550 . 195869) (TTPROMPTCHAR 195871 . 196767) (TTRUBOUT 196769 . 197732) (TTUNREADBUF 197734 .
198143) (TTWAITFORINPUT 198145 . 202353) (TTYINSTRING 202355 . 203314) (TYPE.BUFFER 203316 . 205068) (
U-CASECODE 205070 . 205229) (U/L-CASE 205231 . 207829)) (207986 217345 (TTRATOM 207996 . 208440) (
TTREADLIST 208442 . 208809) (TTSKIPSEPR 208811 . 209185) (TTSKREAD 209187 . 213827) (TTYIN.READ 213829
. 217343)) (217392 237436 (FIND.MATCHING.WORD 217402 . 217930) (TTCOMPLETEWORD 217932 . 232360) (
WORD.MATCHES.BUFFER 232362 . 233922) (TTYIN.SHOW.?ALTERNATIVES 233924 . 237434)) (237470 255780 (
DO?CMD 237480 . 243382) (TTYIN.PRINTARGS 243384 . 254242) (TTYIN.READ?=ARGS 254244 . 255025) (
DO?CMD.ERRORHANDLER 255027 . 255778)) (255814 263887 (BEEP 255824 . 255999) (BITBLT.DELETE 256001 .
256648) (BITBLT.ERASE 256650 . 256835) (BITBLT.INSERT 256837 . 257148) (DO.CRLF 257150 . 257469) (
DO.DELETE.LINES 257471 . 258515) (DO.INSERT.LINE 258517 . 260451) (DO.LF 260453 . 260619) (
ERASE.TO.END.OF.LINE 260621 . 260946) (ERASE.TO.END.OF.PAGE 260948 . 261553) (INSERT.TEXT 261555 .
262059) (TTDELSECTION 262061 . 262359) (TTADJUSTWIDTH 262361 . 263225) (TTINSERTSECTION 263227 .
263566) (TTSETCURSOR 263568 . 263885)) (263922 268899 (TTYINBUFFERDEVICE 263932 . 265248) (
TTYINBUFFERSTREAM 265250 . 266012) (TTYINBUFFERBIN 266014 . 266550) (TTYINBUFFERPEEK 266552 . 267030)
(TTYINBUFFERREADP 267032 . 267287) (TTYINBUFFEREOFP 267289 . 267541) (TTYINBUFFERBACKPTR 267543 .
268095) (TTYINWORDRDTBL 268097 . 268897)) (269060 294617 (DO.MOUSE 269070 . 271827) (
DO.SHIFTED.SELECTION 271829 . 282268) (COPY.SEGMENT 282270 . 282474) (DELETE.LONG.SEGMENT 282476 .
282835) (DELETE.LONG.SEGMENT1 282837 . 285313) (INVERT.LONG.SEGMENT 285315 . 286344) (INVERT.SEGMENT
286346 . 287861) (BRACKET.CURRENT.WORD 287863 . 289397) (TTBEFOREPOS 289399 . 290129) (TTNEXTPOS
290131 . 290839) (TTRACKMOUSE 290841 . 294615)) (294761 300016 (SETREADFN 294771 . 295249) (
TTYINENTRYFN 295251 . 295676) (TTYINREADP 295678 . 296142) (TTYINREAD 296144 . 297538) (TTYINFIX
297540 . 298739) (CHARMACRO? 298741 . 299308) (TTYINMETA 299310 . 299438) (TTYIN.LASTINPUT 299440 .
300014)) (300017 308286 (TTYINEDIT 300027 . 302144) (SIMPLETEXTEDIT 302146 . 305190) (
SET.TTYINEDIT.WINDOW 305192 . 306343) (TTYIN.PPTOFILE 306345 . 308284)) (308344 308521 (
MAKE-TTSCRATCHFILE 308354 . 308519)) (308668 309801 (TTYIN.SCRATCHFILE 308678 . 309124) (\TTYIN.RPEOF
309126 . 309799)) (310013 313646 (TTYINPROMPTFORWORD 310023 . 313644)))))
STOP

Binary file not shown.

View File

@@ -0,0 +1,12 @@
EASTASIA:
The CDROM came with CJK cross reference mappings for standards such as KSC5601,
GB2312, JIS0208, etc. to Unicode 2.0.
However, these particular mappings are now obsolete and have been removed as per
this note from Unicode.org:
The entire former contents of this directory are obsolete and have been
moved to the OBSOLETE directory. The latest information may be found
in the Unihan data files in the latest Unicode Character Database.
August 1, 2001.
The current set of mappings are available from
https://unicode.org/Public/UNIDATA/Unihan.zip
The format of these files is given in https://unicode.org/reports/tr38/

View File

@@ -1,42 +1,71 @@
# 8859-1.TXT
# Date: 2015-12-02 20:19:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-1 (1987) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO/IEC 8859-1:1998 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-1 (1987) characters map into Unicode.
# ISO/IEC 8859-1:1998 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-1 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-1 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-1 order
# The entries are in ISO/IEC 8859-1 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +161,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x00A1 # INVERTED EXCLAMATION MARK
0xA2 0x00A2 # CENT SIGN

292
unicode/iso8859/8859-10.TXT Normal file
View File

@@ -0,0 +1,292 @@
# 8859-10.TXT
# Date: 2015-12-02 21:53:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-10:1998 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 1999 October 11 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-10:1998 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-10 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-10 order.
#
# Version history
# 1.0 version new.
# 1.1 corrected mistake in mapping of 0xA4
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0xA2 0x0112 # LATIN CAPITAL LETTER E WITH MACRON
0xA3 0x0122 # LATIN CAPITAL LETTER G WITH CEDILLA
0xA4 0x012A # LATIN CAPITAL LETTER I WITH MACRON
0xA5 0x0128 # LATIN CAPITAL LETTER I WITH TILDE
0xA6 0x0136 # LATIN CAPITAL LETTER K WITH CEDILLA
0xA7 0x00A7 # SECTION SIGN
0xA8 0x013B # LATIN CAPITAL LETTER L WITH CEDILLA
0xA9 0x0110 # LATIN CAPITAL LETTER D WITH STROKE
0xAA 0x0160 # LATIN CAPITAL LETTER S WITH CARON
0xAB 0x0166 # LATIN CAPITAL LETTER T WITH STROKE
0xAC 0x017D # LATIN CAPITAL LETTER Z WITH CARON
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x016A # LATIN CAPITAL LETTER U WITH MACRON
0xAF 0x014A # LATIN CAPITAL LETTER ENG
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x0105 # LATIN SMALL LETTER A WITH OGONEK
0xB2 0x0113 # LATIN SMALL LETTER E WITH MACRON
0xB3 0x0123 # LATIN SMALL LETTER G WITH CEDILLA
0xB4 0x012B # LATIN SMALL LETTER I WITH MACRON
0xB5 0x0129 # LATIN SMALL LETTER I WITH TILDE
0xB6 0x0137 # LATIN SMALL LETTER K WITH CEDILLA
0xB7 0x00B7 # MIDDLE DOT
0xB8 0x013C # LATIN SMALL LETTER L WITH CEDILLA
0xB9 0x0111 # LATIN SMALL LETTER D WITH STROKE
0xBA 0x0161 # LATIN SMALL LETTER S WITH CARON
0xBB 0x0167 # LATIN SMALL LETTER T WITH STROKE
0xBC 0x017E # LATIN SMALL LETTER Z WITH CARON
0xBD 0x2015 # HORIZONTAL BAR
0xBE 0x016B # LATIN SMALL LETTER U WITH MACRON
0xBF 0x014B # LATIN SMALL LETTER ENG
0xC0 0x0100 # LATIN CAPITAL LETTER A WITH MACRON
0xC1 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xC2 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xC3 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xC4 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0xC5 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0xC6 0x00C6 # LATIN CAPITAL LETTER AE
0xC7 0x012E # LATIN CAPITAL LETTER I WITH OGONEK
0xC8 0x010C # LATIN CAPITAL LETTER C WITH CARON
0xC9 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0xCA 0x0118 # LATIN CAPITAL LETTER E WITH OGONEK
0xCB 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xCC 0x0116 # LATIN CAPITAL LETTER E WITH DOT ABOVE
0xCD 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xCE 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xCF 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xD0 0x00D0 # LATIN CAPITAL LETTER ETH (Icelandic)
0xD1 0x0145 # LATIN CAPITAL LETTER N WITH CEDILLA
0xD2 0x014C # LATIN CAPITAL LETTER O WITH MACRON
0xD3 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xD4 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xD5 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xD6 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0xD7 0x0168 # LATIN CAPITAL LETTER U WITH TILDE
0xD8 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xD9 0x0172 # LATIN CAPITAL LETTER U WITH OGONEK
0xDA 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xDB 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xDC 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0xDD 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xDE 0x00DE # LATIN CAPITAL LETTER THORN (Icelandic)
0xDF 0x00DF # LATIN SMALL LETTER SHARP S (German)
0xE0 0x0101 # LATIN SMALL LETTER A WITH MACRON
0xE1 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0xE2 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0xE3 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0xE4 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0xE5 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0xE6 0x00E6 # LATIN SMALL LETTER AE
0xE7 0x012F # LATIN SMALL LETTER I WITH OGONEK
0xE8 0x010D # LATIN SMALL LETTER C WITH CARON
0xE9 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0xEA 0x0119 # LATIN SMALL LETTER E WITH OGONEK
0xEB 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0xEC 0x0117 # LATIN SMALL LETTER E WITH DOT ABOVE
0xED 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0xEE 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0xEF 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0xF0 0x00F0 # LATIN SMALL LETTER ETH (Icelandic)
0xF1 0x0146 # LATIN SMALL LETTER N WITH CEDILLA
0xF2 0x014D # LATIN SMALL LETTER O WITH MACRON
0xF3 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0xF4 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0xF5 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0xF6 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0xF7 0x0169 # LATIN SMALL LETTER U WITH TILDE
0xF8 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xF9 0x0173 # LATIN SMALL LETTER U WITH OGONEK
0xFA 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0xFB 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0xFC 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xFD 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xFE 0x00FE # LATIN SMALL LETTER THORN (Icelandic)
0xFF 0x0138 # LATIN SMALL LETTER KRA

286
unicode/iso8859/8859-11.TXT Normal file
View File

@@ -0,0 +1,286 @@
# 8859-11.TXT
# Date: 2015-12-02 21:55:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-11:2001 to Unicode
# Unicode version: 3.2
# Table version: 2.0
# Table format: Format A
# Date: 2002 October 7 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-11:2001 characters map into Unicode.
#
# ISO/IEC 8859-11:2001 is equivalent to TIS 620-2533 (1990) with
# the addition of 0xA0 NO-BREAK SPACE.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-11 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-11 order.
#
# Version history:
# 2002 October 7 Created
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0E01 # THAI CHARACTER KO KAI
0xA2 0x0E02 # THAI CHARACTER KHO KHAI
0xA3 0x0E03 # THAI CHARACTER KHO KHUAT
0xA4 0x0E04 # THAI CHARACTER KHO KHWAI
0xA5 0x0E05 # THAI CHARACTER KHO KHON
0xA6 0x0E06 # THAI CHARACTER KHO RAKHANG
0xA7 0x0E07 # THAI CHARACTER NGO NGU
0xA8 0x0E08 # THAI CHARACTER CHO CHAN
0xA9 0x0E09 # THAI CHARACTER CHO CHING
0xAA 0x0E0A # THAI CHARACTER CHO CHANG
0xAB 0x0E0B # THAI CHARACTER SO SO
0xAC 0x0E0C # THAI CHARACTER CHO CHOE
0xAD 0x0E0D # THAI CHARACTER YO YING
0xAE 0x0E0E # THAI CHARACTER DO CHADA
0xAF 0x0E0F # THAI CHARACTER TO PATAK
0xB0 0x0E10 # THAI CHARACTER THO THAN
0xB1 0x0E11 # THAI CHARACTER THO NANGMONTHO
0xB2 0x0E12 # THAI CHARACTER THO PHUTHAO
0xB3 0x0E13 # THAI CHARACTER NO NEN
0xB4 0x0E14 # THAI CHARACTER DO DEK
0xB5 0x0E15 # THAI CHARACTER TO TAO
0xB6 0x0E16 # THAI CHARACTER THO THUNG
0xB7 0x0E17 # THAI CHARACTER THO THAHAN
0xB8 0x0E18 # THAI CHARACTER THO THONG
0xB9 0x0E19 # THAI CHARACTER NO NU
0xBA 0x0E1A # THAI CHARACTER BO BAIMAI
0xBB 0x0E1B # THAI CHARACTER PO PLA
0xBC 0x0E1C # THAI CHARACTER PHO PHUNG
0xBD 0x0E1D # THAI CHARACTER FO FA
0xBE 0x0E1E # THAI CHARACTER PHO PHAN
0xBF 0x0E1F # THAI CHARACTER FO FAN
0xC0 0x0E20 # THAI CHARACTER PHO SAMPHAO
0xC1 0x0E21 # THAI CHARACTER MO MA
0xC2 0x0E22 # THAI CHARACTER YO YAK
0xC3 0x0E23 # THAI CHARACTER RO RUA
0xC4 0x0E24 # THAI CHARACTER RU
0xC5 0x0E25 # THAI CHARACTER LO LING
0xC6 0x0E26 # THAI CHARACTER LU
0xC7 0x0E27 # THAI CHARACTER WO WAEN
0xC8 0x0E28 # THAI CHARACTER SO SALA
0xC9 0x0E29 # THAI CHARACTER SO RUSI
0xCA 0x0E2A # THAI CHARACTER SO SUA
0xCB 0x0E2B # THAI CHARACTER HO HIP
0xCC 0x0E2C # THAI CHARACTER LO CHULA
0xCD 0x0E2D # THAI CHARACTER O ANG
0xCE 0x0E2E # THAI CHARACTER HO NOKHUK
0xCF 0x0E2F # THAI CHARACTER PAIYANNOI
0xD0 0x0E30 # THAI CHARACTER SARA A
0xD1 0x0E31 # THAI CHARACTER MAI HAN-AKAT
0xD2 0x0E32 # THAI CHARACTER SARA AA
0xD3 0x0E33 # THAI CHARACTER SARA AM
0xD4 0x0E34 # THAI CHARACTER SARA I
0xD5 0x0E35 # THAI CHARACTER SARA II
0xD6 0x0E36 # THAI CHARACTER SARA UE
0xD7 0x0E37 # THAI CHARACTER SARA UEE
0xD8 0x0E38 # THAI CHARACTER SARA U
0xD9 0x0E39 # THAI CHARACTER SARA UU
0xDA 0x0E3A # THAI CHARACTER PHINTHU
0xDF 0x0E3F # THAI CURRENCY SYMBOL BAHT
0xE0 0x0E40 # THAI CHARACTER SARA E
0xE1 0x0E41 # THAI CHARACTER SARA AE
0xE2 0x0E42 # THAI CHARACTER SARA O
0xE3 0x0E43 # THAI CHARACTER SARA AI MAIMUAN
0xE4 0x0E44 # THAI CHARACTER SARA AI MAIMALAI
0xE5 0x0E45 # THAI CHARACTER LAKKHANGYAO
0xE6 0x0E46 # THAI CHARACTER MAIYAMOK
0xE7 0x0E47 # THAI CHARACTER MAITAIKHU
0xE8 0x0E48 # THAI CHARACTER MAI EK
0xE9 0x0E49 # THAI CHARACTER MAI THO
0xEA 0x0E4A # THAI CHARACTER MAI TRI
0xEB 0x0E4B # THAI CHARACTER MAI CHATTAWA
0xEC 0x0E4C # THAI CHARACTER THANTHAKHAT
0xED 0x0E4D # THAI CHARACTER NIKHAHIT
0xEE 0x0E4E # THAI CHARACTER YAMAKKAN
0xEF 0x0E4F # THAI CHARACTER FONGMAN
0xF0 0x0E50 # THAI DIGIT ZERO
0xF1 0x0E51 # THAI DIGIT ONE
0xF2 0x0E52 # THAI DIGIT TWO
0xF3 0x0E53 # THAI DIGIT THREE
0xF4 0x0E54 # THAI DIGIT FOUR
0xF5 0x0E55 # THAI DIGIT FIVE
0xF6 0x0E56 # THAI DIGIT SIX
0xF7 0x0E57 # THAI DIGIT SEVEN
0xF8 0x0E58 # THAI DIGIT EIGHT
0xF9 0x0E59 # THAI DIGIT NINE
0xFA 0x0E5A # THAI CHARACTER ANGKHANKHU
0xFB 0x0E5B # THAI CHARACTER KHOMUT

291
unicode/iso8859/8859-13.TXT Normal file
View File

@@ -0,0 +1,291 @@
# 8859-13.TXT
# Date: 2015-12-02 22:03:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-13:1998 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-13:1998 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-13 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-13 order.
#
# Version history
# 1.0 version: created
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x201D # RIGHT DOUBLE QUOTATION MARK
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A4 # CURRENCY SIGN
0xA5 0x201E # DOUBLE LOW-9 QUOTATION MARK
0xA6 0x00A6 # BROKEN BAR
0xA7 0x00A7 # SECTION SIGN
0xA8 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x0156 # LATIN CAPITAL LETTER R WITH CEDILLA
0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xAC 0x00AC # NOT SIGN
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x00AE # REGISTERED SIGN
0xAF 0x00C6 # LATIN CAPITAL LETTER AE
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x00B2 # SUPERSCRIPT TWO
0xB3 0x00B3 # SUPERSCRIPT THREE
0xB4 0x201C # LEFT DOUBLE QUOTATION MARK
0xB5 0x00B5 # MICRO SIGN
0xB6 0x00B6 # PILCROW SIGN
0xB7 0x00B7 # MIDDLE DOT
0xB8 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xB9 0x00B9 # SUPERSCRIPT ONE
0xBA 0x0157 # LATIN SMALL LETTER R WITH CEDILLA
0xBB 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xBC 0x00BC # VULGAR FRACTION ONE QUARTER
0xBD 0x00BD # VULGAR FRACTION ONE HALF
0xBE 0x00BE # VULGAR FRACTION THREE QUARTERS
0xBF 0x00E6 # LATIN SMALL LETTER AE
0xC0 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0xC1 0x012E # LATIN CAPITAL LETTER I WITH OGONEK
0xC2 0x0100 # LATIN CAPITAL LETTER A WITH MACRON
0xC3 0x0106 # LATIN CAPITAL LETTER C WITH ACUTE
0xC4 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0xC5 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0xC6 0x0118 # LATIN CAPITAL LETTER E WITH OGONEK
0xC7 0x0112 # LATIN CAPITAL LETTER E WITH MACRON
0xC8 0x010C # LATIN CAPITAL LETTER C WITH CARON
0xC9 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0xCA 0x0179 # LATIN CAPITAL LETTER Z WITH ACUTE
0xCB 0x0116 # LATIN CAPITAL LETTER E WITH DOT ABOVE
0xCC 0x0122 # LATIN CAPITAL LETTER G WITH CEDILLA
0xCD 0x0136 # LATIN CAPITAL LETTER K WITH CEDILLA
0xCE 0x012A # LATIN CAPITAL LETTER I WITH MACRON
0xCF 0x013B # LATIN CAPITAL LETTER L WITH CEDILLA
0xD0 0x0160 # LATIN CAPITAL LETTER S WITH CARON
0xD1 0x0143 # LATIN CAPITAL LETTER N WITH ACUTE
0xD2 0x0145 # LATIN CAPITAL LETTER N WITH CEDILLA
0xD3 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xD4 0x014C # LATIN CAPITAL LETTER O WITH MACRON
0xD5 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xD6 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0xD7 0x00D7 # MULTIPLICATION SIGN
0xD8 0x0172 # LATIN CAPITAL LETTER U WITH OGONEK
0xD9 0x0141 # LATIN CAPITAL LETTER L WITH STROKE
0xDA 0x015A # LATIN CAPITAL LETTER S WITH ACUTE
0xDB 0x016A # LATIN CAPITAL LETTER U WITH MACRON
0xDC 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0xDD 0x017B # LATIN CAPITAL LETTER Z WITH DOT ABOVE
0xDE 0x017D # LATIN CAPITAL LETTER Z WITH CARON
0xDF 0x00DF # LATIN SMALL LETTER SHARP S (German)
0xE0 0x0105 # LATIN SMALL LETTER A WITH OGONEK
0xE1 0x012F # LATIN SMALL LETTER I WITH OGONEK
0xE2 0x0101 # LATIN SMALL LETTER A WITH MACRON
0xE3 0x0107 # LATIN SMALL LETTER C WITH ACUTE
0xE4 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0xE5 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0xE6 0x0119 # LATIN SMALL LETTER E WITH OGONEK
0xE7 0x0113 # LATIN SMALL LETTER E WITH MACRON
0xE8 0x010D # LATIN SMALL LETTER C WITH CARON
0xE9 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0xEA 0x017A # LATIN SMALL LETTER Z WITH ACUTE
0xEB 0x0117 # LATIN SMALL LETTER E WITH DOT ABOVE
0xEC 0x0123 # LATIN SMALL LETTER G WITH CEDILLA
0xED 0x0137 # LATIN SMALL LETTER K WITH CEDILLA
0xEE 0x012B # LATIN SMALL LETTER I WITH MACRON
0xEF 0x013C # LATIN SMALL LETTER L WITH CEDILLA
0xF0 0x0161 # LATIN SMALL LETTER S WITH CARON
0xF1 0x0144 # LATIN SMALL LETTER N WITH ACUTE
0xF2 0x0146 # LATIN SMALL LETTER N WITH CEDILLA
0xF3 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0xF4 0x014D # LATIN SMALL LETTER O WITH MACRON
0xF5 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0xF6 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0xF7 0x00F7 # DIVISION SIGN
0xF8 0x0173 # LATIN SMALL LETTER U WITH OGONEK
0xF9 0x0142 # LATIN SMALL LETTER L WITH STROKE
0xFA 0x015B # LATIN SMALL LETTER S WITH ACUTE
0xFB 0x016B # LATIN SMALL LETTER U WITH MACRON
0xFC 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xFD 0x017C # LATIN SMALL LETTER Z WITH DOT ABOVE
0xFE 0x017E # LATIN SMALL LETTER Z WITH CARON
0xFF 0x2019 # RIGHT SINGLE QUOTATION MARK

293
unicode/iso8859/8859-14.TXT Normal file
View File

@@ -0,0 +1,293 @@
# 8859-14.TXT
# Date: 2015-12-02 22:05:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-14:1998 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Markus Kuhn <http://www.cl.cam.ac.uk/~mgk25/>
# Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-14:1998 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-14 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-14 order.
#
# Version history
# 1.0 version: created
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x1E02 # LATIN CAPITAL LETTER B WITH DOT ABOVE
0xA2 0x1E03 # LATIN SMALL LETTER B WITH DOT ABOVE
0xA3 0x00A3 # POUND SIGN
0xA4 0x010A # LATIN CAPITAL LETTER C WITH DOT ABOVE
0xA5 0x010B # LATIN SMALL LETTER C WITH DOT ABOVE
0xA6 0x1E0A # LATIN CAPITAL LETTER D WITH DOT ABOVE
0xA7 0x00A7 # SECTION SIGN
0xA8 0x1E80 # LATIN CAPITAL LETTER W WITH GRAVE
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x1E82 # LATIN CAPITAL LETTER W WITH ACUTE
0xAB 0x1E0B # LATIN SMALL LETTER D WITH DOT ABOVE
0xAC 0x1EF2 # LATIN CAPITAL LETTER Y WITH GRAVE
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x00AE # REGISTERED SIGN
0xAF 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xB0 0x1E1E # LATIN CAPITAL LETTER F WITH DOT ABOVE
0xB1 0x1E1F # LATIN SMALL LETTER F WITH DOT ABOVE
0xB2 0x0120 # LATIN CAPITAL LETTER G WITH DOT ABOVE
0xB3 0x0121 # LATIN SMALL LETTER G WITH DOT ABOVE
0xB4 0x1E40 # LATIN CAPITAL LETTER M WITH DOT ABOVE
0xB5 0x1E41 # LATIN SMALL LETTER M WITH DOT ABOVE
0xB6 0x00B6 # PILCROW SIGN
0xB7 0x1E56 # LATIN CAPITAL LETTER P WITH DOT ABOVE
0xB8 0x1E81 # LATIN SMALL LETTER W WITH GRAVE
0xB9 0x1E57 # LATIN SMALL LETTER P WITH DOT ABOVE
0xBA 0x1E83 # LATIN SMALL LETTER W WITH ACUTE
0xBB 0x1E60 # LATIN CAPITAL LETTER S WITH DOT ABOVE
0xBC 0x1EF3 # LATIN SMALL LETTER Y WITH GRAVE
0xBD 0x1E84 # LATIN CAPITAL LETTER W WITH DIAERESIS
0xBE 0x1E85 # LATIN SMALL LETTER W WITH DIAERESIS
0xBF 0x1E61 # LATIN SMALL LETTER S WITH DOT ABOVE
0xC0 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xC1 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xC2 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xC3 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xC4 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0xC5 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0xC6 0x00C6 # LATIN CAPITAL LETTER AE
0xC7 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0xC8 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xC9 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0xCA 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xCB 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xCC 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xCD 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xCE 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xCF 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xD0 0x0174 # LATIN CAPITAL LETTER W WITH CIRCUMFLEX
0xD1 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0xD2 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xD3 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xD4 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xD5 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xD6 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0xD7 0x1E6A # LATIN CAPITAL LETTER T WITH DOT ABOVE
0xD8 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xD9 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xDA 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xDB 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xDC 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0xDD 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xDE 0x0176 # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
0xDF 0x00DF # LATIN SMALL LETTER SHARP S
0xE0 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0xE1 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0xE2 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0xE3 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0xE4 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0xE5 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0xE6 0x00E6 # LATIN SMALL LETTER AE
0xE7 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0xE8 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0xE9 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0xEA 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0xEB 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0xEC 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0xED 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0xEE 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0xEF 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0xF0 0x0175 # LATIN SMALL LETTER W WITH CIRCUMFLEX
0xF1 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0xF2 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0xF3 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0xF4 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0xF5 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0xF6 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0xF7 0x1E6B # LATIN SMALL LETTER T WITH DOT ABOVE
0xF8 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xF9 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0xFA 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0xFB 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0xFC 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xFD 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xFE 0x0177 # LATIN SMALL LETTER Y WITH CIRCUMFLEX
0xFF 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS

295
unicode/iso8859/8859-15.TXT Normal file
View File

@@ -0,0 +1,295 @@
# 8859-15.TXT
# Date: 2015-12-02 22:06:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-15:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Markus Kuhn <http://www.cl.cam.ac.uk/~mgk25/>
# Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-15:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-15 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-15 order.
#
# Version history
#
# Version history
# 1.0 version: created
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x00A1 # INVERTED EXCLAMATION MARK
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
0xA4 0x20AC # EURO SIGN
0xA5 0x00A5 # YEN SIGN
0xA6 0x0160 # LATIN CAPITAL LETTER S WITH CARON
0xA7 0x00A7 # SECTION SIGN
0xA8 0x0161 # LATIN SMALL LETTER S WITH CARON
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x00AA # FEMININE ORDINAL INDICATOR
0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xAC 0x00AC # NOT SIGN
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x00AE # REGISTERED SIGN
0xAF 0x00AF # MACRON
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x00B2 # SUPERSCRIPT TWO
0xB3 0x00B3 # SUPERSCRIPT THREE
0xB4 0x017D # LATIN CAPITAL LETTER Z WITH CARON
0xB5 0x00B5 # MICRO SIGN
0xB6 0x00B6 # PILCROW SIGN
0xB7 0x00B7 # MIDDLE DOT
0xB8 0x017E # LATIN SMALL LETTER Z WITH CARON
0xB9 0x00B9 # SUPERSCRIPT ONE
0xBA 0x00BA # MASCULINE ORDINAL INDICATOR
0xBB 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xBC 0x0152 # LATIN CAPITAL LIGATURE OE
0xBD 0x0153 # LATIN SMALL LIGATURE OE
0xBE 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xBF 0x00BF # INVERTED QUESTION MARK
0xC0 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xC1 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xC2 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xC3 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xC4 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0xC5 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0xC6 0x00C6 # LATIN CAPITAL LETTER AE
0xC7 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0xC8 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xC9 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0xCA 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xCB 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xCC 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xCD 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xCE 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xCF 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xD0 0x00D0 # LATIN CAPITAL LETTER ETH
0xD1 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0xD2 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xD3 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xD4 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xD5 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xD6 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0xD7 0x00D7 # MULTIPLICATION SIGN
0xD8 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xD9 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xDA 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xDB 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xDC 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0xDD 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xDE 0x00DE # LATIN CAPITAL LETTER THORN
0xDF 0x00DF # LATIN SMALL LETTER SHARP S
0xE0 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0xE1 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0xE2 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0xE3 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0xE4 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0xE5 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0xE6 0x00E6 # LATIN SMALL LETTER AE
0xE7 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0xE8 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0xE9 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0xEA 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0xEB 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0xEC 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0xED 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0xEE 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0xEF 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0xF0 0x00F0 # LATIN SMALL LETTER ETH
0xF1 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0xF2 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0xF3 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0xF4 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0xF5 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0xF6 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0xF7 0x00F7 # DIVISION SIGN
0xF8 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xF9 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0xFA 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0xFB 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0xFC 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xFD 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xFE 0x00FE # LATIN SMALL LETTER THORN
0xFF 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS

293
unicode/iso8859/8859-16.TXT Normal file
View File

@@ -0,0 +1,293 @@
# 8859-16.TXT
# Date: 2015-12-02 22:08:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO/IEC 8859-16:2001 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 2001 July 26 (header updated: 2015 December 02)
# Authors: Markus Kuhn <http://www.cl.cam.ac.uk/~mgk25/>
#
# Copyright (c) 1999-2001 Unicode, Inc. All Rights reserved.
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO/IEC 8859-16:2001 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO/IEC 8859-16 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO/IEC 8859-16 order.
#
# Version history
# 1.0 version: created
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0xA2 0x0105 # LATIN SMALL LETTER A WITH OGONEK
0xA3 0x0141 # LATIN CAPITAL LETTER L WITH STROKE
0xA4 0x20AC # EURO SIGN
0xA5 0x201E # DOUBLE LOW-9 QUOTATION MARK
0xA6 0x0160 # LATIN CAPITAL LETTER S WITH CARON
0xA7 0x00A7 # SECTION SIGN
0xA8 0x0161 # LATIN SMALL LETTER S WITH CARON
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x0218 # LATIN CAPITAL LETTER S WITH COMMA BELOW
0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xAC 0x0179 # LATIN CAPITAL LETTER Z WITH ACUTE
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x017A # LATIN SMALL LETTER Z WITH ACUTE
0xAF 0x017B # LATIN CAPITAL LETTER Z WITH DOT ABOVE
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x010C # LATIN CAPITAL LETTER C WITH CARON
0xB3 0x0142 # LATIN SMALL LETTER L WITH STROKE
0xB4 0x017D # LATIN CAPITAL LETTER Z WITH CARON
0xB5 0x201D # RIGHT DOUBLE QUOTATION MARK
0xB6 0x00B6 # PILCROW SIGN
0xB7 0x00B7 # MIDDLE DOT
0xB8 0x017E # LATIN SMALL LETTER Z WITH CARON
0xB9 0x010D # LATIN SMALL LETTER C WITH CARON
0xBA 0x0219 # LATIN SMALL LETTER S WITH COMMA BELOW
0xBB 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xBC 0x0152 # LATIN CAPITAL LIGATURE OE
0xBD 0x0153 # LATIN SMALL LIGATURE OE
0xBE 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xBF 0x017C # LATIN SMALL LETTER Z WITH DOT ABOVE
0xC0 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xC1 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xC2 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xC3 0x0102 # LATIN CAPITAL LETTER A WITH BREVE
0xC4 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0xC5 0x0106 # LATIN CAPITAL LETTER C WITH ACUTE
0xC6 0x00C6 # LATIN CAPITAL LETTER AE
0xC7 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0xC8 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xC9 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0xCA 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xCB 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xCC 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xCD 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xCE 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xCF 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xD0 0x0110 # LATIN CAPITAL LETTER D WITH STROKE
0xD1 0x0143 # LATIN CAPITAL LETTER N WITH ACUTE
0xD2 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xD3 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xD4 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xD5 0x0150 # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
0xD6 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0xD7 0x015A # LATIN CAPITAL LETTER S WITH ACUTE
0xD8 0x0170 # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
0xD9 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xDA 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xDB 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xDC 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0xDD 0x0118 # LATIN CAPITAL LETTER E WITH OGONEK
0xDE 0x021A # LATIN CAPITAL LETTER T WITH COMMA BELOW
0xDF 0x00DF # LATIN SMALL LETTER SHARP S
0xE0 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0xE1 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0xE2 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0xE3 0x0103 # LATIN SMALL LETTER A WITH BREVE
0xE4 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0xE5 0x0107 # LATIN SMALL LETTER C WITH ACUTE
0xE6 0x00E6 # LATIN SMALL LETTER AE
0xE7 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0xE8 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0xE9 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0xEA 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0xEB 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0xEC 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0xED 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0xEE 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0xEF 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0xF0 0x0111 # LATIN SMALL LETTER D WITH STROKE
0xF1 0x0144 # LATIN SMALL LETTER N WITH ACUTE
0xF2 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0xF3 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0xF4 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0xF5 0x0151 # LATIN SMALL LETTER O WITH DOUBLE ACUTE
0xF6 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0xF7 0x015B # LATIN SMALL LETTER S WITH ACUTE
0xF8 0x0171 # LATIN SMALL LETTER U WITH DOUBLE ACUTE
0xF9 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0xFA 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0xFB 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0xFC 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xFD 0x0119 # LATIN SMALL LETTER E WITH OGONEK
0xFE 0x021B # LATIN SMALL LETTER T WITH COMMA BELOW
0xFF 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS

View File

@@ -1,42 +1,71 @@
# 8859-2.TXT
# Date: 2015-12-02 21:34:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-2 (1987) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO 8859-2:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-2 (1987) characters map into Unicode.
# ISO/IEC 8859-2:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-2 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-2 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-2 order
# The entries are in ISO/IEC 8859-2 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +161,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0xA2 0x02D8 # BREVE

View File

@@ -1,42 +1,71 @@
# 8859-3.TXT
# Date: 2015-12-02 21:39:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-3 (1988) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO/IEC 8859-3:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-3 (1988) characters map into Unicode.
# ISO/IEC 8859-3:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-3 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-3 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-3 order
# The entries are in ISO/IEC 8859-3 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +161,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0126 # LATIN CAPITAL LETTER H WITH STROKE
0xA2 0x02D8 # BREVE

View File

@@ -1,42 +1,71 @@
# 8859-4.TXT
# Date: 2015-12-02 21:41:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-4 (1988) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO/IEC 8859-4:1998 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-4 (1988) characters map into Unicode.
# ISO/IEC 8859-4:1998 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-4 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-4 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-4 order
# The entries are in ISO/IEC 8859-4 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +161,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0xA2 0x0138 # LATIN SMALL LETTER KRA

View File

@@ -1,42 +1,71 @@
# 8859-5.TXT
# Date: 2015-12-02 21:43:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-5 (1988) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO 8859-5:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-5 (1988) characters map into Unicode.
# ISO/IEC 8859-5:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-5 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-5 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-5 order
# The entries are in ISO/IEC 8859-5 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +161,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0401 # CYRILLIC CAPITAL LETTER IO
0xA2 0x0402 # CYRILLIC CAPITAL LETTER DJE

View File

@@ -1,42 +1,73 @@
# 8859-6.TXT
# Date: 2015-12-02 21:44:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-6 (1987) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO 8859-6:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-6 (1987) characters map into Unicode.
# ISO/IEC 8859-6:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-6 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-6 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-6 order
# The entries are in ISO/IEC 8859-6 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 0x30..0x39 remapped to the ASCII digits (U+0030..U+0039) instead
# of the Arabic digits (U+0660..U+0669).
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -53,16 +84,16 @@
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0660 # ARABIC-INDIC DIGIT ZERO
0x31 0x0661 # ARABIC-INDIC DIGIT ONE
0x32 0x0662 # ARABIC-INDIC DIGIT TWO
0x33 0x0663 # ARABIC-INDIC DIGIT THREE
0x34 0x0664 # ARABIC-INDIC DIGIT FOUR
0x35 0x0665 # ARABIC-INDIC DIGIT FIVE
0x36 0x0666 # ARABIC-INDIC DIGIT SIX
0x37 0x0667 # ARABIC-INDIC DIGIT SEVEN
0x38 0x0668 # ARABIC-INDIC DIGIT EIGHT
0x39 0x0669 # ARABIC-INDIC DIGIT NINE
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
@@ -132,6 +163,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA4 0x00A4 # CURRENCY SIGN
0xAC 0x060C # ARABIC COMMA

View File

@@ -1,42 +1,81 @@
# 8859-7.TXT
# Date: 2015-12-02 21:47:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-7 (1987) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO 8859-7:2003 to Unicode
# Unicode version: 4.0
# Table version: 3.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 2003-Nov-12 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-7 (1987) characters map into Unicode.
# ISO 8859-7:2003 characters map into Unicode.
#
# ISO 8859-7:1987 is equivalent to ISO-IR-126, ELOT 928,
# and ECMA 118. ISO 8859-7:2003 adds two currency signs
# and one other character not in the earlier standard.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-7 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-7 order
# The entries are in ISO 8859-7 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version updates 0.1 version by adding mappings for all
# control characters.
# Remap 0xA1 to U+2018 (instead of 0x02BD) to match text of 8859-7
# Remap 0xA2 to U+2019 (instead of 0x02BC) to match text of 8859-7
#
# 2.0 version updates 1.0 version by adding mappings for the
# three newly added characters 0xA4, 0xA5, 0xAA.
#
# 3.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,14 +171,50 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x02BD # MODIFIER LETTER REVERSED COMMA
0xA2 0x02BC # MODIFIER LETTER APOSTROPHE
0xA1 0x2018 # LEFT SINGLE QUOTATION MARK
0xA2 0x2019 # RIGHT SINGLE QUOTATION MARK
0xA3 0x00A3 # POUND SIGN
0xA4 0x20AC # EURO SIGN
0xA5 0x20AF # DRACHMA SIGN
0xA6 0x00A6 # BROKEN BAR
0xA7 0x00A7 # SECTION SIGN
0xA8 0x00A8 # DIAERESIS
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x037A # GREEK YPOGEGRAMMENI
0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xAC 0x00AC # NOT SIGN
0xAD 0x00AD # SOFT HYPHEN

View File

@@ -1,42 +1,73 @@
# 8859-8.TXT
# Date: 2015-12-02 21:50:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-8 (1988) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO/IEC 8859-8:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 2000-Jan-03 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-8 (1988) characters map into Unicode.
# ISO/IEC 8859-8:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-8 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-8 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-8 order
# The entries are in ISO/IEC 8859-8 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# Version history
# 1.0 version updates 0.1 version by adding mappings for all
# control characters.
# 1.1 version updates to the published 8859-8:1999, correcting
# the mapping of 0xAF and adding mappings for LRM and RLM.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +163,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
@@ -146,7 +210,7 @@
0xAC 0x00AC # NOT SIGN
0xAD 0x00AD # SOFT HYPHEN
0xAE 0x00AE # REGISTERED SIGN
0xAF 0x203E # OVERLINE
0xAF 0x00AF # MACRON
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x00B2 # SUPERSCRIPT TWO
@@ -190,3 +254,6 @@
0xF8 0x05E8 # HEBREW LETTER RESH
0xF9 0x05E9 # HEBREW LETTER SHIN
0xFA 0x05EA # HEBREW LETTER TAV
0xFD 0x200E # LEFT-TO-RIGHT MARK
0xFE 0x200F # RIGHT-TO-LEFT MARK

View File

@@ -1,42 +1,73 @@
# 8859-9.TXT
# Date: 2015-12-02 21:51:00 GMT [KW]
# © 2015 Unicode®, Inc.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Name: ISO 8859-9 (1989) to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Name: ISO/IEC 8859-9:1999 to Unicode
# Unicode version: 3.0
# Table version: 2.0
# Table format: Format A
# Date: 16 January 1995
# Authors: Tim Greenwood <greenwood@r2me2.enet.dec.com>
# John H. Jenkins <John_Jenkins@taligent.com>
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Date: 1999 July 27 (header updated: 2015 December 02)
# Authors: Ken Whistler <ken@unicode.org>
#
# General notes:
#
# This table contains the data the Unicode Consortium has on how
# ISO 8859-9 (1989) characters map into Unicode.
# ISO/IEC 8859-9:1999 characters map into Unicode.
#
# Format: Three tab-separated columns
# Column #1 is the ISO 8859-9 code (in hex as 0xXX)
# Column #1 is the ISO/IEC 8859-9 code (in hex as 0xXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 the Unicode name (follows a comment sign, '#')
#
# The entries are in ISO 8859-9 order
# The entries are in ISO/IEC 8859-9 order.
#
# Any comments or problems, contact <John_Jenkins@taligent.com>
# ISO/IEC 8859-9 is also equivalent to ISO-IR-148.
#
# Version history
# 1.0 version: updates 0.1 version by adding mappings for all
# control characters.
# 2.0 version: updates to copyright notice and terms of use; no
# changes to character mappings
#
# Updated versions of this file may be found in:
# http://www.unicode.org/Public/MAPPINGS/
#
# Any comments or problems, contact us at:
# http://www.unicode.org/reporting.html
#
0x00 0x0000 # NULL
0x01 0x0001 # START OF HEADING
0x02 0x0002 # START OF TEXT
0x03 0x0003 # END OF TEXT
0x04 0x0004 # END OF TRANSMISSION
0x05 0x0005 # ENQUIRY
0x06 0x0006 # ACKNOWLEDGE
0x07 0x0007 # BELL
0x08 0x0008 # BACKSPACE
0x09 0x0009 # HORIZONTAL TABULATION
0x0A 0x000A # LINE FEED
0x0B 0x000B # VERTICAL TABULATION
0x0C 0x000C # FORM FEED
0x0D 0x000D # CARRIAGE RETURN
0x0E 0x000E # SHIFT OUT
0x0F 0x000F # SHIFT IN
0x10 0x0010 # DATA LINK ESCAPE
0x11 0x0011 # DEVICE CONTROL ONE
0x12 0x0012 # DEVICE CONTROL TWO
0x13 0x0013 # DEVICE CONTROL THREE
0x14 0x0014 # DEVICE CONTROL FOUR
0x15 0x0015 # NEGATIVE ACKNOWLEDGE
0x16 0x0016 # SYNCHRONOUS IDLE
0x17 0x0017 # END OF TRANSMISSION BLOCK
0x18 0x0018 # CANCEL
0x19 0x0019 # END OF MEDIUM
0x1A 0x001A # SUBSTITUTE
0x1B 0x001B # ESCAPE
0x1C 0x001C # FILE SEPARATOR
0x1D 0x001D # GROUP SEPARATOR
0x1E 0x001E # RECORD SEPARATOR
0x1F 0x001F # UNIT SEPARATOR
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
@@ -132,6 +163,39 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
0x7F 0x007F # DELETE
0x80 0x0080 # <control>
0x81 0x0081 # <control>
0x82 0x0082 # <control>
0x83 0x0083 # <control>
0x84 0x0084 # <control>
0x85 0x0085 # <control>
0x86 0x0086 # <control>
0x87 0x0087 # <control>
0x88 0x0088 # <control>
0x89 0x0089 # <control>
0x8A 0x008A # <control>
0x8B 0x008B # <control>
0x8C 0x008C # <control>
0x8D 0x008D # <control>
0x8E 0x008E # <control>
0x8F 0x008F # <control>
0x90 0x0090 # <control>
0x91 0x0091 # <control>
0x92 0x0092 # <control>
0x93 0x0093 # <control>
0x94 0x0094 # <control>
0x95 0x0095 # <control>
0x96 0x0096 # <control>
0x97 0x0097 # <control>
0x98 0x0098 # <control>
0x99 0x0099 # <control>
0x9A 0x009A # <control>
0x9B 0x009B # <control>
0x9C 0x009C # <control>
0x9D 0x009D # <control>
0x9E 0x009E # <control>
0x9F 0x009F # <control>
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x00A1 # INVERTED EXCLAMATION MARK
0xA2 0x00A2 # CENT SIGN

View File

@@ -1,6 +1,32 @@
8859 to Unicode mapping tables.
These tables are believed to be substantially correct in their
mappings. However, the names in the comment fields (after "#"
on each line of the file) have not been updated since Unicode
Version 1.0.
May 2, 1996.
December 2, 2015
These tables are considered to be authoritative mappings
between the Unicode Standard and different parts of
the ISO/IEC 8859 standard.
Mappings between these standards are done on the basis
of the character names and UCS identifications published
in each part of ISO/IEC 8859, in the revised editions
of each part of the standard.
In cases where prior mappings to ISO/IEC 8859 differed in
some substantial way from the most recent mapping, or where
characters have been added to a part of ISO/IEC 8859, earlier
versions are archived in the DatedVersions directory, with
the relevant year of the published part of ISO/IEC 8859
as part of the mapping table filename, to indicate which
edition the mapping was intended for. The archived mappings
tables can be used to verify mappings which may have been
implemented in earlier versions of software.
Note that some of the older tables internally refer to an
out-of-date mechanism for reporting errors, using an "errata"
email address that is no longer in service. Any errors
should instead be reported using the Unicode Consortium's
reporting form:
<http://www.unicode.org/reporting.html>

View File

@@ -1,980 +0,0 @@
DISCLAIMER, May 1, 1996: This file is obsolete. It was made
for Unicode Version 1.0, and has neither been updated nor
verified for use with any subsequent version of the standard.
Use this data entirely at your own risk.
== == == == == == == == == == == == == == == == == == == == ==
Copyright 1991-1992 Unicode, Inc.
All Rights reserved.
This file is provided as-is by Unicode, Inc. (The Unicode Consortium). No
claims are made as to fitness for any particular purpose. No warranties of
any kind are expressed or implied. The recipient agrees to determine
applicability of information provided. If this file has been provided on
magnetic media by Unicode, Inc., the sole remedy for any claim will be
exchange of defective media within 90 days of receipt.
Unicode Encoding, Version 1.0 to ISO 8879 (SGML) & ISO DIS 6862.2 Mappings
ISO 8879-1986 contains an ASCII-alphabetic encoding of a large number
of "character entities" used as identifiers in SGML (Standard Graphic
Markup Language). The following table maps Unicode character encodings
to the entity reference names of those character entities which clearly
constitute characters in the sense used by the Unicode standard.
However, SGML also contains many identifiers for character entities
which are glyph variants or stylistic variants which do not have
one-to-one mappings with Unicode characters.
ISO/DIS 6862.2 contains two 7-bit character encodings of mathematical
symbols of various types. The first encoding (set G0) is identified by
numbers beginning with a digit 0 in the table below, for example, 07.13,
while the second encoding (set G1) is identified by numbers beginning
with a digit 1 in the table below, for example, 17.13. The other digits
are to be interpreted as decimal values corresponding to regular ISO
conventions for citing the position of characters in code tables; 17.13
can thus be interpreted as position 7/13 in the code table for set G1.
UNIC 6862.2 SGML Unicode character name
0021 excl EXCLAMATION MARK
0023 num NUMBER SIGN
0024 dollar DOLLAR SIGN
0025 percnt PERCENT SIGN
0026 amp AMPERSAND
0027 quot APOSTROPHE-QUOTE
0028 lpar OPENING PARENTHESIS
0029 rpar CLOSING PARENTHESIS
002A ast ASTERISK
002B 05.00 plus PLUS SIGN
002C comma COMMA
002D hyphen HYPHEN-MINUS
002E period PERIOD
002F sol SLASH
003A colon COLON
003B semi SEMICOLON
003C lt LESS-THAN SIGN
003D equals EQUALS SIGN
003E gt GREATER-THAN SIGN
003F quest QUESTION MARK
0040 commat COMMERCIAL AT
005B lsqb OPENING SQUARE BRACKET
005C bsol BACKSLASH
005D rsqb CLOSING SQUARE BRACKET
005E circ SPACING CIRCUMFLEX
005F lowbar SPACING UNDERSCORE
0060 grave SPACING GRAVE
007B lcub OPENING CURLY BRACKET
007C verbar VERTICAL BAR
007D rcub CLOSING CURLY BRACKET
007E tilde TILDE
00A0 nbsp NON-BREAKING SPACE
00A1 iexcl INVERTED EXCLAMATION MARK
00A2 cent CENT SIGN
00A3 pound POUND SIGN
00A4 curren CURRENCY SIGN
00A5 yen YEN SIGN
00A6 brvbar BROKEN VERTICAL BAR
00A7 sect SECTION SIGN
00A8 die,Dot SPACING DIAERESIS
00A8 uml SPACING DIAERESIS
00A9 copy COPYRIGHT SIGN
00AA ordf FEMININE ORDINAL INDICATOR
00AB laquo LEFT POINTING GUILLEMET
00AC 07.05 not NOT SIGN
00AD shy SOFT HYPHEN
00AE reg REGISTERED TRADE MARK SIGN
00AF macr SPACING MACRON
00B0 03.12 deg DEGREE SIGN
00B1 03.01 plusmn PLUS-OR-MINUS SIGN
00B2 sup2 SUPERSCRIPT DIGIT TWO
00B3 sup3 SUPERSCRIPT DIGIT THREE
00B4 acute SPACING ACUTE
00B5 micro MICRO SIGN
00B6 para PARAGRAPH SIGN
00B7 middot MIDDLE DOT
00B8 cedil SPACING CEDILLA
00B9 sup1 SUPERSCRIPT DIGIT ONE
00BA ordm MASCULINE ORDINAL INDICATOR
00BB raquo RIGHT POINTING GUILLEMET
00BC frac14 FRACTION ONE QUARTER
00BD frac12 FRACTION ONE HALF
00BE frac34 FRACTION THREE QUARTERS
00BF iquest INVERTED QUESTION MARK
00C0 Agrave LATIN CAPITAL LETTER A GRAVE
00C1 Aacute LATIN CAPITAL LETTER A ACUTE
00C2 Acirc LATIN CAPITAL LETTER A CIRCUMFLEX
00C3 Atilde LATIN CAPITAL LETTER A TILDE
00C4 Auml LATIN CAPITAL LETTER A DIAERESIS
00C5 Aring LATIN CAPITAL LETTER A RING
00C6 AElig LATIN CAPITAL LETTER A E
00C7 Ccedil LATIN CAPITAL LETTER C CEDILLA
00C8 Egrave LATIN CAPITAL LETTER E GRAVE
00C9 Eacute LATIN CAPITAL LETTER E ACUTE
00CA Ecirc LATIN CAPITAL LETTER E CIRCUMFLEX
00CB Euml LATIN CAPITAL LETTER E DIAERESIS
00CC Igrave LATIN CAPITAL LETTER I GRAVE
00CD Iacute LATIN CAPITAL LETTER I ACUTE
00CE Icirc LATIN CAPITAL LETTER I CIRCUMFLEX
00CF Iuml LATIN CAPITAL LETTER I DIAERESIS
00D0 ETH LATIN CAPITAL LETTER ETH
00D1 Ntilde LATIN CAPITAL LETTER N TILDE
00D2 Ograve LATIN CAPITAL LETTER O GRAVE
00D3 Oacute LATIN CAPITAL LETTER O ACUTE
00D4 Ocirc LATIN CAPITAL LETTER O CIRCUMFLEX
00D5 Otilde LATIN CAPITAL LETTER O TILDE
00D6 Ouml LATIN CAPITAL LETTER O DIAERESIS
00D7 03.00 times MULTIPLICATION SIGN
00D8 Oslash LATIN CAPITAL LETTER O SLASH
00D9 Ugrave LATIN CAPITAL LETTER U GRAVE
00DA Uacute LATIN CAPITAL LETTER U ACUTE
00DB Ucirc LATIN CAPITAL LETTER U CIRCUMFLEX
00DC Uuml LATIN CAPITAL LETTER U DIAERESIS
00DD Yacute LATIN CAPITAL LETTER Y ACUTE
00DE THORN LATIN CAPITAL LETTER THORN
00DF szlig LATIN SMALL LETTER SHARP S
00E0 agrave LATIN SMALL LETTER A GRAVE
00E1 aacute LATIN SMALL LETTER A ACUTE
00E2 acirc LATIN SMALL LETTER A CIRCUMFLEX
00E3 atilde LATIN SMALL LETTER A TILDE
00E4 auml LATIN SMALL LETTER A DIAERESIS
00E5 aring LATIN SMALL LETTER A RING
00E6 aelig LATIN SMALL LETTER A E
00E7 ccedil LATIN SMALL LETTER C CEDILLA
00E8 egrave LATIN SMALL LETTER E GRAVE
00E9 eacute LATIN SMALL LETTER E ACUTE
00EA ecirc LATIN SMALL LETTER E CIRCUMFLEX
00EB euml LATIN SMALL LETTER E DIAERESIS
00EC igrave LATIN SMALL LETTER I GRAVE
00ED iacute LATIN SMALL LETTER I ACUTE
00EE icirc LATIN SMALL LETTER I CIRCUMFLEX
00EF iuml LATIN SMALL LETTER I DIAERESIS
00F0 eth LATIN SMALL LETTER ETH
00F1 ntilde LATIN SMALL LETTER N TILDE
00F2 ograve LATIN SMALL LETTER O GRAVE
00F3 oacute LATIN SMALL LETTER O ACUTE
00F4 ocirc LATIN SMALL LETTER O CIRCUMFLEX
00F5 otilde LATIN SMALL LETTER O TILDE
00F6 ouml LATIN SMALL LETTER O DIAERESIS
00F7 04.00 divide DIVISION SIGN
00F8 oslash LATIN SMALL LETTER O SLASH
00F9 ugrave LATIN SMALL LETTER U GRAVE
00FA uacute LATIN SMALL LETTER U ACUTE
00FB ucirc LATIN SMALL LETTER U CIRCUMFLEX
00FC uuml LATIN SMALL LETTER U DIAERESIS
00FD yacute LATIN SMALL LETTER Y ACUTE
00FE thorn LATIN SMALL LETTER THORN
00FF yuml LATIN SMALL LETTER Y DIAERESIS
0100 Amacr LATIN CAPITAL LETTER A MACRON
0101 amacr LATIN SMALL LETTER A MACRON
0102 Abreve LATIN CAPITAL LETTER A BREVE
0103 abreve LATIN SMALL LETTER A BREVE
0104 Aogon LATIN CAPITAL LETTER A OGONEK
0105 aogon LATIN SMALL LETTER A OGONEK
0106 Cacute LATIN CAPITAL LETTER C ACUTE
0107 cacute LATIN SMALL LETTER C ACUTE
0108 Ccirc LATIN CAPITAL LETTER C CIRCUMFLEX
0109 ccirc LATIN SMALL LETTER C CIRCUMFLEX
010A Cdot LATIN CAPITAL LETTER C DOT
010B cdot LATIN SMALL LETTER C DOT
010C Ccaron LATIN CAPITAL LETTER C HACEK
010D ccaron LATIN SMALL LETTER C HACEK
010E Dcaron LATIN CAPITAL LETTER D HACEK
010F dcaron LATIN SMALL LETTER D HACEK
0110 Dstrok LATIN CAPITAL LETTER D BAR
0111 dstrok LATIN SMALL LETTER D BAR
0112 Emacr LATIN CAPITAL LETTER E MACRON
0113 emacr LATIN SMALL LETTER E MACRON
0116 Edot LATIN CAPITAL LETTER E DOT
0117 edot LATIN SMALL LETTER E DOT
0118 Eogon LATIN CAPITAL LETTER E OGONEK
0119 eogon LATIN SMALL LETTER E OGONEK
011A Ecaron LATIN CAPITAL LETTER E HACEK
011B ecaron LATIN SMALL LETTER E HACEK
011C Gcirc LATIN CAPITAL LETTER G CIRCUMFLEX
011D gcirc LATIN SMALL LETTER G CIRCUMFLEX
011E Gbreve LATIN CAPITAL LETTER G BREVE
011F gbreve LATIN SMALL LETTER G BREVE
0120 Gdot LATIN CAPITAL LETTER G DOT
0121 gdot LATIN SMALL LETTER G DOT
0122 Gcedil LATIN CAPITAL LETTER G CEDILLA
0123 gcedil LATIN SMALL LETTER G CEDILLA
0124 Hcirc LATIN CAPITAL LETTER H CIRCUMFLEX
0125 hcirc LATIN SMALL LETTER H CIRCUMFLEX
0126 Hstrok LATIN CAPITAL LETTER H BAR
0127 hstrok LATIN SMALL LETTER H BAR
0128 Itilde LATIN CAPITAL LETTER I TILDE
0129 itilde LATIN SMALL LETTER I TILDE
012A Imacr LATIN CAPITAL LETTER I MACRON
012B imacr LATIN SMALL LETTER I MACRON
012E Iogon LATIN CAPITAL LETTER I OGONEK
012F iogon LATIN SMALL LETTER I OGONEK
0130 Idot LATIN CAPITAL LETTER I DOT
0131 inodot LATIN SMALL LETTER DOTLESS I
0132 IJlig LATIN CAPITAL LETTER I J
0133 ijlig LATIN SMALL LETTER I J
0134 Jcirc LATIN CAPITAL LETTER J CIRCUMFLEX
0135 jcirc LATIN SMALL LETTER J CIRCUMFLEX
0136 Kcedil LATIN CAPITAL LETTER K CEDILLA
0137 kcedil LATIN SMALL LETTER K CEDILLA
0138 kgreen LATIN SMALL LETTER KRA
0139 Lacute LATIN CAPITAL LETTER L ACUTE
013A lacute LATIN SMALL LETTER L ACUTE
013B Lcedil LATIN CAPITAL LETTER L CEDILLA
013C lcedil LATIN SMALL LETTER L CEDILLA
013D Lcaron LATIN CAPITAL LETTER L HACEK
013E lcaron LATIN SMALL LETTER L HACEK
013F Lmidot LATIN CAPITAL LETTER L WITH MIDDLE DOT
0140 lmidot LATIN SMALL LETTER L WITH MIDDLE DOT
0141 Lstrok LATIN CAPITAL LETTER L SLASH
0142 lstrok LATIN SMALL LETTER L SLASH
0143 Nacute LATIN CAPITAL LETTER N ACUTE
0144 nacute LATIN SMALL LETTER N ACUTE
0145 Ncedil LATIN CAPITAL LETTER N CEDILLA
0146 ncedil LATIN SMALL LETTER N CEDILLA
0147 Ncaron LATIN CAPITAL LETTER N HACEK
0148 ncaron LATIN SMALL LETTER N HACEK
0149 napos LATIN SMALL LETTER APOSTROPHE N
014A ENG LATIN CAPITAL LETTER ENG
014B eng LATIN SMALL LETTER ENG
014C Omacr LATIN CAPITAL LETTER O MACRON
014D omacr LATIN SMALL LETTER O MACRON
0150 Odblac LATIN CAPITAL LETTER O DOUBLE ACUTE
0151 odblac LATIN SMALL LETTER O DOUBLE ACUTE
0152 OElig LATIN CAPITAL LETTER O E
0153 oelig LATIN SMALL LETTER O E
0154 Racute LATIN CAPITAL LETTER R ACUTE
0155 racute LATIN SMALL LETTER R ACUTE
0156 Rcedil LATIN CAPITAL LETTER R CEDILLA
0157 rcedil LATIN SMALL LETTER R CEDILLA
0158 Rcaron LATIN CAPITAL LETTER R HACEK
0159 rcaron LATIN SMALL LETTER R HACEK
015A Sacute LATIN CAPITAL LETTER S ACUTE
015B sacute LATIN SMALL LETTER S ACUTE
015C Scirc LATIN CAPITAL LETTER S CIRCUMFLEX
015D scirc LATIN SMALL LETTER S CIRCUMFLEX
015E Scedil LATIN CAPITAL LETTER S CEDILLA
015F scedil LATIN SMALL LETTER S CEDILLA
0160 Scaron LATIN CAPITAL LETTER S HACEK
0161 scaron LATIN SMALL LETTER S HACEK
0162 Tcedil LATIN CAPITAL LETTER T CEDILLA
0163 tcedil LATIN SMALL LETTER T CEDILLA
0164 Tcaron LATIN CAPITAL LETTER T HACEK
0165 tcaron LATIN SMALL LETTER T HACEK
0166 Tstrok LATIN CAPITAL LETTER T BAR
0167 tstrok LATIN SMALL LETTER T BAR
0168 Utilde LATIN CAPITAL LETTER U TILDE
0169 utilde LATIN SMALL LETTER U TILDE
016A Umacr LATIN CAPITAL LETTER U MACRON
016B umacr LATIN SMALL LETTER U MACRON
016C Ubreve LATIN CAPITAL LETTER U BREVE
016D ubreve LATIN SMALL LETTER U BREVE
016E Uring LATIN CAPITAL LETTER U RING
016F uring LATIN SMALL LETTER U RING
0170 Udblac LATIN CAPITAL LETTER U DOUBLE ACUTE
0171 udblac LATIN SMALL LETTER U DOUBLE ACUTE
0172 Uogon LATIN CAPITAL LETTER U OGONEK
0173 uogon LATIN SMALL LETTER U OGONEK
0174 Wcirc LATIN CAPITAL LETTER W CIRCUMFLEX
0175 wcirc LATIN SMALL LETTER W CIRCUMFLEX
0176 Ycirc LATIN CAPITAL LETTER Y CIRCUMFLEX
0177 ycirc LATIN SMALL LETTER Y CIRCUMFLEX
0178 Yuml LATIN CAPITAL LETTER Y DIAERESIS
0179 Zacute LATIN CAPITAL LETTER Z ACUTE
017A zacute LATIN SMALL LETTER Z ACUTE
017B Zdot LATIN CAPITAL LETTER Z DOT
017C zdot LATIN SMALL LETTER Z DOT
017D Zcaron LATIN CAPITAL LETTER Z HACEK
017E zcaron LATIN SMALL LETTER Z HACEK
0192 fnof LATIN SMALL LETTER SCRIPT F
02BC apos MODIFIER LETTER APOSTROPHE
02C7 caron MODIFIER LETTER HACEK
02D8 breve SPACING BREVE
02D9 dot SPACING DOT ABOVE
02DA ring SPACING RING ABOVE
02DB ogon SPACING OGONEK
02DC tilde SPACING TILDE
02DD dblac SPACING DOUBLE ACUTE
0302 02.12 NON-SPACING CIRCUMFLEX
0307 02.09 NON-SPACING DOT ABOVE
0308 02.10 NON-SPACING DIAERESIS
030C 02.13 NON-SPACING HACEK
0336 02.04 NON-SPACING LONG BAR OVERLAY
0338 02.01 NON-SPACING LONG SLASH OVERLAY
0386 Aacgr GREEK CAPITAL LETTER ALPHA TONOS
0388 Eacgr GREEK CAPITAL LETTER EPSILON TONOS
0389 EEacgr GREEK CAPITAL LETTER ETA TONOS
038A Iacgr GREEK CAPITAL LETTER IOTA TONOS
038C Oacgr GREEK CAPITAL LETTER OMICRON TONOS
038E Uacgr GREEK CAPITAL LETTER UPSILON TONOS
038F OHacgr GREEK CAPITAL LETTER OMEGA TONOS
0390 idiagr GREEK SMALL LETTER IOTA DIAERESIS TONOS
0391 Agr GREEK CAPITAL LETTER ALPHA
0392 Bgr GREEK CAPITAL LETTER BETA
0393 Ggr,Gamma GREEK CAPITAL LETTER GAMMA
0394 Dgr,Delta GREEK CAPITAL LETTER DELTA
0395 Egr GREEK CAPITAL LETTER EPSILON
0396 Zgr GREEK CAPITAL LETTER ZETA
0397 EEgr GREEK CAPITAL LETTER ETA
0398 THgr,Theta GREEK CAPITAL LETTER THETA
0399 Igr GREEK CAPITAL LETTER IOTA
039A Kgr GREEK CAPITAL LETTER KAPPA
039B Lgr,Lambda GREEK CAPITAL LETTER LAMBDA
039C Mgr GREEK CAPITAL LETTER MU
039D Ngr GREEK CAPITAL LETTER NU
039E Xgr,Xi GREEK CAPITAL LETTER XI
039F Ogr GREEK CAPITAL LETTER OMICRON
03A0 Pgr,Pi GREEK CAPITAL LETTER PI
03A1 Rgr GREEK CAPITAL LETTER RHO
03A3 Sgr,Sigma GREEK CAPITAL LETTER SIGMA
03A4 Tgr GREEK CAPITAL LETTER TAU
03A5 Ugr,Upsi GREEK CAPITAL LETTER UPSILON
03A6 PHgr,Phi GREEK CAPITAL LETTER PHI
03A7 KHgr GREEK CAPITAL LETTER CHI
03A8 PSgr,Psi GREEK CAPITAL LETTER PSI
03A9 OHgr,Omega GREEK CAPITAL LETTER OMEGA
03AA Idigr GREEK CAPITAL LETTER IOTA DIAERESIS
03AB Udigr GREEK CAPITAL LETTER UPSILON DIAERESIS
03AC aacgr GREEK SMALL LETTER ALPHA TONOS
03AD eacgr GREEK SMALL LETTER EPSILON TONOS
03AE eeacgr GREEK SMALL LETTER ETA TONOS
03AF iacgr GREEK SMALL LETTER IOTA TONOS
03B0 udiagr GREEK SMALL LETTER UPSILON DIAERESIS TONOS
03B1 agr,alpha GREEK SMALL LETTER ALPHA
03B2 bgr,beta GREEK SMALL LETTER BETA
03B3 ggr,gamma GREEK SMALL LETTER GAMMA
03B4 dgr,delta GREEK SMALL LETTER DELTA
03B5 egr,epsi GREEK SMALL LETTER EPSILON
03B6 zgr,zeta GREEK SMALL LETTER ZETA
03B7 eegr,eta GREEK SMALL LETTER ETA
03B8 thetas GREEK SMALL LETTER THETA
03B8 thgr GREEK SMALL LETTER THETA
03B9 igr,iota GREEK SMALL LETTER IOTA
03BA kgr,kappa GREEK SMALL LETTER KAPPA
03BB lgr,lambda GREEK SMALL LETTER LAMBDA
03BC mgr,mu GREEK SMALL LETTER MU
03BD ngr,nu GREEK SMALL LETTER NU
03BE xgr,xi GREEK SMALL LETTER XI
03BF ogr GREEK SMALL LETTER OMICRON
03C0 pgr,pi GREEK SMALL LETTER PI
03C1 rgr,rho GREEK SMALL LETTER RHO
03C2 sfgr,sigmav GREEK SMALL LETTER FINAL SIGMA
03C3 sgr,sigma GREEK SMALL LETTER SIGMA
03C4 tgr,tau GREEK SMALL LETTER TAU
03C5 ugr,upsi GREEK SMALL LETTER UPSILON
03C6 phgr,phis GREEK SMALL LETTER PHI
03C7 khgr,chi GREEK SMALL LETTER CHI
03C8 psgr,psi GREEK SMALL LETTER PSI
03C9 ohgr,omega GREEK SMALL LETTER OMEGA
03CA idigr GREEK SMALL LETTER IOTA DIAERESIS
03CB udigr GREEK SMALL LETTER UPSILON DIAERESIS
03CC oacgr GREEK SMALL LETTER OMICRON TONOS
03CD uacgr GREEK SMALL LETTER UPSILON TONOS
03CE ohacgr GREEK SMALL LETTER OMEGA TONOS
03D1 thetav GREEK SMALL LETTER SCRIPT THETA
03D5 phiv GREEK SMALL LETTER SCRIPT PHI
03D6 piv GREEK SMALL LETTER OMEGA PI
03DD gammad GREEK SMALL LETTER DIGAMMA
03F0 kappav GREEK SMALL LETTER SCRIPT KAPPA
03F1 rhov GREEK SMALL LETTER TAILED RHO
0401 IOcy CYRILLIC CAPITAL LETTER IO
0402 DJcy CYRILLIC CAPITAL LETTER DJE
0403 GJcy CYRILLIC CAPITAL LETTER GJE
0404 Jukcy CYRILLIC CAPITAL LETTER E
0405 DScy CYRILLIC CAPITAL LETTER DZE
0406 Iukcy CYRILLIC CAPITAL LETTER I
0407 YIcy CYRILLIC CAPITAL LETTER YI
0408 Jsercy CYRILLIC CAPITAL LETTER JE
0409 LJcy CYRILLIC CAPITAL LETTER LJE
040A NJcy CYRILLIC CAPITAL LETTER NJE
040B TSHcy CYRILLIC CAPITAL LETTER TSHE
040C KJcy CYRILLIC CAPITAL LETTER KJE
040E Ubrcy CYRILLIC CAPITAL LETTER SHORT U
040F DZcy CYRILLIC CAPITAL LETTER DZHE
0410 Acy CYRILLIC CAPITAL LETTER A
0411 Bcy CYRILLIC CAPITAL LETTER BE
0412 Vcy CYRILLIC CAPITAL LETTER VE
0413 Gcy CYRILLIC CAPITAL LETTER GE
0414 dcy CYRILLIC CAPITAL LETTER DE
0415 IEcy CYRILLIC CAPITAL LETTER IE
0416 ZHcy CYRILLIC CAPITAL LETTER ZHE
0417 Zcy CYRILLIC CAPITAL LETTER ZE
0418 Icy CYRILLIC CAPITAL LETTER II
0419 Jcy CYRILLIC CAPITAL LETTER SHORT II
041A Kcy CYRILLIC CAPITAL LETTER KA
041B Lcy CYRILLIC CAPITAL LETTER EL
041C Mcy CYRILLIC CAPITAL LETTER EM
041D Ncy CYRILLIC CAPITAL LETTER EN
041E Ocy CYRILLIC CAPITAL LETTER O
041F Pcy CYRILLIC CAPITAL LETTER PE
0420 Rcy CYRILLIC CAPITAL LETTER ER
0421 Scy CYRILLIC CAPITAL LETTER ES
0422 Tcy CYRILLIC CAPITAL LETTER TE
0423 Ucy CYRILLIC CAPITAL LETTER U
0424 Fcy CYRILLIC CAPITAL LETTER EF
0425 KHcy CYRILLIC CAPITAL LETTER KHA
0426 TScy CYRILLIC CAPITAL LETTER TSE
0427 CHcy CYRILLIC CAPITAL LETTER CHE
0428 SHcy CYRILLIC CAPITAL LETTER SHA
0429 SHCHcy CYRILLIC CAPITAL LETTER SHCHA
042A HARDcy CYRILLIC CAPITAL LETTER HARD SIGN
042B Ycy CYRILLIC CAPITAL LETTER YERI
042C SOFTcy CYRILLIC CAPITAL LETTER SOFT SIGN
042D Ecy CYRILLIC CAPITAL LETTER REVERSED E
042E YUcy CYRILLIC CAPITAL LETTER IU
042F YAcy CYRILLIC CAPITAL LETTER IA
0430 acy CYRILLIC SMALL LETTER A
0431 bcy CYRILLIC SMALL LETTER BE
0432 vcy CYRILLIC SMALL LETTER VE
0433 gcy CYRILLIC SMALL LETTER GE
0434 dcy CYRILLIC SMALL LETTER DE
0435 iecy CYRILLIC SMALL LETTER IE
0436 zhcy CYRILLIC SMALL LETTER ZHE
0437 zcy CYRILLIC SMALL LETTER ZE
0438 icy CYRILLIC SMALL LETTER II
0439 jcy CYRILLIC SMALL LETTER SHORT II
043A kcy CYRILLIC SMALL LETTER KA
043B lcy CYRILLIC SMALL LETTER EL
043C mcy CYRILLIC SMALL LETTER EM
043D ncy CYRILLIC SMALL LETTER EN
043E ocy CYRILLIC SMALL LETTER O
043F pcy CYRILLIC SMALL LETTER PE
0440 rcy CYRILLIC SMALL LETTER ER
0441 scy CYRILLIC SMALL LETTER ES
0442 tcy CYRILLIC SMALL LETTER TE
0443 ucy CYRILLIC SMALL LETTER U
0444 fcy CYRILLIC SMALL LETTER EF
0445 khcy CYRILLIC SMALL LETTER KHA
0446 tscy CYRILLIC SMALL LETTER TSE
0447 chcy CYRILLIC SMALL LETTER CHE
0448 shcy CYRILLIC SMALL LETTER SHA
0449 shchcy CYRILLIC SMALL LETTER SHCHA
044A hardcy CYRILLIC SMALL LETTER HARD SIGN
044B ycy CYRILLIC SMALL LETTER YERI
044C softcy CYRILLIC SMALL LETTER SOFT SIGN
044D ecy CYRILLIC SMALL LETTER REVERSED E
044E yucy CYRILLIC SMALL LETTER IU
044F yacy CYRILLIC SMALL LETTER IA
0451 iocy CYRILLIC SMALL LETTER IO
0452 djcy CYRILLIC SMALL LETTER DJE
0453 gjcy CYRILLIC SMALL LETTER GJE
0454 jukcy CYRILLIC SMALL LETTER E
0455 dscy CYRILLIC SMALL LETTER DZE
0456 iukcy CYRILLIC SMALL LETTER I
0457 yicy CYRILLIC SMALL LETTER YI
0458 jsercy CYRILLIC SMALL LETTER JE
0459 ljcy CYRILLIC SMALL LETTER LJE
045A njcy CYRILLIC SMALL LETTER NJE
045B tshcy CYRILLIC SMALL LETTER TSHE
045C kjcy CYRILLIC SMALL LETTER KJE
045E ubrcy CYRILLIC SMALL LETTER SHORT U
045F dzcy CYRILLIC SMALL LETTER DZHE
2002 ensp EN SPACE
2003 emsp EM SPACE
2004 emsp13 THREE-PER-EM SPACE
2005 emsp14 FOUR-PER-EM SPACE
2007 numsp FIGURE SPACE
2008 puncsp PUNCTUATION SPACE
2009 thinsp THIN SPACE
200A hairsp HAIR SPACE
2010 dash HYPHEN
2013 ndash EN DASH
2014 mdash EM DASH
2015 horbar QUOTATION DASH
2016 15.00 Verbar DOUBLE VERTICAL BAR
2018 lsquo SINGLE TURNED COMMA QUOTATION MARK
2018 rsquor SINGLE TURNED COMMA QUOTATION MARK
2019 rsquo SINGLE COMMA QUOTATION MARK
201A lsquor LOW SINGLE COMMA QUOTATION MARK
201C ldquo DOUBLE TURNED COMMA QUOTATION MARK
201C rdquor DOUBLE TURNED COMMA QUOTATION MARK
201D rdquo DOUBLE COMMA QUOTATION MARK
201E ldquor LOW DOUBLE COMMA QUOTATION MARK
2020 dagger DAGGER
2021 Dagger DOUBLE DAGGER
2022 bull BULLET
2025 nldr TWO DOT LEADER
2026 hellip HORIZONTAL ELLIPSIS
2026 mldr HORIZONTAL ELLIPSIS
2030 04.12 permil PER MILLE SIGN
2032 07.00 prime PRIME
2032 vprime PRIME
2033 07.01 Prime DOUBLE PRIME
2034 07.02 tprime TRIPLE PRIME
2035 bprime REVERSED PRIME
2041 caret CARET INSERTION POINT
2043 hybull HYPHEN BULLET
20D2 02.02 NON-SPACING LONG VERTICAL BAR OVERLAY
20D3 02.03 NON-SPACING SHORT VERTICAL BAR OVERLAY
20D4 02.08 NON-SPACING ANTICLOCKWISE ARROW ABOVE
20D5 02.15 NON-SPACING CLOCKWISE ARROW ABOVE
20D6 02.11 NON-SPACING LEFT ARROW ABOVE
20D7 02.14 NON-SPACING RIGHT ARROW ABOVE
20D8 02.05 NON-SPACING RING OVERLAY
20D9 02.07 NON-SPACING CLOCKWISE RING OVERLAY
20DA 02.06 NON-SPACING ANTICLOCKWISE RING OVERLAY
20DB tdot NON-SPACING THREE DOTS ABOVE
20DC DotDot NON-SPACING FOUR DOTS ABOVE
2105 incare CARE OF
210B hamilt SCRIPT H
210E 07.06 PLANCK CONSTANT
210F 07.12 planck PLANCK CONSTANT OVER 2 PI
2111 image BLACK-LETTER I
2112 lagran SCRIPT L
2113 ell SCRIPT SMALL L
2116 numero NUMERO
2117 copysr SOUND RECORDING COPYRIGHT
2118 17.13 weierp SCRIPT P
211C real BLACK-LETTER R
211E 17.14 rx PRESCRIPTION TAKE
2122 trade TRADEMARK
2126 ohm OHM
2129 17.12 TURNED GREEK SMALL LETTER IOTA
212B angst ANGSTROM UNIT
212C bernou SCRIPT B
2133 phmmat SCRIPT M
2134 order SCRIPT SMALL O
2135 07.13 aleph FIRST TRANSFINITE CARDINAL
2136 beth SECOND TRANSFINITE CARDINAL
2137 gimel THIRD TRANSFINITE CARDINAL
2138 daleth FOURTH TRANSFINITE CARDINAL
2153 frac13 FRACTION ONE THIRD
2154 frac23 FRACTION TWO THIRDS
2155 frac15 FRACTION ONE FIFTH
2156 frac25 FRACTION TWO FIFTHS
2157 frac35 FRACTION THREE FIFTHS
2158 frac45 FRACTION FOUR FIFTHS
2159 frac16 FRACTION ONE SIXTH
215A frac56 FRACTION FIVE SIXTHS
215B frac18 FRACTION ONE EIGHTH
215C frac38 FRACTION THREE EIGHTHS
215D frac58 FRACTION FIVE EIGHTHS
215E frac78 FRACTION SEVEN EIGHTHS
2190 larr LEFT ARROW
2191 uarr UP ARROW
2192 rarr RIGHT ARROW
2193 darr DOWN ARROW
2194 05.10 harr LEFT RIGHT ARROW
2195 06.10 varr UP DOWN ARROW
2196 15.07 nwarr UPPER LEFT ARROW
2197 16.07 nearr UPPER RIGHT ARROW
2198 16.08 drarr LOWER RIGHT ARROW
2199 15.08 dlarr LOWER LEFT ARROW
219A 17.08 nlarr LEFT ARROW WITH STROKE
219B 15.10 nrarr RIGHT ARROW WITH STROKE
219D rarrw RIGHT WAVE ARROW
219E Larr LEFT TWO HEADED ARROW
21A0 16.10 Rarr RIGHT TWO HEADED ARROW
21A2 larrtl LEFT ARROW WITH TAIL
21A3 rarrtl RIGHT ARROW WITH TAIL
21A6 05.12 map RIGHT ARROW FROM BAR
21A9 16.11 larrhk LEFT ARROW WITH HOOK
21AA 15.11 rarrhk RIGHT ARROW WITH HOOK
21AB larrlp LEFT ARROW WITH LOOP
21AC rarrlp RIGHT ARROW WITH LOOP
21AD harrw LEFT RIGHT WAVE ARROW
21AE nharr LEFT RIGHT ARROW WITH STROKE
21B0 lsh UP ARROW WITH TIP LEFT
21B1 rsh UP ARROW WITH TIP RIGHT
21B6 05.09 cularr ANTICLOCKWISE TOP SEMICIRCLE ARROW
21B7 06.09 curarr CLOCKWISE TOP SEMICIRCLE ARROW
21BA olarr ANTICLOCKWISE OPEN CIRCLE ARROW
21BB orarr CLOCKWISE OPEN CIRCLE ARROW
21BC lharu LEFT HARPOON WITH BARB UP
21BD lhard LEFT HARPOON WITH BARB DOWN
21BE 15.09 uharr UP HARPOON WITH BARB RIGHT
21BF uharl UP HARPOON WITH BARB LEFT
21C0 rharu RIGHT HARPOON WITH BARB UP
21C1 rhard RIGHT HARPOON WITH BARB DOWN
21C2 dharr DOWN HARPOON WITH BARB RIGHT
21C3 dharl DOWN HARPOON WITH BARB LEFT
21C4 06.11 rlarr2 RIGHT ARROW OVER LEFT ARROW
21C5 06.12 UP ARROW LEFT OF DOWN ARROW
21C6 05.11 lrarr2 LEFT ARROW OVER RIGHT ARROW
21C7 larr2 LEFT PAIRED ARROWS
21C8 uarr2 UP PAIRED ARROWS
21C9 rarr2 RIGHT PAIRED ARROWS
21CA darr2 DOWN PAIRED ARROWS
21CB lrhar2 LEFT HARPOON OVER RIGHT HARPOON
21CC rlhar2 RIGHT HARPOON OVER LEFT HARPOON
21CD nlArr LEFT DOUBLE ARROW WITH STROKE
21CE nhArr LEFT RIGHT DOUBLE ARROW WITH STROKE
21CF nrArr RIGHT DOUBLE ARROW WITH STROKE
21D0 05.14 lArr LEFT DOUBLE ARROW
21D1 05.13 uArr UP DOUBLE ARROW
21D2 06.14 rArr RIGHT DOUBLE ARROW
21D3 06.13 dArr DOWN DOUBLE ARROW
21D4 17.10 hArr,iff LEFT RIGHT DOUBLE ARROW
21D5 17.11 vArr UP DOWN DOUBLE ARROW
21DA lAarr LEFT TRIPLE ARROW
21DB rAarr RIGHT TRIPLE ARROW
21DC 17.09 LEFT SQUIGGLE ARROW
21DD 16.09 rarrw RIGHT SQUIGGLE ARROW
2200 05.05 forall FOR ALL
2201 05.06 comp COMPLEMENT
2202 07.11 part PARTIAL DIFFERENTIAL
2203 06.05 exist THERE EXISTS
2204 nexist THERE DOES NOT EXIST
2205 06.06 empty EMPTY SET
2206 03.11 INCREMENT
2207 04.11 nabla NABLA
2208 05.03 isin ELEMENT OF
2209 notin NOT AN ELEMENT OF
220A 15.06 epsis SMALL ELEMENT OF
220B 06.03 ni CONTAINS AS MEMBER
220D 16.06 bepsi SMALL CONTAINS AS MEMBER
220E 16.13 END OF PROOF
220F 04.15 prod N-ARY PRODUCT
2210 coprod N-ARY COPRODUCT
2210 amalg N-ARY COPRODUCT
2210 samalg N-ARY COPRODUCT
2211 03.15 sum N-ARY SUMMATION
2212 06.00 minus MINUS SIGN
2213 04.01 mnplus MINUS-OR-PLUS SIGN
2214 12.12 plusdo DOT PLUS
2216 16.02 setmn SET MINUS
2218 07.14 compfn RING OPERATOR
221A 06.15 radic SQUARE ROOT
221D vprop PROPORTIONAL TO
221D 17.02 prop PROPORTIONAL TO
221E 05.15 infin INFINITY
221F 03.10 ang90 RIGHT ANGLE
2220 04.10 ang ANGLE
2221 angmsd MEASURED ANGLE
2222 16.15 angsph SPHERICAL ANGLE
2223 mid DIVIDES
2224 17.07 nmid DOES NOT DIVIDE
2225 03.09 par PARALLEL TO
2226 17.06 npar NOT PARALLEL TO
2227 07.04 and LOGICAL AND
2228 07.03 or LOGICAL OR
2229 16.04 cap INTERSECTION
222A 15.04 cup UNION
222B 07.08 int INTEGRAL
222C 07.09 DOUBLE INTEGRAL
222D 07.10 TRIPLE INTEGRAL
222E conint CONTOUR INTEGRAL
2234 12.05 there4 THEREFORE
2235 12.06 becaus BECAUSE
2237 14.01 PROPORTION
2238 13.12 DOT MINUS
223A 17.01 GEOMETRIC PROPORTION
223B 12.14 HOMOTHETIC
223C thksim TILDE OPERATOR
223C 03.02 sim TILDE OPERATOR
223D bsim REVERSED TILDE
223E 12.13 INVERTED LAZY S
2240 wreath WREATH PRODUCT
2241 nsim NOT TILDE
2243 04.02 sime ASYMPTOTICALLY EQUAL TO
2244 nsime NOT ASYMPTOTICALLY EQUAL TO
2245 04.03 cong APPROXIMATELY EQUAL TO
2247 ncong NEITHER APPROXIMATELY NOR ACTUALLY EQUAL TO
2248 thkap ALMOST EQUAL TO
2248 03.03 ap ALMOST EQUAL TO
2249 nap NOT ALMOST EQUAL TO
224A 14.00 ape ALMOST EQUAL OR EQUAL TO
224C bcong ALL EQUAL TO
224D asymp EQUIVALENT TO
224E 14.04 bump GEOMETRICALLY EQUIVALENT TO
224F 04.04 bumpe DIFFERENCE BETWEEN
2250 14.12 esdot APPROACHES THE LIMIT
2251 eDot GEOMETRICALLY EQUAL TO
2252 17.00 efDot APPROXIMATELY EQUAL TO OR THE IMAGE OF
2253 13.01 erDot IMAGE OF OR APPROXIMATELY EQUAL TO
2254 colone COLON EQUAL
2255 ecolon EQUAL COLON
2256 ecir RING IN EQUAL TO
2257 cire RING EQUAL TO
2259 13.13 wedgeq ESTIMATES
225A 14.13 EQUIANGULAR TO
225C trie DELTA EQUAL TO
2260 13.00 ne NOT EQUAL TO
2261 03.04 equiv IDENTICAL TO
2262 nequiv NOT IDENTICAL TO
2264 03.05 le LESS THAN OR EQUAL TO
2264 les LESS THAN OR EQUAL TO
2265 04.05 ge GREATER THAN OR EQUAL TO
2265 ges GREATER THAN OR EQUAL TO
2266 lE LESS THAN OVER EQUAL TO
2267 gE GREATER THAN OVER EQUAL TO
2268 lnE LESS THAN BUT NOT EQUAL TO
2268 lvnE LESS THAN BUT NOT EQUAL TO
2269 gnE GREATER THAN BUT NOT EQUAL TO
2269 gvnE GREATER THAN BUT NOT EQUAL TO
226A Lt MUCH LESS THAN
226A 03.08 MUCH LESS THAN
226B Gt MUCH GREATER THAN
226B 04.08 MUCH GREATER THAN
226C twixt BETWEEN
226E nlt NOT LESS THAN
226F ngt NOT GREATER THAN
2270 nles NEITHER LESS THAN NOR EQUAL TO
2270 nle NEITHER LESS THAN NOR EQUAL TO
2271 nges NEITHER GREATER THAN NOR EQUAL TO
2271 nge NEITHER GREATER THAN NOR EQUAL TO
2272 03.07 lsim LESS THAN OR EQUIVALENT TO
2273 04.07 gsim GREATER THAN OR EQUIVALENT TO
2276 03.06 lg LESS THAN OR GREATER THAN
2277 04.06 gl GREATER THAN OR LESS THAN
227A 13.10 pr PRECEDES
227B 14.10 sc SUCCEEDS
227C 13.11 cupre PRECEDES OR EQUAL TO
227D 14.11 sccue SUCCEEDS OR EQUAL TO
227E 14.09 prsim PRECEDES OR EQUIVALENT TO
227F 13.09 scsim SUCCEEDS OR EQUIVALENT TO
2280 npr DOES NOT PRECEDE
2281 nsc DOES NOT SUCCEED
2282 05.01 sub SUBSET OF
2282 15.05 SUBSET OF
2283 06.01 sup SUPERSET OF
2283 16.05 SUPERSET OF
2284 nsub NOT A SUBSET OF
2285 nsup NOT A SUPERSET OF
2286 05.02 sube SUBSET OF OR EQUAL TO
2287 06.02 supe SUPERSET OF OR EQUAL TO
2288 nsube NEITHER A SUBSET OF NOR EQUAL TO
2289 nsupe NEITHER A SUPERSET OF NOR EQUAL TO
228A subnE SUBSET OF OR NOT EQUAL TO
228B supnE SUPERSET OF OR NOT EQUAL TO
228E uplus MULTISET UNION
228F 12.07 sqsub SQUARE IMAGE OF
2290 12.08 sqsup SQUARE ORIGINAL OF
2291 sqsube SQUARE IMAGE OF OR EQUAL TO
2292 sqsupe SQUARE ORIGINAL OF OR EQUAL TO
2293 sqcap SQUARE CAP
2294 sqcup SQUARE CUP
2295 12.01 oplus CIRCLED PLUS
2296 12.02 ominus CIRCLED MINUS
2297 12.03 otimes CIRCLED TIMES
2298 osol CIRCLED DIVISION SLASH
2299 12.04 odot CIRCLED DOT OPERATOR
229A ocir CIRCLED RING OPERATOR
229B oast CIRCLED ASTERISK OPERATOR
229D odash CIRCLED DASH
229E plusb SQUARED PLUS
229F minusb SQUARED MINUS
22A0 timesb SQUARED TIMES
22A1 sdotb SQUARED DOT OPERATOR
22A2 07.07 vdash RIGHT TACK
22A3 dashv LEFT TACK
22A4 top DOWN TACK
22A5 bottom UP TACK
22A5 04.09 perp UP TACK
22A6 17.05 ASSERTION
22A7 17.04 models MODELS
22A8 vDash TRUE
22A9 Vdash FORCES
22AA Vvdash TRIPLE VERTICAL BAR RIGHT TURNSTILE
22AC nvdash DOES NOT PROVE
22AD nvDash NOT TRUE
22AE nVdash DOES NOT FORCE
22AF nVDash NEGATED DOUBLE VERTICAL BAR DOUBLE RIGHT TURNSTILE
22B0 12.15 PRECEDES UNDER RELATION
22B2 13.08 vltri NORMAL SUBGROUP OF
22B3 vrtri CONTAINS AS NORMAL SUBGROUP
22B4 14.08 ltrie NORMAL SUBGROUP OF OR EQUAL TO
22B5 rtrie CONTAINS AS NORMAL SUBGROUP OR EQUAL TO
22B6 12.10 ORIGINAL OF
22B7 12.09 IMAGE OF
22B8 mumap MULTIMAP
22B9 12.11 HERMITIAN CONJUGATE MATRIX
22BA intcal INTERCALATE
22BB veebar XOR
22BC barwed NAND
22C0 16.03 N-ARY LOGICAL AND
22C1 15.03 N-ARY LOGICAL OR
22C2 06.04 N-ARY INTERSECTION
22C3 05.04 N-ARY UNION
22C4 diam DIAMOND OPERATOR
22C5 sdot DOT OPERATOR
22C6 sstarf STAR OPERATOR
22C7 divonx DIVISION TIMES
22C8 bowtie BOWTIE
22C9 ltimes LEFT NORMAL FACTOR SEMIDIRECT PRODUCT
22CA rtimes RIGHT NORMAL FACTOR SEMIDIRECT PRODUCT
22CB lthree LEFT SEMIDIRECT PRODUCT
22CC rthree RIGHT SEMIDIRECT PRODUCT
22CD bsime REVERSED TILDE EQUALS
22CE cuvee CURLY LOGICAL OR
22CF cuwed CURLY LOGICAL AND
22D0 Sub DOUBLE SUBSET
22D1 Sup DOUBLE SUPERSET
22D2 Cap DOUBLE INTERSECTION
22D3 Cup DOUBLE UNION
22D4 fork PITCHFORK
22D6 ldot LESS THAN WITH DOT
22D7 gsdot GREATER THAN WITH DOT
22D8 Ll VERY MUCH LESS THAN
22D9 Gg VERY MUCH GREATER THAN
22DA leg LESS THAN EQUAL TO OR GREATER THAN
22DB gel GREATER THAN EQUAL TO OR LESS THAN
22DC els EQUAL TO OR LESS THAN
22DD egs EQUAL TO OR GREATER THAN
22DE cuepr EQUAL TO OR PRECEDES
22DF cuesc EQUAL TO OR SUCCEEDS
22E0 npre DOES NOT PRECEDE OR EQUAL
22E1 nsce DOES NOT SUCCEED OR EQUAL
22E6 lnsim LESS THAN BUT NOT EQUIVALENT TO
22E7 gnsim GREATER THAN BUT NOT EQUIVALENT TO
22E8 prnsim PRECEDES BUT NOT EQUIVALENT TO
22E9 scnsim SUCCEEDS BUT NOT EQUIVALENT TO
22EA nltri NOT NORMAL SUBGROUP OF
22EB nrtri DOES NOT CONTAIN AS NORMAL SUBGROUP
22EC nltrie NOT NORMAL SUBGROUP OF OR EQUAL TO
22ED nrtrie DOES NOT CONTAIN AS NORMAL SUBGROUP OR EQUAL
22EE 13.04 vellip VERTICAL ELLIPSIS
2306 Barwed PERSPECTIVE
2307 17.03 WAVY LINE
2308 lceil LEFT CEILING
2309 rceil RIGHT CEILING
230A lfloor LEFT FLOOR
230B rfloor RIGHT FLOOR
230C drcrop BOTTOM RIGHT CROP
230D dlcrop BOTTOM LEFT CROP
230E urcrop TOP RIGHT CROP
230F ulcrop TOP LEFT CROP
2315 telrec TELEPHONE RECORDER
2316 target POSITION INDICATOR
231C ulcorn TOP LEFT CORNER
231D urcorn TOP RIGHT CORNER
231E dlcorn BOTTOM LEFT CORNER
231F drcorn BOTTOM RIGHT CORNER
2322 frown FROWN
2323 smile SMILE
2329 03.13 lang BRA
232A 04.13 rang KET
2423 blank OPEN BOX
24C8 oS CIRCLED LATIN CAPITAL LETTER S
2500 boxh FORMS LIGHT HORIZONTAL
2502 boxv FORMS LIGHT VERTICAL
250C boxdr FORMS LIGHT DOWN AND RIGHT
2510 boxdl FORMS LIGHT DOWN AND LEFT
2514 boxur FORMS LIGHT UP AND RIGHT
2518 boxul FORMS LIGHT UP AND LEFT
251C boxvr FORMS LIGHT VERTICAL AND RIGHT
2524 boxvl FORMS LIGHT VERTICAL AND LEFT
252C boxhd FORMS LIGHT DOWN AND HORIZONTAL
2534 boxhu FORMS LIGHT UP AND HORIZONTAL
253C boxvh FORMS LIGHT VERTICAL AND HORIZONTAL
2550 boxH FORMS DOUBLE HORIZONTAL
2551 boxV FORMS DOUBLE VERTICAL
2552 boxdR FORMS DOWN SINGLE AND RIGHT DOUBLE
2553 boxDr FORMS DOWN DOUBLE AND RIGHT SINGLE
2554 boxDR FORMS DOUBLE DOWN AND RIGHT
2555 boxdL FORMS DOWN SINGLE AND LEFT DOUBLE
2556 boxDl FORMS DOWN DOUBLE AND LEFT SINGLE
2557 boxDL FORMS DOUBLE DOWN AND LEFT
2558 boxuR FORMS UP SINGLE AND RIGHT DOUBLE
2559 boxUr FORMS UP DOUBLE AND RIGHT SINGLE
255A boxUR FORMS DOUBLE UP AND RIGHT
255B boxuL FORMS UP SINGLE AND LEFT DOUBLE
255C boxUl FORMS UP DOUBLE AND LEFT SINGLE
255D boxUL FORMS DOUBLE UP AND LEFT
255E boxvR FORMS VERTICAL SINGLE AND RIGHT DOUBLE
255F boxVr FORMS VERTICAL DOUBLE AND RIGHT SINGLE
2560 boxVR FORMS DOUBLE VERTICAL AND RIGHT
2561 boxvL FORMS VERTICAL SINGLE AND LEFT DOUBLE
2562 boxVl FORMS VERTICAL DOUBLE AND LEFT SINGLE
2563 boxVL FORMS DOUBLE VERTICAL AND LEFT
2564 boxHd FORMS DOWN SINGLE AND HORIZONTAL DOUBLE
2565 boxhD FORMS DOWN DOUBLE AND HORIZONTAL SINGLE
2566 boxHD FORMS DOUBLE DOWN AND HORIZONTAL
2567 boxHu FORMS UP SINGLE AND HORIZONTAL DOUBLE
2568 boxhU FORMS UP DOUBLE AND HORIZONTAL SINGLE
2569 boxHU FORMS DOUBLE UP AND HORIZONTAL
256A boxvH FORMS VERTICAL SINGLE AND HORIZONTAL DOUBLE
256B boxVh FORMS VERTICAL DOUBLE AND HORIZONTAL SINGLE
256C boxVH FORMS DOUBLE VERTICAL AND HORIZONTAL
2571 15.01 FORMS LIGHT DIAGONAL UPPER RIGHT TO LOWER LEFT
2572 16.01 FORMS LIGHT DIAGONAL UPPER LEFT TO LOWER RIGHT
2580 uhblk UPPER HALF BLOCK
2584 lhblk LOWER HALF BLOCK
2588 block FULL BLOCK
2591 blk14 LIGHT SHADE
2592 blk12 MEDIUM SHADE
2593 blk34 DARK SHADE
25A1 15.13 squ,square WHITE SQUARE
25AA squf BLACK SMALL SQUARE
25AD 15.14 rect WHITE RECTANGLE
25AE marker BLACK VERTICAL RECTANGLE
25B1 16.14 WHITE PARALLELOGRAM
25B3 13.15 xutri WHITE UP POINTING TRIANGLE
25B4 utrif BLACK UP POINTING SMALL TRIANGLE
25B5 utri WHITE UP POINTING SMALL TRIANGLE
25B7 14.14 WHITE RIGHT POINTING TRIANGLE
25B8 rtrif BLACK RIGHT POINTING SMALL TRIANGLE
25B9 rtri WHITE RIGHT POINTING SMALL TRIANGLE
25BD 14.15 xdtri WHITE DOWN POINTING TRIANGLE
25BE dtrif BLACK DOWN POINTING SMALL TRIANGLE
25BF dtri WHITE DOWN POINTING SMALL TRIANGLE
25C1 13.14 WHITE LEFT POINTING TRIANGLE
25C2 ltrif BLACK LEFT POINTING SMALL TRIANGLE
25C3 ltri WHITE LEFT POINTING SMALL TRIANGLE
25CA 15.15 LOZENGE
25CB 15.12 cir WHITE CIRCLE
25CB xcirc WHITE CIRCLE
25CF 16.12 BLACK CIRCLE
2605 starf BLACK STAR
2606 star WHITE STAR
260E phone BLACK TELEPHONE
2640 female FEMALE SIGN
2642 male MALE SIGN
2660 spades BLACK SPADE SUIT
2661 hearts WHITE HEART SUIT
2662 diams WHITE DIAMOND SUIT
2663 clubs BLACK CLUB SUIT
266A sung EIGHTH NOTE
266D flat FLAT
266E natur NATURAL
266F sharp SHARP
2713 check CHECK MARK
2717 cross BALLOT X
2720 malt MALTESE CROSS
2726 lozf BLACK FOUR POINTED STAR
2727 loz WHITE FOUR POINTED STAR
2736 sextile SIX POINTED BLACK STAR
3018 13.02 OPENING WHITE TORTOISE SHELL BRACKET
3019 14.02 CLOSING WHITE TORTOISE SHELL BRACKET
301A 03.14 OPENING WHITE SQUARE BRACKET
301B 04.14 CLOSING WHITE SQUARE BRACKET
SGML and ISO 6862.2 ligatures, glyphs, and glyph variants which
are not encoded in Unicode:
fflig
ffilig
ffllig
filig
fllig
fjlig
jnodot
spar
16.00 smid
nsmid
nspar
ssmile
sfrown
lap
gap
lnap
gnap
lEg
gEl
vsupne
vsubne
vsubnE
vsupnE
subne
supne
pre
sce
prnE
scnE
prap
scap
prnap
scnap
xlArr
xrArr
xharr

View File

@@ -3,11 +3,20 @@ This Unicode directory contains mapping files extracted from the CDROM that came
The Xerox subdirectory contains mappings from the Xerox character encoding (version XC1-3-3-0, 1887) into Unicode 3.0. standard into Unicode. That is the version of XCCS corresponding to the fonts in the Medley system. The Xerox mappings did not come from the Unicode CDROM, they were constructed by combining and constrasting information from a binary file (xerox>XCCStoUni) of unknown provenance with code mappings scraped from the Wikipedia page https://en.wikipedia.org/wiki/Xerox_Character_Code_Standard in July 2020. Both sources were errorful and incomplete, so many of the mappings were hand corrected. There are still missing mappings, and there still may be errors.
EASTASIA:
CJK cross reference mappings for standards such as KSC5601,
The CDROM came with CJK cross reference mappings for standards such as KSC5601,
GB2312, JIS0208, etc. to Unicode 2.0.
However, these particular mappings are now obsolete and have been removed as per
this note from Unicode.org:
The entire former contents of this directory are obsolete and have been
moved to the OBSOLETE directory. The latest information may be found
in the Unihan data files in the latest Unicode Character Database.
August 1, 2001.
The current set of mappings are available from
https://unicode.org/Public/UNIDATA/Unihan.zip
The format of these files is given in https://unicode.org/reports/tr38/
ISO8859:
These are the mapping tables of the ISO 8859 series (1 through 9)
These are the mapping tables of the ISO 8859 series (1 through 16)
VENDORS:
Miscellaneous mapping tables for small codesets, typically provided

View File

@@ -1,4 +1,42 @@
The files in this directory are not official. They were created
from the original Unicode Version 1.0 mapping tables for
convenience only.
Contact rick@unicode.org with any problems.
2011 July 12 Note:
The information in this ReadMe.txt is obsolete, but is retained for
historical purposes, as documentation of the original mapping files
included in this directory.
For a more current version of the Adobe glyph naming conventions, see
the document "Glyph", located at:
<http://www.adobe.com/devnet/opentype/archives/glyph.html>
-------------------------------------------------------------------------------
File name: Adobe ReadMe for Unicode 3.0
Date: 30 March 1999
-------------------------------------------------------------------------------
The document "Unicode and Glyph Names," at:
http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
describes Adobe's PostScript glyph naming conventions in the context of
Unicode and also contains links to the following 3 database files:
"The Adobe Glyph List" (AGL), which maps approximately 1000 glyph names to
standard or Corporate Use subarea Unicode values.
"Unicode's Corporate Use Subarea as used by Adobe."
"Zapf Dingbats Glyph Names and Unicode Values."
-------------------------------------------------------------------------------
The 3 files in the current directory:
stdenc.txt
symbol.txt
zdingbat.txt
were originally provided by the Unicode Consortium for use by NeXT
implementations with DPS, and continue to be provided for compatibility with
those implementations. All others should refer to the files on Adobe.com
mentioned above.
-------------------------------------------------------------------------------

View File

@@ -1,51 +1,80 @@
#
# Name: Adobe Standard Encoding to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Table format: Format A
# Date: 05 May 1995
# Name: Adobe Standard Encoding to Unicode
# Unicode version: 2.0
# Table version: 1.0
# Date: 2011 July 12
#
# Copyright (c) 1991-2011 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium). No
# claims are made as to fitness for any particular purpose. No warranties of
# any kind are expressed or implied. The recipient agrees to determine
# applicability of information provided. If this file has been provided on
# magnetic media by Unicode, Inc., the sole remedy for any claim will be
# exchange of defective media within 90 days of receipt.
#
# Unicode, Inc. hereby grants the right to freely use the information
# supplied in this file in the creation of products supporting the
# Unicode Standard, and to make copies of this file in any form for
# internal or external distribution as long as this notice remains
# attached.
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
# Format: 4 tab-delimited fields:
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
# (1) The Unicode value (in hexadecimal)
# (2) The Adobe Standard Encoding code point (in hexadecimal)
# (3) # Unicode name
# (4) # PostScript character name
#
# General Notes:
#
# The Unicode values in this table were produced as the result of applying
# the algorithm described in the section "Populating a Unicode space" in the
# document "Unicode and Glyph Names," at
# http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
# to the characters encoded in Adobe Standard Encoding. Note that some
# Standard Encoding characters, such as "space", are mapped to 2 Unicode
# values. Refer to the above document for more details.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# 2011 July 12: The above link is no longer valid. For comparable,
# more current information, see the document, "Glyph", at:
# <http://www.adobe.com/devnet/opentype/archives/glyph.html>
#
# General notes:
# Revision History:
#
# Format: Three tab-separated columns
# Column #1 is the Unicode code (in hex)
# Column #2 is the Adobe code (in hex)
# Column #3 # Unicode name <tab> # Adobe name
# [v1.0, 2011 July 12]
# Updated terms of use to current wording.
# Updated contact information and document link.
# No changes to the mapping data.
#
# Any comments or questions contact: unicode-inc@unicode.org
# [v0.2, 30 March 1999]
# Different algorithm to produce Unicode values (see notes above) results in
# some character codes being mapped to 2 Unicode values. Updated Unicode
# names to Unicode 2.0 names.
#
# [v0.1, 5 May 1995] First release.
#
# Use the Unicode reporting form <http://www.unicode.org/reporting.html>
# for any questions or comments or to report errors in the data.
#
0020 20 # SPACE # space
00A0 20 # NO-BREAK SPACE # space
0021 21 # EXCLAMATION MARK # exclam
0022 22 # QUOTATION MARK # quotedbl
0023 23 # NUMBER SIGN # numbersign
0024 24 # DOLLAR SIGN # dollar
0025 25 # PERCENT SIGN # percent
0026 26 # AMPERSAND # ampersand
0027 A9 # APOSTROPHE-QUOTE # quotesingle
0028 28 # OPENING PARENTHESIS # parenleft
0029 29 # CLOSING PARENTHESIS # parenright
2019 27 # RIGHT SINGLE QUOTATION MARK # quoteright
0028 28 # LEFT PARENTHESIS # parenleft
0029 29 # RIGHT PARENTHESIS # parenright
002A 2A # ASTERISK # asterisk
002B 2B # PLUS SIGN # plus
002C 2C # COMMA # comma
002D 2D # HYPHEN-MINUS # hyphen
002E 2E # PERIOD # period
002F 2F # SLASH # slash
00AD 2D # SOFT HYPHEN # hyphen
002E 2E # FULL STOP # period
002F 2F # SOLIDUS # slash
0030 30 # DIGIT ZERO # zero
0031 31 # DIGIT ONE # one
0032 32 # DIGIT TWO # two
@@ -89,12 +118,12 @@
0058 58 # LATIN CAPITAL LETTER X # X
0059 59 # LATIN CAPITAL LETTER Y # Y
005A 5A # LATIN CAPITAL LETTER Z # Z
005B 5B # OPENING SQUARE BRACKET # bracketleft
005C 5C # BACKSLASH # backslash
005D 5D # CLOSING SQUARE BRACKET # bracketright
005E 5E # SPACING CIRCUMFLEX # asciicircum
005F 5F # SPACING UNDERSCORE # underscore
0060 C1 # SPACING GRAVE # grave
005B 5B # LEFT SQUARE BRACKET # bracketleft
005C 5C # REVERSE SOLIDUS # backslash
005D 5D # RIGHT SQUARE BRACKET # bracketright
005E 5E # CIRCUMFLEX ACCENT # asciicircum
005F 5F # LOW LINE # underscore
2018 60 # LEFT SINGLE QUOTATION MARK # quoteleft
0061 61 # LATIN SMALL LETTER A # a
0062 62 # LATIN SMALL LETTER B # b
0063 63 # LATIN SMALL LETTER C # c
@@ -121,61 +150,64 @@
0078 78 # LATIN SMALL LETTER X # x
0079 79 # LATIN SMALL LETTER Y # y
007A 7A # LATIN SMALL LETTER Z # z
007B 7B # OPENING CURLY BRACKET # braceleft
007C 7C # VERTICAL BAR # bar
007D 7D # CLOSING CURLY BRACKET # braceright
007B 7B # LEFT CURLY BRACKET # braceleft
007C 7C # VERTICAL LINE # bar
007D 7D # RIGHT CURLY BRACKET # braceright
007E 7E # TILDE # asciitilde
00A1 A1 # INVERTED EXCLAMATION MARK # exclamdown
00A2 A2 # CENT SIGN # cent
00A3 A3 # POUND SIGN # sterling
00A4 A8 # CURRENCY SIGN # currency
00A5 A5 # YEN SIGN # yen
00A7 A7 # SECTION SIGN # section
00A8 C8 # SPACING DIAERESIS # dieresis
00AA E3 # FEMININE ORDINAL INDICATOR # ordfeminine
00AB AB # LEFT POINTING GUILLEMET # guillemotleft
00AF C5 # SPACING MACRON # macron
00B4 C2 # SPACING ACUTE # acute
00B6 B6 # PARAGRAPH SIGN # paragraph
00B7 B4 # MIDDLE DOT # periodcentered
00B8 CB # SPACING CEDILLA # cedilla
00BA EB # MASCULINE ORDINAL INDICATOR # ordmasculine
00BB BB # RIGHT POINTING GUILLEMET # guillemotright
00BF BF # INVERTED QUESTION MARK # questiondown
00C6 E1 # LATIN CAPITAL LETTER A E # AE
00D8 E9 # LATIN CAPITAL LETTER O SLASH # Oslash
00DF FB # LATIN SMALL LETTER SHARP S # germandbls
00E6 F1 # LATIN SMALL LETTER A E # ae
00F8 F9 # LATIN SMALL LETTER O SLASH # oslash
0131 F5 # LATIN SMALL LETTER DOTLESS I # dotlessi
0141 E8 # LATIN CAPITAL LETTER L SLASH # Lslash
0142 F8 # LATIN SMALL LETTER L SLASH # lslash
0152 EA # LATIN CAPITAL LETTER O E # OE
0153 FA # LATIN SMALL LETTER O E # oe
0192 A6 # LATIN SMALL LETTER SCRIPT F # florin
02C6 C3 # MODIFIER LETTER CIRCUMFLEX # circumflex
02C7 CF # MODIFIER LETTER HACEK # caron
02D8 C6 # SPACING BREVE # breve
02D9 C7 # SPACING DOT ABOVE # dotaccent
02DA CA # SPACING RING ABOVE # ring
02DB CE # SPACING OGONEK # ogonek
02DC C4 # SPACING TILDE # tilde
02DD CD # SPACING DOUBLE ACUTE # hungarumlaut
2013 B1 # EN DASH # endash
2014 D0 # EM DASH # emdash
2018 60 # SINGLE TURNED COMMA QUOTATION MARK # quoteleft
2019 27 # SINGLE COMMA QUOTATION MARK # quoteright
201A B8 # LOW SINGLE COMMA QUOTATION MARK # quotesinglbase
201C AA # DOUBLE TURNED COMMA QUOTATION MARK # quotedblleft
201D BA # DOUBLE COMMA QUOTATION MARK # quotedblright
201E B9 # LOW DOUBLE COMMA QUOTATION MARK # quotedblbase
2020 B2 # DAGGER # dagger
2021 B3 # DOUBLE DAGGER # daggerdbl
2022 B7 # BULLET # bullet
2026 BC # HORIZONTAL ELLIPSIS # ellipsis
2030 BD # PER MILLE SIGN # perthousand
2039 AC # LEFT POINTING SINGLE GUILLEMET # guilsinglleft
203A AD # RIGHT POINTING SINGLE GUILLEMET # guilsinglright
2044 A4 # FRACTION SLASH # fraction
2215 A4 # DIVISION SLASH # fraction
00A5 A5 # YEN SIGN # yen
0192 A6 # LATIN SMALL LETTER F WITH HOOK # florin
00A7 A7 # SECTION SIGN # section
00A4 A8 # CURRENCY SIGN # currency
0027 A9 # APOSTROPHE # quotesingle
201C AA # LEFT DOUBLE QUOTATION MARK # quotedblleft
00AB AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK # guillemotleft
2039 AC # SINGLE LEFT-POINTING ANGLE QUOTATION MARK # guilsinglleft
203A AD # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK # guilsinglright
FB01 AE # LATIN SMALL LIGATURE FI # fi
FB02 AF # LATIN SMALL LIGATURE FL # fl
2013 B1 # EN DASH # endash
2020 B2 # DAGGER # dagger
2021 B3 # DOUBLE DAGGER # daggerdbl
00B7 B4 # MIDDLE DOT # periodcentered
2219 B4 # BULLET OPERATOR # periodcentered
00B6 B6 # PILCROW SIGN # paragraph
2022 B7 # BULLET # bullet
201A B8 # SINGLE LOW-9 QUOTATION MARK # quotesinglbase
201E B9 # DOUBLE LOW-9 QUOTATION MARK # quotedblbase
201D BA # RIGHT DOUBLE QUOTATION MARK # quotedblright
00BB BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK # guillemotright
2026 BC # HORIZONTAL ELLIPSIS # ellipsis
2030 BD # PER MILLE SIGN # perthousand
00BF BF # INVERTED QUESTION MARK # questiondown
0060 C1 # GRAVE ACCENT # grave
00B4 C2 # ACUTE ACCENT # acute
02C6 C3 # MODIFIER LETTER CIRCUMFLEX ACCENT # circumflex
02DC C4 # SMALL TILDE # tilde
00AF C5 # MACRON # macron
02C9 C5 # MODIFIER LETTER MACRON # macron
02D8 C6 # BREVE # breve
02D9 C7 # DOT ABOVE # dotaccent
00A8 C8 # DIAERESIS # dieresis
02DA CA # RING ABOVE # ring
00B8 CB # CEDILLA # cedilla
02DD CD # DOUBLE ACUTE ACCENT # hungarumlaut
02DB CE # OGONEK # ogonek
02C7 CF # CARON # caron
2014 D0 # EM DASH # emdash
00C6 E1 # LATIN CAPITAL LETTER AE # AE
00AA E3 # FEMININE ORDINAL INDICATOR # ordfeminine
0141 E8 # LATIN CAPITAL LETTER L WITH STROKE # Lslash
00D8 E9 # LATIN CAPITAL LETTER O WITH STROKE # Oslash
0152 EA # LATIN CAPITAL LIGATURE OE # OE
00BA EB # MASCULINE ORDINAL INDICATOR # ordmasculine
00E6 F1 # LATIN SMALL LETTER AE # ae
0131 F5 # LATIN SMALL LETTER DOTLESS I # dotlessi
0142 F8 # LATIN SMALL LETTER L WITH STROKE # lslash
00F8 F9 # LATIN SMALL LETTER O WITH STROKE # oslash
0153 FA # LATIN SMALL LIGATURE OE # oe
00DF FB # LATIN SMALL LETTER SHARP S # germandbls

View File

@@ -1,71 +1,82 @@
#
# Name: Adobe Symbol Encoding to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Table format: Format A
# Date: 05 May 1995
# Name: Adobe Symbol Encoding to Unicode
# Unicode version: 2.0
# Table version: 1.0
# Date: 2011 July 12
#
# Copyright (c) 1991-2011 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium). No
# claims are made as to fitness for any particular purpose. No warranties of
# any kind are expressed or implied. The recipient agrees to determine
# applicability of information provided. If this file has been provided on
# magnetic media by Unicode, Inc., the sole remedy for any claim will be
# exchange of defective media within 90 days of receipt.
#
# Unicode, Inc. hereby grants the right to freely use the information
# supplied in this file in the creation of products supporting the
# Unicode Standard, and to make copies of this file in any form for
# internal or external distribution as long as this notice remains
# attached.
#
# Format: 4 tab-delimited fields:
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
# (1) The Unicode value (in hexadecimal)
# (2) The Symbol Encoding code point (in hexadecimal)
# (3) # Unicode name
# (4) # PostScript character name
#
# General Notes:
#
# The Unicode values in this table were produced as the result of applying
# the algorithm described in the section "Populating a Unicode space" in the
# document "Unicode and Glyph Names," at
# http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
# to the characters in Symbol. Note that some characters, such as "space",
# are mapped to 2 Unicode values. 29 characters have assignments in the
# Corporate Use Subarea; these are indicated by "(CUS)" in field 4. Refer to
# the above document for more details.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
# 2011 July 12: The above link is no longer valid. For comparable,
# more current information, see the document, "Glyph", at:
# <http://www.adobe.com/devnet/opentype/archives/glyph.html>
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Revision History:
#
# Format: Three tab-separated columns
# Column #1 is the Unicode code (in hex)
# Column #2 is the Adobe code (in hex)
# Column #3 # Unicode name <tab> # Adobe name
# [v1.0, 2011 July 12]
# Updated terms of use to current wording.
# Updated contact information and document link.
# No changes to the mapping data.
#
# General notes: There are a number of glyph parts in the Adobe Symbol
# Encoding which are not in the Unicode set. They are as follows:
# [v0.2, 30 March 1999]
# Different algorithm to produce Unicode values (see notes above) results in
# some character codes being mapped to 2 Unicode values; use of Corporate
# Use subarea values; addition of the euro character; changed assignments of
# some characters such as the COPYRIGHT SIGNs and RADICAL EXTENDER. Updated
# Unicode names to Unicode 2.0 names.
#
# BD arrowvertex
# BE arrowhorizex
# E6 parenlefttp
# E7 parenleftex
# E8 parenleftbt
# E9 bracketlefttp
# EA bracketleftex
# EB bracketleftbt
# EC bracelefttp
# ED braceleftmid
# EE braceleftbt
# EF braceex
# F4 integralex
# F6 parenrighttp
# F7 parenrightex
# F8 parenrightbt
# F9 bracketrighttp
# FA bracketrightex
# FB bracketrightbt
# FC bracerighttp
# FD bracerightmid
# FE bracerightbt
#
# Any comments or questions contact: unicode-inc@unicode.org
# [v0.1, 5 May 1995] First release.
#
# Use the Unicode reporting form <http://www.unicode.org/reporting.html>
# for any questions or comments or to report errors in the data.
#
0020 20 # SPACE # space
00A0 20 # NO-BREAK SPACE # space
0021 21 # EXCLAMATION MARK # exclam
2200 22 # FOR ALL # universal
0023 23 # NUMBER SIGN # numbersign
2203 24 # THERE EXISTS # existential
0025 25 # PERCENT SIGN # percent
0026 26 # AMPERSAND # ampersand
0028 28 # OPENING PARENTHESIS # parenleft
0029 29 # CLOSING PARENTHESIS # parenright
220B 27 # CONTAINS AS MEMBER # suchthat
0028 28 # LEFT PARENTHESIS # parenleft
0029 29 # RIGHT PARENTHESIS # parenright
2217 2A # ASTERISK OPERATOR # asteriskmath
002B 2B # PLUS SIGN # plus
002C 2C # COMMA # comma
002E 2E # PERIOD # period
002F 2F # SLASH # slash
2212 2D # MINUS SIGN # minus
002E 2E # FULL STOP # period
002F 2F # SOLIDUS # slash
0030 30 # DIGIT ZERO # zero
0031 31 # DIGIT ONE # one
0032 32 # DIGIT TWO # two
@@ -82,145 +93,164 @@
003D 3D # EQUALS SIGN # equal
003E 3E # GREATER-THAN SIGN # greater
003F 3F # QUESTION MARK # question
005B 5B # OPENING SQUARE BRACKET # bracketleft
005D 5D # CLOSING SQUARE BRACKET # bracketright
005F 5F # SPACING UNDERSCORE # underscore
007B 7B # OPENING CURLY BRACKET # braceleft
007C 7C # VERTICAL BAR # bar
007D 7D # CLOSING CURLY BRACKET # braceright
00A9 D3 # COPYRIGHT SIGN # copyrightserif
00A9 E3 # COPYRIGHT SIGN # copyrightsans
00AC D8 # NOT SIGN # logicalnot
00AE D2 # REGISTERED TRADE MARK SIGN # registeredserif
00AE E2 # REGISTERED TRADE MARK SIGN # registeredsans
00B0 B0 # DEGREE SIGN # degree
00B1 B1 # PLUS-OR-MINUS SIGN # plusminus
00D7 B4 # MULTIPLICATION SIGN # multiply
00F7 B8 # DIVISION SIGN # divide
0192 A6 # LATIN SMALL LETTER SCRIPT F # florin
2245 40 # APPROXIMATELY EQUAL TO # congruent
0391 41 # GREEK CAPITAL LETTER ALPHA # Alpha
0392 42 # GREEK CAPITAL LETTER BETA # Beta
0393 47 # GREEK CAPITAL LETTER GAMMA # Gamma
03A7 43 # GREEK CAPITAL LETTER CHI # Chi
0394 44 # GREEK CAPITAL LETTER DELTA # Delta
2206 44 # INCREMENT # Delta
0395 45 # GREEK CAPITAL LETTER EPSILON # Epsilon
0396 5A # GREEK CAPITAL LETTER ZETA # Zeta
03A6 46 # GREEK CAPITAL LETTER PHI # Phi
0393 47 # GREEK CAPITAL LETTER GAMMA # Gamma
0397 48 # GREEK CAPITAL LETTER ETA # Eta
0398 51 # GREEK CAPITAL LETTER THETA # Theta
0399 49 # GREEK CAPITAL LETTER IOTA # Iota
03D1 4A # GREEK THETA SYMBOL # theta1
039A 4B # GREEK CAPITAL LETTER KAPPA # Kappa
039B 4C # GREEK CAPITAL LETTER LAMBDA # Lambda
039B 4C # GREEK CAPITAL LETTER LAMDA # Lambda
039C 4D # GREEK CAPITAL LETTER MU # Mu
039D 4E # GREEK CAPITAL LETTER NU # Nu
039E 58 # GREEK CAPITAL LETTER XI # Xi
039F 4F # GREEK CAPITAL LETTER OMICRON # Omicron
03A0 50 # GREEK CAPITAL LETTER PI # Pi
0398 51 # GREEK CAPITAL LETTER THETA # Theta
03A1 52 # GREEK CAPITAL LETTER RHO # Rho
03A3 53 # GREEK CAPITAL LETTER SIGMA # Sigma
03A4 54 # GREEK CAPITAL LETTER TAU # Tau
03A5 55 # GREEK CAPITAL LETTER UPSILON # Upsilon
03A6 46 # GREEK CAPITAL LETTER PHI # Phi
03A7 43 # GREEK CAPITAL LETTER CHI # Chi
03A8 59 # GREEK CAPITAL LETTER PSI # Psi
03C2 56 # GREEK SMALL LETTER FINAL SIGMA # sigma1
03A9 57 # GREEK CAPITAL LETTER OMEGA # Omega
2126 57 # OHM SIGN # Omega
039E 58 # GREEK CAPITAL LETTER XI # Xi
03A8 59 # GREEK CAPITAL LETTER PSI # Psi
0396 5A # GREEK CAPITAL LETTER ZETA # Zeta
005B 5B # LEFT SQUARE BRACKET # bracketleft
2234 5C # THEREFORE # therefore
005D 5D # RIGHT SQUARE BRACKET # bracketright
22A5 5E # UP TACK # perpendicular
005F 5F # LOW LINE # underscore
F8E5 60 # RADICAL EXTENDER # radicalex (CUS)
03B1 61 # GREEK SMALL LETTER ALPHA # alpha
03B2 62 # GREEK SMALL LETTER BETA # beta
03B3 67 # GREEK SMALL LETTER GAMMA # gamma
03C7 63 # GREEK SMALL LETTER CHI # chi
03B4 64 # GREEK SMALL LETTER DELTA # delta
03B5 65 # GREEK SMALL LETTER EPSILON # epsilon
03B6 7A # GREEK SMALL LETTER ZETA # zeta
03C6 66 # GREEK SMALL LETTER PHI # phi
03B3 67 # GREEK SMALL LETTER GAMMA # gamma
03B7 68 # GREEK SMALL LETTER ETA # eta
03B8 71 # GREEK SMALL LETTER THETA # theta
03B9 69 # GREEK SMALL LETTER IOTA # iota
03D5 6A # GREEK PHI SYMBOL # phi1
03BA 6B # GREEK SMALL LETTER KAPPA # kappa
03BB 6C # GREEK SMALL LETTER LAMBDA # lambda
03BB 6C # GREEK SMALL LETTER LAMDA # lambda
00B5 6D # MICRO SIGN # mu
03BC 6D # GREEK SMALL LETTER MU # mu
03BD 6E # GREEK SMALL LETTER NU # nu
03BE 78 # GREEK SMALL LETTER XI # xi
03BF 6F # GREEK SMALL LETTER OMICRON # omicron
03C0 70 # GREEK SMALL LETTER PI # pi
03B8 71 # GREEK SMALL LETTER THETA # theta
03C1 72 # GREEK SMALL LETTER RHO # rho
03C2 56 # GREEK SMALL LETTER FINAL SIGMA # sigma1
03C3 73 # GREEK SMALL LETTER SIGMA # sigma
03C4 74 # GREEK SMALL LETTER TAU # tau
03C5 75 # GREEK SMALL LETTER UPSILON # upsilon
03C6 66 # GREEK SMALL LETTER PHI # phi
03C7 63 # GREEK SMALL LETTER CHI # chi
03C8 79 # GREEK SMALL LETTER PSI # psi
03D6 76 # GREEK PI SYMBOL # omega1
03C9 77 # GREEK SMALL LETTER OMEGA # omega
03D1 4A # GREEK SMALL LETTER SCRIPT THETA # theta1
03D2 A1 # GREEK CAPITAL LETTER UPSILON HOOK # Upsilon1
03D5 6A # GREEK SMALL LETTER SCRIPT PHI # phi1
03D6 76 # GREEK SMALL LETTER OMEGA PI # omega1
2022 B7 # BULLET # bullet
2026 BC # HORIZONTAL ELLIPSIS # ellipsis
2032 A2 # PRIME # minute
2033 B2 # DOUBLE PRIME # second
203E 60 # SPACING OVERSCORE # radicalex
2044 A4 # FRACTION SLASH # fraction
2111 C1 # BLACK-LETTER I # Ifraktur
2118 C3 # SCRIPT P # weierstrass
211C C2 # BLACK-LETTER R # Rfraktur
2122 D4 # TRADEMARK # trademarkserif
2122 E4 # TRADEMARK # trademarksans
2126 57 # OHM # Omega
2135 C0 # FIRST TRANSFINITE CARDINAL # aleph
2190 AC # LEFT ARROW # arrowleft
2191 AD # UP ARROW # arrowup
2192 AE # RIGHT ARROW # arrowright
2193 AF # DOWN ARROW # arrowdown
2194 AB # LEFT RIGHT ARROW # arrowboth
21B5 BF # DOWN ARROW WITH CORNER LEFT # carriagereturn
21D0 DC # LEFT DOUBLE ARROW # arrowdblleft
21D1 DD # UP DOUBLE ARROW # arrowdblup
21D2 DE # RIGHT DOUBLE ARROW # arrowdblright
21D3 DF # DOWN DOUBLE ARROW # arrowdbldown
21D4 DB # LEFT RIGHT DOUBLE ARROW # arrowdblboth
2200 22 # FOR ALL # universal
2202 B6 # PARTIAL DIFFERENTIAL # partialdiff
2203 24 # THERE EXISTS # existential
2205 C6 # EMPTY SET # emptyset
2206 44 # INCREMENT # Delta
2207 D1 # NABLA # gradient
2208 CE # ELEMENT OF # element
2209 CF # NOT AN ELEMENT OF # notelement
220B 27 # CONTAINS AS MEMBER # suchthat
220F D5 # N-ARY PRODUCT # product
2211 E5 # N-ARY SUMMATION # summation
2212 2D # MINUS SIGN # minus
2215 A4 # DIVISION SLASH # fraction
2217 2A # ASTERISK OPERATOR # asteriskmath
221A D6 # SQUARE ROOT # radical
221D B5 # PROPORTIONAL TO # proportional
221E A5 # INFINITY # infinity
2220 D0 # ANGLE # angle
2227 D9 # LOGICAL AND # logicaland
2228 DA # LOGICAL OR # logicalor
2229 C7 # INTERSECTION # intersection
222A C8 # UNION # union
222B F2 # INTEGRAL # integral
2234 5C # THEREFORE # therefore
03BE 78 # GREEK SMALL LETTER XI # xi
03C8 79 # GREEK SMALL LETTER PSI # psi
03B6 7A # GREEK SMALL LETTER ZETA # zeta
007B 7B # LEFT CURLY BRACKET # braceleft
007C 7C # VERTICAL LINE # bar
007D 7D # RIGHT CURLY BRACKET # braceright
223C 7E # TILDE OPERATOR # similar
2245 40 # APPROXIMATELY EQUAL TO # congruent
2248 BB # ALMOST EQUAL TO # approxequal
20AC A0 # EURO SIGN # Euro
03D2 A1 # GREEK UPSILON WITH HOOK SYMBOL # Upsilon1
2032 A2 # PRIME # minute
2264 A3 # LESS-THAN OR EQUAL TO # lessequal
2044 A4 # FRACTION SLASH # fraction
2215 A4 # DIVISION SLASH # fraction
221E A5 # INFINITY # infinity
0192 A6 # LATIN SMALL LETTER F WITH HOOK # florin
2663 A7 # BLACK CLUB SUIT # club
2666 A8 # BLACK DIAMOND SUIT # diamond
2665 A9 # BLACK HEART SUIT # heart
2660 AA # BLACK SPADE SUIT # spade
2194 AB # LEFT RIGHT ARROW # arrowboth
2190 AC # LEFTWARDS ARROW # arrowleft
2191 AD # UPWARDS ARROW # arrowup
2192 AE # RIGHTWARDS ARROW # arrowright
2193 AF # DOWNWARDS ARROW # arrowdown
00B0 B0 # DEGREE SIGN # degree
00B1 B1 # PLUS-MINUS SIGN # plusminus
2033 B2 # DOUBLE PRIME # second
2265 B3 # GREATER-THAN OR EQUAL TO # greaterequal
00D7 B4 # MULTIPLICATION SIGN # multiply
221D B5 # PROPORTIONAL TO # proportional
2202 B6 # PARTIAL DIFFERENTIAL # partialdiff
2022 B7 # BULLET # bullet
00F7 B8 # DIVISION SIGN # divide
2260 B9 # NOT EQUAL TO # notequal
2261 BA # IDENTICAL TO # equivalence
2264 A3 # LESS THAN OR EQUAL TO # lessequal
2265 B3 # GREATER THAN OR EQUAL TO # greaterequal
2282 CC # SUBSET OF # propersubset
2283 C9 # SUPERSET OF # propersuperset
2284 CB # NOT A SUBSET OF # notsubset
2286 CD # SUBSET OF OR EQUAL TO # reflexsubset
2287 CA # SUPERSET OF OR EQUAL TO # reflexsuperset
2295 C5 # CIRCLED PLUS # circleplus
2248 BB # ALMOST EQUAL TO # approxequal
2026 BC # HORIZONTAL ELLIPSIS # ellipsis
F8E6 BD # VERTICAL ARROW EXTENDER # arrowvertex (CUS)
F8E7 BE # HORIZONTAL ARROW EXTENDER # arrowhorizex (CUS)
21B5 BF # DOWNWARDS ARROW WITH CORNER LEFTWARDS # carriagereturn
2135 C0 # ALEF SYMBOL # aleph
2111 C1 # BLACK-LETTER CAPITAL I # Ifraktur
211C C2 # BLACK-LETTER CAPITAL R # Rfraktur
2118 C3 # SCRIPT CAPITAL P # weierstrass
2297 C4 # CIRCLED TIMES # circlemultiply
22A5 5E # UP TACK # perpendicular
2295 C5 # CIRCLED PLUS # circleplus
2205 C6 # EMPTY SET # emptyset
2229 C7 # INTERSECTION # intersection
222A C8 # UNION # union
2283 C9 # SUPERSET OF # propersuperset
2287 CA # SUPERSET OF OR EQUAL TO # reflexsuperset
2284 CB # NOT A SUBSET OF # notsubset
2282 CC # SUBSET OF # propersubset
2286 CD # SUBSET OF OR EQUAL TO # reflexsubset
2208 CE # ELEMENT OF # element
2209 CF # NOT AN ELEMENT OF # notelement
2220 D0 # ANGLE # angle
2207 D1 # NABLA # gradient
F6DA D2 # REGISTERED SIGN SERIF # registerserif (CUS)
F6D9 D3 # COPYRIGHT SIGN SERIF # copyrightserif (CUS)
F6DB D4 # TRADE MARK SIGN SERIF # trademarkserif (CUS)
220F D5 # N-ARY PRODUCT # product
221A D6 # SQUARE ROOT # radical
22C5 D7 # DOT OPERATOR # dotmath
2320 F3 # TOP HALF INTEGRAL # integraltp
2321 F5 # BOTTOM HALF INTEGRAL # integralbt
2329 E1 # BRA # angleleft
232A F1 # KET # angleright
00AC D8 # NOT SIGN # logicalnot
2227 D9 # LOGICAL AND # logicaland
2228 DA # LOGICAL OR # logicalor
21D4 DB # LEFT RIGHT DOUBLE ARROW # arrowdblboth
21D0 DC # LEFTWARDS DOUBLE ARROW # arrowdblleft
21D1 DD # UPWARDS DOUBLE ARROW # arrowdblup
21D2 DE # RIGHTWARDS DOUBLE ARROW # arrowdblright
21D3 DF # DOWNWARDS DOUBLE ARROW # arrowdbldown
25CA E0 # LOZENGE # lozenge
2660 AA # BLACK SPADE SUIT # spade
2663 A7 # BLACK CLUB SUIT # club
2665 A9 # BLACK HEART SUIT # heart
2666 A8 # BLACK DIAMOND SUIT # diamond
2329 E1 # LEFT-POINTING ANGLE BRACKET # angleleft
F8E8 E2 # REGISTERED SIGN SANS SERIF # registersans (CUS)
F8E9 E3 # COPYRIGHT SIGN SANS SERIF # copyrightsans (CUS)
F8EA E4 # TRADE MARK SIGN SANS SERIF # trademarksans (CUS)
2211 E5 # N-ARY SUMMATION # summation
F8EB E6 # LEFT PAREN TOP # parenlefttp (CUS)
F8EC E7 # LEFT PAREN EXTENDER # parenleftex (CUS)
F8ED E8 # LEFT PAREN BOTTOM # parenleftbt (CUS)
F8EE E9 # LEFT SQUARE BRACKET TOP # bracketlefttp (CUS)
F8EF EA # LEFT SQUARE BRACKET EXTENDER # bracketleftex (CUS)
F8F0 EB # LEFT SQUARE BRACKET BOTTOM # bracketleftbt (CUS)
F8F1 EC # LEFT CURLY BRACKET TOP # bracelefttp (CUS)
F8F2 ED # LEFT CURLY BRACKET MID # braceleftmid (CUS)
F8F3 EE # LEFT CURLY BRACKET BOTTOM # braceleftbt (CUS)
F8F4 EF # CURLY BRACKET EXTENDER # braceex (CUS)
232A F1 # RIGHT-POINTING ANGLE BRACKET # angleright
222B F2 # INTEGRAL # integral
2320 F3 # TOP HALF INTEGRAL # integraltp
F8F5 F4 # INTEGRAL EXTENDER # integralex (CUS)
2321 F5 # BOTTOM HALF INTEGRAL # integralbt
F8F6 F6 # RIGHT PAREN TOP # parenrighttp (CUS)
F8F7 F7 # RIGHT PAREN EXTENDER # parenrightex (CUS)
F8F8 F8 # RIGHT PAREN BOTTOM # parenrightbt (CUS)
F8F9 F9 # RIGHT SQUARE BRACKET TOP # bracketrighttp (CUS)
F8FA FA # RIGHT SQUARE BRACKET EXTENDER # bracketrightex (CUS)
F8FB FB # RIGHT SQUARE BRACKET BOTTOM # bracketrightbt (CUS)
F8FC FC # RIGHT CURLY BRACKET TOP # bracerighttp (CUS)
F8FD FD # RIGHT CURLY BRACKET MID # bracerightmid (CUS)
F8FE FE # RIGHT CURLY BRACKET BOTTOM # bracerightbt (CUS)

View File

@@ -1,68 +1,76 @@
#
# Name: Adobe Zapf Dingbats Encoding to Unicode
# Unicode version: 1.1
# Table version: 0.1
# Table format: Format A
# Date: 05 May 1995
# Name: Adobe Zapf Dingbats Encoding to Unicode
# Unicode version: 2.0
# Table version: 1.0
# Date: 2011 July 12
#
# Copyright (c) 1991-2011 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium). No
# claims are made as to fitness for any particular purpose. No warranties of
# any kind are expressed or implied. The recipient agrees to determine
# applicability of information provided. If this file has been provided on
# magnetic media by Unicode, Inc., the sole remedy for any claim will be
# exchange of defective media within 90 days of receipt.
#
# Unicode, Inc. hereby grants the right to freely use the information
# supplied in this file in the creation of products supporting the
# Unicode Standard, and to make copies of this file in any form for
# internal or external distribution as long as this notice remains
# attached.
#
# Format: Three tab-delimited fields:
#
# Copyright (c) 1991-1995 Unicode, Inc. All Rights reserved.
# (1) The Unicode value (in hexadecimal)
# (2) The Zapf Dingbats Encoding code point (in hexadecimal)
# (3) # Unicode 2.0 name
# (4) # PostScript character name
#
# General Notes:
#
# The Unicode values in this table were produced as the result of
# applying the algorithm described in the section "Populating a Unicode
# space" in the document "Unicode and Glyph Names," at
# http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
# to the characters in Zapf Dingbats. Note that some characters, such as
# "space", are mapped to 2 Unicode values. 14 characters have assignments in
# the Corporate Use Subarea; these are indicated by "(CUS)" in field 4.
# Refer to the above document for more details.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
# 2011 July 12: The above link is no longer valid. For comparable,
# more current information, see the document, "Glyph", at:
# <http://www.adobe.com/devnet/opentype/archives/glyph.html>
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
# Revision History:
#
# General notes:
# [v1.0, 2011 July 12]
# Updated terms of use to current wording.
# Updated contact information and document link.
# No changes to the mapping data.
#
# Format: Three tab-separated columns
# Column #1 is the Unicode code (in hex)
# Column #2 is the Adobe code (in hex)
# Column #3 # Unicode name <tab> # Adobe name
# [v0.2, 30 March 1999] Different algorithm to produce Unicode values (see
# notes above) results in some character codes being mapped to 2 Unicode
# values; use of Corporate Use subarea values; included BLACK CIRCLE and
# RIGHT HALF BLACK CIRCLE. Updated Unicode names to Unicode 2.0 names.
#
# [v0.1, 5 May 1995] First release.
#
# Any comments or questions contact: unicode-inc@unicode.org
# Use the Unicode reporting form <http://www.unicode.org/reporting.html>
# for any questions or comments or to report errors in the data.
#
2192 D5 # RIGHT ARROW # a161
2194 D6 # LEFT RIGHT ARROW # a163
2195 D7 # UP DOWN ARROW # a164
2460 AC # CIRCLED DIGIT ONE # a120
2461 AD # CIRCLED DIGIT TWO # a121
2462 AE # CIRCLED DIGIT THREE # a122
2463 AF # CIRCLED DIGIT FOUR # a123
2464 B0 # CIRCLED DIGIT FIVE # a124
2465 B1 # CIRCLED DIGIT SIX # a125
2466 B2 # CIRCLED DIGIT SEVEN # a126
2467 B3 # CIRCLED DIGIT EIGHT # a127
2468 B4 # CIRCLED DIGIT NINE # a128
2469 B5 # CIRCLED NUMBER TEN # a129
25A0 6E # BLACK SQUARE # a73
25B2 73 # BLACK UP POINTING TRIANGLE # a76
25BC 74 # BLACK DOWN POINTING TRIANGLE # a77
25C6 75 # BLACK DIAMOND # a78
2605 48 # BLACK STAR # a35
260E 25 # BLACK TELEPHONE # a4
261B 2A # BLACK RIGHT POINTING INDEX # a11
261E 2B # WHITE RIGHT POINTING INDEX # a12
2660 AB # BLACK SPADE SUIT # a109
2663 A8 # BLACK CLUB SUIT # a112
2665 AA # BLACK HEART SUIT # a110
2666 A9 # BLACK DIAMOND SUIT # a111
0020 20 # SPACE # space
00A0 20 # NO-BREAK SPACE # space
2701 21 # UPPER BLADE SCISSORS # a1
2702 22 # BLACK SCISSORS # a2
2703 23 # LOWER BLADE SCISSORS # a202
2704 24 # WHITE SCISSORS # a3
260E 25 # BLACK TELEPHONE # a4
2706 26 # TELEPHONE LOCATION SIGN # a5
2707 27 # TAPE DRIVE # a119
2708 28 # AIRPLANE # a118
2709 29 # ENVELOPE # a117
261B 2A # BLACK RIGHT POINTING INDEX # a11
261E 2B # WHITE RIGHT POINTING INDEX # a12
270C 2C # VICTORY HAND # a13
270D 2D # WRITING HAND # a14
270E 2E # LOWER RIGHT PENCIL # a15
@@ -78,8 +86,8 @@
2718 38 # HEAVY BALLOT X # a24
2719 39 # OUTLINED GREEK CROSS # a25
271A 3A # HEAVY GREEK CROSS # a26
271B 3B # OPEN CENTER CROSS # a27
271C 3C # HEAVY OPEN CENTER CROSS # a28
271B 3B # OPEN CENTRE CROSS # a27
271C 3C # HEAVY OPEN CENTRE CROSS # a28
271D 3D # LATIN CROSS # a6
271E 3E # SHADOWED WHITE LATIN CROSS # a7
271F 3F # OUTLINED LATIN CROSS # a8
@@ -91,16 +99,17 @@
2725 45 # FOUR CLUB-SPOKED ASTERISK # a32
2726 46 # BLACK FOUR POINTED STAR # a33
2727 47 # WHITE FOUR POINTED STAR # a34
2605 48 # BLACK STAR # a35
2729 49 # STRESS OUTLINED WHITE STAR # a36
272A 4A # CIRCLED WHITE STAR # a37
272B 4B # OPEN CENTER BLACK STAR # a38
272C 4C # BLACK CENTER WHITE STAR # a39
272B 4B # OPEN CENTRE BLACK STAR # a38
272C 4C # BLACK CENTRE WHITE STAR # a39
272D 4D # OUTLINED BLACK STAR # a40
272E 4E # HEAVY OUTLINED BLACK STAR # a41
272F 4F # PINWHEEL STAR # a42
2730 50 # SHADOWED WHITE STAR # a43
2731 51 # HEAVY ASTERISK # a44
2732 52 # OPEN CENTER ASTERISK # a45
2732 52 # OPEN CENTRE ASTERISK # a45
2733 53 # EIGHT SPOKED ASTERISK # a46
2734 54 # EIGHT POINTED BLACK STAR # a47
2735 55 # EIGHT POINTED PINWHEEL STAR # a48
@@ -110,13 +119,13 @@
2739 59 # TWELVE POINTED BLACK STAR # a52
273A 5A # SIXTEEN POINTED ASTERISK # a53
273B 5B # TEARDROP-SPOKED ASTERISK # a54
273C 5C # OPEN CENTER TEARDROP-SPOKED ASTERISK # a55
273C 5C # OPEN CENTRE TEARDROP-SPOKED ASTERISK # a55
273D 5D # HEAVY TEARDROP-SPOKED ASTERISK # a56
273E 5E # SIX PETALLED BLACK AND WHITE FLORETTE # a57
273F 5F # BLACK FLORETTE # a58
2740 60 # WHITE FLORETTE # a59
2741 61 # EIGHT PETALLED OUTLINED BLACK FLORETTE # a60
2742 62 # CIRCLED OPEN CENTER EIGHT POINTED STAR # a61
2742 62 # CIRCLED OPEN CENTRE EIGHT POINTED STAR # a61
2743 63 # HEAVY TEARDROP-SPOKED PINWHEEL ASTERISK # a62
2744 64 # SNOWFLAKE # a63
2745 65 # TIGHT TRIFOLIATE SNOWFLAKE # a64
@@ -126,12 +135,18 @@
2749 69 # BALLOON-SPOKED ASTERISK # a68
274A 6A # EIGHT TEARDROP-SPOKED PROPELLER ASTERISK # a69
274B 6B # HEAVY EIGHT TEARDROP-SPOKED PROPELLER ASTERISK # a70
25CF 6C # BLACK CIRCLE # a71
274D 6D # SHADOWED WHITE CIRCLE # a72
25A0 6E # BLACK SQUARE # a73
274F 6F # LOWER RIGHT DROP-SHADOWED WHITE SQUARE # a74
2750 70 # UPPER RIGHT DROP-SHADOWED WHITE SQUARE # a203
2751 71 # LOWER RIGHT SHADOWED WHITE SQUARE # a75
2752 72 # UPPER RIGHT SHADOWED WHITE SQUARE # a204
25B2 73 # BLACK UP-POINTING TRIANGLE # a76
25BC 74 # BLACK DOWN-POINTING TRIANGLE # a77
25C6 75 # BLACK DIAMOND # a78
2756 76 # BLACK DIAMOND MINUS WHITE X # a79
25D7 77 # RIGHT HALF BLACK CIRCLE # a81
2758 78 # LIGHT VERTICAL BAR # a82
2759 79 # MEDIUM VERTICAL BAR # a83
275A 7A # HEAVY VERTICAL BAR # a84
@@ -139,79 +154,110 @@
275C 7C # HEAVY SINGLE COMMA QUOTATION MARK ORNAMENT # a98
275D 7D # HEAVY DOUBLE TURNED COMMA QUOTATION MARK ORNAMENT # a99
275E 7E # HEAVY DOUBLE COMMA QUOTATION MARK ORNAMENT # a100
F8D7 80 # MEDIUM LEFT PARENTHESIS ORNAMENT # a89 (CUS)
F8D8 81 # MEDIUM RIGHT PARENTHESIS ORNAMENT # a90 (CUS)
F8D9 82 # MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT # a93 (CUS)
F8DA 83 # MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT # a94 (CUS)
F8DB 84 # MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT # a91 (CUS)
F8DC 85 # MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT # a92 (CUS)
F8DD 86 # HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT # a205 (CUS)
F8DE 87 # HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT # a85 (CUS)
F8DF 88 # HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT # a206 (CUS)
F8E0 89 # HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT # a86 (CUS)
F8E1 8A # LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT # a87 (CUS)
F8E2 8B # LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT # a88 (CUS)
F8E3 8C # MEDIUM LEFT CURLY BRACKET ORNAMENT # a95 (CUS)
F8E4 8D # MEDIUM RIGHT CURLY BRACKET ORNAMENT # a96 (CUS)
2761 A1 # CURVED STEM PARAGRAPH SIGN ORNAMENT # a101
2762 A2 # HEAVY EXCLAMATION MARK ORNAMENT # a102
2763 A3 # HEAVY HEART EXCLAMATION MARK ORNAMENT # a103
2764 A4 # HEAVY BLACK HEART # a104
2765 A5 # ROTATED HEAVY BLACK HEART BULLET # a105
2766 A6 # FLORAL HEART # a106
2767 A7 # ROTATED FLORAL HEART BULLET # a107
2776 B6 # INVERSE CIRCLED DIGIT ONE # a130
2777 B7 # INVERSE CIRCLED DIGIT TWO # a131
2778 B8 # INVERSE CIRCLED DIGIT THREE # a132
2779 B9 # INVERSE CIRCLED DIGIT FOUR # a133
277A BA # INVERSE CIRCLED DIGIT FIVE # a134
277B BB # INVERSE CIRCLED DIGIT SIX # a135
277C BC # INVERSE CIRCLED DIGIT SEVEN # a136
277D BD # INVERSE CIRCLED DIGIT EIGHT # a137
277E BE # INVERSE CIRCLED DIGIT NINE # a138
277F BF # INVERSE CIRCLED NUMBER TEN # a139
2780 C0 # CIRCLED SANS-SERIF DIGIT ONE # a140
2781 C1 # CIRCLED SANS-SERIF DIGIT TWO # a141
2782 C2 # CIRCLED SANS-SERIF DIGIT THREE # a142
2783 C3 # CIRCLED SANS-SERIF DIGIT FOUR # a143
2784 C4 # CIRCLED SANS-SERIF DIGIT FIVE # a144
2785 C5 # CIRCLED SANS-SERIF DIGIT SIX # a145
2786 C6 # CIRCLED SANS-SERIF DIGIT SEVEN # a146
2787 C7 # CIRCLED SANS-SERIF DIGIT EIGHT # a147
2788 C8 # CIRCLED SANS-SERIF DIGIT NINE # a148
2789 C9 # CIRCLED SANS-SERIF NUMBER TEN # a149
278A CA # INVERSE CIRCLED SANS-SERIF DIGIT ONE # a150
278B CB # INVERSE CIRCLED SANS-SERIF DIGIT TWO # a151
278C CC # INVERSE CIRCLED SANS-SERIF DIGIT THREE # a152
278D CD # INVERSE CIRCLED SANS-SERIF DIGIT FOUR # a153
278E CE # INVERSE CIRCLED SANS-SERIF DIGIT FIVE # a154
278F CF # INVERSE CIRCLED SANS-SERIF DIGIT SIX # a155
2790 D0 # INVERSE CIRCLED SANS-SERIF DIGIT SEVEN # a156
2791 D1 # INVERSE CIRCLED SANS-SERIF DIGIT EIGHT # a157
2792 D2 # INVERSE CIRCLED SANS-SERIF DIGIT NINE # a158
2793 D3 # INVERSE CIRCLED SANS-SERIF NUMBER TEN # a159
2794 D4 # HEAVY WIDE-HEADED RIGHT ARROW # a160
2798 D8 # HEAVY LOWER RIGHT ARROW # a196
2799 D9 # HEAVY RIGHT ARROW # a165
279A DA # HEAVY UPPER RIGHT ARROW # a192
279B DB # DRAFTING POINT RIGHT ARROW # a166
279C DC # HEAVY ROUND-TIPPED RIGHT ARROW # a167
279D DD # TRIANGLE-HEADED RIGHT ARROW # a168
279E DE # HEAVY TRIANGLE-HEADED RIGHT ARROW # a169
279F DF # DASHED TRIANGLE-HEADED RIGHT ARROW # a170
27A0 E0 # HEAVY DASHED TRIANGLE-HEADED RIGHT ARROW # a171
27A1 E1 # BLACK RIGHT ARROW # a172
27A2 E2 # THREE-D TOP-LIGHTED RIGHT ARROWHEAD # a173
27A3 E3 # THREE-D BOTTOM-LIGHTED RIGHT ARROWHEAD # a162
27A4 E4 # BLACK RIGHT ARROWHEAD # a174
27A5 E5 # HEAVY BLACK CURVED DOWN AND RIGHT ARROW # a175
27A6 E6 # HEAVY BLACK CURVED UP AND RIGHT ARROW # a176
27A7 E7 # SQUAT BLACK RIGHT ARROW # a177
27A8 E8 # HEAVY CONCAVE-POINTED BLACK RIGHT ARROW # a178
27A9 E9 # RIGHT-SHADED WHITE RIGHT ARROW # a179
27AA EA # LEFT-SHADED WHITE RIGHT ARROW # a193
27AB EB # BACK-TILTED SHADOWED WHITE RIGHT ARROW # a180
27AC EC # FRONT-TILTED SHADOWED WHITE RIGHT ARROW # a199
27AD ED # HEAVY LOWER RIGHT-SHADOWED WHITE RIGHT ARROW # a181
27AE EE # HEAVY UPPER RIGHT-SHADOWED WHITE RIGHT ARROW # a200
27AF EF # NOTCHED LOWER RIGHT-SHADOWED WHITE RIGHT ARROW # a182
27B1 F1 # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHT ARROW # a201
27B2 F2 # CIRCLED HEAVY WHITE RIGHT ARROW # a183
27B3 F3 # WHITE-FEATHERED RIGHT ARROW # a184
27B4 F4 # BLACK-FEATHERED LOWER RIGHT ARROW # a197
27B5 F5 # BLACK-FEATHERED RIGHT ARROW # a185
27B6 F6 # BLACK-FEATHERED UPPER RIGHT ARROW # a194
27B7 F7 # HEAVY BLACK-FEATHERED LOWER RIGHT ARROW # a198
27B8 F8 # HEAVY BLACK-FEATHERED RIGHT ARROW # a186
27B9 F9 # HEAVY BLACK-FEATHERED UPPER RIGHT ARROW # a195
27BA FA # TEARDROP-BARBED RIGHT ARROW # a187
27BB FB # HEAVY TEARDROP-SHANKED RIGHT ARROW # a188
27BC FC # WEDGE-TAILED RIGHT ARROW # a189
27BD FD # HEAVY WEDGE-TAILED RIGHT ARROW # a190
27BE FE # OPEN-OUTLINED RIGHT ARROW # a191
2765 A5 # ROTATED HEAVY BLACK HEART BULLET # a106
2766 A6 # FLORAL HEART # a107
2767 A7 # ROTATED FLORAL HEART BULLET # a108
2663 A8 # BLACK CLUB SUIT # a112
2666 A9 # BLACK DIAMOND SUIT # a111
2665 AA # BLACK HEART SUIT # a110
2660 AB # BLACK SPADE SUIT # a109
2460 AC # CIRCLED DIGIT ONE # a120
2461 AD # CIRCLED DIGIT TWO # a121
2462 AE # CIRCLED DIGIT THREE # a122
2463 AF # CIRCLED DIGIT FOUR # a123
2464 B0 # CIRCLED DIGIT FIVE # a124
2465 B1 # CIRCLED DIGIT SIX # a125
2466 B2 # CIRCLED DIGIT SEVEN # a126
2467 B3 # CIRCLED DIGIT EIGHT # a127
2468 B4 # CIRCLED DIGIT NINE # a128
2469 B5 # CIRCLED NUMBER TEN # a129
2776 B6 # DINGBAT NEGATIVE CIRCLED DIGIT ONE # a130
2777 B7 # DINGBAT NEGATIVE CIRCLED DIGIT TWO # a131
2778 B8 # DINGBAT NEGATIVE CIRCLED DIGIT THREE # a132
2779 B9 # DINGBAT NEGATIVE CIRCLED DIGIT FOUR # a133
277A BA # DINGBAT NEGATIVE CIRCLED DIGIT FIVE # a134
277B BB # DINGBAT NEGATIVE CIRCLED DIGIT SIX # a135
277C BC # DINGBAT NEGATIVE CIRCLED DIGIT SEVEN # a136
277D BD # DINGBAT NEGATIVE CIRCLED DIGIT EIGHT # a137
277E BE # DINGBAT NEGATIVE CIRCLED DIGIT NINE # a138
277F BF # DINGBAT NEGATIVE CIRCLED NUMBER TEN # a139
2780 C0 # DINGBAT CIRCLED SANS-SERIF DIGIT ONE # a140
2781 C1 # DINGBAT CIRCLED SANS-SERIF DIGIT TWO # a141
2782 C2 # DINGBAT CIRCLED SANS-SERIF DIGIT THREE # a142
2783 C3 # DINGBAT CIRCLED SANS-SERIF DIGIT FOUR # a143
2784 C4 # DINGBAT CIRCLED SANS-SERIF DIGIT FIVE # a144
2785 C5 # DINGBAT CIRCLED SANS-SERIF DIGIT SIX # a145
2786 C6 # DINGBAT CIRCLED SANS-SERIF DIGIT SEVEN # a146
2787 C7 # DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT # a147
2788 C8 # DINGBAT CIRCLED SANS-SERIF DIGIT NINE # a148
2789 C9 # DINGBAT CIRCLED SANS-SERIF NUMBER TEN # a149
278A CA # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ONE # a150
278B CB # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT TWO # a151
278C CC # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT THREE # a152
278D CD # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT FOUR # a153
278E CE # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT FIVE # a154
278F CF # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT SIX # a155
2790 D0 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT SEVEN # a156
2791 D1 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT EIGHT # a157
2792 D2 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT NINE # a158
2793 D3 # DINGBAT NEGATIVE CIRCLED SANS-SERIF NUMBER TEN # a159
2794 D4 # HEAVY WIDE-HEADED RIGHTWARDS ARROW # a160
2192 D5 # RIGHTWARDS ARROW # a161
2194 D6 # LEFT RIGHT ARROW # a163
2195 D7 # UP DOWN ARROW # a164
2798 D8 # HEAVY SOUTH EAST ARROW # a196
2799 D9 # HEAVY RIGHTWARDS ARROW # a165
279A DA # HEAVY NORTH EAST ARROW # a192
279B DB # DRAFTING POINT RIGHTWARDS ARROW # a166
279C DC # HEAVY ROUND-TIPPED RIGHTWARDS ARROW # a167
279D DD # TRIANGLE-HEADED RIGHTWARDS ARROW # a168
279E DE # HEAVY TRIANGLE-HEADED RIGHTWARDS ARROW # a169
279F DF # DASHED TRIANGLE-HEADED RIGHTWARDS ARROW # a170
27A0 E0 # HEAVY DASHED TRIANGLE-HEADED RIGHTWARDS ARROW # a171
27A1 E1 # BLACK RIGHTWARDS ARROW # a172
27A2 E2 # THREE-D TOP-LIGHTED RIGHTWARDS ARROWHEAD # a173
27A3 E3 # THREE-D BOTTOM-LIGHTED RIGHTWARDS ARROWHEAD # a162
27A4 E4 # BLACK RIGHTWARDS ARROWHEAD # a174
27A5 E5 # HEAVY BLACK CURVED DOWNWARDS AND RIGHTWARDS ARROW # a175
27A6 E6 # HEAVY BLACK CURVED UPWARDS AND RIGHTWARDS ARROW # a176
27A7 E7 # SQUAT BLACK RIGHTWARDS ARROW # a177
27A8 E8 # HEAVY CONCAVE-POINTED BLACK RIGHTWARDS ARROW # a178
27A9 E9 # RIGHT-SHADED WHITE RIGHTWARDS ARROW # a179
27AA EA # LEFT-SHADED WHITE RIGHTWARDS ARROW # a193
27AB EB # BACK-TILTED SHADOWED WHITE RIGHTWARDS ARROW # a180
27AC EC # FRONT-TILTED SHADOWED WHITE RIGHTWARDS ARROW # a199
27AD ED # HEAVY LOWER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW # a181
27AE EE # HEAVY UPPER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW # a200
27AF EF # NOTCHED LOWER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW # a182
27B1 F1 # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW # a201
27B2 F2 # CIRCLED HEAVY WHITE RIGHTWARDS ARROW # a183
27B3 F3 # WHITE-FEATHERED RIGHTWARDS ARROW # a184
27B4 F4 # BLACK-FEATHERED SOUTH EAST ARROW # a197
27B5 F5 # BLACK-FEATHERED RIGHTWARDS ARROW # a185
27B6 F6 # BLACK-FEATHERED NORTH EAST ARROW # a194
27B7 F7 # HEAVY BLACK-FEATHERED SOUTH EAST ARROW # a198
27B8 F8 # HEAVY BLACK-FEATHERED RIGHTWARDS ARROW # a186
27B9 F9 # HEAVY BLACK-FEATHERED NORTH EAST ARROW # a195
27BA FA # TEARDROP-BARBED RIGHTWARDS ARROW # a187
27BB FB # HEAVY TEARDROP-SHANKED RIGHTWARDS ARROW # a188
27BC FC # WEDGE-TAILED RIGHTWARDS ARROW # a189
27BD FD # HEAVY WEDGE-TAILED RIGHTWARDS ARROW # a190
27BE FE # OPEN-OUTLINED RIGHTWARDS ARROW # a191

328
unicode/vendors/apple/CELTIC.TXT vendored Normal file
View File

@@ -0,0 +1,328 @@
#=======================================================================
# File name: CELTIC.TXT
#
# Contents: Map (external version) from Mac OS Celtic
# character set to Unicode 2.1 and later
#
# Contacts: charsets@apple.com, everson@evertype.com
#
# Changes:
#
# c01 2005-Apr-01 First posted version. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Celtic code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in Mac OS Celtic code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Celtic character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Celtic (partly from Michael Everson):
# -----------------------------------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# This character set was developed by Michael Everson of Everson
# Typography (everson@evertype.com) and was used for the Irish
# localizations of Mac OS 6.0.8 and 7.1, for the Welsh localization of
# Mac OS 7.1, and for several fonts that can be used on any version of
# Mac OS 7.1 or later. Note that while Apple authorized
# the Irish and Welsh localizations mentioned above, they were not
# systems which shipped with Apple hardware, and were not otherwise
# supported by Apple. Fonts conforming to the Mac OS Celtic character
# set are available from Everson Typography (http://www.evertype.com)
# and MEU Cymru (http://www.meucymru.co.uk). Information about the use
# of this character set is available at
# http://www.evertype.com/celtscript/celtcode.html.
#
# The Mac OS Celtic encoding shares the script code smRoman (0) with
# the standard Mac OS Roman encoding. To determine if the Celtic
# encoding is being used in Mac OS 7-9, you should also check if the
# system region code is 50, verIreland, or 79, verWales. Otherwise,
# you can check for particular fonts that conform to this encoding.
#
# This character set is a variant of standard Mac OS Roman, adding
# capital and small y with acute, grave, and circumflex, and capital
# and small w with acute, grave, circumflex and diaeresis. It has 14
# code point differences from standard Mac OS Roman (0xDE, 0xDF, 0xE2,
# 0xE3, 0xF6-0xFF).
#
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts were updated for Mac OS 8.5 to reflect this. There is
# a "currency sign" variant of the Mac OS Celtic encoding that still
# maps 0xDB to U+00A4; this can be used for older fonts.
# Note: U+20AC is new with Unicode 2.1; for earlier Unicode
# versions, Mac OS Celtic 0xDB may be mapped to private-use
# character U+F8A0.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0x85 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0x86 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0x87 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0x88 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0x89 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0x8C 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0x8D 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0x90 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0x91 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0x94 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0x95 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0x96 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x00DF # LATIN SMALL LETTER SHARP S
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2264 # LESS-THAN OR EQUAL TO
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x00A5 # YEN SIGN
0xB5 0x00B5 # MICRO SIGN
0xB6 0x2202 # PARTIAL DIFFERENTIAL
0xB7 0x2211 # N-ARY SUMMATION
0xB8 0x220F # N-ARY PRODUCT
0xB9 0x03C0 # GREEK SMALL LETTER PI
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK
0xC2 0x00AC # NOT SIGN
0xC3 0x221A # SQUARE ROOT
0xC4 0x0192 # LATIN SMALL LETTER F WITH HOOK
0xC5 0x2248 # ALMOST EQUAL TO
0xC6 0x2206 # INCREMENT
0xC7 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC8 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xCC 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xCD 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xCE 0x0152 # LATIN CAPITAL LIGATURE OE
0xCF 0x0153 # LATIN SMALL LIGATURE OE
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x00F7 # DIVISION SIGN
0xD7 0x25CA # LOZENGE
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x2044 # FRACTION SLASH
0xDB 0x20AC # EURO SIGN # before Mac OS 8.5 this was U+00A4 CURRENCY SIGN
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0x0176 # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
0xDF 0x0177 # LATIN SMALL LETTER Y WITH CIRCUMFLEX
0xE0 0x2021 # DOUBLE DAGGER
0xE1 0x00B7 # MIDDLE DOT
0xE2 0x1EF2 # LATIN CAPITAL LETTER Y WITH GRAVE
0xE3 0x1EF3 # LATIN SMALL LETTER Y WITH GRAVE
0xE4 0x2030 # PER MILLE SIGN
0xE5 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xE6 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xE7 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xE8 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xE9 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xEA 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xEB 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xEC 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xED 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xEE 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xEF 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xF0 0x2663 # BLACK CLUB SUIT = shamrock # future mapping U+2618 SHAMROCK
0xF1 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xF2 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xF3 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xF4 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xF5 0x0131 # LATIN SMALL LETTER DOTLESS I
0xF6 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xF7 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xF8 0x0174 # LATIN CAPITAL LETTER W WITH CIRCUMFLEX
0xF9 0x0175 # LATIN SMALL LETTER W WITH CIRCUMFLEX
0xFA 0x1E84 # LATIN CAPITAL LETTER W WITH DIAERESIS
0xFB 0x1E85 # LATIN SMALL LETTER W WITH DIAERESIS
0xFC 0x1E80 # LATIN CAPITAL LETTER W WITH GRAVE
0xFD 0x1E81 # LATIN SMALL LETTER W WITH GRAVE
0xFE 0x1E82 # LATIN CAPITAL LETTER W WITH ACUTE
0xFF 0x1E83 # LATIN SMALL LETTER W WITH ACUTE

327
unicode/vendors/apple/CENTEURO.TXT vendored Normal file
View File

@@ -0,0 +1,327 @@
#=======================================================================
# File name: CENTEURO.TXT
#
# Contents: Map (external version) from Mac OS Central European
# character set to Unicode 2.1 and later.
#
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-04 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs. Matches internal utom<b1>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n05 1998-Feb-05 Update header comments to new format; no
# mapping changes. Matches internal utom<n3>,
# ufrm<n13>, and Text Encoding Converter
# version 1.3.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n5>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Central European code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in Mac OS Central European code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Central European character set uses the standard control
# characters at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Central European:
# ---------------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported directly in programming
# interfaces for QuickDraw Text, the Script Manager, and related
# Text Utilities. For other purposes it is supported via transcoding
# to and from Unicode.
#
# This character set is intended to cover the following languages:
#
# Polish, Czech, Slovak, Hungarian, Estonian, Latvian, Lithuanian
#
# These are written in Latin script, but using a different set of
# of accented characters than Mac OS Roman. The Mac OS Central
# European character set also includes a number of characters
# needed for the Mac OS user interface and localization (e.g.
# ellipsis, bullet, copyright sign), several typographic
# punctuation symbols, math symbols, etc. However, it has a
# smaller set of punctuation and symbols than Mac OS Roman. All of
# the characters in Mac OS Central European that are also in the
# Mac OS Roman character set are at the same code point in both
# character sets; this improves application compatibility.
#
# Note: This does not have the same letter repertoire as ISO
# 8859-2 (Latin-2); each has some accented letters that the other
# does not have.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0x0100 # LATIN CAPITAL LETTER A WITH MACRON
0x82 0x0101 # LATIN SMALL LETTER A WITH MACRON
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x0104 # LATIN CAPITAL LETTER A WITH OGONEK
0x85 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0x86 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0x87 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0x88 0x0105 # LATIN SMALL LETTER A WITH OGONEK
0x89 0x010C # LATIN CAPITAL LETTER C WITH CARON
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x010D # LATIN SMALL LETTER C WITH CARON
0x8C 0x0106 # LATIN CAPITAL LETTER C WITH ACUTE
0x8D 0x0107 # LATIN SMALL LETTER C WITH ACUTE
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x0179 # LATIN CAPITAL LETTER Z WITH ACUTE
0x90 0x017A # LATIN SMALL LETTER Z WITH ACUTE
0x91 0x010E # LATIN CAPITAL LETTER D WITH CARON
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 0x010F # LATIN SMALL LETTER D WITH CARON
0x94 0x0112 # LATIN CAPITAL LETTER E WITH MACRON
0x95 0x0113 # LATIN SMALL LETTER E WITH MACRON
0x96 0x0116 # LATIN CAPITAL LETTER E WITH DOT ABOVE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 0x0117 # LATIN SMALL LETTER E WITH DOT ABOVE
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x011A # LATIN CAPITAL LETTER E WITH CARON
0x9E 0x011B # LATIN SMALL LETTER E WITH CARON
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x0118 # LATIN CAPITAL LETTER E WITH OGONEK
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x00DF # LATIN SMALL LETTER SHARP S
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x0119 # LATIN SMALL LETTER E WITH OGONEK
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x0123 # LATIN SMALL LETTER G WITH CEDILLA
0xAF 0x012E # LATIN CAPITAL LETTER I WITH OGONEK
0xB0 0x012F # LATIN SMALL LETTER I WITH OGONEK
0xB1 0x012A # LATIN CAPITAL LETTER I WITH MACRON
0xB2 0x2264 # LESS-THAN OR EQUAL TO
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x012B # LATIN SMALL LETTER I WITH MACRON
0xB5 0x0136 # LATIN CAPITAL LETTER K WITH CEDILLA
0xB6 0x2202 # PARTIAL DIFFERENTIAL
0xB7 0x2211 # N-ARY SUMMATION
0xB8 0x0142 # LATIN SMALL LETTER L WITH STROKE
0xB9 0x013B # LATIN CAPITAL LETTER L WITH CEDILLA
0xBA 0x013C # LATIN SMALL LETTER L WITH CEDILLA
0xBB 0x013D # LATIN CAPITAL LETTER L WITH CARON
0xBC 0x013E # LATIN SMALL LETTER L WITH CARON
0xBD 0x0139 # LATIN CAPITAL LETTER L WITH ACUTE
0xBE 0x013A # LATIN SMALL LETTER L WITH ACUTE
0xBF 0x0145 # LATIN CAPITAL LETTER N WITH CEDILLA
0xC0 0x0146 # LATIN SMALL LETTER N WITH CEDILLA
0xC1 0x0143 # LATIN CAPITAL LETTER N WITH ACUTE
0xC2 0x00AC # NOT SIGN
0xC3 0x221A # SQUARE ROOT
0xC4 0x0144 # LATIN SMALL LETTER N WITH ACUTE
0xC5 0x0147 # LATIN CAPITAL LETTER N WITH CARON
0xC6 0x2206 # INCREMENT
0xC7 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC8 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x0148 # LATIN SMALL LETTER N WITH CARON
0xCC 0x0150 # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
0xCD 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xCE 0x0151 # LATIN SMALL LETTER O WITH DOUBLE ACUTE
0xCF 0x014C # LATIN CAPITAL LETTER O WITH MACRON
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x00F7 # DIVISION SIGN
0xD7 0x25CA # LOZENGE
0xD8 0x014D # LATIN SMALL LETTER O WITH MACRON
0xD9 0x0154 # LATIN CAPITAL LETTER R WITH ACUTE
0xDA 0x0155 # LATIN SMALL LETTER R WITH ACUTE
0xDB 0x0158 # LATIN CAPITAL LETTER R WITH CARON
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0x0159 # LATIN SMALL LETTER R WITH CARON
0xDF 0x0156 # LATIN CAPITAL LETTER R WITH CEDILLA
0xE0 0x0157 # LATIN SMALL LETTER R WITH CEDILLA
0xE1 0x0160 # LATIN CAPITAL LETTER S WITH CARON
0xE2 0x201A # SINGLE LOW-9 QUOTATION MARK
0xE3 0x201E # DOUBLE LOW-9 QUOTATION MARK
0xE4 0x0161 # LATIN SMALL LETTER S WITH CARON
0xE5 0x015A # LATIN CAPITAL LETTER S WITH ACUTE
0xE6 0x015B # LATIN SMALL LETTER S WITH ACUTE
0xE7 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xE8 0x0164 # LATIN CAPITAL LETTER T WITH CARON
0xE9 0x0165 # LATIN SMALL LETTER T WITH CARON
0xEA 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xEB 0x017D # LATIN CAPITAL LETTER Z WITH CARON
0xEC 0x017E # LATIN SMALL LETTER Z WITH CARON
0xED 0x016A # LATIN CAPITAL LETTER U WITH MACRON
0xEE 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xEF 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xF0 0x016B # LATIN SMALL LETTER U WITH MACRON
0xF1 0x016E # LATIN CAPITAL LETTER U WITH RING ABOVE
0xF2 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xF3 0x016F # LATIN SMALL LETTER U WITH RING ABOVE
0xF4 0x0170 # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
0xF5 0x0171 # LATIN SMALL LETTER U WITH DOUBLE ACUTE
0xF6 0x0172 # LATIN CAPITAL LETTER U WITH OGONEK
0xF7 0x0173 # LATIN SMALL LETTER U WITH OGONEK
0xF8 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xF9 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xFA 0x0137 # LATIN SMALL LETTER K WITH CEDILLA
0xFB 0x017B # LATIN CAPITAL LETTER Z WITH DOT ABOVE
0xFC 0x0141 # LATIN CAPITAL LETTER L WITH STROKE
0xFD 0x017C # LATIN SMALL LETTER Z WITH DOT ABOVE
0xFE 0x0122 # LATIN CAPITAL LETTER G WITH CEDILLA
0xFF 0x02C7 # CARON

7914
unicode/vendors/apple/CHINSIMP.TXT vendored Normal file

File diff suppressed because it is too large Load Diff

13911
unicode/vendors/apple/CHINTRAD.TXT vendored Normal file

File diff suppressed because it is too large Load Diff

519
unicode/vendors/apple/CORPCHAR.TXT vendored Normal file
View File

@@ -0,0 +1,519 @@
#=======================================================================
# File name: CORPCHAR.TXT
#
# Contents: Registry (external version) of Apple use of
# Unicode corporate-zone characters.
#
# Copyright: (c) 1994-2003, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c03 2005-Apr-04 Deprecate 0xF8E6. Matches internal registry
# <c1.3>
# c02 2003-Feb-18 Add entry for 0xF802.
# b4,c1 2002-Dec-19 Add entries for 0xF700-0xF747 and 0xF803-
# 0xF84F; update replacement characters for
# 0xF883, 0xF8AA, 0xF8B4, 0xF8B7, 0xF8BD,
# 0xF8D7-0xF8E4, 0xF8EB-0xF8F3, 0xF8F5-
# 0xF8FE. Deprecate 0xF8E7, 0xF8F4. Delete Mac
# OS Greek mapping for 0xF8A0. Update URLs.
# Matches internal registry <b7>.
# b03 1999-Sep-22 Update contact e-mail address. Matches
# internal registry <b3> and Text Encoding
# Converter version 1.5.
# b02 1998-Aug-18 Expanded usage of 0xF8A0. Matches internal
# registry <b3>.
# n11 1998-Feb-05 Minor update to header comments
# n09 1997-Dec-14 Update to match internal registry <n23>:
# Add source hint 0xF850, transcoding hints
# 0xF860-0xF86B and 0xF870-0xF872, deprecate
# almost all other non-hint corporate
# characters.
# n08 1997-Jul-17 Update to match internal registry <n13>:
# Add characters for Mac OS Chinese, Korean &
# Farsi. Add CJK source hints. Deprecate some
# characters in favor of combinations of
# standard characters and transcoding hints.
# Change header format.
# n04 1995-Nov-15 Update to match internal registry <n8>:
# Add characters for Mac OS Hebrew and Thai.
# n02 1995-Apr-18 First version. Matches internal registry
# <n5>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Two tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Unicode corporate character code point
# (in hex as 0xNNNN)
# Column #2 is a comment containing:
# 1) an informal name describing the Unicode corporate character,
# or if it is deprecated, information about what to use
# instead.
# 2) optionally, another '#', followed by information on which
# Mac OS encodings use the Unicode corporate character, and -
# if relevant - the Mac OS code points that correspond to the
# corporate character.
#
# The entries are in Unicode order.
#_______________________________________________________________________
# NeXT's OpenStep reserved corporate characters in the range 0xF700 to
# 0xF8FF for transient use as keyboard function keys. The ones actually
# assigned in NextStep are 0xF700-0xF747, as follows. These are still
# used in the Mac OS X AppKit frameworks. Note that there is no glyph
# associated with these, and they are not mapped or used by the Mac OS
# Text Encoding Converter.
0xF700 # NSUpArrowFunctionKey
0xF701 # NSDownArrowFunctionKey
0xF702 # NSLeftArrowFunctionKey
0xF703 # NSRightArrowFunctionKey
0xF704 # NSF1FunctionKey
0xF705 # NSF2FunctionKey
0xF706 # NSF3FunctionKey
0xF707 # NSF4FunctionKey
0xF708 # NSF5FunctionKey
0xF709 # NSF6FunctionKey
0xF70A # NSF7FunctionKey
0xF70B # NSF8FunctionKey
0xF70C # NSF9FunctionKey
0xF70D # NSF10FunctionKey
0xF70E # NSF11FunctionKey
0xF70F # NSF12FunctionKey
0xF710 # NSF13FunctionKey
0xF711 # NSF14FunctionKey
0xF712 # NSF15FunctionKey
0xF713 # NSF16FunctionKey
0xF714 # NSF17FunctionKey
0xF715 # NSF18FunctionKey
0xF716 # NSF19FunctionKey
0xF717 # NSF20FunctionKey
0xF718 # NSF21FunctionKey
0xF719 # NSF22FunctionKey
0xF71A # NSF23FunctionKey
0xF71B # NSF24FunctionKey
0xF71C # NSF25FunctionKey
0xF71D # NSF26FunctionKey
0xF71E # NSF27FunctionKey
0xF71F # NSF28FunctionKey
0xF720 # NSF29FunctionKey
0xF721 # NSF30FunctionKey
0xF722 # NSF31FunctionKey
0xF723 # NSF32FunctionKey
0xF724 # NSF33FunctionKey
0xF725 # NSF34FunctionKey
0xF726 # NSF35FunctionKey
0xF727 # NSInsertFunctionKey
0xF728 # NSDeleteFunctionKey
0xF729 # NSHomeFunctionKey
0xF72A # NSBeginFunctionKey
0xF72B # NSEndFunctionKey
0xF72C # NSPageUpFunctionKey
0xF72D # NSPageDownFunctionKey
0xF72E # NSPrintScreenFunctionKey
0xF72F # NSScrollLockFunctionKey
0xF730 # NSPauseFunctionKey
0xF731 # NSSysReqFunctionKey
0xF732 # NSBreakFunctionKey
0xF733 # NSResetFunctionKey
0xF734 # NSStopFunctionKey
0xF735 # NSMenuFunctionKey
0xF736 # NSUserFunctionKey
0xF737 # NSSystemFunctionKey
0xF738 # NSPrintFunctionKey
0xF739 # NSClearLineFunctionKey
0xF73A # NSClearDisplayFunctionKey
0xF73B # NSInsertLineFunctionKey
0xF73C # NSDeleteLineFunctionKey
0xF73D # NSInsertCharFunctionKey
0xF73E # NSDeleteCharFunctionKey
0xF73F # NSPrevFunctionKey
0xF740 # NSNextFunctionKey
0xF741 # NSSelectFunctionKey
0xF742 # NSExecuteFunctionKey
0xF743 # NSUndoFunctionKey
0xF744 # NSRedoFunctionKey
0xF745 # NSFindFunctionKey
0xF746 # NSHelpFunctionKey
0xF747 # NSModeSwitchFunctionKey
# The following (11) are for mapping the Mac OS Keyboard and Mac OS Korean
# encodings (for Mac OS Korean also see 0xF83D, 0xF840-0xF84F).
0xF802 # lower left pencil # Keyboard-0x0F
0xF803 # contextual menu symbol # Keyboard-0x6D
0xF804 # eject symbol # Keyboard-0x8C
0xF805 # black diamond minus white square # Korean-0xA658
0xF806 # black square minus white diamond # Korean-0xA663
0xF807 # telephone dial # Korean-0xA69F
0xF808 # five vertical lines # Korean-0xA68F
0xF809 # one downward-pointing black triangle over two others # Korean-0xA681
0xF80A # two interwoven eye shapes # Korean-0xA674
0xF80B # narrow-leaf four-petal florette # Korean-0xA696
0xF80C # four interleaved fisheyes # Korean-0xA69A
# The following (51) are mainly for mapping the dingbat/fleuron repetoire
# of the Hoefler Ornaments font, which is otherwise unmappable to Unicode.
# 0xF83D is also used for mapping MacKorean.
0xF80D # horizontal line thickening at center # Hoefler Ornaments glyph 6
0xF80E # dotted X design 1 # Hoefler Ornaments glyph 7
0xF80F # dotted X design 2 # Hoefler Ornaments glyph 8
0xF810 # dotted X design 3 # Hoefler Ornaments glyph 9
0xF811 # dotted X design 4 # Hoefler Ornaments glyph 10
0xF812 # horizontal line with wasp waist at center # Hoefler Ornaments glyph 11
0xF813 # horizontal line thickening at center, alternate # Hoefler Ornaments glyph 12
0xF814 # half-filled fleuron 1 # Hoefler Ornaments glyph 13
0xF815 # half-filled fleuron 2 # Hoefler Ornaments glyph 14
0xF816 # half-filled fleuron 3 # Hoefler Ornaments glyph 15
0xF817 # half-filled fleuron 4 # Hoefler Ornaments glyph 16
0xF818 # half-filled fleuron 5 # Hoefler Ornaments glyph 17
0xF819 # half-filled fleuron 6 # Hoefler Ornaments glyph 18
0xF81A # half-filled fleuron 7 # Hoefler Ornaments glyph 19
0xF81B # half-filled fleuron 8 # Hoefler Ornaments glyph 20
0xF81C # half-filled fleuron 9 # Hoefler Ornaments glyph 21
0xF81D # half-filled fleuron 10 # Hoefler Ornaments glyph 22
0xF81E # half-filled fleuron 11 # Hoefler Ornaments glyph 23
0xF81F # half-filled fleuron 12 # Hoefler Ornaments glyph 24
0xF820 # half-filled fleuron 13 # Hoefler Ornaments glyph 25
0xF821 # half-filled fleuron 14 # Hoefler Ornaments glyph 26
0xF822 # half-filled fleuron 15 # Hoefler Ornaments glyph 27
0xF823 # half-filled fleuron 16 # Hoefler Ornaments glyph 28
0xF824 # half-filled dingbat 1 # Hoefler Ornaments glyph 29
0xF825 # half-filled dingbat 2 # Hoefler Ornaments glyph 30
0xF826 # half-filled dingbat 3 # Hoefler Ornaments glyph 31
0xF827 # filled fleuron 1 # Hoefler Ornaments glyph 34
0xF828 # filled fleuron 2 # Hoefler Ornaments glyph 35
0xF829 # filled fleuron 3 # Hoefler Ornaments glyph 36
0xF82A # filled fleuron 4 # Hoefler Ornaments glyph 37
0xF82B # filled fleuron 5 # Hoefler Ornaments glyph 38
0xF82C # filled fleuron 6 # Hoefler Ornaments glyph 39
0xF82D # filled fleuron 7 # Hoefler Ornaments glyph 40
0xF82E # filled fleuron 8 # Hoefler Ornaments glyph 41
0xF82F # filled fleuron 9 # Hoefler Ornaments glyph 42
0xF830 # filled fleuron 10 # Hoefler Ornaments glyph 43
0xF831 # filled fleuron 11 # Hoefler Ornaments glyph 44
0xF832 # filled fleuron 12 # Hoefler Ornaments glyph 45
0xF833 # filled fleuron 13 # Hoefler Ornaments glyph 46
0xF834 # filled fleuron 14 # Hoefler Ornaments glyph 47
0xF835 # filled fleuron 15 # Hoefler Ornaments glyph 48
0xF836 # filled fleuron 16 # Hoefler Ornaments glyph 49
0xF837 # filled dingbat 1 # Hoefler Ornaments glyph 50
0xF838 # filled dingbat 2 # Hoefler Ornaments glyph 51
0xF839 # filled dingbat 3 # Hoefler Ornaments glyph 52
0xF83A # sun with face # Hoefler Ornaments glyph 53
0xF83B # moon with face # Hoefler Ornaments glyph 54
0xF83C # crown # Hoefler Ornaments glyph 55
0xF83D # fleur-de-lis # Korean-0xA642, Hoefler Ornaments glyph 57
0xF83E # sailing ship # Hoefler Ornaments glyph 58
0xF83F # fleuron 17 # Hoefler Ornaments glyph 59
# The following (16) are for mapping the Mac OS Korean encoding
# (also see 0xF805-0xF80C, 0xF83D).
0xF840 # three asterisks aligned vertically # Korean-0xA16E
0xF841 # left right up down arrow # Korean-0xA894
0xF842 # downwards wave arrow # Korean-0xAC54
0xF843 # leftwards white arrow from wall (cf. U+21F0) # Korean-0xAC42
0xF844 # black leftwards arrowhead (cf. U+27A4) # Korean-0xAC49
0xF845 # black-feathered leftwards arrow (cf. U+27B5) # Korean-0xAC5F
0xF846 # leftwards arrowhead with tail of spreading ripples # Korean-0xA867
0xF847 # rightwards arrowhead with tail of spreading ripples # Korean-0xA868
0xF848 # large white leftwards arrow with white fins # Korean-0xA89D
0xF849 # large white rightwards arrow with white fins # Korean-0xA89C
0xF84A # leftwards arrow with bow # Korean-0xAC4B
0xF84B # rightwards arrow with bow # Korean-0xAC4A
0xF84C # pentagon # Korean-0xA747
0xF84D # trapezoid # Korean-0xA74B
0xF84E # quadrilateral with shorter right side # Korean-0xA74C
0xF84F # quadrilateral with shorter left side # Korean-0xA74D
# The block of 16 characters 0xF850-0xF85F is for source hint characters.
# These have no display (like zero-width no-break space). If they appear
# in text, they can only be mapped to tables that include them. If a run
# of Unicode characters such as Han characters could otherwise be mapped
# to any of several encodings, including one of these hint characters can
# force the text to be mapped only to an encoding whose mapping table
# includes the hint character. Once they have forced mapping to a particular
# encoding, they no longer apply (they don't need to be cancelled); if a
# subsequent character cannot be mapped to that encoding, it may be mapped
# to another encoding. Currently source hints are mainly defined for CJK
# source disambiguation.
# NOTE: These are only defined for application developers who have requested
# them. The Mac OS Text Encoding Converter does not generate these when
# converting from other CJK encodings to Unicode. However, it will handle
# these characters correctly when converting from Unicode to other encodings.
0xF850 # source hint: Reset, try all candidate encodings in preferred order.
0xF85C # source hint: Chinese simplified
0xF85D # source hint: Chinese traditional
0xF85E # source hint: Japanese
0xF85F # source hint: Korean
# The block of 32 characters 0xF860-0xF87F is for transcoding hints.
# These are used in combination with standard Unicode characters to force
# them to be treated in a special way for mapping to other encodings;
# they have no other effect.
#
# 0xF870-0xF87F are "variant tags" - they are like combining characters,
# and can follow a standard Unicode (or a sequence consisting of a base
# character and other combining characters) to tag it so that it will be
# unique, treated in a special way for transcoding. These always terminate
# a sequence of combining characters.
#
# 0xF860-0xF86B are "grouping hints" - they precede a group of two to
# four standard Unicode characters to indicate that they are treated as a
# group for transcoding. This grouping overrides any other combining
# behavior.
#
# Here are the ones defined so far:
0xF860 # transcoding hint: group next 2 characters # Japanese,Korean
0xF861 # transcoding hint: group next 3 characters # Japanese,Korean
0xF862 # transcoding hint: group next 4 characters # Japanese,Korean
0xF863 # transcoding hint: group next 4 characters, alt1 # Korean
0xF864 # transcoding hint: group next 4 characters, alt2 # Korean
0xF865 # transcoding hint: group next 4 characters, alt3 # Korean
0xF866 # transcoding hint: group next 4 characters, alt4 # Korean
0xF867 # transcoding hint: group next 2 characters, alt1 # Korean
0xF868 # transcoding hint: group next 2 characters, alt2 # Korean
0xF869 # transcoding hint: group next 2 characters, alt3 # Korean
0xF86A # transcoding hint: group next 2 characters, RL # Hebrew
0xF86B # transcoding hint: group next 4 characters, RL # Farsi variant
#
0xF870 # transcoding hint: variant tag 16 # Symbol, Korean
0xF871 # transcoding hint: variant tag 15 # Symbol, Korean
0xF872 # transcoding hint: variant tag 14 # Symbol
0xF873 # transcoding hint: variant tag 13 # Korean, Thai
0xF874 # transcoding hint: variant tag 12 # Korean, Thai
0xF875 # transcoding hint: variant tag 11 # Korean, Thai
0xF876 # transcoding hint: variant tag 10 # Korean
0xF877 # transcoding hint: variant tag 9 # Korean
0xF878 # transcoding hint: variant tag 8 # Korean
0xF879 # transcoding hint: variant tag 7 # Korean
0xF87A # transcoding hint: variant tag 6 # Korean
0xF87B # transcoding hint: variant tag 5 # Korean
0xF87C # transcoding hint: variant tag 4 # ChineseTrad, Korean, Dingbats
0xF87D # transcoding hint: variant tag 3 # ChineseTrad
0xF87E # transcoding hint: variant tag 2 # Chinese,Japanese
0xF87F # transcoding hint: variant tag 1 # CJK,Symbol,Dingbats,Hebrew
# The following (2) are metrics "characters" so applications can get the
# height and width of double-byte character glyphs by measuring the glyph of a
# one-byte character (e.g. calling CharWidth for character 0x82 in a Chinese
# Traditional font); this approach assumes that the glyphs for all double-byte
# characters in a font have the same metrics, which is currently true. Note
# that the width-metric character glyphs are used differently for TrueType and
# old-style bitmap fonts; for TrueType fonts the metric glyph width is equal
# to the full width of a double-byte character glyph, while for FBIT/FDEF
# bitmap fonts the metric glyph width is half the width of a double-byte
# character glyph.
0xF880 # height-metric character for double-byte fonts # Chinese Simp&Trad-0x81
0xF881 # width-metric character for double-byte fonts # Chinese Simp&Trad-0x82
# The following (2) are for the TrueType variant of Mac OS Farsi.
# NOTE: 0xF883 is deprecated, but is still loosely mapped to 0xA4 in the
# Mac OS Farsi TrueType variant.
0xF882 # Arabic ligature "peace on him" # Farsi(TrueType variant)-0x8B
0xF883 # deprecated, use 0xFDFC (3.2) or 0xF86B+0x0631+0x06CC+0x0627+0x0644 # Farsi(TrueType variant)-0xA4
# The following (22) are for the Mac OS Thai encoding.
# In this encoding, positional variants of upper vowels, tone marks,
# and other marks are normally handled automatically by WorldScript I.
# However, the Thai-DTP keyboard allows the codes for the positional
# variants to be entered directly, so they must be treated as
# characters. When the abstract character is treated as a positional
# variant, it has the right (and high, if relevant) position.
# NOTE: These are now all deprecated in favor of combinations of standard
# characters and transcoding hints. The deprecated characters will still
# be loosely mapped to the appropriate Mac OS Thai character.
0xF884 # deprecated, use 0x0E31+0xF874 # Thai-0x92
0xF885 # deprecated, use 0x0E34+0xF874 # Thai-0x94
0xF886 # deprecated, use 0x0E35+0xF874 # Thai-0x95
0xF887 # deprecated, use 0x0E36+0xF874 # Thai-0x96
0xF888 # deprecated, use 0x0E37+0xF874 # Thai-0x97
0xF889 # deprecated, use 0x0E47+0xF874 # Thai-0x93
0xF88A # deprecated, use 0x0E48+0xF874 # Thai-0x98
0xF88B # deprecated, use 0x0E48+0xF873 # Thai-0x88
0xF88C # deprecated, use 0x0E48+0xF875 # Thai-0x83
0xF88D # deprecated, use 0x0E49+0xF874 # Thai-0x99
0xF88E # deprecated, use 0x0E49+0xF873 # Thai-0x89
0xF88F # deprecated, use 0x0E49+0xF875 # Thai-0x84
0xF890 # deprecated, use 0x0E4A+0xF874 # Thai-0x9A
0xF891 # deprecated, use 0x0E4A+0xF873 # Thai-0x8A
0xF892 # deprecated, use 0x0E4A+0xF875 # Thai-0x85
0xF893 # deprecated, use 0x0E4B+0xF874 # Thai-0x9B
0xF894 # deprecated, use 0x0E4B+0xF873 # Thai-0x8B
0xF895 # deprecated, use 0x0E4B+0xF875 # Thai-0x86
0xF896 # deprecated, use 0x0E4C+0xF874 # Thai-0x9C
0xF897 # deprecated, use 0x0E4C+0xF873 # Thai-0x8C
0xF898 # deprecated, use 0x0E4C+0xF875 # Thai-0x87
0xF899 # deprecated, use 0x0E4D+0xF874 # Thai-0x8F
# The following (6) are for the Mac OS Hebrew encoding. Four of
# these are for the obsolete "canoral" codes that were used before
# System 7.1/Worldscript to control positioning of nikud marks (points).
# In the future these 4 code points may be redefined.
# NOTE: Some of these are deprecated in favor of a combination of standard
# character and transcoding hint. The deprecated characters will still
# be loosely mapped to the appropriate Mac OS Hebrew character.
0xF89A # deprecated, use 0xF86A+0x05DC+0x05B9 # Hebrew-0xC0
0xF89B # Hebrew canoral 1 # Hebrew-0xC2
0xF89C # Hebrew canoral 2 # Hebrew-0xC3
0xF89D # Hebrew canoral 3 # Hebrew-0xC4
0xF89E # Hebrew canoral 4 # Hebrew-0xC5
0xF89F # deprecated, use 0x05B8+0xF87F # Hebrew-0xDE
# The following (1) is for mapping the single undefined code point in
# the Mac OS Greek and Turkish encodings, thus permitting full
# round-trip fidelity. This character is also used for mapping EURO SIGN
# when mapping to Unicode 1.1 (e.g. for Mac OS Roman and Symbol).
0xF8A0 # undefined1, also EURO SIGN for Unicode 1.1 # Turkish-0xF5, Roman-0xDB, Symbol-0xA0
# The following (54) are for the Mac OS Japanese encoding.
# part 1 - Apple corporate Unicode chars for Mac OS Japanese extended
# characters not in Unicode.
# NOTE: These are now all deprecated in favor of combinations of standard
# characters and transcoding hints. The deprecated characters will still
# be loosely mapped to the appropriate Mac OS Japanese character.
0xF8A1 # deprecated, use 0xF860+0x0030+0x002E # Jpn-0x8591
0xF8A2 # deprecated, use 0xF862+0x0058+0x0049+0x0049+0x0049 # Jpn-0x85AB
0xF8A3 # deprecated, use 0xF861+0x0058+0x0049+0x0056 # Jpn-0x85AC
0xF8A4 # deprecated, use 0xF860+0x0058+0x0056 # Jpn-0x85AD
0xF8A5 # deprecated, use 0xF862+0x0078+0x0069+0x0069+0x0069 # Jpn-0x85BF
0xF8A6 # deprecated, use 0xF861+0x0078+0x0069+0x0076 # Jpn-0x85C0
0xF8A7 # deprecated, use 0xF860+0x0078+0x0076 # Jpn-0x85C1
0xF8A8 # deprecated, use 0xFF4D+0xF87F # Jpn-0x8645
0xF8A9 # deprecated, use 0xFF47+0xF87F # Jpn-0x864B
0xF8AA # deprecated, use 0x2113 # Jpn-0x8650
0xF8AB # deprecated, use 0xF860+0x0054+0x0042 # Jpn-0x865D
0xF8AC # deprecated, use 0xF861+0x0046+0x0041+0x0058 # Jpn-0x869E
0xF8AD # deprecated, use 0xF860+0x2193+0x2191 # Jpn-0x86CE
0xF8AE # deprecated, use 0x21E8+0xF87A # Jpn-0x86D3
0xF8AF # deprecated, use 0x21E6+0xF87A # Jpn-0x86D4
0xF8B0 # deprecated, use 0x21E7+0xF87A # Jpn-0x86D5
0xF8B1 # deprecated, use 0x21E9+0xF87A # Jpn-0x86D6
0xF8B2 # deprecated, use 0xF862+0x6709+0x9650+0x4F1A+0x793E # Jpn-0x87FB
0xF8B3 # deprecated, use 0xF862+0x8CA1+0x56E3+0x6CD5+0x4EBA # Jpn-0x87FC
0xF8B4 # deprecated, use 0x301F # Jpn-0x8855
# part 2 - Apple corporate Unicode chars for Mac OS Japanese vertical
# forms not in Unicode.
# NOTE: These are now all deprecated in favor of combinations of standard
# characters and transcoding hints. The deprecated characters will still
# be loosely mapped to the appropriate Mac OS Japanese character.
0xF8B5 # deprecated, use 0x3001+0xF87E # Jpn-0xEB41
0xF8B6 # deprecated, use 0x3002+0xF87E # Jpn-0xEB42
0xF8B7 # deprecated, use 0xFFE3+0xF87E # Jpn-0xEB50
0xF8B8 # deprecated, use 0x30FC+0xF87E # Jpn-0xEB5B
0xF8B9 # deprecated, use 0x2010+0xF87E # Jpn-0xEB5D
0xF8BA # deprecated, use 0x301C+0xF87E # Jpn-0xEB60
0xF8BB # deprecated, use 0x2016+0xF87E # Jpn-0xEB61
0xF8BC # deprecated, use 0xFF5C+0xF87E # Jpn-0xEB62
0xF8BD # deprecated, use 0x2026+0xF87E # Jpn-0xEB63
0xF8BE # deprecated, use 0xFF3B+0xF87E # Jpn-0xEB6D
0xF8BF # deprecated, use 0xFF3D+0xF87E # Jpn-0xEB6E
0xF8C0 # deprecated, use 0xFF1D+0xF87E # Jpn-0xEB81
0xF8C1 # deprecated, use 0x3041+0xF87E # Jpn-0xEC9F
0xF8C2 # deprecated, use 0x3043+0xF87E # Jpn-0xECA1
0xF8C3 # deprecated, use 0x3045+0xF87E # Jpn-0xECA3
0xF8C4 # deprecated, use 0x3047+0xF87E # Jpn-0xECA5
0xF8C5 # deprecated, use 0x3049+0xF87E # Jpn-0xECA7
0xF8C6 # deprecated, use 0x3063+0xF87E # Jpn-0xECC1
0xF8C7 # deprecated, use 0x3083+0xF87E # Jpn-0xECE1
0xF8C8 # deprecated, use 0x3085+0xF87E # Jpn-0xECE3
0xF8C9 # deprecated, use 0x3087+0xF87E # Jpn-0xECE5
0xF8CA # deprecated, use 0x308E+0xF87E # Jpn-0xECEC
0xF8CB # deprecated, use 0x30A1+0xF87E # Jpn-0xED40
0xF8CC # deprecated, use 0x30A3+0xF87E # Jpn-0xED42
0xF8CD # deprecated, use 0x30A5+0xF87E # Jpn-0xED44
0xF8CE # deprecated, use 0x30A7+0xF87E # Jpn-0xED46
0xF8CF # deprecated, use 0x30A9+0xF87E # Jpn-0xED48
0xF8D0 # deprecated, use 0x30C3+0xF87E # Jpn-0xED62
0xF8D1 # deprecated, use 0x30E3+0xF87E # Jpn-0xED83
0xF8D2 # deprecated, use 0x30E5+0xF87E # Jpn-0xED85
0xF8D3 # deprecated, use 0x30E7+0xF87E # Jpn-0xED87
0xF8D4 # deprecated, use 0x30EE+0xF87E # Jpn-0xED8E
0xF8D5 # deprecated, use 0x30F5+0xF87E # Jpn-0xED95
0xF8D6 # deprecated, use 0x30F6+0xF87E # Jpn-0xED96
# The following (14) are for the Mac OS Dingbats encoding.
# NOTE: These are now all deprecated in favor of standard characters or
# combinations of standard characters and transcoding hints. The
# deprecated characters will still be loosely mapped to the appropriate
# Mac OS Dingbats character.
0xF8D7 # deprecated, use 0x2768 (3.2) or 0x0028 # Dingbats-0x80
0xF8D8 # deprecated, use 0x2769 (3.2) or 0x0029 # Dingbats-0x81
0xF8D9 # deprecated, use 0x276A (3.2) or 0x0028+0xF87F # Dingbats-0x82
0xF8DA # deprecated, use 0x276B (3.2) or 0x0029+0xF87F # Dingbats-0x83
0xF8DB # deprecated, use 0x276C (3.2) or 0x3008 # Dingbats-0x84
0xF8DC # deprecated, use 0x276D (3.2) or 0x3009 # Dingbats-0x85
0xF8DD # deprecated, use 0x276E (3.2) or 0x2039 # Dingbats-0x86
0xF8DE # deprecated, use 0x276F (3.2) or 0x203A # Dingbats-0x87
0xF8DF # deprecated, use 0x2770 (3.2) or 0x3008+0xF87C # Dingbats-0x88
0xF8E0 # deprecated, use 0x2771 (3.2) or 0x3009+0xF87C # Dingbats-0x89
0xF8E1 # deprecated, use 0x2772 (3.2) or 0x3014 # Dingbats-0x8A
0xF8E2 # deprecated, use 0x2773 (3.2) or 0x3015 # Dingbats-0x8B
0xF8E3 # deprecated, use 0x2774 (3.2) or 0x007B # Dingbats-0x8C
0xF8E4 # deprecated, use 0x2775 (3.2) or 0x007D # Dingbats-0x8D
# The following (26) are for the Mac OS Symbol encoding.
# NOTE: Some of these are deprecated in favor of combinations of standard
# characters and transcoding hints. The deprecated characters will still
# be loosely mapped to the appropriate Mac OS Symbol character.
0xF8E5 # radical extender # Symbol-0x60
0xF8E6 # deprecated, use 0x23D0 (4.0) # Symbol-0xBD
0xF8E7 # deprecated, use 0x23AF (3.2) # Symbol-0xBE
0xF8E8 # deprecated, use 0x00AE+0xF87F # Symbol-0xE2
0xF8E9 # deprecated, use 0x00A9+0xF87F # Symbol-0xE3
0xF8EA # deprecated, use 0x2122+0xF87F # Symbol-0xE4
0xF8EB # deprecated, use 0x239B (3.2) or 0x0028+0xF870 # Symbol-0xE6
0xF8EC # deprecated, use 0x239C (3.2) or 0x0028+0xF871 # Symbol-0xE7
0xF8ED # deprecated, use 0x239D (3.2) or 0x0028+0xF872 # Symbol-0xE8
0xF8EE # deprecated, use 0x23A1 (3.2) or 0x005B+0xF870 # Symbol-0xE9
0xF8EF # deprecated, use 0x23A2 (3.2) or 0x005B+0xF871 # Symbol-0xEA
0xF8F0 # deprecated, use 0x23A3 (3.2) or 0x005B+0xF872 # Symbol-0xEB
0xF8F1 # deprecated, use 0x23A7 (3.2) or 0x007B+0xF870 # Symbol-0xEC
0xF8F2 # deprecated, use 0x23A8 (3.2) or 0x007B+0xF871 # Symbol-0xED
0xF8F3 # deprecated, use 0x23A9 (3.2) or 0x007B+0xF872 # Symbol-0xEE
0xF8F4 # deprecated, use 0x23AA (3.2) # Symbol-0xEF
0xF8F5 # deprecated, use 0x23AE (3.2) or 0x222B+0xF871 # Symbol-0xF4
0xF8F6 # deprecated, use 0x239E (3.2) or 0x0029+0xF870 # Symbol-0xF6
0xF8F7 # deprecated, use 0x239F (3.2) or 0x0029+0xF871 # Symbol-0xF7
0xF8F8 # deprecated, use 0x23A0 (3.2) or 0x0029+0xF872 # Symbol-0xF8
0xF8F9 # deprecated, use 0x23A4 (3.2) or 0x005D+0xF870 # Symbol-0xF9
0xF8FA # deprecated, use 0x23A5 (3.2) or 0x005D+0xF871 # Symbol-0xFA
0xF8FB # deprecated, use 0x23A6 (3.2) or 0x005D+0xF872 # Symbol-0xFB
0xF8FC # deprecated, use 0x23AB (3.2) or 0x007D+0xF870 # Symbol-0xFC
0xF8FD # deprecated, use 0x23AC (3.2) or 0x007D+0xF871 # Symbol-0xFD
0xF8FE # deprecated, use 0x23AD (3.2) or 0x007D+0xF872 # Symbol-0xFE
# The following (1) is for the Mac OS Roman encoding
# (also used in Symbol & Croatian).
# NOTE: The graphic image associated with the Apple logo character is
# not authorized for use without permission of Apple, and unauthorized
# use might constitute trademark infringement.
0xF8FF # Apple logo # Roman-0xF0, Symbol-0xF0, Croatian-0xD8

447
unicode/vendors/apple/DEVANAGA.TXT vendored Normal file
View File

@@ -0,0 +1,447 @@
#=======================================================================
# File name: DEVANAGA.TXT
#
# Contents: Map (external version) from Mac OS Devanagari
# encoding to Unicode 2.1 and later.
#
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments; add section on
# roundtrip considerations. Matches internal
# xml <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs. Matches internal utom<b1>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n04 1998-Feb-05 First version; matches internal utom<n9>,
# ufrm<n15>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Devanagari code or code sequence
# (in hex as 0xNN or 0xNN+0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN).
# Column #3 is a comment containing the Unicode name or sequence
# of names. In some cases an additional comment follows the
# Unicode name(s).
#
# The entries are in two sections. The first section is for pairs of
# Mac OS Devanagari code points that must be mapped in a special way.
# The second section maps individual code points.
#
# Within each section, the entries are in Mac OS Devanagari code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Devanagari character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Devanagari:
# ---------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Devanagari is based on IS 13194:1991 (ISCII-91), with the
# addition of several punctuation and symbol characters. However,
# Mac OS Devanagari does not support the ATR (attribute) mechanism of
# ISCII-91.
#
# 1. ISCII-91 features in Mac OS Devanagari include:
#
# a) Overloading of nukta
#
# In addition to using the nukta (0xE9) like a combining dot below,
# nukta is overloaded to function as a general character modifier.
# In this role, certain code points followed by 0xE9 are treated as
# a two-byte code point representing a character which may be
# rather different than the characters represented by either of
# the code points alone. For example, the character DEVANAGARI OM
# (U+0950) is represented in ISCII-91 as candrabindu + nukta.
#
# b) Explicit halant and soft halant
#
# A double halant (0xE8 + 0xE8) constitutes an "explicit halant",
# which will always appear as a halant instead of causing formation
# of a ligature or half-form consonant.
#
# Halant followed by nukta (0xE8 + 0xE9) constitutes a "soft
# halant", which prevents formation of a ligature and instead
# retains the half-form of the first consonant.
#
# c) Invisible consonant
#
# The byte 0xD9 (called INV in ISCII-91) is an invisible consonant:
# It behaves like a consonant but has no visible appearance. It is
# intended to be used (often in combination with halant) to display
# dependent forms in isolation, such as the RA forms or consonant
# half-forms.
#
# d) Extensions for Vedic, etc.
#
# The byte 0xF0 (called EXT in ISCII-91) followed by any byte in
# the range 0xA1-0xEE constitutes a two-byte code point which can
# be used to represent additional characters for Vedic (or other
# extensions); 0xF0 followed by any other byte value constitutes
# malformed text. Mac OS Devanagari supports this mechanism, but
# does not currently map any of these two-byte code points to
# anything.
#
# 2. Mac OS Devanagari additions
#
# Mac OS Devanagari adds characters using the code points
# 0x80-0x8A and 0x90-0x91 (the latter are some Devanagari additions
# from Unicode).
#
# 3. Unused code points
#
# The following code points are currently unused, and are not shown
# here: 0x8B-0x8F, 0x92-0xA0, 0xEB-0xEF, 0xFB-0xFF. In addition,
# 0xF0 is not shown here, but it has a special function as described
# above.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Mapping the byte pairs
#
# If one of the following byte values is encountered when mapping
# Mac OS Devanagari text - 0xA1, 0xA6, 0xA7, 0xAA, 0xDB, 0xDC, 0xDF,
# 0xE8, or 0xEA - then the next byte (if there is one) should be
# examined. If the next byte is 0xE9 - or also 0xE8, if the first
# byte was 0xE8 - then the byte pair should be mapped using the
# first section of the mapping table below. Otherwise, each byte
# should be mapped using the second section of the mapping table
# below.
#
# - The Unicode Standard, Version 2.0, specifies how explicit
# halant and soft halant should be represented in Unicode;
# these mappings are used below.
#
# If the byte value 0xF0 is encountered when mapping Mac OS
# Devanagari text, then the next byte should be examined. If there
# is no next byte (e.g. 0xF0 at end of buffer), the mapping
# process should indicate incomplete character. If there is a next
# byte but it is not in the range 0xA1-0xEE, the mapping process
# should indicate malformed text. Otherwise, the mapping process
# should treat the byte pair as a valid two-byte code point with no
# mapping (e.g. map it to QUESTION MARK, REPLACEMENT CHARACTER,
# etc.).
#
# 2. Mapping the invisible consonant
#
# It has been suggested that INV in ISCII-91 should map to ZERO
# WIDTH NON-JOINER in Unicode. However, this causes problems with
# roundtrip fidelity: The ISCII-91 sequences 0xE8+0xE8 and 0xE8+0xD9
# would map to the same sequence of Unicode characters. We have
# instead mapped INV to LEFT-TO-RIGHT MARK, which avoids these
# problems.
#
# 3. Additional loose mappings from Unicode
#
# These are not preserved in roundtrip mappings.
#
# U+0958 0xB3+0xE9 # DEVANAGARI LETTER QA
# U+0959 0xB4+0xE9 # DEVANAGARI LETTER KHHA
# U+095A 0xB5+0xE9 # DEVANAGARI LETTER GHHA
# U+095B 0xBA+0xE9 # DEVANAGARI LETTER ZA
# U+095C 0xBF+0xE9 # DEVANAGARI LETTER DDDHA
# U+095D 0xC0+0xE9 # DEVANAGARI LETTER RHA
# U+095E 0xC9+0xE9 # DEVANAGARI LETTER FA
#
# 4. Roundtrip considerations when mapping to decomposed Unicode
#
# Both ISCII-91 (hence Mac OS Devanagari) and Unicode provide multiple
# ways of representing certain Devanagari consonants. For example,
# DEVANAGARI LETTER NNNA can be represented in Unicode as the single
# character 0x0929 or as the sequence 0x0928 0x093C; similarly, this
# consonant can be represented in Mac OS Devanagari as 0xC7 or as the
# sequence 0xC6 0xE9. This leads to some roundtrip problems. First
# note that we have the following mappings without such problems:
#
# ISCII/ standard decomposition of reverse mapping
# Mac OS Unicode mapping standard mapping of decomposition
# ------ ----------------------- ---------------- ----------------
# 0xC6 0x0928 ... LETTER NA 0x0928 (same) 0xC6
# 0xCD 0x092F ... LETTER YA 0x092F (same) 0xCD
# 0xCF 0x0930 ... LETTER RA 0x0930 (same) 0xCF
# 0xD2 0x0933 ... LETTER LLA 0x0933 (same) 0xD2
# 0xE9 0x093C ... SIGN NUKTA 0x093C (same) 0xE9
#
# However, those mappings above cause roundtrip problems for the
# the following mappings if they are decomposed:
#
# ISCII/ standard decomposition of reverse mapping
# Mac OS Unicode mapping standard mapping of decomposition
# ------ ----------------------- ---------------- ----------------
# 0xC7 0x0929 ... LETTER NNNA 0x0928 0x093C 0xC6 0xE9
# 0xCE 0x095F ... LETTER YYA 0x092F 0x093C 0xCD 0xE9
# 0xD0 0x0931 ... LETTER RRA 0x0930 0x093C 0xCF 0xE9
# 0xD3 0x0934 ... LETTER LLLA 0x0933 0x093C 0xD2 0xE9
#
# One solution is to use a grouping transcoding hint with the four
# decompositions above to mark the decomposed sequence for special
# treatment in transcoding. This yields the following mappings to
# decomposed Unicode:
#
# ISCII/ decomposed
# Mac OS Unicode mapping
# ------ ----------------
# 0xC7 0xF860 0x0928 0x093C
# 0xCE 0xF860 0x092F 0x093C
# 0xD0 0xF860 0x0930 0x093C
# 0xD3 0xF860 0x0933 0x093C
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
# Section 1: Map the following byte pairs as indicated:
# (ZWNJ means ZERO WIDTH NON-JOINER, ZWJ means ZERO WIDTH JOINER)
# (Also see note about 0xF0 in comments above)
0xA1+0xE9 0x0950 # DEVANAGARI OM
0xA6+0xE9 0x090C # DEVANAGARI LETTER VOCALIC L
0xA7+0xE9 0x0961 # DEVANAGARI LETTER VOCALIC LL
0xAA+0xE9 0x0960 # DEVANAGARI LETTER VOCALIC RR
0xDB+0xE9 0x0962 # DEVANAGARI VOWEL SIGN VOCALIC L
0xDC+0xE9 0x0963 # DEVANAGARI VOWEL SIGN VOCALIC LL
0xDF+0xE9 0x0944 # DEVANAGARI VOWEL SIGN VOCALIC RR
0xE8+0xE8 0x094D+0x200C # DEVANAGARI SIGN VIRAMA + ZWNJ # explicit halant
0xE8+0xE9 0x094D+0x200D # DEVANAGARI SIGN VIRAMA + ZWJ # soft halant
0xEA+0xE9 0x093D # DEVANAGARI SIGN AVAGRAHA
# Section 2: Map the remaining bytes as follows:
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00D7 # MULTIPLICATION SIGN
0x81 0x2212 # MINUS SIGN
0x82 0x2013 # EN DASH
0x83 0x2014 # EM DASH
0x84 0x2018 # LEFT SINGLE QUOTATION MARK
0x85 0x2019 # RIGHT SINGLE QUOTATION MARK
0x86 0x2026 # HORIZONTAL ELLIPSIS
0x87 0x2022 # BULLET
0x88 0x00A9 # COPYRIGHT SIGN
0x89 0x00AE # REGISTERED SIGN
0x8A 0x2122 # TRADE MARK SIGN
#
0x90 0x0965 # DEVANAGARI DOUBLE DANDA
0x91 0x0970 # DEVANAGARI ABBREVIATION SIGN
#
0xA1 0x0901 # DEVANAGARI SIGN CANDRABINDU
0xA2 0x0902 # DEVANAGARI SIGN ANUSVARA
0xA3 0x0903 # DEVANAGARI SIGN VISARGA
0xA4 0x0905 # DEVANAGARI LETTER A
0xA5 0x0906 # DEVANAGARI LETTER AA
0xA6 0x0907 # DEVANAGARI LETTER I
0xA7 0x0908 # DEVANAGARI LETTER II
0xA8 0x0909 # DEVANAGARI LETTER U
0xA9 0x090A # DEVANAGARI LETTER UU
0xAA 0x090B # DEVANAGARI LETTER VOCALIC R
0xAB 0x090E # DEVANAGARI LETTER SHORT E
0xAC 0x090F # DEVANAGARI LETTER E
0xAD 0x0910 # DEVANAGARI LETTER AI
0xAE 0x090D # DEVANAGARI LETTER CANDRA E
0xAF 0x0912 # DEVANAGARI LETTER SHORT O
0xB0 0x0913 # DEVANAGARI LETTER O
0xB1 0x0914 # DEVANAGARI LETTER AU
0xB2 0x0911 # DEVANAGARI LETTER CANDRA O
0xB3 0x0915 # DEVANAGARI LETTER KA
0xB4 0x0916 # DEVANAGARI LETTER KHA
0xB5 0x0917 # DEVANAGARI LETTER GA
0xB6 0x0918 # DEVANAGARI LETTER GHA
0xB7 0x0919 # DEVANAGARI LETTER NGA
0xB8 0x091A # DEVANAGARI LETTER CA
0xB9 0x091B # DEVANAGARI LETTER CHA
0xBA 0x091C # DEVANAGARI LETTER JA
0xBB 0x091D # DEVANAGARI LETTER JHA
0xBC 0x091E # DEVANAGARI LETTER NYA
0xBD 0x091F # DEVANAGARI LETTER TTA
0xBE 0x0920 # DEVANAGARI LETTER TTHA
0xBF 0x0921 # DEVANAGARI LETTER DDA
0xC0 0x0922 # DEVANAGARI LETTER DDHA
0xC1 0x0923 # DEVANAGARI LETTER NNA
0xC2 0x0924 # DEVANAGARI LETTER TA
0xC3 0x0925 # DEVANAGARI LETTER THA
0xC4 0x0926 # DEVANAGARI LETTER DA
0xC5 0x0927 # DEVANAGARI LETTER DHA
0xC6 0x0928 # DEVANAGARI LETTER NA
0xC7 0x0929 # DEVANAGARI LETTER NNNA
0xC8 0x092A # DEVANAGARI LETTER PA
0xC9 0x092B # DEVANAGARI LETTER PHA
0xCA 0x092C # DEVANAGARI LETTER BA
0xCB 0x092D # DEVANAGARI LETTER BHA
0xCC 0x092E # DEVANAGARI LETTER MA
0xCD 0x092F # DEVANAGARI LETTER YA
0xCE 0x095F # DEVANAGARI LETTER YYA
0xCF 0x0930 # DEVANAGARI LETTER RA
0xD0 0x0931 # DEVANAGARI LETTER RRA
0xD1 0x0932 # DEVANAGARI LETTER LA
0xD2 0x0933 # DEVANAGARI LETTER LLA
0xD3 0x0934 # DEVANAGARI LETTER LLLA
0xD4 0x0935 # DEVANAGARI LETTER VA
0xD5 0x0936 # DEVANAGARI LETTER SHA
0xD6 0x0937 # DEVANAGARI LETTER SSA
0xD7 0x0938 # DEVANAGARI LETTER SA
0xD8 0x0939 # DEVANAGARI LETTER HA
0xD9 0x200E # LEFT-TO-RIGHT MARK # invisible consonant
0xDA 0x093E # DEVANAGARI VOWEL SIGN AA
0xDB 0x093F # DEVANAGARI VOWEL SIGN I
0xDC 0x0940 # DEVANAGARI VOWEL SIGN II
0xDD 0x0941 # DEVANAGARI VOWEL SIGN U
0xDE 0x0942 # DEVANAGARI VOWEL SIGN UU
0xDF 0x0943 # DEVANAGARI VOWEL SIGN VOCALIC R
0xE0 0x0946 # DEVANAGARI VOWEL SIGN SHORT E
0xE1 0x0947 # DEVANAGARI VOWEL SIGN E
0xE2 0x0948 # DEVANAGARI VOWEL SIGN AI
0xE3 0x0945 # DEVANAGARI VOWEL SIGN CANDRA E
0xE4 0x094A # DEVANAGARI VOWEL SIGN SHORT O
0xE5 0x094B # DEVANAGARI VOWEL SIGN O
0xE6 0x094C # DEVANAGARI VOWEL SIGN AU
0xE7 0x0949 # DEVANAGARI VOWEL SIGN CANDRA O
0xE8 0x094D # DEVANAGARI SIGN VIRAMA # halant
0xE9 0x093C # DEVANAGARI SIGN NUKTA
0xEA 0x0964 # DEVANAGARI DANDA
#
0xF1 0x0966 # DEVANAGARI DIGIT ZERO
0xF2 0x0967 # DEVANAGARI DIGIT ONE
0xF3 0x0968 # DEVANAGARI DIGIT TWO
0xF4 0x0969 # DEVANAGARI DIGIT THREE
0xF5 0x096A # DEVANAGARI DIGIT FOUR
0xF6 0x096B # DEVANAGARI DIGIT FIVE
0xF7 0x096C # DEVANAGARI DIGIT SIX
0xF8 0x096D # DEVANAGARI DIGIT SEVEN
0xF9 0x096E # DEVANAGARI DIGIT EIGHT
0xFA 0x096F # DEVANAGARI DIGIT NINE

329
unicode/vendors/apple/DINGBATS.TXT vendored Normal file
View File

@@ -0,0 +1,329 @@
#=======================================================================
# File name: DINGBATS.TXT
#
# Contents: Map (external version) from Mac OS Dingbats
# character set to Unicode 3.2 and later.
#
# Copyright: (c) 1994-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update mappings for 0x80-0x8D to use new
# Unicode 3.2 characters. Update URLs, notes.
# Matches internal utom<b2>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n05 1998-Feb-05 Update to match internal utom<n4>, ufrm<n14>,
# and Text Encoding Converter version 1.3:
# Change all mappings to single corporate-zone
# Unicodes to either use standard Unicodes
# or standard Unicodes plus transcoding hints;
# see details below. Also update header
# comments to new format.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n4>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Dingbats code (in hex as 0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN).
# Column #3 is a comment containing the Unicode name.
# In some cases an additional comment follows the Unicode name.
#
# The entries are in Mac OS Dingbats code order.
#
# Some of these mappings require the use of corporate characters.
# See the file "CORPCHAR.TXT" and notes below.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Dingbats character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Dingbats:
# -------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported directly in programming
# interfaces for QuickDraw Text, the Script Manager, and related
# Text Utilities. For other purposes it is supported via transcoding
# to and from Unicode.
#
# The Mac OS Dingbats encoding shares the script code smRoman
# (0) with the standard Mac OS Roman encoding. To determine if
# the Dingbats encoding is being used, you must check if the
# font name is "Zapf Dingbats".
#
# The layout of the Dingbats character set is identical to or
# a superset of the layout of the Adobe Zapf Dingbats encoding
# vector.
#
# The following code points are unused, and are not shown here:
# 0x8E-0xA0, 0xF0, 0xFF.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - The mappings for the following Mac OS Dingbats characters
# were changed to use standard Unicode characters added for
# Unicode 3.2: 0x80-0x8D.
#
# Changes from version n03 to version n05:
#
# - The mappings for the following Mac OS Dingbats characters
# were changed from single corporate-zone Unicode characters
# to standard Unicode characters:
# 0x80-0x81, 0x84-0x87, 0x8A-0x8D.
#
# - The mappings for the following Mac OS Dingbats characters
# were changed from single corporate-zone Unicode characters
# to combinations of a standard Unicode and a transcoding hint:
# 0x82-0x83, 0x88-0x89.
#
##################
0x20 0x0020 # SPACE
0x21 0x2701 # UPPER BLADE SCISSORS
0x22 0x2702 # BLACK SCISSORS
0x23 0x2703 # LOWER BLADE SCISSORS
0x24 0x2704 # WHITE SCISSORS
0x25 0x260E # BLACK TELEPHONE
0x26 0x2706 # TELEPHONE LOCATION SIGN
0x27 0x2707 # TAPE DRIVE
0x28 0x2708 # AIRPLANE
0x29 0x2709 # ENVELOPE
0x2A 0x261B # BLACK RIGHT POINTING INDEX
0x2B 0x261E # WHITE RIGHT POINTING INDEX
0x2C 0x270C # VICTORY HAND
0x2D 0x270D # WRITING HAND
0x2E 0x270E # LOWER RIGHT PENCIL
0x2F 0x270F # PENCIL
0x30 0x2710 # UPPER RIGHT PENCIL
0x31 0x2711 # WHITE NIB
0x32 0x2712 # BLACK NIB
0x33 0x2713 # CHECK MARK
0x34 0x2714 # HEAVY CHECK MARK
0x35 0x2715 # MULTIPLICATION X
0x36 0x2716 # HEAVY MULTIPLICATION X
0x37 0x2717 # BALLOT X
0x38 0x2718 # HEAVY BALLOT X
0x39 0x2719 # OUTLINED GREEK CROSS
0x3A 0x271A # HEAVY GREEK CROSS
0x3B 0x271B # OPEN CENTRE CROSS
0x3C 0x271C # HEAVY OPEN CENTRE CROSS
0x3D 0x271D # LATIN CROSS
0x3E 0x271E # SHADOWED WHITE LATIN CROSS
0x3F 0x271F # OUTLINED LATIN CROSS
0x40 0x2720 # MALTESE CROSS
0x41 0x2721 # STAR OF DAVID
0x42 0x2722 # FOUR TEARDROP-SPOKED ASTERISK
0x43 0x2723 # FOUR BALLOON-SPOKED ASTERISK
0x44 0x2724 # HEAVY FOUR BALLOON-SPOKED ASTERISK
0x45 0x2725 # FOUR CLUB-SPOKED ASTERISK
0x46 0x2726 # BLACK FOUR POINTED STAR
0x47 0x2727 # WHITE FOUR POINTED STAR
0x48 0x2605 # BLACK STAR
0x49 0x2729 # STRESS OUTLINED WHITE STAR
0x4A 0x272A # CIRCLED WHITE STAR
0x4B 0x272B # OPEN CENTRE BLACK STAR
0x4C 0x272C # BLACK CENTRE WHITE STAR
0x4D 0x272D # OUTLINED BLACK STAR
0x4E 0x272E # HEAVY OUTLINED BLACK STAR
0x4F 0x272F # PINWHEEL STAR
0x50 0x2730 # SHADOWED WHITE STAR
0x51 0x2731 # HEAVY ASTERISK
0x52 0x2732 # OPEN CENTRE ASTERISK
0x53 0x2733 # EIGHT SPOKED ASTERISK
0x54 0x2734 # EIGHT POINTED BLACK STAR
0x55 0x2735 # EIGHT POINTED PINWHEEL STAR
0x56 0x2736 # SIX POINTED BLACK STAR
0x57 0x2737 # EIGHT POINTED RECTILINEAR BLACK STAR
0x58 0x2738 # HEAVY EIGHT POINTED RECTILINEAR BLACK STAR
0x59 0x2739 # TWELVE POINTED BLACK STAR
0x5A 0x273A # SIXTEEN POINTED ASTERISK
0x5B 0x273B # TEARDROP-SPOKED ASTERISK
0x5C 0x273C # OPEN CENTRE TEARDROP-SPOKED ASTERISK
0x5D 0x273D # HEAVY TEARDROP-SPOKED ASTERISK
0x5E 0x273E # SIX PETALLED BLACK AND WHITE FLORETTE
0x5F 0x273F # BLACK FLORETTE
0x60 0x2740 # WHITE FLORETTE
0x61 0x2741 # EIGHT PETALLED OUTLINED BLACK FLORETTE
0x62 0x2742 # CIRCLED OPEN CENTRE EIGHT POINTED STAR
0x63 0x2743 # HEAVY TEARDROP-SPOKED PINWHEEL ASTERISK
0x64 0x2744 # SNOWFLAKE
0x65 0x2745 # TIGHT TRIFOLIATE SNOWFLAKE
0x66 0x2746 # HEAVY CHEVRON SNOWFLAKE
0x67 0x2747 # SPARKLE
0x68 0x2748 # HEAVY SPARKLE
0x69 0x2749 # BALLOON-SPOKED ASTERISK
0x6A 0x274A # EIGHT TEARDROP-SPOKED PROPELLER ASTERISK
0x6B 0x274B # HEAVY EIGHT TEARDROP-SPOKED PROPELLER ASTERISK
0x6C 0x25CF # BLACK CIRCLE
0x6D 0x274D # SHADOWED WHITE CIRCLE
0x6E 0x25A0 # BLACK SQUARE
0x6F 0x274F # LOWER RIGHT DROP-SHADOWED WHITE SQUARE
0x70 0x2750 # UPPER RIGHT DROP-SHADOWED WHITE SQUARE
0x71 0x2751 # LOWER RIGHT SHADOWED WHITE SQUARE
0x72 0x2752 # UPPER RIGHT SHADOWED WHITE SQUARE
0x73 0x25B2 # BLACK UP-POINTING TRIANGLE
0x74 0x25BC # BLACK DOWN-POINTING TRIANGLE
0x75 0x25C6 # BLACK DIAMOND
0x76 0x2756 # BLACK DIAMOND MINUS WHITE X
0x77 0x25D7 # RIGHT HALF BLACK CIRCLE
0x78 0x2758 # LIGHT VERTICAL BAR
0x79 0x2759 # MEDIUM VERTICAL BAR
0x7A 0x275A # HEAVY VERTICAL BAR
0x7B 0x275B # HEAVY SINGLE TURNED COMMA QUOTATION MARK ORNAMENT
0x7C 0x275C # HEAVY SINGLE COMMA QUOTATION MARK ORNAMENT
0x7D 0x275D # HEAVY DOUBLE TURNED COMMA QUOTATION MARK ORNAMENT
0x7E 0x275E # HEAVY DOUBLE COMMA QUOTATION MARK ORNAMENT
#
0x80 0x2768 # MEDIUM LEFT PARENTHESIS ORNAMENT # for Unicode 3.2 and later
0x81 0x2769 # MEDIUM RIGHT PARENTHESIS ORNAMENT # for Unicode 3.2 and later
0x82 0x276A # MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT # for Unicode 3.2 and later
0x83 0x276B # MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT # for Unicode 3.2 and later
0x84 0x276C # MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT # for Unicode 3.2 and later
0x85 0x276D # MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT # for Unicode 3.2 and later
0x86 0x276E # HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT # for Unicode 3.2 and later
0x87 0x276F # HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT # for Unicode 3.2 and later
0x88 0x2770 # HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT # for Unicode 3.2 and later
0x89 0x2771 # HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT # for Unicode 3.2 and later
0x8A 0x2772 # LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT # for Unicode 3.2 and later
0x8B 0x2773 # LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT # for Unicode 3.2 and later
0x8C 0x2774 # MEDIUM LEFT CURLY BRACKET ORNAMENT # for Unicode 3.2 and later
0x8D 0x2775 # MEDIUM RIGHT CURLY BRACKET ORNAMENT # for Unicode 3.2 and later
#
0xA1 0x2761 # CURVED STEM PARAGRAPH SIGN ORNAMENT
0xA2 0x2762 # HEAVY EXCLAMATION MARK ORNAMENT
0xA3 0x2763 # HEAVY HEART EXCLAMATION MARK ORNAMENT
0xA4 0x2764 # HEAVY BLACK HEART
0xA5 0x2765 # ROTATED HEAVY BLACK HEART BULLET
0xA6 0x2766 # FLORAL HEART
0xA7 0x2767 # ROTATED FLORAL HEART BULLET
0xA8 0x2663 # BLACK CLUB SUIT
0xA9 0x2666 # BLACK DIAMOND SUIT
0xAA 0x2665 # BLACK HEART SUIT
0xAB 0x2660 # BLACK SPADE SUIT
0xAC 0x2460 # CIRCLED DIGIT ONE
0xAD 0x2461 # CIRCLED DIGIT TWO
0xAE 0x2462 # CIRCLED DIGIT THREE
0xAF 0x2463 # CIRCLED DIGIT FOUR
0xB0 0x2464 # CIRCLED DIGIT FIVE
0xB1 0x2465 # CIRCLED DIGIT SIX
0xB2 0x2466 # CIRCLED DIGIT SEVEN
0xB3 0x2467 # CIRCLED DIGIT EIGHT
0xB4 0x2468 # CIRCLED DIGIT NINE
0xB5 0x2469 # CIRCLED NUMBER TEN
0xB6 0x2776 # DINGBAT NEGATIVE CIRCLED DIGIT ONE
0xB7 0x2777 # DINGBAT NEGATIVE CIRCLED DIGIT TWO
0xB8 0x2778 # DINGBAT NEGATIVE CIRCLED DIGIT THREE
0xB9 0x2779 # DINGBAT NEGATIVE CIRCLED DIGIT FOUR
0xBA 0x277A # DINGBAT NEGATIVE CIRCLED DIGIT FIVE
0xBB 0x277B # DINGBAT NEGATIVE CIRCLED DIGIT SIX
0xBC 0x277C # DINGBAT NEGATIVE CIRCLED DIGIT SEVEN
0xBD 0x277D # DINGBAT NEGATIVE CIRCLED DIGIT EIGHT
0xBE 0x277E # DINGBAT NEGATIVE CIRCLED DIGIT NINE
0xBF 0x277F # DINGBAT NEGATIVE CIRCLED NUMBER TEN
0xC0 0x2780 # DINGBAT CIRCLED SANS-SERIF DIGIT ONE
0xC1 0x2781 # DINGBAT CIRCLED SANS-SERIF DIGIT TWO
0xC2 0x2782 # DINGBAT CIRCLED SANS-SERIF DIGIT THREE
0xC3 0x2783 # DINGBAT CIRCLED SANS-SERIF DIGIT FOUR
0xC4 0x2784 # DINGBAT CIRCLED SANS-SERIF DIGIT FIVE
0xC5 0x2785 # DINGBAT CIRCLED SANS-SERIF DIGIT SIX
0xC6 0x2786 # DINGBAT CIRCLED SANS-SERIF DIGIT SEVEN
0xC7 0x2787 # DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT
0xC8 0x2788 # DINGBAT CIRCLED SANS-SERIF DIGIT NINE
0xC9 0x2789 # DINGBAT CIRCLED SANS-SERIF NUMBER TEN
0xCA 0x278A # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ONE
0xCB 0x278B # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT TWO
0xCC 0x278C # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT THREE
0xCD 0x278D # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT FOUR
0xCE 0x278E # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT FIVE
0xCF 0x278F # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT SIX
0xD0 0x2790 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT SEVEN
0xD1 0x2791 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT EIGHT
0xD2 0x2792 # DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT NINE
0xD3 0x2793 # DINGBAT NEGATIVE CIRCLED SANS-SERIF NUMBER TEN
0xD4 0x2794 # HEAVY WIDE-HEADED RIGHTWARDS ARROW
0xD5 0x2192 # RIGHTWARDS ARROW
0xD6 0x2194 # LEFT RIGHT ARROW
0xD7 0x2195 # UP DOWN ARROW
0xD8 0x2798 # HEAVY SOUTH EAST ARROW
0xD9 0x2799 # HEAVY RIGHTWARDS ARROW
0xDA 0x279A # HEAVY NORTH EAST ARROW
0xDB 0x279B # DRAFTING POINT RIGHTWARDS ARROW
0xDC 0x279C # HEAVY ROUND-TIPPED RIGHTWARDS ARROW
0xDD 0x279D # TRIANGLE-HEADED RIGHTWARDS ARROW
0xDE 0x279E # HEAVY TRIANGLE-HEADED RIGHTWARDS ARROW
0xDF 0x279F # DASHED TRIANGLE-HEADED RIGHTWARDS ARROW
0xE0 0x27A0 # HEAVY DASHED TRIANGLE-HEADED RIGHTWARDS ARROW
0xE1 0x27A1 # BLACK RIGHTWARDS ARROW
0xE2 0x27A2 # THREE-D TOP-LIGHTED RIGHTWARDS ARROWHEAD
0xE3 0x27A3 # THREE-D BOTTOM-LIGHTED RIGHTWARDS ARROWHEAD
0xE4 0x27A4 # BLACK RIGHTWARDS ARROWHEAD
0xE5 0x27A5 # HEAVY BLACK CURVED DOWNWARDS AND RIGHTWARDS ARROW
0xE6 0x27A6 # HEAVY BLACK CURVED UPWARDS AND RIGHTWARDS ARROW
0xE7 0x27A7 # SQUAT BLACK RIGHTWARDS ARROW
0xE8 0x27A8 # HEAVY CONCAVE-POINTED BLACK RIGHTWARDS ARROW
0xE9 0x27A9 # RIGHT-SHADED WHITE RIGHTWARDS ARROW
0xEA 0x27AA # LEFT-SHADED WHITE RIGHTWARDS ARROW
0xEB 0x27AB # BACK-TILTED SHADOWED WHITE RIGHTWARDS ARROW
0xEC 0x27AC # FRONT-TILTED SHADOWED WHITE RIGHTWARDS ARROW
0xED 0x27AD # HEAVY LOWER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW
0xEE 0x27AE # HEAVY UPPER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW
0xEF 0x27AF # NOTCHED LOWER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW
#
0xF1 0x27B1 # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARDS ARROW
0xF2 0x27B2 # CIRCLED HEAVY WHITE RIGHTWARDS ARROW
0xF3 0x27B3 # WHITE-FEATHERED RIGHTWARDS ARROW
0xF4 0x27B4 # BLACK-FEATHERED SOUTH EAST ARROW
0xF5 0x27B5 # BLACK-FEATHERED RIGHTWARDS ARROW
0xF6 0x27B6 # BLACK-FEATHERED NORTH EAST ARROW
0xF7 0x27B7 # HEAVY BLACK-FEATHERED SOUTH EAST ARROW
0xF8 0x27B8 # HEAVY BLACK-FEATHERED RIGHTWARDS ARROW
0xF9 0x27B9 # HEAVY BLACK-FEATHERED NORTH EAST ARROW
0xFA 0x27BA # TEARDROP-BARBED RIGHTWARDS ARROW
0xFB 0x27BB # HEAVY TEARDROP-SHANKED RIGHTWARDS ARROW
0xFC 0x27BC # WEDGE-TAILED RIGHTWARDS ARROW
0xFD 0x27BD # HEAVY WEDGE-TAILED RIGHTWARDS ARROW
0xFE 0x27BE # OPEN-OUTLINED RIGHTWARDS ARROW

521
unicode/vendors/apple/FARSI.TXT vendored Normal file
View File

@@ -0,0 +1,521 @@
#=======================================================================
# File name: FARSI.TXT
#
# Contents: Map (external version) from Mac OS Farsi
# character set to Unicode 2.1 and later.
#
# Copyright: (c) 1997-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Add comments about character display and
# direction overrides. Update URLs, notes.
# Matches internal utom<b3>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n04 1998-Feb-05 Show required Unicode character
# directionality in a different way. Matches
# internal utom<n3>, ufrm<n9>, and Text
# Encoding Converter version 1.3. Update
# header comments; include information on
# loose mapping of digits, and changes to
# mapping for the TrueType variant.
# n01 1997-Jul-17 First version. Matches internal utom<n1>,
# ufrm<n2>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Farsi code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN),
# possibly preceded by a tag indicating required directionality
# (i.e. <LR>+0xNNNN or <RL>+0xNNNN).
# Column #3 is a comment containing the Unicode name.
#
# The entries are in Mac OS Farsi code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Farsi character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Farsi:
# ----------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# 1. General
#
# The Mac OS Farsi character set is based on the Mac OS Arabic
# character set. The main difference is in the right-to-left digits
# 0xB0-0xB9: For Mac OS Arabic these correspond to right-left
# versions of the Unicode ARABIC-INDIC DIGITs 0660-0669; for
# Mac OS Farsi these correspond to right-left versions of the
# Unicode EXTENDED ARABIC-INDIC DIGITs 06F0-06F9. The other
# difference is in the nature of the font variants.
#
# For more information, see the comments in the mapping table for
# Mac OS Arabic.
#
# Mac OS Farsi characters 0xEB-0xF2 are non-spacing/combining marks.
#
# 2. Directional characters and roundtrip fidelity
#
# The Mac OS Arabic character set (on which Mac OS Farsi is based)
# was developed in 1986-1987. At that time the bidirectional line
# layout algorithm used in the Mac OS Arabic system was fairly simple;
# it used only a few direction classes (instead of the 19 now used in
# the Unicode bidirectional algorithm). In order to permit users to
# handle some tricky layout problems, certain punctuation and symbol
# characters were encoded twice, one with a left-right direction
# attribute and the other with a right-left direction attribute. This
# is the case in Mac OS Farsi too.
#
# For example, plus sign is encoded at 0x2B with a left-right
# attribute, and at 0xAB with a right-left attribute. However, there
# is only one PLUS SIGN character in Unicode. This leads to some
# interesting problems when mapping between Mac OS Farsi and Unicode;
# see below.
#
# A related problem is that even when a particular character is
# encoded only once in Mac OS Farsi, it may have a different
# direction attribute than the corresponding Unicode character.
#
# For example, the Mac OS Farsi character at 0x93 is HORIZONTAL
# ELLIPSIS with strong right-left direction. However, the Unicode
# character HORIZONTAL ELLIPSIS has direction class neutral.
#
# 3. Behavior of ASCII-range numbers in WorldScript
#
# Mac OS Farsi also has two sets of digit codes.
# The digits at 0x30-0x39 may be displayed using either European
# digit forms or Persian digit forms, depending on context. If there
# is a "strong European" character such as a Latin letter on either
# side of a sequence consisting of digits 0x30-0x39 and possibly comma
# 0x2C or period 0x2E, then the characters will be displayed using
# European forms (This will happen even if there are neutral characters
# between the digits and the strong European character). Otherwise, the
# digits will be displayed using Persian forms, the comma will be
# displayed as Arabic thousands separator, and the period as Arabic
# decimal separator. In any case, 0x2C, 0x2E, and 0x30-0x39 are always
# left-right.
#
# The digits at 0xB0-0xB9 are always displayed using Persian digit
# shapes, and moreover, these digits always have strong right-left
# directionality. These are mainly intended for special layout
# purposes such as part numbers, etc.
#
# 4. Font variants
#
# The table in this file gives the Unicode mappings for the standard
# Mac OS Farsi encoding. This encoding is supported by the Tehran font
# (the system font for Farsi), and is the encoding supported by the
# text processing utilities. However, the other Farsi fonts actually
# implement a somewhat different encoding; this affects nine code
# points including 0xAA and 0xC0 (which are also affected by font
# variants in Mac OS Arabic). For these nine code points the standard
# Mac OS Farsi encoding has the following mappings:
# 0x8B -> 0x06BA ARABIC LETTER NOON GHUNNA (Urdu)
# 0xA4 -> <RL>+0x0024 DOLLAR SIGN, right-left
# 0xAA -> <RL>+0x002A ASTERISK, right-left
# 0xC0 -> <RL>+0x274A EIGHT TEARDROP-SPOKED PROPELLER ASTERISK,
# right-left
# 0xF4 -> 0x0679 ARABIC LETTER TTEH (Urdu)
# 0xF7 -> 0x06A4 ARABIC LETTER VEH (for transliteration)
# 0xF9 -> 0x0688 ARABIC LETTER DDAL (Urdu)
# 0xFA -> 0x0691 ARABIC LETTER RREH (Urdu)
# 0xFF -> 0x06D2 ARABIC LETTER YEH BARREE (Urdu)
#
# The TrueType variant is used for the Farsi TrueType fonts: Ashfahan,
# Amir, Kamran, Mashad, NadeemFarsi. It differs from the standard
# variant in the following ways:
# 0x8B -> 0xF882 Arabic ligature "peace on him" (corporate char.)
# 0xA4 -> 0xFDFC RIAL SIGN (added in Unicode 3.2)
# 0xAA -> <RL>+0x00D7 MULTIPLICATION SIGN, right-left
# 0xC0 -> <RL>+0x002A ASTERISK, right-left
# 0xF4 -> <RL>+0x00B0 DEGREE SIGN, right-left
# 0xF7 -> 0xFDFA ARABIC LIGATURE SALLALLAHOU ALAYHE WASALLAM
# 0xF9 -> <RL>+0x25CF BLACK CIRCLE, right-left
# 0xFA -> <RL>+0x25A0 BLACK SQUARE, right-left
# 0xFF -> <RL>+0x25B2 BLACK UP-POINTING TRIANGLE, right-left
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Matching the direction of Mac OS Farsi characters
#
# When Mac OS Farsi encodes a character twice but with different
# direction attributes for the two code points - as in the case of
# plus sign mentioned above - we need a way to map both Mac OS Farsi
# code points to Unicode and back again without loss of information.
# With the plus sign, for example, mapping one of the Mac OS Farsi
# characters to a code in the Unicode corporate use zone is
# undesirable, since both of the plus sign characters are likely to
# be used in text that is interchanged.
#
# The problem is solved with the use of direction override characters
# and direction-dependent mappings. When mapping from Mac OS Farsi
# to Unicode, we use direction overrides as necessary to force the
# direction of the resulting Unicode characters.
#
# The required direction is indicated by a direction tag in the
# mappings. A tag of <LR> means the corresponding Unicode character
# must have a strong left-right context, and a tag of <RL> indicates
# a right-left context.
#
# For example, the mapping of 0x2B is given as <LR>+0x002B; the
# mapping of 0xAB is given as <RL>+0x002B. If we map an isolated
# instance of 0x2B to Unicode, it should be mapped as follows (LRO
# indicates LEFT-RIGHT OVERRIDE, PDF indicates POP DIRECTION
# FORMATTING):
#
# 0x2B -> 0x202D (LRO) + 0x002B (PLUS SIGN) + 0x202C (PDF)
#
# When mapping several characters in a row that require direction
# forcing, the overrides need only be used at the beginning and end.
# For example:
#
# 0x24 0x20 0x28 0x29 -> 0x202D 0x0024 0x0020 0x0028 0x0029 0x202C
#
# If neutral characters that require direction forcing are already
# between strong-direction characters with matching directionality,
# then direction overrides need not be used. Direction overrides are
# always needed to map the right-left digits at 0xB0-0xB9.
#
# When mapping from Unicode to Mac OS Farsi, the Unicode
# bidirectional algorithm should be used to determine resolved
# direction of the Unicode characters. The mapping from Unicode to
# Mac OS Farsi can then be disambiguated by the use of the resolved
# direction:
#
# Unicode 0x002B -> Mac OS Farsi 0x2B (if L) or 0xAB (if R)
#
# However, this also means the direction override characters should
# be discarded when mapping from Unicode to Mac OS Farsi (after
# they have been used to determine resolved direction), since the
# direction override information is carried by the code point itself.
#
# Even when direction overrides are not needed for roundtrip
# fidelity, they are sometimes used when mapping Mac OS Farsi
# characters to Unicode in order to achieve similar text layout with
# the resulting Unicode text. For example, the single Mac OS Farsi
# ellipsis character has direction class right-left,and there is no
# left-right version. However, the Unicode HORIZONTAL ELLIPSIS
# character has direction class neutral (which means it may end up
# with a resolved direction of left-right if surrounded by left-right
# characters). When mapping the Mac OS Farsi ellipsis to Unicode, it
# is surrounded with a direction override to help preserve proper
# text layout. The resolved direction is not needed or used when
# mapping the Unicode HORIZONTAL ELLIPSIS back to Mac OS Farsi.
#
# 2. Mapping the Mac OS Farsi digits
#
# The main table below contains mappings that should be used when
# strict round-trip fidelity is required. However, for numeric
# values, the mappings in that table will produce Unicode characters
# that may appear different than the Mac OS Farsi text displayed on
# a Mac OS system using WorldScript. This is because WorldScript
# uses context-dependent display for the 0x30-0x39 digits.
#
# If roundtrip fidelity is not required, then the following
# alternate mappings should be used when a sequence of 0x30-0x39
# digits - possibly including 0x2C and 0x2E - occurs in an Arabic
# context (that is, when the first "strong" character on either side
# of the digit sequence is Arabic, or there is no strong character):
#
# 0x2C 0x066C # ARABIC THOUSANDS SEPARATOR
# 0x2E 0x066B # ARABIC DECIMAL SEPARATOR
# 0x30 0x06F0 # EXTENDED ARABIC-INDIC DIGIT ZERO
# 0x31 0x06F1 # EXTENDED ARABIC-INDIC DIGIT ONE
# 0x32 0x06F2 # EXTENDED ARABIC-INDIC DIGIT TWO
# 0x33 0x06F3 # EXTENDED ARABIC-INDIC DIGIT THREE
# 0x34 0x06F4 # EXTENDED ARABIC-INDIC DIGIT FOUR
# 0x35 0x06F5 # EXTENDED ARABIC-INDIC DIGIT FIVE
# 0x36 0x06F6 # EXTENDED ARABIC-INDIC DIGIT SIX
# 0x37 0x06F7 # EXTENDED ARABIC-INDIC DIGIT SEVEN
# 0x38 0x06F8 # EXTENDED ARABIC-INDIC DIGIT EIGHT
# 0x39 0x06F9 # EXTENDED ARABIC-INDIC DIGIT NINE
#
# 3. Use of corporate-zone Unicodes (mapping the TrueType variant)
#
# The following corporate zone Unicode character is used in this
# mapping:
#
# 0xF882 Arabic ligature "peace on him"
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - Update mapping of 0xA4 in TrueType variant to use new Unicode
# character U+FDFC RIAL SIGN addded for Unicode 3.2
#
# Changes from version n01 to version n04:
#
# - Change mapping of 0xA4 in TrueType variant (just described in
# header comment) from single corporate character to use
# grouping hint
#
##################
0x20 <LR>+0x0020 # SPACE, left-right
0x21 <LR>+0x0021 # EXCLAMATION MARK, left-right
0x22 <LR>+0x0022 # QUOTATION MARK, left-right
0x23 <LR>+0x0023 # NUMBER SIGN, left-right
0x24 <LR>+0x0024 # DOLLAR SIGN, left-right
0x25 <LR>+0x0025 # PERCENT SIGN, left-right
0x26 <LR>+0x0026 # AMPERSAND, left-right
0x27 <LR>+0x0027 # APOSTROPHE, left-right
0x28 <LR>+0x0028 # LEFT PARENTHESIS, left-right
0x29 <LR>+0x0029 # RIGHT PARENTHESIS, left-right
0x2A <LR>+0x002A # ASTERISK, left-right
0x2B <LR>+0x002B # PLUS SIGN, left-right
0x2C <LR>+0x002C # COMMA, left-right; in Arabic-script context, displayed as 0x066C ARABIC THOUSANDS SEPARATOR
0x2D <LR>+0x002D # HYPHEN-MINUS, left-right
0x2E <LR>+0x002E # FULL STOP, left-right; in Arabic-script context, displayed as 0x066B ARABIC DECIMAL SEPARATOR
0x2F <LR>+0x002F # SOLIDUS, left-right
0x30 0x0030 # DIGIT ZERO; in Arabic-script context, displayed as 0x06F0 EXTENDED ARABIC-INDIC DIGIT ZERO
0x31 0x0031 # DIGIT ONE; in Arabic-script context, displayed as 0x06F1 EXTENDED ARABIC-INDIC DIGIT ONE
0x32 0x0032 # DIGIT TWO; in Arabic-script context, displayed as 0x06F2 EXTENDED ARABIC-INDIC DIGIT TWO
0x33 0x0033 # DIGIT THREE; in Arabic-script context, displayed as 0x06F3 EXTENDED ARABIC-INDIC DIGIT THREE
0x34 0x0034 # DIGIT FOUR; in Arabic-script context, displayed as 0x06F4 EXTENDED ARABIC-INDIC DIGIT FOUR
0x35 0x0035 # DIGIT FIVE; in Arabic-script context, displayed as 0x06F5 EXTENDED ARABIC-INDIC DIGIT FIVE
0x36 0x0036 # DIGIT SIX; in Arabic-script context, displayed as 0x06F6 EXTENDED ARABIC-INDIC DIGIT SIX
0x37 0x0037 # DIGIT SEVEN; in Arabic-script context, displayed as 0x06F7 EXTENDED ARABIC-INDIC DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT; in Arabic-script context, displayed as 0x06F8 EXTENDED ARABIC-INDIC DIGIT EIGHT
0x39 0x0039 # DIGIT NINE; in Arabic-script context, displayed as 0x06F9 EXTENDED ARABIC-INDIC DIGIT NINE
0x3A <LR>+0x003A # COLON, left-right
0x3B <LR>+0x003B # SEMICOLON, left-right
0x3C <LR>+0x003C # LESS-THAN SIGN, left-right
0x3D <LR>+0x003D # EQUALS SIGN, left-right
0x3E <LR>+0x003E # GREATER-THAN SIGN, left-right
0x3F <LR>+0x003F # QUESTION MARK, left-right
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B <LR>+0x005B # LEFT SQUARE BRACKET, left-right
0x5C <LR>+0x005C # REVERSE SOLIDUS, left-right
0x5D <LR>+0x005D # RIGHT SQUARE BRACKET, left-right
0x5E <LR>+0x005E # CIRCUMFLEX ACCENT, left-right
0x5F <LR>+0x005F # LOW LINE, left-right
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B <LR>+0x007B # LEFT CURLY BRACKET, left-right
0x7C <LR>+0x007C # VERTICAL LINE, left-right
0x7D <LR>+0x007D # RIGHT CURLY BRACKET, left-right
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 <RL>+0x00A0 # NO-BREAK SPACE, right-left
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0x85 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0x86 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0x87 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0x88 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0x89 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x06BA # ARABIC LETTER NOON GHUNNA
0x8C <RL>+0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left
0x8D 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0x90 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0x91 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 <RL>+0x2026 # HORIZONTAL ELLIPSIS, right-left
0x94 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0x95 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0x96 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 <RL>+0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B <RL>+0x00F7 # DIVISION SIGN, right-left
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 <RL>+0x0020 # SPACE, right-left
0xA1 <RL>+0x0021 # EXCLAMATION MARK, right-left
0xA2 <RL>+0x0022 # QUOTATION MARK, right-left
0xA3 <RL>+0x0023 # NUMBER SIGN, right-left
0xA4 <RL>+0x0024 # DOLLAR SIGN, right-left
0xA5 0x066A # ARABIC PERCENT SIGN
0xA6 <RL>+0x0026 # AMPERSAND, right-left
0xA7 <RL>+0x0027 # APOSTROPHE, right-left
0xA8 <RL>+0x0028 # LEFT PARENTHESIS, right-left
0xA9 <RL>+0x0029 # RIGHT PARENTHESIS, right-left
0xAA <RL>+0x002A # ASTERISK, right-left
0xAB <RL>+0x002B # PLUS SIGN, right-left
0xAC 0x060C # ARABIC COMMA
0xAD <RL>+0x002D # HYPHEN-MINUS, right-left
0xAE <RL>+0x002E # FULL STOP, right-left
0xAF <RL>+0x002F # SOLIDUS, right-left
0xB0 <RL>+0x06F0 # EXTENDED ARABIC-INDIC DIGIT ZERO, right-left (need override)
0xB1 <RL>+0x06F1 # EXTENDED ARABIC-INDIC DIGIT ONE, right-left (need override)
0xB2 <RL>+0x06F2 # EXTENDED ARABIC-INDIC DIGIT TWO, right-left (need override)
0xB3 <RL>+0x06F3 # EXTENDED ARABIC-INDIC DIGIT THREE, right-left (need override)
0xB4 <RL>+0x06F4 # EXTENDED ARABIC-INDIC DIGIT FOUR, right-left (need override)
0xB5 <RL>+0x06F5 # EXTENDED ARABIC-INDIC DIGIT FIVE, right-left (need override)
0xB6 <RL>+0x06F6 # EXTENDED ARABIC-INDIC DIGIT SIX, right-left (need override)
0xB7 <RL>+0x06F7 # EXTENDED ARABIC-INDIC DIGIT SEVEN, right-left (need override)
0xB8 <RL>+0x06F8 # EXTENDED ARABIC-INDIC DIGIT EIGHT, right-left (need override)
0xB9 <RL>+0x06F9 # EXTENDED ARABIC-INDIC DIGIT NINE, right-left (need override)
0xBA <RL>+0x003A # COLON, right-left
0xBB 0x061B # ARABIC SEMICOLON
0xBC <RL>+0x003C # LESS-THAN SIGN, right-left
0xBD <RL>+0x003D # EQUALS SIGN, right-left
0xBE <RL>+0x003E # GREATER-THAN SIGN, right-left
0xBF 0x061F # ARABIC QUESTION MARK
0xC0 <RL>+0x274A # EIGHT TEARDROP-SPOKED PROPELLER ASTERISK, right-left
0xC1 0x0621 # ARABIC LETTER HAMZA
0xC2 0x0622 # ARABIC LETTER ALEF WITH MADDA ABOVE
0xC3 0x0623 # ARABIC LETTER ALEF WITH HAMZA ABOVE
0xC4 0x0624 # ARABIC LETTER WAW WITH HAMZA ABOVE
0xC5 0x0625 # ARABIC LETTER ALEF WITH HAMZA BELOW
0xC6 0x0626 # ARABIC LETTER YEH WITH HAMZA ABOVE
0xC7 0x0627 # ARABIC LETTER ALEF
0xC8 0x0628 # ARABIC LETTER BEH
0xC9 0x0629 # ARABIC LETTER TEH MARBUTA
0xCA 0x062A # ARABIC LETTER TEH
0xCB 0x062B # ARABIC LETTER THEH
0xCC 0x062C # ARABIC LETTER JEEM
0xCD 0x062D # ARABIC LETTER HAH
0xCE 0x062E # ARABIC LETTER KHAH
0xCF 0x062F # ARABIC LETTER DAL
0xD0 0x0630 # ARABIC LETTER THAL
0xD1 0x0631 # ARABIC LETTER REH
0xD2 0x0632 # ARABIC LETTER ZAIN
0xD3 0x0633 # ARABIC LETTER SEEN
0xD4 0x0634 # ARABIC LETTER SHEEN
0xD5 0x0635 # ARABIC LETTER SAD
0xD6 0x0636 # ARABIC LETTER DAD
0xD7 0x0637 # ARABIC LETTER TAH
0xD8 0x0638 # ARABIC LETTER ZAH
0xD9 0x0639 # ARABIC LETTER AIN
0xDA 0x063A # ARABIC LETTER GHAIN
0xDB <RL>+0x005B # LEFT SQUARE BRACKET, right-left
0xDC <RL>+0x005C # REVERSE SOLIDUS, right-left
0xDD <RL>+0x005D # RIGHT SQUARE BRACKET, right-left
0xDE <RL>+0x005E # CIRCUMFLEX ACCENT, right-left
0xDF <RL>+0x005F # LOW LINE, right-left
0xE0 0x0640 # ARABIC TATWEEL
0xE1 0x0641 # ARABIC LETTER FEH
0xE2 0x0642 # ARABIC LETTER QAF
0xE3 0x0643 # ARABIC LETTER KAF
0xE4 0x0644 # ARABIC LETTER LAM
0xE5 0x0645 # ARABIC LETTER MEEM
0xE6 0x0646 # ARABIC LETTER NOON
0xE7 0x0647 # ARABIC LETTER HEH
0xE8 0x0648 # ARABIC LETTER WAW
0xE9 0x0649 # ARABIC LETTER ALEF MAKSURA
0xEA 0x064A # ARABIC LETTER YEH
0xEB 0x064B # ARABIC FATHATAN
0xEC 0x064C # ARABIC DAMMATAN
0xED 0x064D # ARABIC KASRATAN
0xEE 0x064E # ARABIC FATHA
0xEF 0x064F # ARABIC DAMMA
0xF0 0x0650 # ARABIC KASRA
0xF1 0x0651 # ARABIC SHADDA
0xF2 0x0652 # ARABIC SUKUN
0xF3 0x067E # ARABIC LETTER PEH
0xF4 0x0679 # ARABIC LETTER TTEH
0xF5 0x0686 # ARABIC LETTER TCHEH
0xF6 0x06D5 # ARABIC LETTER AE
0xF7 0x06A4 # ARABIC LETTER VEH
0xF8 0x06AF # ARABIC LETTER GAF
0xF9 0x0688 # ARABIC LETTER DDAL
0xFA 0x0691 # ARABIC LETTER RREH
0xFB <RL>+0x007B # LEFT CURLY BRACKET, right-left
0xFC <RL>+0x007C # VERTICAL LINE, right-left
0xFD <RL>+0x007D # RIGHT CURLY BRACKET, right-left
0xFE 0x0698 # ARABIC LETTER JEH
0xFF 0x06D2 # ARABIC LETTER YEH BARREE

337
unicode/vendors/apple/GAELIC.TXT vendored Normal file
View File

@@ -0,0 +1,337 @@
#=======================================================================
# File name: GAELIC.TXT
#
# Contents: Map (external version) from Mac OS Celtic
# character set to Unicode 3.0 and later
#
# Contacts: charsets@apple.com, everson@evertype.com
#
# Changes:
#
# c01 2005-Apr-01 First posted version. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Gaelic code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in Mac OS Gaelic code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Gaelic character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Gaelic (partly from Michael Everson):
# -----------------------------------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# This character set was developed by Michael Everson of Everson
# Typography (everson@evertype.com) and was used for fonts in his
# Celtic Utilities and CeltScript font packages for the Mac, as well
# as some fonts included with the Irish localizations of Mac OS 6.0.8
# and 7.1. Note that while Apple authorized this Irish localization,
# it was not a system which shipped with Apple hardware, and was not
# otherwise supported by Apple. Fonts conforming to the Mac OS Gaelic
# character set are available from Everson Typography
# (http://www.evertype.com/celtscript/). Information about the use of
# this character set is available at
# http://www.evertype.com/celtscript/celtcode.html.
#
# The Mac OS Gaelic encoding shares the script code smRoman (0) with
# the standard Mac OS Roman encoding. To determine if the Gaelic
# encoding is being used in Mac OS 7-9, you should also check if the
# system region code is 81. Otherwise, you can check for particular
# fonts that conform to this encoding (since in practice Gaelic fonts
# are used with the ordinary US or UK system versions).
#
# This character set is a variant of standard Mac OS Roman, adding
# capital and small y with acute, grave, and circumflex; capital and
# small w with acute, grave, circumflex and diaeresis; capital and
# small b, c, d, f, g, m, p, s, t with dot above; tironian et; small
# long r, small long s, and small long s with dot above. It has 36
# code point differences from standard Mac OS Roman.
#
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts are updated for Mac OS 8.5 to reflect this. There is
# a "currency sign" variant of the Latin 8 Extended encoding that still
# maps 0xDB to U+00A4; this can be used for older fonts.
# Note: U+20AC is new with Unicode 2.1; for earlier Unicode
# versions, Latin 8 Extended 0xDB may be mapped to private-use
# character U+F8A0.
#
# Before Unicode 3.0, code point 0xE4 was PER MILLE SIGN, and was
# mapped to U+2030. Since August 1998, code point 0xE4 is changed
# to TIRONIAN SIGN ET and maps to U+204A. There is a "per mille
# sign" variant of the Mac OS Gaelic encoding that still
# maps 0xE4 to U+2030; this can be used for older fonts.
# Note: U+204A is new with Unicode 3.0; for earlier Unicode
# versions, Mac OS Gaelic was unified with AMPERSAND.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0x85 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0x86 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0x87 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0x88 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0x89 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0x8C 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0x8D 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0x90 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0x91 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0x94 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0x95 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0x96 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x00DF # LATIN SMALL LETTER SHARP S
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x1E02 # LATIN CAPITAL LETTER B WITH DOT ABOVE
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2264 # LESS-THAN OR EQUAL TO
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x1E03 # LATIN SMALL LETTER B WITH DOT ABOVE
0xB5 0x010A # LATIN CAPITAL LETTER C WITH DOT ABOVE
0xB6 0x010B # LATIN SMALL LETTER C WITH DOT ABOVE
0xB7 0x1E0A # LATIN CAPITAL LETTER D WITH DOT ABOVE
0xB8 0x1E0B # LATIN SMALL LETTER D WITH DOT ABOVE
0xB9 0x1E1E # LATIN CAPITAL LETTER F WITH DOT ABOVE
0xBA 0x1E1F # LATIN SMALL LETTER F WITH DOT ABOVE
0xBB 0x0120 # LATIN CAPITAL LETTER G WITH DOT ABOVE
0xBC 0x0121 # LATIN SMALL LETTER G WITH DOT ABOVE
0xBD 0x1E40 # LATIN CAPITAL LETTER M WITH DOT ABOVE
0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x1E41 # LATIN SMALL LETTER M WITH DOT ABOVE
0xC1 0x1E56 # LATIN CAPITAL LETTER P WITH DOT ABOVE
0xC2 0x1E57 # LATIN SMALL LETTER P WITH DOT ABOVE
0xC3 0x027C # LATIN SMALL LETTER R WITH LONG LEG
0xC4 0x0192 # LATIN SMALL LETTER F WITH HOOK
0xC5 0x017F # LATIN SMALL LETTER LONG S
0xC6 0x1E60 # LATIN CAPITAL LETTER S WITH DOT ABOVE
0xC7 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC8 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xCC 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xCD 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xCE 0x0152 # LATIN CAPITAL LIGATURE OE
0xCF 0x0153 # LATIN SMALL LIGATURE OE
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x1E61 # LATIN SMALL LETTER S WITH DOT ABOVE
0xD7 0x1E9B # LATIN SMALL LETTER LONG S WITH DOT ABOVE
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x1E6A # LATIN CAPITAL LETTER T WITH DOT ABOVE
0xDB 0x20AC # EURO SIGN # before Mac OS 8.5 this was U+00A4 CURRENCY SIGN
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0x0176 # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
0xDF 0x0177 # LATIN SMALL LETTER Y WITH CIRCUMFLEX
0xE0 0x1E6B # LATIN SMALL LETTER T WITH DOT ABOVE
0xE1 0x00B7 # MIDDLE DOT
0xE2 0x1EF2 # LATIN CAPITAL LETTER Y WITH GRAVE
0xE3 0x1EF3 # LATIN SMALL LETTER Y WITH GRAVE
0xE4 0x204A # TIRONIAN SIGN ET # change from MacCeltic for Unicode 3.0; before Aug. 1998 this was U+2030 PER MILLE SIGN
0xE5 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xE6 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xE7 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xE8 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xE9 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xEA 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xEB 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xEC 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xED 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xEE 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xEF 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xF0 0x2663 # BLACK CLUB SUIT = shamrock # future mapping U+2618 SHAMROCK
0xF1 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xF2 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xF3 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xF4 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xF5 0x0131 # LATIN SMALL LETTER DOTLESS I
0xF6 0x00DD # LATIN CAPITAL LETTER Y WITH ACUTE
0xF7 0x00FD # LATIN SMALL LETTER Y WITH ACUTE
0xF8 0x0174 # LATIN CAPITAL LETTER W WITH CIRCUMFLEX
0xF9 0x0175 # LATIN SMALL LETTER W WITH CIRCUMFLEX
0xFA 0x1E84 # LATIN CAPITAL LETTER W WITH DIAERESIS
0xFB 0x1E85 # LATIN SMALL LETTER W WITH DIAERESIS
0xFC 0x1E80 # LATIN CAPITAL LETTER W WITH GRAVE
0xFD 0x1E81 # LATIN SMALL LETTER W WITH GRAVE
0xFE 0x1E82 # LATIN CAPITAL LETTER W WITH ACUTE
0xFF 0x1E83 # LATIN SMALL LETTER W WITH ACUTE

383
unicode/vendors/apple/GUJARATI.TXT vendored Normal file
View File

@@ -0,0 +1,383 @@
#=======================================================================
# File name: GUJARATI.TXT
#
# Contents: Map (external version) from Mac OS Gujarati
# encoding to Unicode 2.1 and later.
#
# Copyright: (c) 1997-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs. Matches internal utom<b1>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n02 1998-Feb-05 First version; matches internal utom<n4>,
# ufrm<n5>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Gujarati code or code sequence
# (in hex as 0xNN or 0xNN+0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN).
# Column #3 is a comment containing the Unicode name or sequence
# of names. In some cases an additional comment follows the
# Unicode name(s).
#
# The entries are in two sections. The first section is for pairs of
# Mac OS Gujarati code points that must be mapped in a special way.
# The second section maps individual code points.
#
# Within each section, the entries are in Mac OS Gujarati code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Gujarati character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Gujarati:
# -------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Gujarati is based on IS 13194:1991 (ISCII-91), with the
# addition of several punctuation and symbol characters. However,
# Mac OS Gujarati does not support the ATR (attribute) mechanism of
# ISCII-91.
#
# 1. ISCII-91 features in Mac OS Gujarati include:
#
# a) Overloading of nukta
#
# In addition to using the nukta (0xE9) like a combining dot below,
# nukta is overloaded to function as a general character modifier.
# In this role, certain code points followed by 0xE9 are treated as
# a two-byte code point representing a character which may be
# rather different than the characters represented by either of
# the code points alone. For example, the character GUJARATI OM
# (U+0AD0) is represented in ISCII-91 as candrabindu + nukta.
#
# b) Explicit halant and soft halant
#
# A double halant (0xE8 + 0xE8) constitutes an "explicit halant",
# which will always appear as a halant instead of causing formation
# of a ligature or half-form consonant.
#
# Halant followed by nukta (0xE8 + 0xE9) constitutes a "soft
# halant", which prevents formation of a ligature and instead
# retains the half-form of the first consonant.
#
# c) Invisible consonant
#
# The byte 0xD9 (called INV in ISCII-91) is an invisible consonant:
# It behaves like a consonant but has no visible appearance. It is
# intended to be used (often in combination with halant) to display
# dependent forms in isolation, such as the RA forms or consonant
# half-forms.
#
# d) Extensions for Vedic, etc.
#
# The byte 0xF0 (called EXT in ISCII-91) followed by any byte in
# the range 0xA1-0xEE constitutes a two-byte code point which can
# be used to represent additional characters for Vedic (or other
# extensions); 0xF0 followed by any other byte value constitutes
# malformed text. Mac OS Gujarati supports this mechanism, but
# does not currently map any of these two-byte code points to
# anything.
#
# 2. Mac OS Gujarati additions
#
# Mac OS Gujarati adds characters using the code points
# 0x80-0x8A and 0x90.
#
# 3. Unused code points
#
# The following code points are currently unused, and are not shown
# here: 0x8B-0x8F, 0x91-0xA0, 0xAB, 0xAF, 0xC7, 0xCE, 0xD0, 0xD3,
# 0xE0, 0xE4, 0xEB-0xEF, 0xFB-0xFF. In addition, 0xF0 is not shown
# here, but it has a special function as described above.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Mapping the byte pairs
#
# If one of the following byte values is encountered when mapping
# Mac OS Gujarati text - xA1, xAA, xDF, or 0xE8 - then the next
# byte (if there is one) should be examined. If the next byte is
# 0xE9 - or also 0xE8, if the first byte was 0xE8 - then the byte
# pair should be mapped using the first section of the mapping
# table below. Otherwise, each byte should be mapped using the
# second section of the mapping table below.
#
# - The Unicode Standard, Version 2.0, specifies how explicit
# halant and soft halant should be represented in Unicode;
# these mappings are used below.
#
# If the byte value 0xF0 is encountered when mapping Mac OS
# Gujarati text, then the next byte should be examined. If there
# is no next byte (e.g. 0xF0 at end of buffer), the mapping
# process should indicate incomplete character. If there is a next
# byte but it is not in the range 0xA1-0xEE, the mapping process
# should indicate malformed text. Otherwise, the mapping process
# should treat the byte pair as a valid two-byte code point with no
# mapping (e.g. map it to QUESTION MARK, REPLACEMENT CHARACTER,
# etc.).
#
# 2. Mapping the invisible consonant
#
# It has been suggested that INV in ISCII-91 should map to ZERO
# WIDTH NON-JOINER in Unicode. However, this causes problems with
# roundtrip fidelity: The ISCII-91 sequences 0xE8+0xE8 and 0xE8+0xD9
# would map to the same sequence of Unicode characters. We have
# instead mapped INV to LEFT-TO-RIGHT MARK, which avoids these
# problems.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
# Section 1: Map the following byte pairs as indicated:
# (ZWNJ means ZERO WIDTH NON-JOINER, ZWJ means ZERO WIDTH JOINER)
# (Also see note about 0xF0 in comments above)
0xA1+0xE9 0x0AD0 # GUJARATI OM
0xAA+0xE9 0x0AE0 # GUJARATI LETTER VOCALIC RR
0xDF+0xE9 0x0AC4 # GUJARATI VOWEL SIGN VOCALIC RR
0xE8+0xE8 0x0ACD+0x200C # GUJARATI SIGN VIRAMA + ZWNJ # explicit halant
0xE8+0xE9 0x0ACD+0x200D # GUJARATI SIGN VIRAMA + ZWJ # soft halant
# Section 2: Map the remaining bytes as follows:
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00D7 # MULTIPLICATION SIGN
0x81 0x2212 # MINUS SIGN
0x82 0x2013 # EN DASH
0x83 0x2014 # EM DASH
0x84 0x2018 # LEFT SINGLE QUOTATION MARK
0x85 0x2019 # RIGHT SINGLE QUOTATION MARK
0x86 0x2026 # HORIZONTAL ELLIPSIS
0x87 0x2022 # BULLET
0x88 0x00A9 # COPYRIGHT SIGN
0x89 0x00AE # REGISTERED SIGN
0x8A 0x2122 # TRADE MARK SIGN
#
0x90 0x0965 # DEVANAGARI DOUBLE DANDA
#
0xA1 0x0A81 # GUJARATI SIGN CANDRABINDU
0xA2 0x0A82 # GUJARATI SIGN ANUSVARA
0xA3 0x0A83 # GUJARATI SIGN VISARGA
0xA4 0x0A85 # GUJARATI LETTER A
0xA5 0x0A86 # GUJARATI LETTER AA
0xA6 0x0A87 # GUJARATI LETTER I
0xA7 0x0A88 # GUJARATI LETTER II
0xA8 0x0A89 # GUJARATI LETTER U
0xA9 0x0A8A # GUJARATI LETTER UU
0xAA 0x0A8B # GUJARATI LETTER VOCALIC R
#
0xAC 0x0A8F # GUJARATI LETTER E
0xAD 0x0A90 # GUJARATI LETTER AI
0xAE 0x0A8D # GUJARATI VOWEL CANDRA E
#
0xB0 0x0A93 # GUJARATI LETTER O
0xB1 0x0A94 # GUJARATI LETTER AU
0xB2 0x0A91 # GUJARATI VOWEL CANDRA O
0xB3 0x0A95 # GUJARATI LETTER KA
0xB4 0x0A96 # GUJARATI LETTER KHA
0xB5 0x0A97 # GUJARATI LETTER GA
0xB6 0x0A98 # GUJARATI LETTER GHA
0xB7 0x0A99 # GUJARATI LETTER NGA
0xB8 0x0A9A # GUJARATI LETTER CA
0xB9 0x0A9B # GUJARATI LETTER CHA
0xBA 0x0A9C # GUJARATI LETTER JA
0xBB 0x0A9D # GUJARATI LETTER JHA
0xBC 0x0A9E # GUJARATI LETTER NYA
0xBD 0x0A9F # GUJARATI LETTER TTA
0xBE 0x0AA0 # GUJARATI LETTER TTHA
0xBF 0x0AA1 # GUJARATI LETTER DDA
0xC0 0x0AA2 # GUJARATI LETTER DDHA
0xC1 0x0AA3 # GUJARATI LETTER NNA
0xC2 0x0AA4 # GUJARATI LETTER TA
0xC3 0x0AA5 # GUJARATI LETTER THA
0xC4 0x0AA6 # GUJARATI LETTER DA
0xC5 0x0AA7 # GUJARATI LETTER DHA
0xC6 0x0AA8 # GUJARATI LETTER NA
#
0xC8 0x0AAA # GUJARATI LETTER PA
0xC9 0x0AAB # GUJARATI LETTER PHA
0xCA 0x0AAC # GUJARATI LETTER BA
0xCB 0x0AAD # GUJARATI LETTER BHA
0xCC 0x0AAE # GUJARATI LETTER MA
0xCD 0x0AAF # GUJARATI LETTER YA
#
0xCF 0x0AB0 # GUJARATI LETTER RA
#
0xD1 0x0AB2 # GUJARATI LETTER LA
0xD2 0x0AB3 # GUJARATI LETTER LLA
#
0xD4 0x0AB5 # GUJARATI LETTER VA
0xD5 0x0AB6 # GUJARATI LETTER SHA
0xD6 0x0AB7 # GUJARATI LETTER SSA
0xD7 0x0AB8 # GUJARATI LETTER SA
0xD8 0x0AB9 # GUJARATI LETTER HA
0xD9 0x200E # LEFT-TO-RIGHT MARK # invisible consonant
0xDA 0x0ABE # GUJARATI VOWEL SIGN AA
0xDB 0x0ABF # GUJARATI VOWEL SIGN I
0xDC 0x0AC0 # GUJARATI VOWEL SIGN II
0xDD 0x0AC1 # GUJARATI VOWEL SIGN U
0xDE 0x0AC2 # GUJARATI VOWEL SIGN UU
0xDF 0x0AC3 # GUJARATI VOWEL SIGN VOCALIC R
#
0xE1 0x0AC7 # GUJARATI VOWEL SIGN E
0xE2 0x0AC8 # GUJARATI VOWEL SIGN AI
0xE3 0x0AC5 # GUJARATI VOWEL SIGN CANDRA E
#
0xE5 0x0ACB # GUJARATI VOWEL SIGN O
0xE6 0x0ACC # GUJARATI VOWEL SIGN AU
0xE7 0x0AC9 # GUJARATI VOWEL SIGN CANDRA O
0xE8 0x0ACD # GUJARATI SIGN VIRAMA # halant
0xE9 0x0ABC # GUJARATI SIGN NUKTA
0xEA 0x0964 # DEVANAGARI DANDA
#
0xF1 0x0AE6 # GUJARATI DIGIT ZERO
0xF2 0x0AE7 # GUJARATI DIGIT ONE
0xF3 0x0AE8 # GUJARATI DIGIT TWO
0xF4 0x0AE9 # GUJARATI DIGIT THREE
0xF5 0x0AEA # GUJARATI DIGIT FOUR
0xF6 0x0AEB # GUJARATI DIGIT FIVE
0xF7 0x0AEC # GUJARATI DIGIT SIX
0xF8 0x0AED # GUJARATI DIGIT SEVEN
0xF9 0x0AEE # GUJARATI DIGIT EIGHT
0xFA 0x0AEF # GUJARATI DIGIT NINE

441
unicode/vendors/apple/GURMUKHI.TXT vendored Normal file
View File

@@ -0,0 +1,441 @@
#=======================================================================
# File name: GURMUKHI.TXT
#
# Contents: Map (external version) from Mac OS Gurmukhi
# encoding to Unicode 2.1 and later.
#
# Copyright: (c) 1997-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Change mappings for 0x91, 0xD5 based on
# new decomposition rules. Update URLs,
# notes. Matches internal utom<b2>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n02 1998-Feb-05 First version; matches internal utom<n5>,
# ufrm<n6>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Gurmukhi code or code sequence
# (in hex as 0xNN or 0xNN+0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN).
# Column #3 is a comment containing the Unicode name or sequence
# of names. In some cases an additional comment follows the
# Unicode name(s).
#
# The entries are in two sections. The first section is for pairs of
# Mac OS Gurmukhi code points that must be mapped in a special way.
# The second section maps individual code points.
#
# Within each section, the entries are in Mac OS Gurmukhi code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Gurmukhi character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Gurmukhi:
# -------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Gurmukhi is based on IS 13194:1991 (ISCII-91), with the
# addition of several punctuation and symbol characters. However,
# Mac OS Gurmukhi does not support the ATR (attribute) mechanism of
# ISCII-91.
#
# 1. ISCII-91 features in Mac OS Gurmukhi include:
#
# a) Explicit halant and soft halant
#
# A double halant (0xE8 + 0xE8) constitutes an "explicit halant",
# which will always appear as a halant instead of causing formation
# of a ligature or half-form consonant.
#
# Halant followed by nukta (0xE8 + 0xE9) constitutes a "soft
# halant", which prevents formation of a ligature and instead
# retains the half-form of the first consonant.
#
# b) Invisible consonant
#
# The byte 0xD9 (called INV in ISCII-91) is an invisible consonant:
# It behaves like a consonant but has no visible appearance. It is
# intended to be used (often in combination with halant) to display
# dependent forms in isolation, such as the RA forms or consonant
# half-forms.
#
# c) Extensions for Vedic, etc.
#
# The byte 0xF0 (called EXT in ISCII-91) followed by any byte in
# the range 0xA1-0xEE constitutes a two-byte code point which can
# be used to represent additional characters for Vedic (or other
# extensions); 0xF0 followed by any other byte value constitutes
# malformed text. Mac OS Gurmukhi supports this mechanism, but
# does not currently map any of these two-byte code points to
# anything.
#
# 2. Mac OS Gurmukhi additions
#
# Mac OS Gurmukhi adds characters using the code points
# 0x80-0x8A and 0x90-0x94 (the latter are some Gurmukhi additions).
#
# 3. Unused code points
#
# The following code points are currently unused, and are not shown
# here: 0x8B-0x8F, 0x95-0xA1, 0xA3, 0xAA-0xAB, 0xAE-0xAF, 0xB2,
# 0xC7, 0xCE, 0xD0, 0xD2-0xD3, 0xD6, 0xDF-0xE0, 0xE3-0xE4, 0xE7,
# 0xEB-0xEF, 0xFB-0xFF. In addition, 0xF0 is not shown here, but it
# has a special function as described above.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Mapping the byte pairs
#
# If the byte value 0xE8 is encountered when mapping Mac OS
# Gurmukhi text, then the next byte (if there is one) should be
# examined. If the next byte is 0xE8 or 0xE9, then the byte pair
# should be mapped using the first section of the mapping table
# below. Otherwise, each byte should be mapped using the second
# section of the mapping table below.
#
# - The Unicode Standard, Version 2.0, specifies how explicit
# halant and soft halant should be represented in Unicode;
# these mappings are used below.
#
# If the byte value 0xF0 is encountered when mapping Mac OS
# Gurmukhi text, then the next byte should be examined. If there
# is no next byte (e.g. 0xF0 at end of buffer), the mapping
# process should indicate incomplete character. If there is a next
# byte but it is not in the range 0xA1-0xEE, the mapping process
# should indicate malformed text. Otherwise, the mapping process
# should treat the byte pair as a valid two-byte code point with no
# mapping (e.g. map it to QUESTION MARK, REPLACEMENT CHARACTER,
# etc.).
#
# 2. Mapping the invisible consonant
#
# It has been suggested that INV in ISCII-91 should map to ZERO
# WIDTH NON-JOINER in Unicode. However, this causes problems with
# roundtrip fidelity: The ISCII-91 sequences 0xE8+0xE8 and 0xE8+0xD9
# would map to the same sequence of Unicode characters. We have
# instead mapped INV to LEFT-TO-RIGHT MARK, which avoids these
# problems.
#
# 3. Mappings using corporate characters
#
# Mapping the GURMUKHI LETTER SHA 0xD5 presents an interesting
# problem. At first glance, we could map it to the single Unicode
# character 0x0A36.
#
# However, our goal is that the mappings provided here should also
# be able to generate the mappings to maximally decomposed Unicode
# by simple recursive substitution of the canonical decompositions
# in the Unicode database. We want mapping tables derived this way
# to retain full roundtrip fidelity.
#
# Since the canonical decomposition of 0x0A36 is 0x0A38+0x0A3C,
# the decomposition mapping for 0xD5 would be identical with the
# decomposition mapping for 0xD7+0xE9, and roundtrip fidelity would
# be lost.
#
# We solve this problem by using a grouping hint (one of the set of
# transcoding hints defined by Apple).
#
# Apple has defined a block of 32 corporate characters as "transcoding
# hints." These are used in combination with standard Unicode characters
# to force them to be treated in a special way for mapping to other
# encodings; they have no other effect. Sixteen of these transcoding
# hints are "grouping hints" - they indicate that the next 2-4 Unicode
# characters should be treated as a single entity for transcoding. The
# other sixteen transcoding hints are "variant tags" - they are like
# combining characters, and can follow a standard Unicode (or a sequence
# consisting of a base character and other combining characters) to
# cause it to be treated in a special way for transcoding. These always
# terminate a combining-character sequence.
#
# The transcoding coding hint used in this mapping table is:
# 0xF860 group next 2 characters
#
# Then we can map 0x91 as follows:
# 0xD5 -> 0xF860+0x0A38+0x0A3C
#
# We could also have used a variant tag such as 0xF87F and mapped it
# this way:
# 0xD5 -> 0x0A36+0xF87F
#
# 4. Additional loose mappings from Unicode
#
# These are not preserved in roundtrip mappings.
#
# 0A59 -> 0xB4+0xE9 # GURMUKHI LETTER KHHA
# 0A5A -> 0xB5+0xE9 # GURMUKHI LETTER GHHA
# 0A5B -> 0xBA+0xE9 # GURMUKHI LETTER ZA
# 0A5E -> 0xC9+0xE9 # GURMUKHI LETTER FA
#
# 0A70 -> 0xA2 # GURMUKHI TIPPI
#
# Loose mappings from Unicode should also map U+0A71 (GURMUKHI ADDAK)
# followed by any Gurmukhi consonant to the equivalent ISCII-91
# consonant plus halant plus the consonant again. For example:
#
# 0A71+0A15 -> 0xB3+0xE8+0xB3
# 0A71+0A16 -> 0xB4+0xE8+0xB4
# ...
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - Change mapping of 0x91 from 0xF860+0x0A21+0x0A3C to 0x0A5C GURMUKHI
# LETTER RRA, now that the canonical decomposition of 0x0A5C to
# 0x0A21+0x0A3C has been deleted
#
# - Change mapping of 0xD5 from 0x0A36 GURMUKHI LETTER SHA to
# 0xF860+0x0A38+0x0A3C, now that a canonical decomposition of 0x0A36
# to 0x0A38+0x0A3C has been added.
#
##################
# Section 1: Map the following byte pairs as indicated:
# (ZWNJ means ZERO WIDTH NON-JOINER, ZWJ means ZERO WIDTH JOINER)
# (Also see note about 0xF0 in comments above)
0xE8+0xE8 0x0A4D+0x200C # GURMUKHI SIGN VIRAMA + ZWNJ # explicit halant
0xE8+0xE9 0x0A4D+0x200D # GURMUKHI SIGN VIRAMA + ZWJ # soft halant
# Section 2: Map the remaining bytes as follows:
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00D7 # MULTIPLICATION SIGN
0x81 0x2212 # MINUS SIGN
0x82 0x2013 # EN DASH
0x83 0x2014 # EM DASH
0x84 0x2018 # LEFT SINGLE QUOTATION MARK
0x85 0x2019 # RIGHT SINGLE QUOTATION MARK
0x86 0x2026 # HORIZONTAL ELLIPSIS
0x87 0x2022 # BULLET
0x88 0x00A9 # COPYRIGHT SIGN
0x89 0x00AE # REGISTERED SIGN
0x8A 0x2122 # TRADE MARK SIGN
#
0x90 0x0A71 # GURMUKHI ADDAK
0x91 0x0A5C # GURMUKHI LETTER RRA
0x92 0x0A73 # GURMUKHI URA
0x93 0x0A72 # GURMUKHI IRI
0x94 0x0A74 # GURMUKHI EK ONKAR
#
0xA2 0x0A02 # GURMUKHI SIGN BINDI
#
0xA4 0x0A05 # GURMUKHI LETTER A
0xA5 0x0A06 # GURMUKHI LETTER AA
0xA6 0x0A07 # GURMUKHI LETTER I
0xA7 0x0A08 # GURMUKHI LETTER II
0xA8 0x0A09 # GURMUKHI LETTER U
0xA9 0x0A0A # GURMUKHI LETTER UU
#
0xAC 0x0A0F # GURMUKHI LETTER EE
0xAD 0x0A10 # GURMUKHI LETTER AI
#
0xB0 0x0A13 # GURMUKHI LETTER OO
0xB1 0x0A14 # GURMUKHI LETTER AU
#
0xB3 0x0A15 # GURMUKHI LETTER KA
0xB4 0x0A16 # GURMUKHI LETTER KHA
0xB5 0x0A17 # GURMUKHI LETTER GA
0xB6 0x0A18 # GURMUKHI LETTER GHA
0xB7 0x0A19 # GURMUKHI LETTER NGA
0xB8 0x0A1A # GURMUKHI LETTER CA
0xB9 0x0A1B # GURMUKHI LETTER CHA
0xBA 0x0A1C # GURMUKHI LETTER JA
0xBB 0x0A1D # GURMUKHI LETTER JHA
0xBC 0x0A1E # GURMUKHI LETTER NYA
0xBD 0x0A1F # GURMUKHI LETTER TTA
0xBE 0x0A20 # GURMUKHI LETTER TTHA
0xBF 0x0A21 # GURMUKHI LETTER DDA
0xC0 0x0A22 # GURMUKHI LETTER DDHA
0xC1 0x0A23 # GURMUKHI LETTER NNA
0xC2 0x0A24 # GURMUKHI LETTER TA
0xC3 0x0A25 # GURMUKHI LETTER THA
0xC4 0x0A26 # GURMUKHI LETTER DA
0xC5 0x0A27 # GURMUKHI LETTER DHA
0xC6 0x0A28 # GURMUKHI LETTER NA
#
0xC8 0x0A2A # GURMUKHI LETTER PA
0xC9 0x0A2B # GURMUKHI LETTER PHA
0xCA 0x0A2C # GURMUKHI LETTER BA
0xCB 0x0A2D # GURMUKHI LETTER BHA
0xCC 0x0A2E # GURMUKHI LETTER MA
0xCD 0x0A2F # GURMUKHI LETTER YA
#
0xCF 0x0A30 # GURMUKHI LETTER RA
#
0xD1 0x0A32 # GURMUKHI LETTER LA
#
0xD4 0x0A35 # GURMUKHI LETTER VA
0xD5 0xF860+0x0A38+0x0A3C # GURMUKHI LETTER SHA
#
0xD7 0x0A38 # GURMUKHI LETTER SA
0xD8 0x0A39 # GURMUKHI LETTER HA
0xD9 0x200E # LEFT-TO-RIGHT MARK # invisible consonant
0xDA 0x0A3E # GURMUKHI VOWEL SIGN AA
0xDB 0x0A3F # GURMUKHI VOWEL SIGN I
0xDC 0x0A40 # GURMUKHI VOWEL SIGN II
0xDD 0x0A41 # GURMUKHI VOWEL SIGN U
0xDE 0x0A42 # GURMUKHI VOWEL SIGN UU
#
0xE1 0x0A47 # GURMUKHI VOWEL SIGN EE
0xE2 0x0A48 # GURMUKHI VOWEL SIGN AI
#
0xE5 0x0A4B # GURMUKHI VOWEL SIGN OO
0xE6 0x0A4C # GURMUKHI VOWEL SIGN AU
#
0xE8 0x0A4D # GURMUKHI SIGN VIRAMA # halant
0xE9 0x0A3C # GURMUKHI SIGN NUKTA
0xEA 0x0964 # DEVANAGARI DANDA
#
0xF1 0x0A66 # GURMUKHI DIGIT ZERO
0xF2 0x0A67 # GURMUKHI DIGIT ONE
0xF3 0x0A68 # GURMUKHI DIGIT TWO
0xF4 0x0A69 # GURMUKHI DIGIT THREE
0xF5 0x0A6A # GURMUKHI DIGIT FOUR
0xF6 0x0A6B # GURMUKHI DIGIT FIVE
0xF7 0x0A6C # GURMUKHI DIGIT SIX
0xF8 0x0A6D # GURMUKHI DIGIT SEVEN
0xF9 0x0A6E # GURMUKHI DIGIT EIGHT
0xFA 0x0A6F # GURMUKHI DIGIT NINE

322
unicode/vendors/apple/INUIT.TXT vendored Normal file
View File

@@ -0,0 +1,322 @@
#=======================================================================
# File name: INUIT.TXT
#
# Contents: Map (external version) from Mac OS Inuit
# character set to Unicode 3.0 and later
#
# Contacts: charsets@apple.com, everson@evertype.com
#
# Changes:
#
# c01 2005-Apr-01 First posted version. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Inuit code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in Mac OS Inuit code order.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Inuit character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Inuit (partly from Michael Everson):
# ----------------------------------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# This character set was developed by Michael Everson of Everson
# Typography (everson@evertype.com) and was used for the Inuktitut
# localizations of Mac OS, as well as for the Inuktitut utilities
# package from Everson Typography. Note that while Apple authorized
# the Inuktitut localization mentioned above, it was not shipped with
# Apple hardware, and was not otherwise supported by Apple. Fonts
# conforming to the Mac OS Inuit character set are available from
# Everson Typography (http://www.evertype.com/software/apple/).
# Information about the use of this character set is available at
# http://www.evertype.com/standards/iu/.
#
# The Mac OS Inuit character set shares the script code smEthiopic
# (28) with the Ethiopic encoding. To determine if the Inuktitut
# encoding is being used, you must also check if the system region
# code is 78, verNunavut.
#
# The Mac OS Inuit character set includes the full syllabic letter
# repertoire required for Inuktitut; it is a subset of the Unified
# Canadian Aboriginal Syllabics set encoded in Unicode. The encoding
# is InuitSCII, designed by Doug Hitch for the Government of the
# Northwest Territories.
#
# The Mac OS Inuit character set also includes a number of characters
# that were needed for the classic Mac OS user interface and
# localization (e.g. ellipsis, bullet, copyright sign). All of the
# characters in Mac OS Inuit that are also in the Mac OS Roman
# encoding are at the same code point in both; this improves
# application compatibility.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x1403 # CANADIAN SYLLABICS I
0x81 0x1404 # CANADIAN SYLLABICS II
0x82 0x1405 # CANADIAN SYLLABICS O
0x83 0x1406 # CANADIAN SYLLABICS OO
0x84 0x140A # CANADIAN SYLLABICS A
0x85 0x140B # CANADIAN SYLLABICS AA
0x86 0x1431 # CANADIAN SYLLABICS PI
0x87 0x1432 # CANADIAN SYLLABICS PII
0x88 0x1433 # CANADIAN SYLLABICS PO
0x89 0x1434 # CANADIAN SYLLABICS POO
0x8A 0x1438 # CANADIAN SYLLABICS PA
0x8B 0x1439 # CANADIAN SYLLABICS PAA
0x8C 0x1449 # CANADIAN SYLLABICS P
0x8D 0x144E # CANADIAN SYLLABICS TI
0x8E 0x144F # CANADIAN SYLLABICS TII
0x8F 0x1450 # CANADIAN SYLLABICS TO
0x90 0x1451 # CANADIAN SYLLABICS TOO
0x91 0x1455 # CANADIAN SYLLABICS TA
0x92 0x1456 # CANADIAN SYLLABICS TAA
0x93 0x1466 # CANADIAN SYLLABICS T
0x94 0x146D # CANADIAN SYLLABICS KI
0x95 0x146E # CANADIAN SYLLABICS KII
0x96 0x146F # CANADIAN SYLLABICS KO
0x97 0x1470 # CANADIAN SYLLABICS KOO
0x98 0x1472 # CANADIAN SYLLABICS KA
0x99 0x1473 # CANADIAN SYLLABICS KAA
0x9A 0x1483 # CANADIAN SYLLABICS K
0x9B 0x148B # CANADIAN SYLLABICS CI
0x9C 0x148C # CANADIAN SYLLABICS CII
0x9D 0x148D # CANADIAN SYLLABICS CO
0x9E 0x148E # CANADIAN SYLLABICS COO
0x9F 0x1490 # CANADIAN SYLLABICS CA
0xA0 0x1491 # CANADIAN SYLLABICS CAA
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x14A1 # CANADIAN SYLLABICS C
0xA3 0x14A5 # CANADIAN SYLLABICS MI
0xA4 0x14A6 # CANADIAN SYLLABICS MII
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x14A7 # CANADIAN SYLLABICS MO
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x14A8 # CANADIAN SYLLABICS MOO
0xAC 0x14AA # CANADIAN SYLLABICS MA
0xAD 0x14AB # CANADIAN SYLLABICS MAA
0xAE 0x14BB # CANADIAN SYLLABICS M
0xAF 0x14C2 # CANADIAN SYLLABICS NI
0xB0 0x14C3 # CANADIAN SYLLABICS NII
0xB1 0x14C4 # CANADIAN SYLLABICS NO
0xB2 0x14C5 # CANADIAN SYLLABICS NOO
0xB3 0x14C7 # CANADIAN SYLLABICS NA
0xB4 0x14C8 # CANADIAN SYLLABICS NAA
0xB5 0x14D0 # CANADIAN SYLLABICS N
0xB6 0x14EF # CANADIAN SYLLABICS SI
0xB7 0x14F0 # CANADIAN SYLLABICS SII
0xB8 0x14F1 # CANADIAN SYLLABICS SO
0xB9 0x14F2 # CANADIAN SYLLABICS SOO
0xBA 0x14F4 # CANADIAN SYLLABICS SA
0xBB 0x14F5 # CANADIAN SYLLABICS SAA
0xBC 0x1505 # CANADIAN SYLLABICS S
0xBD 0x14D5 # CANADIAN SYLLABICS LI
0xBE 0x14D6 # CANADIAN SYLLABICS LII
0xBF 0x14D7 # CANADIAN SYLLABICS LO
0xC0 0x14D8 # CANADIAN SYLLABICS LOO
0xC1 0x14DA # CANADIAN SYLLABICS LA
0xC2 0x14DB # CANADIAN SYLLABICS LAA
0xC3 0x14EA # CANADIAN SYLLABICS L
0xC4 0x1528 # CANADIAN SYLLABICS YI
0xC5 0x1529 # CANADIAN SYLLABICS YII
0xC6 0x152A # CANADIAN SYLLABICS YO
0xC7 0x152B # CANADIAN SYLLABICS YOO
0xC8 0x152D # CANADIAN SYLLABICS YA
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x152E # CANADIAN SYLLABICS YAA
0xCC 0x153E # CANADIAN SYLLABICS Y
0xCD 0x1555 # CANADIAN SYLLABICS FI
0xCE 0x1556 # CANADIAN SYLLABICS FII
0xCF 0x1557 # CANADIAN SYLLABICS FO
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x1558 # CANADIAN SYLLABICS FOO
0xD7 0x1559 # CANADIAN SYLLABICS FA
0xD8 0x155A # CANADIAN SYLLABICS FAA
0xD9 0x155D # CANADIAN SYLLABICS F
0xDA 0x1546 # CANADIAN SYLLABICS RI
0xDB 0x1547 # CANADIAN SYLLABICS RII
0xDC 0x1548 # CANADIAN SYLLABICS RO
0xDD 0x1549 # CANADIAN SYLLABICS ROO
0xDE 0x154B # CANADIAN SYLLABICS RA
0xDF 0x154C # CANADIAN SYLLABICS RAA
0xE0 0x1550 # CANADIAN SYLLABICS R
0xE1 0x157F # CANADIAN SYLLABICS QI
0xE2 0x1580 # CANADIAN SYLLABICS QII
0xE3 0x1581 # CANADIAN SYLLABICS QO
0xE4 0x1582 # CANADIAN SYLLABICS QOO
0xE5 0x1583 # CANADIAN SYLLABICS QA
0xE6 0x1584 # CANADIAN SYLLABICS QAA
0xE7 0x1585 # CANADIAN SYLLABICS Q
0xE8 0x158F # CANADIAN SYLLABICS NGI
0xE9 0x1590 # CANADIAN SYLLABICS NGII
0xEA 0x1591 # CANADIAN SYLLABICS NGO
0xEB 0x1592 # CANADIAN SYLLABICS NGOO
0xEC 0x1593 # CANADIAN SYLLABICS NGA
0xED 0x1594 # CANADIAN SYLLABICS NGAA
0xEE 0x1595 # CANADIAN SYLLABICS NG
0xEF 0x1671 # CANADIAN SYLLABICS NNGI
0xF0 0x1672 # CANADIAN SYLLABICS NNGII
0xF1 0x1673 # CANADIAN SYLLABICS NNGO
0xF2 0x1674 # CANADIAN SYLLABICS NNGOO
0xF3 0x1675 # CANADIAN SYLLABICS NNGA
0xF4 0x1676 # CANADIAN SYLLABICS NNGAA
0xF5 0x1596 # CANADIAN SYLLABICS NNG
0xF6 0x15A0 # CANADIAN SYLLABICS LHI
0xF7 0x15A1 # CANADIAN SYLLABICS LHII
0xF8 0x15A2 # CANADIAN SYLLABICS LHO
0xF9 0x15A3 # CANADIAN SYLLABICS LHOO
0xFA 0x15A4 # CANADIAN SYLLABICS LHA
0xFB 0x15A5 # CANADIAN SYLLABICS LHAA
0xFC 0x15A6 # CANADIAN SYLLABICS LH
0xFD 0x157C # CANADIAN SYLLABICS NUNAVUT H
0xFE 0x0141 # LATIN CAPITAL LETTER L WITH STROKE
0xFF 0x0142 # LATIN SMALL LETTER L WITH STROKE

7728
unicode/vendors/apple/JAPANESE.TXT vendored Normal file

File diff suppressed because it is too large Load Diff

234
unicode/vendors/apple/KEYBOARD.TXT vendored Normal file
View File

@@ -0,0 +1,234 @@
#=======================================================================
# File name: KEYBOARD.TXT
#
# Contents: Map (external version) from Mac OS Keyboard
# character set to Unicode 4.0 and later.
#
# Copyright: (c) 2001-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Change mappings for 0x09, 0x0F, 0x8C; add
# Mac OS X-only mappings for 0x8D-9x8F.
# Update header comments, including
# clarification of Mac OS X usage. Matches
# internal xml <c1.2> and Text Encoding
# Converter 2.0.
# b1,c1 2002-Dec-19 First version. Matches internal utom<b6>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Keyboard code (in hex as 0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN, etc.).
# Column #3 is a comment containing the Unicode name.
# In some cases an additional comment follows the Unicode name.
#
# The entries are in Mac OS Keyboard code order.
#
# Some of these mappings require the use of corporate characters.
# See the file "CORPCHAR.TXT" and notes below.
#
# The Mac OS Keyboard character set uses the ranges normally set aside
# for controls, so those ranges are present in this table.
#
# Notes on Mac OS Keyboard:
# -------------------------
#
# This is the encoding for the legacy font named ".Keyboard". Before
# Mac OS X, this font was used by the user-interface system to display
# glyphs for special keys on the keyboard. In Mac OS X, that font is
# not present and this mapping is not associated with a font; it is
# only used as a way to map from a set of Menu Manager constants to
# associated Unicode sequences. As such, new mappings added for Mac OS
# X only may be one-way mappings: From the Keyboard glyph "encoding"
# to Unicode, but not back.
#
# The Mac OS Keyboard encoding shares the script code smRoman
# (0) with the Mac OS Roman encoding. To determine if the Keyboard
# encoding is being used in Mac OS 8 or Mac OS 9, you must check if
# the font name is ".Keyboard".
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The goals in the mappings provided here are:
# - For mappings used in Mac OS 8 and Mac OS 9, ensure roundtrip
# mapping from every character in the Mac OS Keyboard character set
# to Unicode and back. This consideration does not apply to mappings
# added for Mac OS X only (noted below).
# - Use standard Unicode characters as much as possible, to
# maximize interchangeability of the resulting Unicode text.
# Whenever possible, avoid having content carried by private-use
# characters.
#
# Some of the characters in the Mac OS Keyboard character set do not
# correspond to distinct, single Unicode characters. To map these
# and satisfy both goals above, we employ various strategies.
#
# a) If possible, use private use characters in combination with
# standard Unicode characters to mark variants of the standard
# Unicode character.
#
# Apple has defined a block of 32 corporate characters as "transcoding
# hints." These are used in combination with standard Unicode
# characters to force them to be treated in a special way for mapping
# to other encodings; they have no other effect. Sixteen of these
# transcoding hints are "grouping hints" - they indicate that the next
# 2-4 Unicode characters should be treated as a single entity for
# transcoding. The other sixteen transcoding hints are "variant tags"
# - they are like combining characters, and can follow a standard
# Unicode (or a sequence consisting of a base character and other
# combining characters) to cause it to be treated in a special way for
# transcoding. These always terminate a combining-character sequence.
#
# The transcoding coding hints used in this mapping table are two
# grouping tags, 0xF860-61, and one variant tag, 0xF87F. Since these
# are combined with standard Unicode characters, some characters in
# the Mac OS Keyboard character set map to a sequence of two to four
# Unicodes instead of a single Unicode character.
#
# For example, the Mac OS Keyboard character at 0x6F, representing the
# F1 key, is mapped to Unicode using the grouping tag F860 (group next
# two) followed by U+0046 (LATIN CAPITAL LETTER F) and U+0031 (DIGIT
# ONE).
#
# b) Otherwise, use private use characters by themselves to map Mac OS
# Keyboard characters which have no relationship to any standard
# Unicode character.
#
# The following additional corporate zone Unicode characters are
# used for this purpose here:
#
# 0xF802 Lower left pencil
# 0xF803 Contextual menu key symbol
# 0xF8FF Apple logo
#
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version c01 to version c02:
#
# - Mapping for 0x09 changed from 0x0009 (wrong) to 0x2423
# - Mapping for 0x0F changed from 0x270E (wrong) to 0xF802
# - Mapping for 0x8C changed from 0xF804 to 0x23CF (Unicode 4.0)
# - Add Mac OS X-only mappings for 0x8D-0x8F
#
##################
0x00 0x0000 # control - NUL
#
0x02 0x21E5 # RIGHTWARDS ARROW TO BAR # Tab right (left-to-right text)
0x03 0x21E4 # LEFTWARDS ARROW TO BAR # Tab left (right-to-left text)
0x04 0x2324 # UP ARROWHEAD BETWEEN TWO HORIZONTAL BARS # Enter key
0x05 0x21E7 # UPWARDS WHITE ARROW # Shift key
0x06 0x2303 # UP ARROWHEAD # Control key
0x07 0x2325 # OPTION KEY # Option key
0x08 0x0008 # control - BS
0x09 0x2423 # OPEN BOX # Space key (Mac OS X mapping, duplicates mapping for 0x61, hence no round-trip)
0x0A 0x2326 # ERASE TO THE RIGHT # Delete right (right-to-left text)
0x0B 0x21A9 # LEFTWARDS ARROW WITH HOOK # Return key (left-to-right text)
0x0C 0x21AA # RIGHTWARDS ARROW WITH HOOK # Return key (right-to-left text)
0x0D 0x000D # control - CR
#
0x0F 0xF802 # lower left pencil
0x10 0x21E3 # DOWNWARDS DASHED ARROW
0x11 0x2318 # PLACE OF INTEREST SIGN # Command key
0x12 0x2713 # CHECK MARK
0x13 0x25C6 # BLACK DIAMOND
0x14 0xF8FF # Apple logo
#
0x17 0x232B # ERASE TO THE LEFT # Delete left (left-to-right text)
0x18 0x21E0 # LEFTWARDS DASHED ARROW
0x19 0x21E1 # UPWARDS DASHED ARROW
0x1A 0x21E2 # RIGHTWARDS DASHED ARROW
0x1B 0x238B # BROKEN CIRCLE WITH NORTHWEST ARROW # Escape key; for Unicode 3.0 and later
0x1C 0x2327 # X IN A RECTANGLE BOX # Clear key
#
0x20 0x0020 # SPACE
#
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
#
0x46 0x0046 # LATIN CAPITAL LETTER F
#
0x61 0x2423 # OPEN BOX # Blank key
0x62 0x21DE # UPWARDS ARROW WITH DOUBLE STROKE # Page up key
0x63 0x21EA # UPWARDS WHITE ARROW FROM BAR # Caps lock key
0x64 0x2190 # LEFTWARDS ARROW
0x65 0x2192 # RIGHTWARDS ARROW
0x66 0x2196 # NORTH WEST ARROW
0x67 0x003F+0x20DD # QUESTION MARK + COMBINING ENCLOSING CIRCLE # Help key
0x68 0x2191 # UPWARDS ARROW
0x69 0x2198 # SOUTH EAST ARROW
0x6A 0x2193 # DOWNWARDS ARROW
0x6B 0x21DF # DOWNWARDS ARROW WITH DOUBLE STROKE # Page down key
0x6C 0xF8FF+0xF87F # Apple logo, outline
0x6D 0xF803 # Contextual menu key symbol
0x6E 0x2758+0x20DD # LIGHT VERTICAL BAR + COMBINING ENCLOSING CIRCLE # Power key
0x6F 0xF860+0x0046+0x0031 # group_2 + F + 1 # F1 key
0x70 0xF860+0x0046+0x0032 # group_2 + F + 2 # F2 key
0x71 0xF860+0x0046+0x0033 # group_2 + F + 3 # F3 key
0x72 0xF860+0x0046+0x0034 # group_2 + F + 4 # F4 key
0x73 0xF860+0x0046+0x0035 # group_2 + F + 5 # F5 key
0x74 0xF860+0x0046+0x0036 # group_2 + F + 6 # F6 key
0x75 0xF860+0x0046+0x0037 # group_2 + F + 7 # F7 key
0x76 0xF860+0x0046+0x0038 # group_2 + F + 8 # F8 key
0x77 0xF860+0x0046+0x0039 # group_2 + F + 9 # F9 key
0x78 0xF861+0x0046+0x0031+0x0030 # group_3 + F + 1 + 0 # F10 key
0x79 0xF861+0x0046+0x0031+0x0031 # group_3 + F + 1 + 1 # F11 key
0x7A 0xF861+0x0046+0x0031+0x0032 # group_3 + F + 1 + 2 # F12 key
#
0x87 0xF861+0x0046+0x0031+0x0033 # group_3 + F + 1 + 3 # F13 key
0x88 0xF861+0x0046+0x0031+0x0034 # group_3 + F + 1 + 4 # F14 key
0x89 0xF861+0x0046+0x0031+0x0035 # group_3 + F + 1 + 5 # F15 key
0x8A 0x2388 # HELM SYMBOL # Control key (ISO standard), Unicode 3.0 and later
0x8B 0x2387 # ALTERNATIVE KEY SYMBOL # Unicode 3.0 and later
0x8C 0x23CF # EJECT SYMBOL # Unicode 4.0 and later, Mac OS X only
0x8D 0x82F1+0x6570 # Japanese "eisu" key symbol # Mac OS X only
0x8E 0x304B+0x306A # Japanese "kana" key symbol # Mac OS X only
0x8F 0xF861+0x0046+0x0031+0x0036 # group_3 + F + 1 + 6 # F16 key, Mac OS X only
#

9942
unicode/vendors/apple/KOREAN.TXT vendored Normal file

File diff suppressed because it is too large Load Diff

365
unicode/vendors/apple/ROMANIAN.TXT vendored Normal file
View File

@@ -0,0 +1,365 @@
#=======================================================================
# File name: ROMANIAN.TXT
#
# Contents: Map (external version) from Mac OS Romanian
# character set to Unicode 3.0 and later.
#
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.2> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update mappings for 0xAF, 0xBF, 0xDE, 0xDF
# to use new composed characters added in
# Unicode 3.0. Update URLs, notes. Matches
# internal utom<b3>.
# b02 1999-Sep-22 Encoding changed for Mac OS 8.5; change
# mapping of 0xDB from CURRENCY SIGN to EURO
# SIGN. Update contact e-mail address. Matches
# internal utom<b2>, ufrm<b2>, and Text
# Encoding Converter version 1.5.
# n05 1998-Feb-05 Minor update to header comments
# n03 1997-Dec-14 Update to match internal utom<n5>, ufrm<n16>:
# Change standard mapping for 0xBD from U+2126
# to its canonical decomposition, U+03A9.
# Change mapping of 0xAF,0xBF,0xDE,0xDF from
# composed S/T WITH CEDILLA to S/T with
# COMBINING COMMA BELOW (to match our
# decomposition mappings).
# n02 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n4>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Mac OS Romanian code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in Mac OS Romanian code order.
#
# One of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Romanian character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Romanian:
# -------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Romanian is used only for Romanian.
#
# The Mac OS Romanian encoding shares the script code smRoman
# (0) with the standard Mac OS Roman encoding. To determine if
# the Romanian encoding is being used, you must also check if the
# system region code is 39, verRomania.
#
# This character set is a variant of standard Mac OS Roman, adding
# upper and lower A breve, S comma below, and T comma below. It
# has 6 code point differences from standard Mac OS Roman.
#
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts are updated for Mac OS 8.5 to reflect this. There is
# a "currency sign" variant of the Mac OS Romanian encoding that
# still maps 0xDB to U+00A4; this can be used for older fonts.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The following corporate zone Unicode character is used in this
# mapping:
#
# 0xF8FF Apple logo
#
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - Update the mappings for 0xAF, 0xBF, 0xDE, 0xDF to use new
# composed Unicode characters 0x0218-0x021B added in Unicode 3.0;
# the previous mappings were to the equivalent decomposition
# sequences.
#
# Changes from version n05 to version b02:
#
# - Encoding changed for Mac OS 8.5; change mapping of 0xDB from
# CURRENCY SIGN (U+00A4) to EURO SIGN (U+20AC).
#
# Changes from version n02 to version n03:
#
# - Change mapping of 0xBD from U+2126 to its canonical
# decomposition, U+03A9.
# - Change mapping of 0xAF,0xBF,0xDE,0xDF from composed S or T
# WITH CEDILLA to S or T with COMBINING COMMA BELOW (to match
# our decomposition mappings).
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0x00C5 # LATIN CAPITAL LETTER A WITH RING ABOVE
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
0x85 0x00D6 # LATIN CAPITAL LETTER O WITH DIAERESIS
0x86 0x00DC # LATIN CAPITAL LETTER U WITH DIAERESIS
0x87 0x00E1 # LATIN SMALL LETTER A WITH ACUTE
0x88 0x00E0 # LATIN SMALL LETTER A WITH GRAVE
0x89 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x00E3 # LATIN SMALL LETTER A WITH TILDE
0x8C 0x00E5 # LATIN SMALL LETTER A WITH RING ABOVE
0x8D 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0x90 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0x91 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 0x00EC # LATIN SMALL LETTER I WITH GRAVE
0x94 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0x95 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0x96 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 0x00F2 # LATIN SMALL LETTER O WITH GRAVE
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x00F5 # LATIN SMALL LETTER O WITH TILDE
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x00A2 # CENT SIGN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x00DF # LATIN SMALL LETTER SHARP S
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x0102 # LATIN CAPITAL LETTER A WITH BREVE
0xAF 0x0218 # LATIN CAPITAL LETTER S WITH COMMA BELOW # for Unicode 3.0 and later
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2264 # LESS-THAN OR EQUAL TO
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x00A5 # YEN SIGN
0xB5 0x00B5 # MICRO SIGN
0xB6 0x2202 # PARTIAL DIFFERENTIAL
0xB7 0x2211 # N-ARY SUMMATION
0xB8 0x220F # N-ARY PRODUCT
0xB9 0x03C0 # GREEK SMALL LETTER PI
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x0103 # LATIN SMALL LETTER A WITH BREVE
0xBF 0x0219 # LATIN SMALL LETTER S WITH COMMA BELOW # for Unicode 3.0 and later
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK
0xC2 0x00AC # NOT SIGN
0xC3 0x221A # SQUARE ROOT
0xC4 0x0192 # LATIN SMALL LETTER F WITH HOOK
0xC5 0x2248 # ALMOST EQUAL TO
0xC6 0x2206 # INCREMENT
0xC7 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC8 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x00C0 # LATIN CAPITAL LETTER A WITH GRAVE
0xCC 0x00C3 # LATIN CAPITAL LETTER A WITH TILDE
0xCD 0x00D5 # LATIN CAPITAL LETTER O WITH TILDE
0xCE 0x0152 # LATIN CAPITAL LIGATURE OE
0xCF 0x0153 # LATIN SMALL LIGATURE OE
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x00F7 # DIVISION SIGN
0xD7 0x25CA # LOZENGE
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x2044 # FRACTION SLASH
0xDB 0x20AC # EURO SIGN
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0x021A # LATIN CAPITAL LETTER T WITH COMMA BELOW # for Unicode 3.0 and later
0xDF 0x021B # LATIN SMALL LETTER T WITH COMMA BELOW # for Unicode 3.0 and later
0xE0 0x2021 # DOUBLE DAGGER
0xE1 0x00B7 # MIDDLE DOT
0xE2 0x201A # SINGLE LOW-9 QUOTATION MARK
0xE3 0x201E # DOUBLE LOW-9 QUOTATION MARK
0xE4 0x2030 # PER MILLE SIGN
0xE5 0x00C2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
0xE6 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xE7 0x00C1 # LATIN CAPITAL LETTER A WITH ACUTE
0xE8 0x00CB # LATIN CAPITAL LETTER E WITH DIAERESIS
0xE9 0x00C8 # LATIN CAPITAL LETTER E WITH GRAVE
0xEA 0x00CD # LATIN CAPITAL LETTER I WITH ACUTE
0xEB 0x00CE # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
0xEC 0x00CF # LATIN CAPITAL LETTER I WITH DIAERESIS
0xED 0x00CC # LATIN CAPITAL LETTER I WITH GRAVE
0xEE 0x00D3 # LATIN CAPITAL LETTER O WITH ACUTE
0xEF 0x00D4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
0xF0 0xF8FF # Apple logo
0xF1 0x00D2 # LATIN CAPITAL LETTER O WITH GRAVE
0xF2 0x00DA # LATIN CAPITAL LETTER U WITH ACUTE
0xF3 0x00DB # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
0xF4 0x00D9 # LATIN CAPITAL LETTER U WITH GRAVE
0xF5 0x0131 # LATIN SMALL LETTER DOTLESS I
0xF6 0x02C6 # MODIFIER LETTER CIRCUMFLEX ACCENT
0xF7 0x02DC # SMALL TILDE
0xF8 0x00AF # MACRON
0xF9 0x02D8 # BREVE
0xFA 0x02D9 # DOT ABOVE
0xFB 0x02DA # RING ABOVE
0xFC 0x00B8 # CEDILLA
0xFD 0x02DD # DOUBLE ACUTE ACCENT
0xFE 0x02DB # OGONEK
0xFF 0x02C7 # CARON

View File

@@ -1,12 +1,40 @@
#=======================================================================
# File name: ARABIC.TXT
#
# Name: MacOS_Arabic [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <11>)
# Date: 15 April 1995
# Authors: Peter Edberg <edberg1@applelink.apple.com>
# Frank Tang
# Contents: Map (external version) from Mac OS Arabic
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1994-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-04 Update header comments. Matches internal xml
# <c1.2> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Add comments about character display and
# direction overrides. Update URLs, notes.
# Matches internal utom<b4>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n10 1998-Feb-05 Show required Unicode character
# directionality in a different way. Matches
# internal utom<n4>, ufrm<n21>, and Text
# Encoding Converter version 1.3. Update
# header comments; include information on
# loose mapping of digits.
# n07 1997-Jul-17 Update to match internal utom<n2>, ufrm<n17>:
# Change standard mapping for 0xC0 from U+066D
# to U+274A. Add direction overrides to
# mappings for 0x25, 0x2C, 0x3B, 0x3F. Add
# information on variants.
# n03 1995-Apr-18 First version (after fixing some typos).
# Matches internal ufrm<n11>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -15,69 +43,69 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Arabic code (in hex as 0xNN)
# Column #2 is the Unicode or Unicode sequence (in hex as 0xNNNN
# or 0xNNNN+0xNNNN+0xNNNN).
# Column #3 is the Unicode name (follows a comment sign, '#')
# Note: The abbreviations LRO, RLO, and PDF are used for
# LEFT-TO-RIGHT OVERRIDE, RIGHT-TO-LEFT OVERRIDE, and
# POP DIRECTIONAL FORMATTING, respectively.
# Column #1 is the Mac OS Arabic code (in hex as 0xNN).
# Column #2 is the corresponding Unicode (in hex as 0xNNNN),
# possibly preceded by a tag indicating required directionality
# (i.e. <LR>+0xNNNN or <RL>+0xNNNN).
# Column #3 is a comment containing the Unicode name.
#
# The entries are in MacOS Arabic code order.
# The entries are in Mac OS Arabic code order.
#
# Note that in many cases, a single MacOS Arabic character maps
# to a sequence of Unicode characters: LRO or RLO plus some Unicode
# character + PDF. This is indicated by joining the Unicode
# characters with '+'. This happens when the direction class of
# the MacOS Arabic character is different than the direction class
# of the Unicode character (usually the MacOS Arabic character has
# a strong direction class and the corresponding Unicode character
# is neutral or has a wek direction class).
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Arabic character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on MacOS Arabic:
# ----------------------
# Notes on Mac OS Arabic:
# -----------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# 1. General
#
# The MacOS Arabic character set is used for the Arabic and Persian
# (Farsi) localizations.
# The Mac OS Arabic character set is intended to cover Arabic as
# used in North Africa, the Arabian peninsula, and the Levant. It
# also contains several characters needed for Urdu and/or Farsi.
#
# The MacOS Arabic character set is essentially a superset of ISO
# The Mac OS Arabic character set is essentially a superset of ISO
# 8859-6. The 8859-6 code points that are interpreted differently
# in the MacOS Arabic set are as follows:
# 0xA0 is no-break space in 8859-6 and right-left space in MacOS
# Arabic; NBSP is 0x81 in MacOS Arabic.
# 0xA4 is currency sign in 8859-6 and right-left dollar sign in
# MacOS Arabic.
# 0xAD is soft hyphen in 8859-6 and right-left hyphen in MacOS
# Arabic.
# in the Mac OS Arabic set are as follows:
# 0xA0 is NO-BREAK SPACE in 8859-6 and right-left SPACE in Mac OS
# Arabic; NO-BREAK is 0x81 in Mac OS Arabic.
# 0xA4 is CURRENCY SIGN in 8859-6 and right-left DOLLAR SIGN in
# Mac OS Arabic.
# 0xAD is SOFT HYPHEN in 8859-6 and right-left HYPHEN-MINUS in
# Mac OS Arabic.
# ISO 8859-6 specifies that codes 0x30-0x39 can be rendered either
# with European digit shapes or Arabic digit shapes. This is also
# true MacOS Arabic, which determines from context which digit shapes
# to use.
# true in Mac OS Arabic, which determines from context which digit
# shapes to use (see below).
#
# The MacOS Arabic character set uses the C1 controls area and other
# The Mac OS Arabic character set uses the C1 controls area and other
# code points which are undefined in ISO 8859-6 for additional
# graphic characters: additional Arabic letters for Persian and Urdu,
# graphic characters: additional Arabic letters for Farsi and Urdu,
# some accented Roman letters for European languages (such as French),
# and duplicates of some of the punctuation, symbols, and digits in
# the ASCII block. The duplicate punctuation, symbol, and digit
@@ -85,119 +113,235 @@
# have left-right directionality. See the next section for more
# information on this.
#
# MacOS Arabic characters 0xEB-0xF2 are non-spacing/combining marks.
# Mac OS Arabic characters 0xEB-0xF2 are non-spacing/combining marks.
#
# 2. Directional characters and roundtrip fidelity
#
# The MacOS Arabic character set was developed in 1986-1987. At that
# time the bidirectional line line layout algorithm used in the MacOS
# The Mac OS Arabic character set was developed in 1986-1987. At that
# time the bidirectional line layout algorithm used in the Mac OS
# Arabic system was fairly simple; it used only a few direction
# classes (instead of the 13 or so now used in the Unicode
# bidirectional algorithm). In order to permit users to handle some
# tricky layout problems, certain punctuation and symbol characters
# have duplicate code points, one with a left-right direction
# attribute and the other with a right-left direction attribute.
# classes (instead of the 19 now used in the Unicode bidirectional
# algorithm). In order to permit users to handle some tricky layout
# problems, certain punctuation and symbol characters were encoded
# twice, one with a left-right direction attribute and the other with
# a right-left direction attribute.
#
# For example, ampersand is encoded at 0x26 with a left-right
# attribute, and at 0xA6 with a right-left attribute. However, there
# is only one ampersand character in Unicode. We need to have a way
# to map both MacOS Arabic ampersand characters to Unicode and back
# again without loss of information. Mapping one of the MacOS Arabic
# ampersand characters to a code in the Unicode corporate use zone is
# undesirable, since both of the ampersand characters are likely to
# For example, plus sign is encoded at 0x2B with a left-right
# attribute, and at 0xAB with a right-left attribute. However, there
# is only one PLUS SIGN character in Unicode. This leads to some
# interesting problems when mapping between Mac OS Arabic and Unicode;
# see below.
#
# A related problem is that even when a particular character is
# encoded only once in Mac OS Arabic, it may have a different
# direction attribute than the corresponding Unicode character.
#
# For example, the Mac OS Arabic character at 0x93 is HORIZONTAL
# ELLIPSIS with strong right-left direction. However, the Unicode
# character HORIZONTAL ELLIPSIS has direction class neutral.
#
# 3. Behavior of ASCII-range numbers in WorldScript
#
# Mac OS Arabic also has two sets of digit codes.
#
# The digits at 0x30-0x39 may be displayed using either European
# digit forms or Arabic digit forms, depending on context. If there
# is a "strong European" character such as a Latin letter on either
# side of a sequence consisting of digits 0x30-0x39 and possibly comma
# 0x2C or period 0x2E, then the characters will be displayed using
# European forms (This will happen even if there are neutral characters
# between the digits and the strong European character). Otherwise, the
# digits will be displayed using Arabic forms, the comma will be
# displayed as Arabic thousands separator, and the period as Arabic
# decimal separator. In any case, 0x2C, 0x2E, and 0x30-0x39 are always
# left-right.
#
# The digits at 0xB0-0xB9 are always displayed using Arabic digit
# shapes, and moreover, these digits always have strong right-left
# directionality. These are mainly intended for special layout
# purposes such as part numbers, etc.
#
# 4. Font variants
#
# The table in this file gives the Unicode mappings for the standard
# Mac OS Arabic encoding. This encoding is supported by the Cairo font
# (the system font for Arabic), and is the encoding supported by the
# text processing utilities. However, the other Arabic fonts actually
# implement slightly different encodings; this mainly affects the code
# points 0xAA and 0xC0. For these code points the standard Mac OS
# Arabic encoding has the following mappings:
# 0xAA -> <RL>+0x002A ASTERISK, right-left
# 0xC0 -> <RL>+0x274A EIGHT TEARDROP-SPOKED PROPELLER ASTERISK,
# right-left
# This mapping of 0xAA is consistent with the normal convention for
# Mac OS Arabic and Hebrew that the right-left duplicates have codes
# that are equal to the ASCII code of the left-right character plus
# 0x80. However, in all of the other fonts, 0xAA is MULTIPLY SIGN, and
# right-left ASTERISK may be at a different code point. The other
# variants are described below.
#
# The TrueType variant is used for most of the Arabic TrueType fonts:
# Baghdad, Geeza, Kufi, Nadeem. It differs from the standard variant
# in the following way:
# 0xAA -> <RL>+0x00D7 MULTIPLICATION SIGN, right-left
# 0xC0 -> <RL>+0x002A ASTERISK, right-left
#
# The Thuluth variant is used for the Arabic Postscript-only fonts:
# Thuluth and Thuluth bold. It differs from the standard variant in
# the following way:
# 0xAA -> <RL>+0x00D7 MULTIPLICATION SIGN, right-left
# 0xC0 -> 0x066D ARABIC FIVE POINTED STAR
#
# The AlBayan variant is used for the Arabic TrueType font Al Bayan.
# It differs from the standard variant in the following way:
# 0x81 -> no mapping (glyph just has authorship information, etc.)
# 0xA3 -> 0xFDFA ARABIC LIGATURE SALLALLAHOU ALAYHE WASALLAM
# 0xA4 -> 0xFDF2 ARABIC LIGATURE ALLAH ISOLATED FORM
# 0xAA -> <RL>+0x00D7 MULTIPLICATION SIGN, right-left
# 0xDC -> <RL>+0x25CF BLACK CIRCLE, right-left
# 0xFC -> <RL>+0x25A0 BLACK SQUARE, right-left
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Matching the direction of Mac OS Arabic characters
#
# When Mac OS Arabic encodes a character twice but with different
# direction attributes for the two code points - as in the case of
# plus sign mentioned above - we need a way to map both Mac OS Arabic
# code points to Unicode and back again without loss of information.
# With the plus sign, for example, mapping one of the Mac OS Arabic
# characters to a code in the Unicode corporate use zone is
# undesirable, since both of the plus sign characters are likely to
# be used in text that is interchanged.
#
# The problem is solved with the use of direction override characters
# and direction-dependent mappings. When mapping from MacOS Arabic to
# Unicode, such problem characters are surrounded with an appropriate
# direction override:
# MacOS Arabic 0x26 ampersand (left) ->
# Unicode 0x202D (LRO) + 0x0026 (AMPERSAND) + 0x202C (PDF)
# MacOS Arabic 0xA6 ampersand (right) ->
# Unicode 0x202E (RLO) + 0x0026 (AMPERSAND) + 0x202C (PDF)
# When mapping from Unicode to MacOS Arabic, the MacOS Unicode
# converter uses the Unicode bidirectional algorithm to determine
# resolved directions. The mapping from Unicode to MacOS Arabic can
# then be disambiguated by the use of the resolved direction:
# Unicode 0x0026 -> MacOS Arabic 0x26 (if L) or 0xA6 (if R)
# and direction-dependent mappings. When mapping from Mac OS Arabic
# to Unicode, we use direction overrides as necessary to force the
# direction of the resulting Unicode characters.
#
# However, note that this means we also need to discard the direction
# override characters when mapping from Unicode to MacOS Arabic.
# The required direction is indicated by a direction tag in the
# mappings. A tag of <LR> means the corresponding Unicode character
# must have a strong left-right context, and a tag of <RL> indicates
# a right-left context.
#
# For example, the mapping of 0x2B is given as <LR>+0x002B; the
# mapping of 0xAB is given as <RL>+0x002B. If we map an isolated
# instance of 0x2B to Unicode, it should be mapped as follows (LRO
# indicates LEFT-RIGHT OVERRIDE, PDF indicates POP DIRECTION
# FORMATTING):
#
# 0x2B -> 0x202D (LRO) + 0x002B (PLUS SIGN) + 0x202C (PDF)
#
# When mapping several characters in a row that require direction
# forcing, the overrides need only be used at the beginning and end.
# For example:
#
# 0x24 0x20 0x28 0x29 -> 0x202D 0x0024 0x0020 0x0028 0x0029 0x202C
#
# If neutral characters that require direction forcing are already
# between strong-direction characters with matching directionality,
# then direction overrides need not be used. Direction overrides are
# always needed to map the right-left digits at 0xB0-0xB9.
#
# When mapping from Unicode to Mac OS Arabic, the Unicode
# bidirectional algorithm should be used to determine resolved
# direction of the Unicode characters. The mapping from Unicode to
# Mac OS Arabic can then be disambiguated by the use of the resolved
# direction:
#
# Unicode 0x002B -> Mac OS Arabic 0x2B (if L) or 0xAB (if R)
#
# However, this also means the direction override characters should
# be discarded when mapping from Unicode to Mac OS Arabic (after
# they have been used to determine resolved direction), since the
# direction override information is carried by the code point itself.
#
# Even when direction overrides are not needed for roundtrip
# fidelity, they are sometimes used when mapping MacOS Arabic
# fidelity, they are sometimes used when mapping Mac OS Arabic
# characters to Unicode in order to achieve similar text layout with
# the resulting Unicode text. For example, the single MacOS Arabic
# the resulting Unicode text. For example, the single Mac OS Arabic
# ellipsis character has direction class right-left,and there is no
# left-right version. However, the Unicode HORIZONTAL ELLIPSIS
# character has direction class neutral (which means it may end up
# with a resolved direction of left-right if surrounded by left-right
# characters). When mapping the MacOS Arabic ellipsis to Unicode, it
# characters). When mapping the Mac OS Arabic ellipsis to Unicode, it
# is surrounded with a direction override to help preserve proper
# text layout. The resolved direction is not needed or used when
# mapping the Unicode HORIZONTAL ELLIPSIS back to MacOS Arabic.
# mapping the Unicode HORIZONTAL ELLIPSIS back to Mac OS Arabic.
#
# MacOS Arabic also has duplicate digit codes at 0xB0-0xB9. These
# have right-left direction and are always displayed with Arabic
# digit glyphs (unlike the 0x30-0x39 digits, which have left-right
# direction). The MacOS Arabic 0xB0-0xB9 digits are mapped to the
# Unicode Arabic digits U+0660-U+0669 and surrounded with direction
# overrides, since the Unicode Arabic digits have a weak left-right
# direction.
# 2. Mapping the Mac OS Arabic digits
#
# 3. Problematic character assignments
# The main table below contains mappings that should be used when
# strict round-trip fidelity is required. However, for numeric
# values, the mappings in that table will produce Unicode characters
# that may appear different than the Mac OS Arabic text displayed on
# a Mac OS system using WorldScript. This is because WorldScript
# uses context-dependent display for the 0x30-0x39 digits.
#
# In the Cairo font, the characters at 0x2A and 0xAA are rendered as
# an asterisk (which normally has 6 points) and the character at 0xC0
# is rendered as something that looks like a large 8-pointed asterisk.
# This handling of 0x2A and 0xAA is consistent with (1) the general
# principle that in MacOS character sets, the ASCII part should be
# identical to ASCII (0x2A is asterisk in ASCII), and (2) in MacOS
# Arabic, the right-left duplicates have codes that are equal to the
# ASCII code of the left-right version plus 0x80. However, in all of
# the other MacOS Arabic fonts, 0x2A and 0xAA are rendered as
# multiply sign (U+00D7), and 0xC0 is rendered as asterisk (with 6
# points). Also note that Unicode has a character ARABIC FIVE POINTED
# STAR (U+066D), which is similar to an asterisk but has five points.
# If roundtrip fidelity is not required, then the following
# alternate mappings should be used when a sequence of 0x30-0x39
# digits - possibly including 0x2C and 0x2E - occurs in an Arabic
# context (that is, when the first "strong" character on either side
# of the digit sequence is Arabic, or there is no strong character):
#
# For now the strict mappings treat 0x2A and 0xAA as asterisk; the
# loose mappings also map U+00D7 to 0xAA; and 0xC0 is treated as
# ARABIC FIVE POINTED STAR (until we find a better mapping).
# 0x2C 0x066C # ARABIC THOUSANDS SEPARATOR
# 0x2E 0x066B # ARABIC DECIMAL SEPARATOR
# 0x30 0x0660 # ARABIC-INDIC DIGIT ZERO
# 0x31 0x0661 # ARABIC-INDIC DIGIT ONE
# 0x32 0x0662 # ARABIC-INDIC DIGIT TWO
# 0x33 0x0663 # ARABIC-INDIC DIGIT THREE
# 0x34 0x0664 # ARABIC-INDIC DIGIT FOUR
# 0x35 0x0665 # ARABIC-INDIC DIGIT FIVE
# 0x36 0x0666 # ARABIC-INDIC DIGIT SIX
# 0x37 0x0667 # ARABIC-INDIC DIGIT SEVEN
# 0x38 0x0668 # ARABIC-INDIC DIGIT EIGHT
# 0x39 0x0669 # ARABIC-INDIC DIGIT NINE
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version n03 to version n07:
#
# - Change mapping for 0xC0 from U+066D to U+274A.
#
# - Add direction overrides (required directionality) to mappings
# for 0x25, 0x2C, 0x3B, 0x3F.
#
##################
0x20 0x202D+0x0020+0x202C # LRO + SPACE + PDF
0x21 0x202D+0x0021+0x202C # LRO + EXCLAMATION MARK + PDF
0x22 0x202D+0x0022+0x202C # LRO + QUOTATION MARK + PDF
0x23 0x202D+0x0023+0x202C # LRO + NUMBER SIGN + PDF
0x24 0x202D+0x0024+0x202C # LRO + DOLLAR SIGN + PDF
0x25 0x0025 # PERCENT SIGN
0x26 0x202D+0x0026+0x202C # LRO + AMPERSAND + PDF
0x27 0x202D+0x0027+0x202C # LRO + APOSTROPHE + PDF
0x28 0x202D+0x0028+0x202C # LRO + LEFT PARENTHESIS + PDF
0x29 0x202D+0x0029+0x202C # LRO + RIGHT PARENTHESIS + PDF
0x2A 0x202D+0x002A+0x202C # LRO + ASTERISK + PDF
0x2B 0x202D+0x002B+0x202C # LRO + PLUS SIGN + PDF
0x2C 0x002C # COMMA
0x2D 0x202D+0x002D+0x202C # LRO + HYPHEN-MINUS + PDF
0x2E 0x202D+0x002E+0x202C # LRO + FULL STOP + PDF
0x2F 0x202D+0x002F+0x202C # LRO + SOLIDUS + PDF
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x202D+0x003A+0x202C # LRO + COLON + PDF
0x3B 0x003B # SEMICOLON
0x3C 0x202D+0x003C+0x202C # LRO + LESS-THAN SIGN + PDF
0x3D 0x202D+0x003D+0x202C # LRO + EQUALS SIGN + PDF
0x3E 0x202D+0x003E+0x202C # LRO + GREATER-THAN SIGN + PDF
0x3F 0x003F # QUESTION MARK
0x20 <LR>+0x0020 # SPACE, left-right
0x21 <LR>+0x0021 # EXCLAMATION MARK, left-right
0x22 <LR>+0x0022 # QUOTATION MARK, left-right
0x23 <LR>+0x0023 # NUMBER SIGN, left-right
0x24 <LR>+0x0024 # DOLLAR SIGN, left-right
0x25 <LR>+0x0025 # PERCENT SIGN, left-right
0x26 <LR>+0x0026 # AMPERSAND, left-right
0x27 <LR>+0x0027 # APOSTROPHE, left-right
0x28 <LR>+0x0028 # LEFT PARENTHESIS, left-right
0x29 <LR>+0x0029 # RIGHT PARENTHESIS, left-right
0x2A <LR>+0x002A # ASTERISK, left-right
0x2B <LR>+0x002B # PLUS SIGN, left-right
0x2C <LR>+0x002C # COMMA, left-right; in Arabic-script context, displayed as 0x066C ARABIC THOUSANDS SEPARATOR
0x2D <LR>+0x002D # HYPHEN-MINUS, left-right
0x2E <LR>+0x002E # FULL STOP, left-right; in Arabic-script context, displayed as 0x066B ARABIC DECIMAL SEPARATOR
0x2F <LR>+0x002F # SOLIDUS, left-right
0x30 0x0030 # DIGIT ZERO; in Arabic-script context, displayed as 0x0660 ARABIC-INDIC DIGIT ZERO
0x31 0x0031 # DIGIT ONE; in Arabic-script context, displayed as 0x0661 ARABIC-INDIC DIGIT ONE
0x32 0x0032 # DIGIT TWO; in Arabic-script context, displayed as 0x0662 ARABIC-INDIC DIGIT TWO
0x33 0x0033 # DIGIT THREE; in Arabic-script context, displayed as 0x0663 ARABIC-INDIC DIGIT THREE
0x34 0x0034 # DIGIT FOUR; in Arabic-script context, displayed as 0x0664 ARABIC-INDIC DIGIT FOUR
0x35 0x0035 # DIGIT FIVE; in Arabic-script context, displayed as 0x0665 ARABIC-INDIC DIGIT FIVE
0x36 0x0036 # DIGIT SIX; in Arabic-script context, displayed as 0x0666 ARABIC-INDIC DIGIT SIX
0x37 0x0037 # DIGIT SEVEN; in Arabic-script context, displayed as 0x0667 ARABIC-INDIC DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT; in Arabic-script context, displayed as 0x0668 ARABIC-INDIC DIGIT EIGHT
0x39 0x0039 # DIGIT NINE; in Arabic-script context, displayed as 0x0669 ARABIC-INDIC DIGIT NINE
0x3A <LR>+0x003A # COLON, left-right
0x3B <LR>+0x003B # SEMICOLON, left-right
0x3C <LR>+0x003C # LESS-THAN SIGN, left-right
0x3D <LR>+0x003D # EQUALS SIGN, left-right
0x3E <LR>+0x003E # GREATER-THAN SIGN, left-right
0x3F <LR>+0x003F # QUESTION MARK, left-right
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
@@ -225,11 +369,11 @@
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x202D+0x005B+0x202C # LRO + LEFT SQUARE BRACKET + PDF
0x5C 0x202D+0x005C+0x202C # LRO + REVERSE SOLIDUS + PDF
0x5D 0x202D+0x005D+0x202C # LRO + RIGHT SQUARE BRACKET + PDF
0x5E 0x202D+0x005E+0x202C # LRO + CIRCUMFLEX ACCENT + PDF
0x5F 0x202D+0x005F+0x202C # LRO + LOW LINE + PDF
0x5B <LR>+0x005B # LEFT SQUARE BRACKET, left-right
0x5C <LR>+0x005C # REVERSE SOLIDUS, left-right
0x5D <LR>+0x005D # RIGHT SQUARE BRACKET, left-right
0x5E <LR>+0x005E # CIRCUMFLEX ACCENT, left-right
0x5F <LR>+0x005F # LOW LINE, left-right
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
@@ -257,13 +401,13 @@
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x202D+0x007B+0x202C # LRO + LEFT CURLY BRACKET + PDF
0x7C 0x202D+0x007C+0x202C # LRO + VERTICAL LINE + PDF
0x7D 0x202D+0x007D+0x202C # LRO + RIGHT CURLY BRACKET + PDF
0x7B <LR>+0x007B # LEFT CURLY BRACKET, left-right
0x7C <LR>+0x007C # VERTICAL LINE, left-right
0x7D <LR>+0x007D # RIGHT CURLY BRACKET, left-right
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0x202E+0x00A0+0x202C # RLO + NO-BREAK SPACE + PDF
0x81 <RL>+0x00A0 # NO-BREAK SPACE, right-left
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
@@ -274,59 +418,59 @@
0x89 0x00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
0x8A 0x00E4 # LATIN SMALL LETTER A WITH DIAERESIS
0x8B 0x06BA # ARABIC LETTER NOON GHUNNA
0x8C 0x202E+0x00AB+0x202C # RLO + LEFT-POINTING DOUBLE ANGLE QUOTATION MARK + PDF
0x8C <RL>+0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left
0x8D 0x00E7 # LATIN SMALL LETTER C WITH CEDILLA
0x8E 0x00E9 # LATIN SMALL LETTER E WITH ACUTE
0x8F 0x00E8 # LATIN SMALL LETTER E WITH GRAVE
0x90 0x00EA # LATIN SMALL LETTER E WITH CIRCUMFLEX
0x91 0x00EB # LATIN SMALL LETTER E WITH DIAERESIS
0x92 0x00ED # LATIN SMALL LETTER I WITH ACUTE
0x93 0x202E+0x2026+0x202C # RLO + HORIZONTAL ELLIPSIS + PDF
0x93 <RL>+0x2026 # HORIZONTAL ELLIPSIS, right-left
0x94 0x00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
0x95 0x00EF # LATIN SMALL LETTER I WITH DIAERESIS
0x96 0x00F1 # LATIN SMALL LETTER N WITH TILDE
0x97 0x00F3 # LATIN SMALL LETTER O WITH ACUTE
0x98 0x202E+0x00BB+0x202C # RLO + RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK + PDF
0x98 <RL>+0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x202E+0x00F7+0x202C # RLO + DIVISION SIGN + PDF
0x9B <RL>+0x00F7 # DIVISION SIGN, right-left
0x9C 0x00FA # LATIN SMALL LETTER U WITH ACUTE
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x202E+0x0020+0x202C # RLO + SPACE + PDF
0xA1 0x202E+0x0021+0x202C # RLO + EXCLAMATION MARK + PDF
0xA2 0x202E+0x0022+0x202C # RLO + QUOTATION MARK + PDF
0xA3 0x202E+0x0023+0x202C # RLO + NUMBER SIGN + PDF
0xA4 0x202E+0x0024+0x202C # RLO + DOLLAR SIGN + PDF
0xA0 <RL>+0x0020 # SPACE, right-left
0xA1 <RL>+0x0021 # EXCLAMATION MARK, right-left
0xA2 <RL>+0x0022 # QUOTATION MARK, right-left
0xA3 <RL>+0x0023 # NUMBER SIGN, right-left
0xA4 <RL>+0x0024 # DOLLAR SIGN, right-left
0xA5 0x066A # ARABIC PERCENT SIGN
0xA6 0x202E+0x0026+0x202C # RLO + AMPERSAND + PDF
0xA7 0x202E+0x0027+0x202C # RLO + APOSTROPHE + PDF
0xA8 0x202E+0x0028+0x202C # RLO + LEFT PARENTHESIS + PDF
0xA9 0x202E+0x0029+0x202C # RLO + RIGHT PARENTHESIS + PDF
0xAA 0x202E+0x002A+0x202C # RLO + ASTERISK + PDF
0xAB 0x202E+0x002B+0x202C # RLO + PLUS SIGN + PDF
0xA6 <RL>+0x0026 # AMPERSAND, right-left
0xA7 <RL>+0x0027 # APOSTROPHE, right-left
0xA8 <RL>+0x0028 # LEFT PARENTHESIS, right-left
0xA9 <RL>+0x0029 # RIGHT PARENTHESIS, right-left
0xAA <RL>+0x002A # ASTERISK, right-left
0xAB <RL>+0x002B # PLUS SIGN, right-left
0xAC 0x060C # ARABIC COMMA
0xAD 0x202E+0x002D+0x202C # RLO + HYPHEN-MINUS + PDF
0xAE 0x202E+0x002E+0x202C # RLO + FULL STOP + PDF
0xAF 0x202E+0x002F+0x202C # RLO + SOLIDUS + PDF
0xB0 0x202E+0x0660+0x202C # RLO + ARABIC-INDIC DIGIT ZERO + PDF
0xB1 0x202E+0x0661+0x202C # RLO + ARABIC-INDIC DIGIT ONE + PDF
0xB2 0x202E+0x0662+0x202C # RLO + ARABIC-INDIC DIGIT TWO + PDF
0xB3 0x202E+0x0663+0x202C # RLO + ARABIC-INDIC DIGIT THREE + PDF
0xB4 0x202E+0x0664+0x202C # RLO + ARABIC-INDIC DIGIT FOUR + PDF
0xB5 0x202E+0x0665+0x202C # RLO + ARABIC-INDIC DIGIT FIVE + PDF
0xB6 0x202E+0x0666+0x202C # RLO + ARABIC-INDIC DIGIT SIX + PDF
0xB7 0x202E+0x0667+0x202C # RLO + ARABIC-INDIC DIGIT SEVEN + PDF
0xB8 0x202E+0x0668+0x202C # RLO + ARABIC-INDIC DIGIT EIGHT + PDF
0xB9 0x202E+0x0669+0x202C # RLO + ARABIC-INDIC DIGIT NINE + PDF
0xBA 0x202E+0x003A+0x202C # RLO + COLON + PDF
0xAD <RL>+0x002D # HYPHEN-MINUS, right-left
0xAE <RL>+0x002E # FULL STOP, right-left
0xAF <RL>+0x002F # SOLIDUS, right-left
0xB0 <RL>+0x0660 # ARABIC-INDIC DIGIT ZERO, right-left (need override)
0xB1 <RL>+0x0661 # ARABIC-INDIC DIGIT ONE, right-left (need override)
0xB2 <RL>+0x0662 # ARABIC-INDIC DIGIT TWO, right-left (need override)
0xB3 <RL>+0x0663 # ARABIC-INDIC DIGIT THREE, right-left (need override)
0xB4 <RL>+0x0664 # ARABIC-INDIC DIGIT FOUR, right-left (need override)
0xB5 <RL>+0x0665 # ARABIC-INDIC DIGIT FIVE, right-left (need override)
0xB6 <RL>+0x0666 # ARABIC-INDIC DIGIT SIX, right-left (need override)
0xB7 <RL>+0x0667 # ARABIC-INDIC DIGIT SEVEN, right-left (need override)
0xB8 <RL>+0x0668 # ARABIC-INDIC DIGIT EIGHT, right-left (need override)
0xB9 <RL>+0x0669 # ARABIC-INDIC DIGIT NINE, right-left (need override)
0xBA <RL>+0x003A # COLON, right-left
0xBB 0x061B # ARABIC SEMICOLON
0xBC 0x202E+0x003C+0x202C # RLO + LESS-THAN SIGN + PDF
0xBD 0x202E+0x003D+0x202C # RLO + EQUALS SIGN + PDF
0xBE 0x202E+0x003E+0x202C # RLO + GREATER-THAN SIGN + PDF
0xBC <RL>+0x003C # LESS-THAN SIGN, right-left
0xBD <RL>+0x003D # EQUALS SIGN, right-left
0xBE <RL>+0x003E # GREATER-THAN SIGN, right-left
0xBF 0x061F # ARABIC QUESTION MARK
0xC0 0x066D # ARABIC FIVE POINTED STAR
0xC0 <RL>+0x274A # EIGHT TEARDROP-SPOKED PROPELLER ASTERISK, right-left
0xC1 0x0621 # ARABIC LETTER HAMZA
0xC2 0x0622 # ARABIC LETTER ALEF WITH MADDA ABOVE
0xC3 0x0623 # ARABIC LETTER ALEF WITH HAMZA ABOVE
@@ -353,11 +497,11 @@
0xD8 0x0638 # ARABIC LETTER ZAH
0xD9 0x0639 # ARABIC LETTER AIN
0xDA 0x063A # ARABIC LETTER GHAIN
0xDB 0x202E+0x005B+0x202C # RLO + LEFT SQUARE BRACKET + PDF
0xDC 0x202E+0x005C+0x202C # RLO + REVERSE SOLIDUS + PDF
0xDD 0x202E+0x005D+0x202C # RLO + RIGHT SQUARE BRACKET + PDF
0xDE 0x202E+0x005E+0x202C # RLO + CIRCUMFLEX ACCENT + PDF
0xDF 0x202E+0x005F+0x202C # RLO + LOW LINE + PDF
0xDB <RL>+0x005B # LEFT SQUARE BRACKET, right-left
0xDC <RL>+0x005C # REVERSE SOLIDUS, right-left
0xDD <RL>+0x005D # RIGHT SQUARE BRACKET, right-left
0xDE <RL>+0x005E # CIRCUMFLEX ACCENT, right-left
0xDF <RL>+0x005F # LOW LINE, right-left
0xE0 0x0640 # ARABIC TATWEEL
0xE1 0x0641 # ARABIC LETTER FEH
0xE2 0x0642 # ARABIC LETTER QAF
@@ -385,9 +529,8 @@
0xF8 0x06AF # ARABIC LETTER GAF
0xF9 0x0688 # ARABIC LETTER DDAL
0xFA 0x0691 # ARABIC LETTER RREH
0xFB 0x202E+0x007B+0x202C # RLO + LEFT CURLY BRACKET + PDF
0xFC 0x202E+0x007C+0x202C # RLO + VERTICAL LINE + PDF
0xFD 0x202E+0x007D+0x202C # RLO + RIGHT CURLY BRACKET + PDF
0xFB <RL>+0x007B # LEFT CURLY BRACKET, right-left
0xFC <RL>+0x007C # VERTICAL LINE, right-left
0xFD <RL>+0x007D # RIGHT CURLY BRACKET, right-left
0xFE 0x0698 # ARABIC LETTER JEH
0xFF 0x06D2 # ARABIC LETTER YEH BARREE

View File

@@ -1,11 +1,34 @@
#=======================================================================
# File name: CROATIAN.TXT
#
# Name: MacOS_Croatian [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <6>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Croatian
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-04 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs, notes. Matches internal
# utom<b3>.
# b02 1999-Sep-22 Encoding changed for Mac OS 8.5; change
# mapping of 0xDB from CURRENCY SIGN to EURO
# SIGN. Update contact e-mail address. Matches
# internal utom<b2>, ufrm<b2>, and Text
# Encoding Converter version 1.5.
# n07 1998-Feb-05 Minor update to header comments
# n05 1997-Dec-14 Update to match internal utom<5>, ufrm<16>:
# Change standard mapping for 0xBD from U+2126
# to its canonical decomposition, U+03A9.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<6>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,50 +37,91 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Croatian code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Croatian code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Croatian code order.
# The entries are in Mac OS Croatian code order.
#
# One of these mappings requires the use of a corporate character
# (for the Apple logo character). See the file "MacOS-CorpCharList".
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
# One of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Notes on MacOS Croatian:
# ------------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Croatian character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# The MacOS Croatian encoding shares the script code smRoman
# (0) with the standard MacOS Roman encoding. To determine if
# Notes on Mac OS Croatian:
# -------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Croatian is used for Croatian and Slovene.
#
# The Mac OS Croatian encoding shares the script code smRoman
# (0) with the standard Mac OS Roman encoding. To determine if
# the Croatian encoding is being used, you must check if the
# system region code is 68, verCroatia (or 25, verYugoCroatian,
# only used in older systems).
#
# This character set is a variant of standard MacOS Roman
# This character set is a variant of standard Mac OS Roman
# encoding, adding five accented letter case pairs to handle
# Croatian. It has 20 code point differences from standard
# MacOS Roman, but only 10 differences in repertoire.
# Mac OS Roman, but only 10 differences in repertoire.
#
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts are updated for Mac OS 8.5 to reflect this. There is
# a "currency sign" variant of the Mac OS Croatian encoding that
# still maps 0xDB to U+00A4; this can be used for older fonts.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The following corporate zone Unicode character is used in this
# mapping:
#
# 0xF8FF Apple logo
#
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version n07 to version b02:
#
# - Encoding changed for Mac OS 8.5; change mapping of 0xDB from
# CURRENCY SIGN (U+00A4) to EURO SIGN (U+20AC).
#
# Changes from version n03 to version n05:
#
# - Change mapping of 0xBD from U+2126 to its canonical
# decomposition, U+03A9.
#
##################
@@ -218,7 +282,7 @@
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x2126 # OHM SIGN
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x017E # LATIN SMALL LETTER Z WITH CARON
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
@@ -248,10 +312,10 @@
0xD8 0xF8FF # Apple logo
0xD9 0x00A9 # COPYRIGHT SIGN
0xDA 0x2044 # FRACTION SLASH
0xDB 0x00A4 # CURRENCY SIGN
0xDB 0x20AC # EURO SIGN
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0x00C6 # LATIN CAPITAL LIGATURE AE
0xDE 0x00C6 # LATIN CAPITAL LETTER AE
0xDF 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xE0 0x2013 # EN DASH
0xE1 0x00B7 # MIDDLE DOT
@@ -283,5 +347,5 @@
0xFB 0x02DA # RING ABOVE
0xFC 0x00B8 # CEDILLA
0xFD 0x00CA # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
0xFE 0x00E6 # LATIN SMALL LIGATURE AE
0xFE 0x00E6 # LATIN SMALL LETTER AE
0xFF 0x02C7 # CARON

View File

@@ -1,12 +1,35 @@
#=======================================================================
# File name: CYRILLIC.TXT
#
# Name: MacOS_Cyrillic [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <5>)
# Date: 15 April 1995
# Authors: Peter Edberg <edberg1@applelink.apple.com>
# Judy Kettenhofen
# Contents: Map (external version) from Mac OS Cyrillic
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c03 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs, notes. Matches internal
# utom<b2>.
# b02 1999-Sep-22 Encoding changed for Mac OS 9.0 to merge
# with Mac OS Ukrainian and support EURO SIGN;
# Change mappings for 0xA2, 0xB6, and 0xFF.
# Update contact e-mail address. Matches
# internal utom<b2>, ufrm<b2>, and Text
# Encoding Converter version 1.5.
# n05 1998-Feb-05 Update header comments to new format; no
# mapping changes. Matches internal utom<n3>,
# ufrm<n13>, and Text Encoding Converter
# version 1.3.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n5>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -15,49 +38,91 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Cyrillic code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Cyrillic code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Cyrillic code order.
# The entries are in Mac OS Cyrillic code order.
#
# Notes on MacOS Cyrillic:
# ------------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Cyrillic character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# The MacOS Cyrillic encoding includes the full Cyrillic letter
# repertoire of ISO 8859-5 (although not at the same code points).
# This covers most of the Slavic languages written in Cyrillic
# script.
# Notes on Mac OS Cyrillic:
# -------------------------
#
# The MacOS Cyrillic encoding also includes a number of characters
# needed for the MacOS user interface (e.g. ellipsis), MacOS
# localization (bullet for echoing passwords, copyright sign, etc.
# All of the characters in MacOS Cyrillic that are also in the MacOS
# Roman encoding are at the same code point as MacOS Roman. This
# improves application compatibility (since some naughty applications
# hard-code the MacOS Roman code points of certain characters).
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported directly in programming
# interfaces for QuickDraw Text, the Script Manager, and related
# Text Utilities. For other purposes it is supported via transcoding
# to and from Unicode.
#
# A variant of MacOS Cyrillic is used for Ukrainian. A separate
# mapping table is available for Ukrainian.
# This is the "Euro sign" version of Mac Cyrillic for Mac OS 9.0 and
# later. Before Mac OS 9.0, there were two separate Slavic Cyrillic
# encodings:
#
# 1. The Cyrillic currency sign variant (used for localized Russian
# and Bulgarian systems), which had the following:
# 0xA2 U+00A2 CENT SIGN
# 0xB6 U+2202 PARTIAL DIFFERENTIAL
# 0xFF U+00A4 CURRENCY SIGN
#
# 2. The Ukrainian currency sign variant (used for localized Ukrainian
# systems and the pre-9.0 Cyrillic Language Kit), which had the
# following:
# 0xA2 U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN
# 0xB6 U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN
# 0xFF U+00A4 CURRENCY SIGN
#
# This new Cyrillic Euro sign version is based on the old Ukrainian
# currency sign variant, with 0xFF changed to be EURO SIGN.
#
# The Mac OS Cyrillic encoding includes the Cyrillic letter repertoire
# of ISO 8859-5 (although not at the same code points). This covers
# most of the Slavic languages written in Cyrillic script.
#
# The Mac OS Cyrillic encoding also includes a number of characters
# needed for the Mac OS user interface and localization (e.g.
# ellipsis, bullet, copyright sign). All of the characters in Mac OS
# Cyrillic that are also in the Mac OS Roman encoding are at the
# same code point in both; this improves application compatibility.
#
# Note: There is a common Ukrainian glyph variation in which the glyph
# for CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I may or may not
# have a dot above.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version n05 to version b02:
#
# - Encoding changed for Mac OS 9.0 to merge with Mac OS Ukrainian and
# support EURO SIGN. 0xA2 changed from U+00A2 to U+0490; 0xB6 changed
# from U+2202 to U+0491; 0xFF changed from U+00A4 to U+20AC.
#
##################
@@ -191,7 +256,7 @@
0x9F 0x042F # CYRILLIC CAPITAL LETTER YA
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x00A2 # CENT SIGN
0xA2 0x0490 # CYRILLIC CAPITAL LETTER GHE WITH UPTURN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
@@ -211,7 +276,7 @@
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x0456 # CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
0xB5 0x00B5 # MICRO SIGN
0xB6 0x2202 # PARTIAL DIFFERENTIAL
0xB6 0x0491 # CYRILLIC SMALL LETTER GHE WITH UPTURN
0xB7 0x0408 # CYRILLIC CAPITAL LETTER JE
0xB8 0x0404 # CYRILLIC CAPITAL LETTER UKRAINIAN IE
0xB9 0x0454 # CYRILLIC SMALL LETTER UKRAINIAN IE
@@ -284,4 +349,4 @@
0xFC 0x044C # CYRILLIC SMALL LETTER SOFT SIGN
0xFD 0x044D # CYRILLIC SMALL LETTER E
0xFE 0x044E # CYRILLIC SMALL LETTER YU
0xFF 0x00A4 # CURRENCY SIGN
0xFF 0x20AC # EURO SIGN

View File

@@ -1,11 +1,35 @@
#=======================================================================
# File name: GREEK.TXT
#
# Name: MacOS_Greek [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <7>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Greek
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update to match changes in Mac OS Greek
# encoding for Mac OS 9.2.2 and later.
# Update URLs, notes. Matches internal
# utom<b3>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n06 1998-Feb-05 Update to match internal utom<n4>, ufrm<n17>,
# and Text Encoding Converter versions 1.3:
# Change mapping for 0xAF from U+0387 to its
# canonical decomposition, U+00B7. Also
# update header comments to new format.
# n04 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n7>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,53 +38,94 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Greek code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Greek code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Greek code order.
# The entries are in Mac OS Greek code order.
#
# One of these mappings requires the use of a corporate character
# (for an undefined code point). See the file "MacOS-CorpCharList".
# One of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Notes on MacOS Greek:
# ---------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Greek character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Although a MacOS script code is defined for Greek (smGreek = 6),
# Notes on Mac OS Greek:
# ----------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Although a Mac OS script code is defined for Greek (smGreek = 6),
# the Greek localized system does not currently use it (the font
# family IDs are in the MacOS Roman range). To determine if the
# family IDs are in the Mac OS Roman range). To determine if the
# Greek encoding is being used when the script code is smRoman (0),
# you must check if the system region code is 20, verGreece.
#
# The MacOS Greek encoding is a superset of the repertoire of
# ISO 8859-5 (although characters are not at the same code points).
# The Mac OS Greek encoding is a superset of the repertoire of
# ISO 8859-7 (although characters are not at the same code points),
# except that LEFT & RIGHT SINGLE QUOTATION MARK replace the
# MODIFIER LETTER REVERSED COMMA & APOSTROPHE (spacing versions of
# Greek rough & smooth breathing marks) that are in ISO 8859-7.
# The added characters in Mac OS Greek include more punctuation and
# symbols and several accented Latin letters.
#
# This character set was used by all MacOS Greek systems 6.x and
# 7.x except for system 6.0.7, which used a variant character set
# but was quickly replaced with Greek system 6.0.7.1 using the
# standard character set documented here. Greek system 4.1 used a
# variant Greek set that had ISO 8859-7 in 0xA0-0xFF (with some
# holes filled in with DTP characters), and MacOS Roman accented
# Roman letters in 0x80-0x9F.
# Before Mac OS 9.2.2, code point 0x9C was SOFT HYPHEN (U+00AD), and
# code point 0xFF was undefined. In Mac OS 9.2.2 and later versions,
# SOFT HYPHEN was moved to 0xFF, and code point 0x9C was changed to be
# EURO SIGN (U+20AC); the standard Apple fonts are updated for Mac OS
# 9.2.2 to reflect this. There is a "no Euro sign" variant of the Mac
# OS Greek encoding that uses the older mapping; this can be used for
# older fonts.
#
# This "no Euro sign" variant of Mac OS Greek was the character set
# used by Mac OS Greek systems before 9.2.2 except for system 6.0.7,
# which used a variant character set but was quickly replaced with
# Greek system 6.0.7.1 using the no Euro sign" character set
# documented here. Greek system 4.1 used a variant Greek set that had
# ISO 8859-7 in 0xA0-0xFF (with some holes filled in with DTP
# characters), and Mac OS Roman accented Roman letters in 0x80-0x9F.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - The Mac OS Greek encoding changed for Mac OS 9.2.2 and later
# as follows:
# 0x9C, changed from 0x00AD SOFT HYPHEN to 0x20AC EURO SIGN
# 0xFF, changed from undefined to 0x00AD SOFT HYPHEN
#
# Changes from version n04 to version n06:
#
# - Change mapping of 0xAF from U+0387 to its canonical
# decomposition, U+00B7.
#
##################
@@ -188,7 +253,7 @@
0x99 0x00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
0x9A 0x00F6 # LATIN SMALL LETTER O WITH DIAERESIS
0x9B 0x00A6 # BROKEN BAR
0x9C 0x00AD # SOFT HYPHEN
0x9C 0x20AC # EURO SIGN # before Mac OS 9.2.2, was SOFT HYPHEN
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
@@ -196,7 +261,7 @@
0xA1 0x0393 # GREEK CAPITAL LETTER GAMMA
0xA2 0x0394 # GREEK CAPITAL LETTER DELTA
0xA3 0x0398 # GREEK CAPITAL LETTER THETA
0xA4 0x039B # GREEK CAPITAL LETTER LAMBDA
0xA4 0x039B # GREEK CAPITAL LETTER LAMDA
0xA5 0x039E # GREEK CAPITAL LETTER XI
0xA6 0x03A0 # GREEK CAPITAL LETTER PI
0xA7 0x00DF # LATIN SMALL LETTER SHARP S
@@ -207,7 +272,7 @@
0xAC 0x00A7 # SECTION SIGN
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00B0 # DEGREE SIGN
0xAF 0x0387 # GREEK ANO TELEIA
0xAF 0x00B7 # MIDDLE DOT
0xB0 0x0391 # GREEK CAPITAL LETTER ALPHA
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2264 # LESS-THAN OR EQUAL TO
@@ -268,7 +333,7 @@
0xE9 0x03B9 # GREEK SMALL LETTER IOTA
0xEA 0x03BE # GREEK SMALL LETTER XI
0xEB 0x03BA # GREEK SMALL LETTER KAPPA
0xEC 0x03BB # GREEK SMALL LETTER LAMBDA
0xEC 0x03BB # GREEK SMALL LETTER LAMDA
0xED 0x03BC # GREEK SMALL LETTER MU
0xEE 0x03BD # GREEK SMALL LETTER NU
0xEF 0x03BF # GREEK SMALL LETTER OMICRON
@@ -287,4 +352,4 @@
0xFC 0x03CB # GREEK SMALL LETTER UPSILON WITH DIALYTIKA
0xFD 0x0390 # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS
0xFE 0x03B0 # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS
0xFF 0xF8A0 # undefined1
0xFF 0x00AD # SOFT HYPHEN # before Mac OS 9.2.2, was undefined

View File

@@ -1,12 +1,37 @@
#=======================================================================
# File name: HEBREW.TXT
#
# Name: MacOS_Hebrew [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.1 (from internal ufrm version <8>)
# Date: 15 November 1995
# Authors: Peter Edberg <edberg1@applelink.apple.com>
# Judy Kettenhofen
# Contents: Map (external version) from Mac OS Hebrew
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments; add section on
# roundtrip considerations. Matches internal
# xml <c1.4> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Don't require left-right context for digits
# 0x30-0x39. Change mapping of 0x81 to use
# decomposition. Reverse the mappings of 0xA8,
# 0xA9. Update URLs, notes. Matches internal
# utom<b7>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n03 1998-Feb-05 Show required Unicode character
# directionality in a different way. Update
# mappings for 0xC0 and 0xDE to use
# transcoding hints; matches internal utom<n6>,
# ufrm<n20>, and Text Encoding Converter
# version 1.3. Rewrite header comments.
# n01 1995-Nov-15 First version. Matches internal ufrm<n8>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -15,83 +40,86 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Hebrew code (in hex as 0xNN)
# Column #2 is the Unicode or Unicode sequence (in hex as 0xNNNN
# or 0xNNNN+0xNNNN+0xNNNN).
# Column #3 is the Unicode name (follows a comment sign, '#')
# Note: The abbreviations LRO, RLO, and PDF are used for
# LEFT-TO-RIGHT OVERRIDE, RIGHT-TO-LEFT OVERRIDE, and
# POP DIRECTIONAL FORMATTING, respectively.
# Column #1 is the Mac OS Hebrew code (in hex as 0xNN).
# Column #2 is the corresponding Unicode or Unicode sequence (in
# hex as 0xNNNN, 0xNNNN+0xNNNN, etc.). Sequences of up to 3
# Unicode characters are used here. A single Unicode character
# may be preceded by a tag indicating required directionality
# (i.e. <LR>+0xNNNN or <RL>+0xNNNN).
# Column #3 is a comment containing the Unicode name.
#
# The entries are in MacOS Hebrew code order.
# The entries are in Mac OS Hebrew code order.
#
# Note that in many cases, a single MacOS Hebrew character maps
# to a sequence of Unicode characters: LRO or RLO plus some Unicode
# character + PDF. This is indicated by joining the Unicode
# characters with '+'. This happens when the direction class of
# the MacOS Hebrew character is different than the direction class
# of the Unicode character (usually the MacOS Hebrew character has
# a strong direction class and the corresponding Unicode character
# is neutral or has a wek direction class).
# Some of these mappings require the use of corporate characters.
# See the file "CORPCHAR.TXT" and notes below.
#
# Notes on MacOS Hebrew:
# ----------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Hebrew character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Hebrew:
# -----------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# 1. General
#
# The MacOS Hebrew character set incorporates the Hebrew letter
# repertoire of ISO 8859-8, and uses the same code points, 0xE0-0xFA.
# The Mac OS Hebrew character set supports the Hebrew and Yiddish
# languages. It incorporates the Hebrew letter repertoire of
# ISO 8859-8, and uses the same code points for them, 0xE0-0xFA.
# It also incorporates the ASCII character set. In addition, the
# MacOS Hebrew character set includes the following:
# Mac OS Hebrew character set includes the following:
#
# - Hebrew points (nikud marks) at xC6, xCB-xCF and xD8-xDF. These are
# non-spacing combining marks. Note that the RAFE point at 0xD8 is
# not displayed correctly in some fonts, and cannot be typed using
# the keyboard layouts in the current Hebrew localized systems. Also
# note: The character given in Unicode as QAMATS (U+05B8) actually
# refers to two different sounds, depending on context. For example,
# when ALEF is followed by QAMATS, the QAMATS can actually refer to
# two different sounds depending on the following letters. The MacOS
# Hebrew character set separately encodes these two sounds for the
# same graphic shape, as "qamats" (0xCB) and "qamats qatan" (0xDE).
# The "qamats" character is more common, so it is mapped to the
# Unicode QAMATS; "qamats qatan" can only be used with a limited
# number of characters, and it is mapped (for roundtrip fidelity) to
# a character in the corporate use zone.
# - Hebrew points (nikud marks) at 0xC6, 0xCB-0xCF and 0xD8-0xDF.
# These are non-spacing combining marks. Note that the RAFE point
# at 0xD8 is not displayed correctly in some fonts, and cannot be
# typed using the keyboard layouts in the current Hebrew localized
# systems. Also note: The character given in Unicode as QAMATS
# (U+05B8) actually refers to two different sounds, depending on
# context. For example, when ALEF is followed by QAMATS, the QAMATS
# can actually refer to two different sounds depending on the
# following letters. The Mac OS Hebrew character set separately
# encodes these two sounds for the same graphic shape, as "qamats"
# (0xCB) and "qamats qatan" (0xDE). The "qamats" character is more
# common, so it is mapped to the Unicode QAMATS; "qamats qatan" can
# only be used with a limited number of characters, and it is
# mapped using a corporate-zone variant tag (see below).
#
# - Various Hebrew ligatures at 0x81, 0xC0, 0xC7, 0xC8, 0xD6, and
# 0xD7. One of these must be mapped to a corporate zone character
# for roundtrip fidelity. Also note that the Yiddish YOD YOD PATAH
# ligature at 0x81 is missing in some fonts.
# 0xD7. Also note that the Yiddish YOD YOD PATAH ligature at 0x81
# is missing in some fonts.
#
# - The NEW SHEQEL SIGN at 0xA6.
#
# - Latin characters with diacritics at 0x80 and 0x82-0x9F. However,
# - Latin characters with diacritics at 0x80 and 0x82-0x9F. However,
# most of these cannot be typed using the keyboard layouts in the
# Hebrew localized systems.
#
# - Right-left versions of certain ASCII punctuation, symbols and
# digits: 0xA0-0xA5, 0xA7-0xBF, 0xFB-0xFF. See section 2 (below)
# on directional characters.
# digits: 0xA0-0xA5, 0xA7-0xBF, 0xFB-0xFF. See below.
#
# - Miscellaneous additional punctuation at 0xC1, 0xC9, 0xCA, and
# 0xD0-0xD5. There is a variant of the Hebrew encoding in which
@@ -103,97 +131,282 @@
# be confused with cantillation marks!). These were used for
# manual positioning of nikud marks before System 7.1 (at which
# point nikud positioning became automatic with WorldScript.).
# However, they may have been incorporated in stored text, so they
# must be mapped to corporate zone characters for roundtrip
# fidelity.
#
# The MacOS Hebrew character set supports the Hebrew and Yiddish
# languages.
#
# 2. Directional characters and roundtrip fidelity
#
# The MacOS Hebrew character set was developed around 1987. At that
# time the bidirectional line line layout algorithm used in the MacOS
# The Mac OS Hebrew character set was developed around 1987. At that
# time the bidirectional line line layout algorithm used in the Mac OS
# Hebrew system was fairly simple; it used only a few direction
# classes (instead of the 12 or so now used in the Unicode
# bidirectional algorithm). In order to permit users to handle some
# tricky layout problems, certain punctuation, symbol, and digit
# characters have duplicate code points, one with a left-right
# direction attribute and the other with a right-left direction attribute.
# classes (instead of the 19 now used in the Unicode bidirectional
# algorithm). In order to permit users to handle some tricky layou
# problems, certain punctuation, symbol, and digit characters have
# duplicate code points, one with a left-right direction attribute and
# the other with a right-left direction attribute.
#
# For example, percent is encoded at 0x25 with a left-right
# attribute, and at 0xA5 with a right-left attribute. However, there
# is only one percent character in Unicode. We need to have a way
# to map both MacOS Hebrew percent characters to Unicode and back
# again without loss of information. Mapping one of the MacOS Hebrew
# percent characters to a code in the Unicode corporate use zone is
# undesirable, since both of the percent characters are likely to
# For example, plus sign is encoded at 0x2B with a left-right
# attribute, and at 0xAB with a right-left attribute. However, there
# is only one PLUS SIGN character in Unicode. This leads to some
# interesting problems when mapping between Mac OS Hebrew and Unicode;
# see below.
#
# A related problem is that even when a particular character is
# encoded only once in Mac OS Hebrew, it may have a different
# direction attribute than the corresponding Unicode character.
#
# For example, the Mac OS Hebrew character at 0xC9 is HORIZONTAL
# ELLIPSIS with strong right-left direction. However, the Unicode
# character HORIZONTAL ELLIPSIS has direction class neutral.
#
# 3. Font variants
#
# The table in this file gives the Unicode mappings for the standard
# Mac OS Hebrew encoding. This encoding is supported by many of the
# Apple fonts (including all of the fonts in the Hebrew Language Kit),
# and is the encoding supported by the text processing utilities.
# However, some TrueType fonts provided with the localized Hebrew
# system implement a slightly different encoding; the difference is
# only in one code point, 0xD4. For the standard variant, this is:
# 0xD4 -> <RL>+0x2018 LEFT SINGLE QUOTATION MARK, right-left
#
# The TrueType variant is used by the following TrueType fonts from
# the localized system: Caesarea, Carmel Book, Gilboa, Ramat Sharon,
# and Sinai Book. For these, 0xD4 is as follows:
# 0xD4 -> <RL>+0x2007 FIGURE SPACE, right-left
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# 1. Matching the direction of Mac OS Hebrew characters
#
# When Mac OS Hebrew encodes a character twice but with different
# direction attributes for the two code points - as in the case of
# plus sign mentioned above - we need a way to map both Mac OS Hebrew
# code points to Unicode and back again without loss of information.
# With the plus sign, for example, mapping one of the Mac OS Hebrew
# characters to a code in the Unicode corporate use zone is
# undesirable, since both of the plus sign characters are likely to
# be used in text that is interchanged.
#
# The problem is solved with the use of direction override characters
# and direction-dependent mappings. When mapping from MacOS Hebrew to
# Unicode, such problem characters are surrounded with an appropriate
# direction override:
# MacOS Hebrew 0x25 percent (left) ->
# Unicode 0x202D (LRO) + 0x0025 (PERCENT SIGN) + 0x202C (PDF)
# MacOS Hebrew 0xA5 percent (right) ->
# Unicode 0x202E (RLO) + 0x0025 (PERCENT SIGN) + 0x202C (PDF)
# When mapping from Unicode to MacOS Hebrew, the MacOS Unicode
# converter uses the Unicode bidirectional algorithm to determine
# resolved directions. The mapping from Unicode to MacOS Hebrew can
# then be disambiguated by the use of the resolved direction:
# Unicode 0x0025 -> MacOS Hebrew 0x25 (if L) or 0xA5 (if R)
# and direction-dependent mappings. When mapping from Mac OS Hebrew
# to Unicode, we use direction overrides as necessary to force the
# direction of the resulting Unicode characters.
#
# However, note that this means we also need to discard the direction
# override characters when mapping from Unicode to MacOS Hebrew.
# The required direction is indicated by a direction tag in the
# mappings. A tag of <LR> means the corresponding Unicode character
# must have a strong left-right context, and a tag of <RL> indicates
# a right-left context.
#
# For example, the mapping of 0x2B is given as <LR>+0x002B; the
# mapping of 0xAB is given as <RL>+0x002B. If we map an isolated
# instance of 0x2B to Unicode, it should be mapped as follows (LRO
# indicates LEFT-RIGHT OVERRIDE, PDF indicates POP DIRECTION
# FORMATTING):
#
# 0x2B -> 0x202D (LRO) + 0x002B (PLUS SIGN) + 0x202C (PDF)
#
# When mapping several characters in a row that require direction
# forcing, the overrides need only be used at the beginning and end.
# For example:
#
# 0x24 0x20 0x28 0x29 -> 0x202D 0x0024 0x0020 0x0028 0x0029 0x202C
#
# If neutral characters that require direction forcing are already
# between strong-direction characters with matching directionality,
# then direction overrides need not be used. Direction overrides are
# always needed to map the right-left digits at 0xB0-0xB9.
#
# When mapping from Unicode to Mac OS Hebrew, the Unicode
# bidirectional algorithm should be used to determine resolved
# direction of the Unicode characters. The mapping from Unicode to
# Mac OS Hebrew can then be disambiguated by the use of the resolved
# direction:
#
# Unicode 0x002B -> Mac OS Hebrew 0x2B (if L) or 0xAB (if R)
#
# However, this also means the direction override characters should
# be discarded when mapping from Unicode to Mac OS Hebrew (after
# they have been used to determine resolved direction), since the
# direction override information is carried by the code point itself.
#
# Even when direction overrides are not needed for roundtrip
# fidelity, they are sometimes used when mapping MacOS Hebrew
# fidelity, they are sometimes used when mapping Mac OS Hebrew
# characters to Unicode in order to achieve similar text layout with
# the resulting Unicode text. For example, the single MacOS Hebrew
# the resulting Unicode text. For example, the single Mac OS Hebrew
# ellipsis character has direction class right-left,and there is no
# left-right version. However, the Unicode HORIZONTAL ELLIPSIS
# character has direction class neutral (which means it may end up
# with a resolved direction of left-right if surrounded by left-right
# characters). When mapping the MacOS Hebrew ellipsis to Unicode, it
# characters). When mapping the Mac OS Hebrew ellipsis to Unicode, it
# is surrounded with a direction override to help preserve proper
# text layout. The resolved direction is not needed or used when
# mapping the Unicode HORIZONTAL ELLIPSIS back to MacOS Hebrew.
# mapping the Unicode HORIZONTAL ELLIPSIS back to Mac OS Hebrew.
#
# 2. Use of corporate-zone Unicodes
#
# The goals in the mappings provided here are:
# - Ensure roundtrip mapping from every character in the Mac OS
# Hebrew character set to Unicode and back
# - Use standard Unicode characters as much as possible, to
# maximize interchangeability of the resulting Unicode text.
# Whenever possible, avoid having content carried by private-use
# characters.
#
# Some of the characters in the Mac OS Hebrew character set do not
# correspond to distinct, single Unicode characters. To map these
# and satisfy both goals above, we employ various strategies.
#
# a) If possible, use private use characters in combination with
# standard Unicode characters to mark variants of the standard
# Unicode character.
#
# Apple has defined a block of 32 corporate characters as "transcoding
# hints." These are used in combination with standard Unicode characters
# to force them to be treated in a special way for mapping to other
# encodings; they have no other effect. Sixteen of these transcoding
# hints are "grouping hints" - they indicate that the next 2-4 Unicode
# characters should be treated as a single entity for transcoding. The
# other sixteen transcoding hints are "variant tags" - they are like
# combining characters, and can follow a standard Unicode (or a sequence
# consisting of a base character and other combining characters) to
# cause it to be treated in a special way for transcoding. These always
# terminate a combining-character sequence.
#
# Two transcoding hints are used in this mapping table: a grouping hint
# and a variant tag:
# hint:
# 0xF86A group next 2 characters, right-left directionality
# 0xF87F variant tag
#
# In Mac OS Hebrew, 0xC0 is a ligature for lamed holam. This can also
# be represented in Mac OS Hebrew as 0xEC+0xDD, using separate
# characters for lamed and holam. The latter sequence is mapped to
# Unicode as 0x05DC+0x05B9, i.e. as the sequence HEBREW LETTER LAMED +
# HEBREW POINT HOLAM. We want to map the ligature 0xC0 using the same
# standard Unicode characters, but for round-trip fidelity we need to
# distinguish it from the mapping of the sequence 0xEC+0xDD. Thus for
# 0xC0 we use a grouping hint, and map as follows:
#
# 0xC0 -> 0xF86A+0x05DC+0x05B9
#
# The variant tag is used for "qamats qatan" to mark it as an alternate
# for HEBREW POINT QAMATS, as follows:
#
# 0xDE -> 0x05B8+0xF87F
#
# b) Otherwise, use private use characters by themselves to map Mac OS
# Hebrew characters which have no relationship to any standard Unicode
# character.
#
# The following additional corporate zone Unicode characters are used
# for this purpose here (to map the obsolete "canorals", see above):
#
# 0xF89B Hebrew canoral 1
# 0xF89C Hebrew canoral 2
# 0xF89D Hebrew canoral 3
# 0xF89E Hebrew canoral 4
#
# 3. Roundtrip considerations when mapping to decomposed Unicode
#
# Both Mac OS Hebrew and Unicode provide multiple ways of representing
# certain letter-and-point combinations. For example, HEBREW LETTER
# VAV WITH HOLAM can be represented in Unicode as the single character
# 0xFB4B or as the sequence 0x05D5 0x05B9; similarly, it can be
# represented in Mac OS Hebrew as 0xC7 or as the sequence 0xE5 0xDD.
# This leads to some roundtrip problems. First note that we have the
# following mappings without such problems:
#
# Mac standard decomp. of reverse map
# OS Unicode mapping std. mapping of decomp.
# ---- ---------------------------------- ------------- -----------
# 0xC6 0x05BC ... POINT DAGESH OR MAPIQ 0x05BC (same) 0xC6
# 0xE5 0x05D5 ... LETTER VAV 0x05D5 (same) 0xE5
# 0xDD 0x05B9 ... POINT HOLAM 0x05B9 (same) 0xDD
#
# However, those mappings above cause roundtrip problems for the
# the following mappings if they are decomposed:
#
# Mac standard decomp. of reverse map
# OS Unicode mapping std. mapping of decomp.
# ---- ---------------------------------- ------------- -----------
# 0xC7 0xFB4B ... LETTER VAV WITH HOLAM 0x05D5 0x05B9 0xE5 0xDD
# 0xC8 0xFB35 ... LETTER VAV WITH DAGESH 0x05D5 0x05BC 0xE5 0xC6
#
# One solution is to use a grouping transcoding hint with the two
# decompositions above to mark the decomposed sequence for special
# treatment in transcoding. This yields the following mappings to
# decomposed Unicode:
#
# Mac decomposed
# OS Unicode mapping
# ---- --------------------
# 0xC7 0xF86A 0x05D5 0x05B9
# 0xC8 0xF86A 0x05D5 0x05BC
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - Stop specifying left-right context for digits 0x30-0x39, since the
# corresponding Unicodes 0x0030-0x0039 already have left-right
# directionality.
#
# - Change mapping of 0x81 from 0xFB1F HEBREW LIGATURE YIDDISH YOD YOD
# PATAH to its canonical decomposition 0x05F2+0x05B7 to improve
# cross-platform compatibility (Windows doesn't handle 0xFB1F)
#
# - Interchange the mappings of 0xA8 and 0xA9 to obtain the correct
# open/close behavior; they work differently than in Mac Arabic.
# The old mapping was
# 0xA8 <RL>+0x0028 # LEFT PARENTHESIS, right-left
# 0xA9 <RL>+0x0029 # RIGHT PARENTHESIS, right-left
# and the new mapping is
# 0xA8 <RL>+0x0029 # RIGHT PARENTHESIS, right-left
# 0xA9 <RL>+0x0028 # LEFT PARENTHESIS, right-left
#
# Changes from version n01 to version n03:
#
# - Change mapping for 0xC0 from single corporate character to
# grouping hint plus standard Unicodes
#
# - Change mapping for 0xDE from single corporate character to
# standard Unicode plus variant tag
#
##################
0x20 0x202D+0x0020+0x202C # LRO + SPACE + PDF
0x21 0x202D+0x0021+0x202C # LRO + EXCLAMATION MARK + PDF
0x22 0x202D+0x0022+0x202C # LRO + QUOTATION MARK + PDF
0x23 0x202D+0x0023+0x202C # LRO + NUMBER SIGN + PDF
0x24 0x202D+0x0024+0x202C # LRO + DOLLAR SIGN + PDF
0x25 0x202D+0x0025+0x202C # LRO + PERCENT SIGN + PDF
0x20 <LR>+0x0020 # SPACE, left-right
0x21 <LR>+0x0021 # EXCLAMATION MARK, left-right
0x22 <LR>+0x0022 # QUOTATION MARK, left-right
0x23 <LR>+0x0023 # NUMBER SIGN, left-right
0x24 <LR>+0x0024 # DOLLAR SIGN, left-right
0x25 <LR>+0x0025 # PERCENT SIGN, left-right
0x26 0x0026 # AMPERSAND
0x27 0x202D+0x0027+0x202C # LRO + APOSTROPHE + PDF
0x28 0x202D+0x0028+0x202C # LRO + LEFT PARENTHESIS + PDF
0x29 0x202D+0x0029+0x202C # LRO + RIGHT PARENTHESIS + PDF
0x2A 0x202D+0x002A+0x202C # LRO + ASTERISK + PDF
0x2B 0x202D+0x002B+0x202C # LRO + PLUS SIGN + PDF
0x2C 0x202D+0x002C+0x202C # LRO + COMMA + PDF
0x2D 0x202D+0x002D+0x202C # LRO + HYPHEN-MINUS + PDF
0x2E 0x202D+0x002E+0x202C # LRO + FULL STOP + PDF
0x2F 0x202D+0x002F+0x202C # LRO + SOLIDUS + PDF
0x30 0x202D+0x0030+0x202C # LRO + DIGIT ZERO + PDF
0x31 0x202D+0x0031+0x202C # LRO + DIGIT ONE + PDF
0x32 0x202D+0x0032+0x202C # LRO + DIGIT TWO + PDF
0x33 0x202D+0x0033+0x202C # LRO + DIGIT THREE + PDF
0x34 0x202D+0x0034+0x202C # LRO + DIGIT FOUR + PDF
0x35 0x202D+0x0035+0x202C # LRO + DIGIT FIVE + PDF
0x36 0x202D+0x0036+0x202C # LRO + DIGIT SIX + PDF
0x37 0x202D+0x0037+0x202C # LRO + DIGIT SEVEN + PDF
0x38 0x202D+0x0038+0x202C # LRO + DIGIT EIGHT + PDF
0x39 0x202D+0x0039+0x202C # LRO + DIGIT NINE + PDF
0x3A 0x202D+0x003A+0x202C # LRO + COLON + PDF
0x3B 0x202D+0x003B+0x202C # LRO + SEMICOLON + PDF
0x3C 0x202D+0x003C+0x202C # LRO + LESS-THAN SIGN + PDF
0x3D 0x202D+0x003D+0x202C # LRO + EQUALS SIGN + PDF
0x3E 0x202D+0x003E+0x202C # LRO + GREATER-THAN SIGN + PDF
0x3F 0x202D+0x003F+0x202C # LRO + QUESTION MARK + PDF
0x27 <LR>+0x0027 # APOSTROPHE, left-right
0x28 <LR>+0x0028 # LEFT PARENTHESIS, left-right
0x29 <LR>+0x0029 # RIGHT PARENTHESIS, left-right
0x2A <LR>+0x002A # ASTERISK, left-right
0x2B <LR>+0x002B # PLUS SIGN, left-right
0x2C <LR>+0x002C # COMMA, left-right
0x2D <LR>+0x002D # HYPHEN-MINUS, left-right
0x2E <LR>+0x002E # FULL STOP, left-right
0x2F <LR>+0x002F # SOLIDUS, left-right
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A <LR>+0x003A # COLON, left-right
0x3B <LR>+0x003B # SEMICOLON, left-right
0x3C <LR>+0x003C # LESS-THAN SIGN, left-right
0x3D <LR>+0x003D # EQUALS SIGN, left-right
0x3E <LR>+0x003E # GREATER-THAN SIGN, left-right
0x3F <LR>+0x003F # QUESTION MARK, left-right
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
@@ -221,9 +434,9 @@
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x202D+0x005B+0x202C # LRO + LEFT SQUARE BRACKET + PDF
0x5B <LR>+0x005B # LEFT SQUARE BRACKET, left-right
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x202D+0x005D+0x202C # LRO + RIGHT SQUARE BRACKET + PDF
0x5D <LR>+0x005D # RIGHT SQUARE BRACKET, left-right
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
@@ -253,13 +466,13 @@
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x202D+0x007B+0x202C # LRO + LEFT CURLY BRACKET + PDF
0x7C 0x202D+0x007C+0x202C # LRO + VERTICAL LINE + PDF
0x7D 0x202D+0x007D+0x202C # LRO + RIGHT CURLY BRACKET + PDF
0x7B <LR>+0x007B # LEFT CURLY BRACKET, left-right
0x7C <LR>+0x007C # VERTICAL LINE, left-right
0x7D <LR>+0x007D # RIGHT CURLY BRACKET, left-right
0x7E 0x007E # TILDE
#
0x80 0x00C4 # LATIN CAPITAL LETTER A WITH DIAERESIS
0x81 0xFB1F # HEBREW LIGATURE YIDDISH YOD YOD PATAH
0x81 0x05F2+0x05B7 # HEBREW LIGATURE YIDDISH YOD YOD PATAH
0x82 0x00C7 # LATIN CAPITAL LETTER C WITH CEDILLA
0x83 0x00C9 # LATIN CAPITAL LETTER E WITH ACUTE
0x84 0x00D1 # LATIN CAPITAL LETTER N WITH TILDE
@@ -290,40 +503,40 @@
0x9D 0x00F9 # LATIN SMALL LETTER U WITH GRAVE
0x9E 0x00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
0x9F 0x00FC # LATIN SMALL LETTER U WITH DIAERESIS
0xA0 0x202E+0x0020+0x202C # RLO + SPACE + PDF
0xA1 0x202E+0x0021+0x202C # RLO + EXCLAMATION MARK + PDF
0xA2 0x202E+0x0022+0x202C # RLO + QUOTATION MARK + PDF
0xA3 0x202E+0x0023+0x202C # RLO + NUMBER SIGN + PDF
0xA4 0x202E+0x0024+0x202C # RLO + DOLLAR SIGN + PDF
0xA5 0x202E+0x0025+0x202C # RLO + PERCENT SIGN + PDF
0xA0 <RL>+0x0020 # SPACE, right-left
0xA1 <RL>+0x0021 # EXCLAMATION MARK, right-left
0xA2 <RL>+0x0022 # QUOTATION MARK, right-left
0xA3 <RL>+0x0023 # NUMBER SIGN, right-left
0xA4 <RL>+0x0024 # DOLLAR SIGN, right-left
0xA5 <RL>+0x0025 # PERCENT SIGN, right-left
0xA6 0x20AA # NEW SHEQEL SIGN
0xA7 0x202E+0x0027+0x202C # RLO + APOSTROPHE + PDF
0xA8 0x202E+0x0028+0x202C # RLO + LEFT PARENTHESIS + PDF
0xA9 0x202E+0x0029+0x202C # RLO + RIGHT PARENTHESIS + PDF
0xAA 0x202E+0x002A+0x202C # RLO + ASTERISK + PDF
0xAB 0x202E+0x002B+0x202C # RLO + PLUS SIGN + PDF
0xAC 0x202E+0x002C+0x202C # RLO + COMMA + PDF
0xAD 0x202E+0x002D+0x202C # RLO + HYPHEN-MINUS + PDF
0xAE 0x202E+0x002E+0x202C # RLO + FULL STOP + PDF
0xAF 0x202E+0x002F+0x202C # RLO + SOLIDUS + PDF
0xB0 0x202E+0x0030+0x202C # RLO + DIGIT ZERO + PDF
0xB1 0x202E+0x0031+0x202C # RLO + DIGIT ONE + PDF
0xB2 0x202E+0x0032+0x202C # RLO + DIGIT TWO + PDF
0xB3 0x202E+0x0033+0x202C # RLO + DIGIT THREE + PDF
0xB4 0x202E+0x0034+0x202C # RLO + DIGIT FOUR + PDF
0xB5 0x202E+0x0035+0x202C # RLO + DIGIT FIVE + PDF
0xB6 0x202E+0x0036+0x202C # RLO + DIGIT SIX + PDF
0xB7 0x202E+0x0037+0x202C # RLO + DIGIT SEVEN + PDF
0xB8 0x202E+0x0038+0x202C # RLO + DIGIT EIGHT + PDF
0xB9 0x202E+0x0039+0x202C # RLO + DIGIT NINE + PDF
0xBA 0x202E+0x003A+0x202C # RLO + COLON + PDF
0xBB 0x202E+0x003B+0x202C # RLO + SEMICOLON + PDF
0xBC 0x202E+0x003C+0x202C # RLO + LESS-THAN SIGN + PDF
0xBD 0x202E+0x003D+0x202C # RLO + EQUALS SIGN + PDF
0xBE 0x202E+0x003E+0x202C # RLO + GREATER-THAN SIGN + PDF
0xBF 0x202E+0x003F+0x202C # RLO + QUESTION MARK + PDF
0xC0 0xF89A # Hebrew ligature lamed holam
0xC1 0x202E+0x201E+0x202C # RLO + DOUBLE LOW-9 QUOTATION MARK + PDF
0xA7 <RL>+0x0027 # APOSTROPHE, right-left
0xA8 <RL>+0x0029 # RIGHT PARENTHESIS, right-left # close parenthesis
0xA9 <RL>+0x0028 # LEFT PARENTHESIS, right-left # open parenthesis
0xAA <RL>+0x002A # ASTERISK, right-left
0xAB <RL>+0x002B # PLUS SIGN, right-left
0xAC <RL>+0x002C # COMMA, right-left
0xAD <RL>+0x002D # HYPHEN-MINUS, right-left
0xAE <RL>+0x002E # FULL STOP, right-left
0xAF <RL>+0x002F # SOLIDUS, right-left
0xB0 <RL>+0x0030 # DIGIT ZERO, right-left (need override)
0xB1 <RL>+0x0031 # DIGIT ONE, right-left (need override)
0xB2 <RL>+0x0032 # DIGIT TWO, right-left (need override)
0xB3 <RL>+0x0033 # DIGIT THREE, right-left (need override)
0xB4 <RL>+0x0034 # DIGIT FOUR, right-left (need override)
0xB5 <RL>+0x0035 # DIGIT FIVE, right-left (need override)
0xB6 <RL>+0x0036 # DIGIT SIX, right-left (need override)
0xB7 <RL>+0x0037 # DIGIT SEVEN, right-left (need override)
0xB8 <RL>+0x0038 # DIGIT EIGHT, right-left (need override)
0xB9 <RL>+0x0039 # DIGIT NINE, right-left (need override)
0xBA <RL>+0x003A # COLON, right-left
0xBB <RL>+0x003B # SEMICOLON, right-left
0xBC <RL>+0x003C # LESS-THAN SIGN, right-left
0xBD <RL>+0x003D # EQUALS SIGN, right-left
0xBE <RL>+0x003E # GREATER-THAN SIGN, right-left
0xBF <RL>+0x003F # QUESTION MARK, right-left
0xC0 0xF86A+0x05DC+0x05B9 # Hebrew ligature lamed holam
0xC1 <RL>+0x201E # DOUBLE LOW-9 QUOTATION MARK, right-left
0xC2 0xF89B # Hebrew canoral 1
0xC3 0xF89C # Hebrew canoral 2
0xC4 0xF89D # Hebrew canoral 3
@@ -331,19 +544,19 @@
0xC6 0x05BC # HEBREW POINT DAGESH OR MAPIQ
0xC7 0xFB4B # HEBREW LETTER VAV WITH HOLAM
0xC8 0xFB35 # HEBREW LETTER VAV WITH DAGESH
0xC9 0x202E+0x2026+0x202C # RLO + HORIZONTAL ELLIPSIS
0xCA 0x202E+0x00A0+0x202C # RLO + NO-BREAK SPACE
0xC9 <RL>+0x2026 # HORIZONTAL ELLIPSIS, right-left
0xCA <RL>+0x00A0 # NO-BREAK SPACE, right-left
0xCB 0x05B8 # HEBREW POINT QAMATS
0xCC 0x05B7 # HEBREW POINT PATAH
0xCD 0x05B5 # HEBREW POINT TSERE
0xCE 0x05B6 # HEBREW POINT SEGOL
0xCF 0x05B4 # HEBREW POINT HIRIQ
0xD0 0x202E+0x2013+0x202C # RLO + EN DASH + PDF
0xD1 0x202E+0x2014+0x202C # RLO + EM DASH + PDF
0xD2 0x202E+0x201C+0x202C # RLO + LEFT DOUBLE QUOTATION MARK + PDF
0xD3 0x202E+0x201D+0x202C # RLO + RIGHT DOUBLE QUOTATION MARK + PDF
0xD4 0x202E+0x2018+0x202C # RLO + LEFT SINGLE QUOTATION MARK + PDF
0xD5 0x202E+0x2019+0x202C # RLO + RIGHT SINGLE QUOTATION MARK + PDF
0xD0 <RL>+0x2013 # EN DASH, right-left
0xD1 <RL>+0x2014 # EM DASH, right-left
0xD2 <RL>+0x201C # LEFT DOUBLE QUOTATION MARK, right-left
0xD3 <RL>+0x201D # RIGHT DOUBLE QUOTATION MARK, right-left
0xD4 <RL>+0x2018 # LEFT SINGLE QUOTATION MARK, right-left
0xD5 <RL>+0x2019 # RIGHT SINGLE QUOTATION MARK, right-left
0xD6 0xFB2A # HEBREW LETTER SHIN WITH SHIN DOT
0xD7 0xFB2B # HEBREW LETTER SHIN WITH SIN DOT
0xD8 0x05BF # HEBREW POINT RAFE
@@ -352,7 +565,7 @@
0xDB 0x05B1 # HEBREW POINT HATAF SEGOL
0xDC 0x05BB # HEBREW POINT QUBUTS
0xDD 0x05B9 # HEBREW POINT HOLAM
0xDE 0xF89F # Hebrew point qamats qatan
0xDE 0x05B8+0xF87F # HEBREW POINT QAMATS, alternate form "qamats qatan"
0xDF 0x05B3 # HEBREW POINT HATAF QAMATS
0xE0 0x05D0 # HEBREW LETTER ALEF
0xE1 0x05D1 # HEBREW LETTER BET
@@ -381,8 +594,8 @@
0xF8 0x05E8 # HEBREW LETTER RESH
0xF9 0x05E9 # HEBREW LETTER SHIN
0xFA 0x05EA # HEBREW LETTER TAV
0xFB 0x202E+0x007D+0x202C # RLO + RIGHT CURLY BRACKET + PDF
0xFC 0x202E+0x005D+0x202C # RLO + RIGHT SQUARE BRACKET + PDF
0xFD 0x202E+0x007B+0x202C # RLO + LEFT CURLY BRACKET + PDF
0xFE 0x202E+0x005B+0x202C # RLO + LEFT SQUARE BRACKET + PDF
0xFF 0x202E+0x007C+0x202C # RLO + VERTICAL LINE + PDF
0xFB <RL>+0x007D # RIGHT CURLY BRACKET, right-left
0xFC <RL>+0x005D # RIGHT SQUARE BRACKET, right-left
0xFD <RL>+0x007B # LEFT CURLY BRACKET, right-left
0xFE <RL>+0x005B # LEFT SQUARE BRACKET, right-left
0xFF <RL>+0x007C # VERTICAL LINE, right-left

View File

@@ -1,11 +1,35 @@
#=======================================================================
# File name: ICELAND.TXT
#
# Name: MacOS_Icelandic [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <5>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Icelandic
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs, notes. Matches internal
# utom<b3>.
# b02 1999-Sep-22 Encoding changed for Mac OS 8.5; change
# mapping of 0xDB from CURRENCY SIGN to EURO
# SIGN. Update contact e-mail address. Matches
# internal utom<b2>, ufrm<b2>, and Text
# Encoding Converter version 1.5.
# n06 1998-Feb-05 Minor update to header comments, add
# information on font variants
# n03 1997-Dec-14 Update to match internal utom<n4>, ufrm<n16>:
# Change standard mapping for 0xBD from U+2126
# to its canonical decomposition, U+03A9.
# n02 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n5>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,48 +38,108 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Icelandic code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Icelandic code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Icelandic code order.
# The entries are in Mac OS Icelandic code order.
#
# One of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Icelandic character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Icelandic:
# --------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# 1. General
#
# Mac OS Icelandic is used for Icelandic and Faroese.
#
# The Mac OS Icelandic encoding shares the script code smRoman
# (0) with the standard Mac OS Roman encoding. To determine if
# the Icelandic encoding is being used, you must also check if
# the system region code is 21, verIceland.
#
# This character set is a variant of standard Mac OS Roman,
# adding upper and lower eth, thorn, and Y acute. It has 6 code
# point differences from standard Mac OS Roman.
#
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts are updated for Mac OS 8.5 to reflect this. There are
# "currency sign" variants of the Mac OS Icelandic encoding that
# still map 0xDB to U+00A4; these can be used for older fonts.
#
# 2. Font variants
#
# The table in this file gives the Unicode mappings for the standard
# Mac OS Icelandic encoding. This encoding is supported by the
# Icelandic versions of the fonts Chicago, Geneva, Monaco, and New
# York, and is the encoding supported by the text processing
# utilities. However, other TrueType fonts implement a slightly
# different encoding; the difference is only in two code points.
# For the standard variant, these are:
# 0xBB -> 0x00AA FEMININE ORDINAL INDICATOR
# 0xBC -> 0x00BA MASCULINE ORDINAL INDICATOR
#
# For the TrueType variant (used by the Icelandic versions of the
# fonts Courier, Helvetica, Palatino, and Times), these are:
# 0xBB -> 0xFB01 LATIN SMALL LIGATURE FI
# 0xBC -> 0xFB02 LATIN SMALL LIGATURE FL
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The following corporate zone Unicode character is used in this
# mapping:
#
# 0xF8FF Apple logo
#
# One of these mappings requires the use of a corporate character
# (for the Apple logo character). See the file "MacOS-CorpCharList".
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Notes on MacOS Icelandic:
# -------------------------
# Details of mapping changes in each version:
# -------------------------------------------
#
# The MacOS Icelandic encoding shares the script code smRoman
# (0) with the standard MacOS Roman encoding. To determine if
# the Icelandic encoding is being used, you must also check if
# the system region code is 21, verIceland.
# Changes from version n06 to version b02:
#
# This character set is a variant of standard MacOS Roman,
# adding upper and lower eth, thorn, and Y acute. It has 6 code
# point differences from standard MacOS Roman.
# - Encoding changed for Mac OS 8.5; change mapping of 0xDB from
# CURRENCY SIGN (U+00A4) to EURO SIGN (U+20AC).
#
# Changes from version n02 to version n03:
#
# - Change mapping of 0xBD from U+2126 to its canonical
# decomposition, U+03A9.
#
##################
@@ -201,7 +285,7 @@
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00C6 # LATIN CAPITAL LIGATURE AE
0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
@@ -216,8 +300,8 @@
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x2126 # OHM SIGN
0xBE 0x00E6 # LATIN SMALL LIGATURE AE
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK
@@ -246,7 +330,7 @@
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x2044 # FRACTION SLASH
0xDB 0x00A4 # CURRENCY SIGN
0xDB 0x20AC # EURO SIGN
0xDC 0x00D0 # LATIN CAPITAL LETTER ETH
0xDD 0x00F0 # LATIN SMALL LETTER ETH
0xDE 0x00DE # LATIN CAPITAL LETTER THORN

File diff suppressed because it is too large Load Diff

View File

@@ -1,11 +1,39 @@
#=======================================================================
# File name: ROMAN.TXT
#
# Name: MacOS_Roman [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <9>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Roman
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1994-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b4,c1 2002-Dec-19 Update URLs, notes. Matches internal
# utom<b5>.
# b03 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b4>, ufrm<b3>, and Text
# Encoding Converter version 1.5.
# b02 1998-Aug-18 Encoding changed for Mac OS 8.5; change
# mapping of 0xDB from CURRENCY SIGN to
# EURO SIGN. Matches internal utom<b3>,
# ufrm<b3>.
# n08 1998-Feb-05 Minor update to header comments
# n06 1997-Dec-14 Add warning about future changes to 0xDB
# from CURRENCY SIGN to EURO SIGN. Clarify
# some header information
# n04 1997-Dec-01 Update to match internal utom<n3>, ufrm<n22>:
# Change standard mapping for 0xBD from U+2126
# to its canonical decomposition, U+03A9.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n9>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,58 +42,74 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Roman code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Roman code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Roman code order.
# The entries are in Mac OS Roman code order.
#
# One of these mappings requires the use of a corporate character
# (for the Apple logo character). See the file "MacOS-CorpCharList".
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
# One of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Notes on MacOS Roman:
# ---------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Roman character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# This character set is used for at least the following MacOS
# Notes on Mac OS Roman:
# ----------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported directly in programming
# interfaces for QuickDraw Text, the Script Manager, and related
# Text Utilities. For other purposes it is supported via transcoding
# to and from Unicode.
#
# This character set is used for at least the following Mac OS
# localizations: U.S., British, Canadian French, French, Swiss
# French, German, Swiss German, Italian, Swiss Italian, Dutch,
# Swedish, Norwegian, Danish, Finnish, Spanish, Catalan,
# Portuguese, Brazilian, and the default International system.
#
# Variants of MacRoman are used for Croatian, Icelandic,
# Turkish, and Romanian. Separate mapping tables are available
# for these encodings.
# Variants of Mac OS Roman are used for Croatian, Icelandic,
# Turkish, Romanian, and other encodings. Separate mapping tables
# are available for these encodings.
#
# At least through System 7.5, the bitmap versions of the fonts
# Chicago, New York, Geneva, and Monaco do not implement the
# full Roman character set; they only support MacOS Roman character
# codes up to 0xD8. The TrueType versions of these fonts do
# implement the full character set, as do both the bitmap and
# TrueType versions of the other standard Roman fonts.
# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
# Apple fonts are updated for Mac OS 8.5 to reflect this. There is
# a "currency sign" variant of the Mac OS Roman encoding that still
# maps 0xDB to U+00A4; this can be used for older fonts.
#
# In all MacOS encodings, fonts such as Chicago which are used
# Before Mac OS 8.5, the ROM bitmap versions of the fonts Chicago,
# New York, Geneva, and Monaco did not implement the full Mac OS
# Roman character set; they only supported character codes up to
# 0xD8. The TrueType versions of these fonts have always implemented
# the full character set, as with the bitmap and TrueType versions
# of the other standard Roman fonts.
#
# In all Mac OS encodings, fonts such as Chicago which are used
# as "system" fonts (for menus, dialogs, etc.) have four glyphs
# at code points 0x11-0x14 for transient use by the Menu Manager.
# These glyphs are not intended as characters for use in normal
@@ -73,6 +117,31 @@
# interpreted as associated with these glyphs; they are usually
# interpreted (if at all) as the control codes DC1-DC4.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The following corporate zone Unicode character is used in this
# mapping:
#
# 0xF8FF Apple logo
#
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version n08 to version b02:
#
# - Encoding changed for Mac OS 8.5; change mapping of 0xDB from
# CURRENCY SIGN (U+00A4) to EURO SIGN (U+20AC).
#
# Changes from version n03 to version n04:
#
# - Change mapping of 0xBD from U+2126 to its canonical
# decomposition, U+03A9.
#
##################
0x20 0x0020 # SPACE
@@ -217,7 +286,7 @@
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00C6 # LATIN CAPITAL LIGATURE AE
0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
@@ -232,8 +301,8 @@
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x2126 # OHM SIGN
0xBE 0x00E6 # LATIN SMALL LIGATURE AE
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK
@@ -262,7 +331,7 @@
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x2044 # FRACTION SLASH
0xDB 0x00A4 # CURRENCY SIGN
0xDB 0x20AC # EURO SIGN
0xDC 0x2039 # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0xDD 0x203A # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0xDE 0xFB01 # LATIN SMALL LIGATURE FI

View File

@@ -1,11 +1,43 @@
#=======================================================================
# File name: SYMBOL.TXT
#
# Name: MacOS_Symbol [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <4>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Symbol
# character set to Unicode 4.0 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1994-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Change mappings for 0xBD, 0xE0. Update
# header comments. Matches internal xml <c1.2>
# and Text Encoding Converter 2.0.
# b4,c1 2002-Dec-19 Update mappings for encoded glyph fragments
# 0xBE, 0xE6-EF, 0xF4, 0xF6-FE to use new
# Unicode 3.2 characters instead of sequences
# involving corporate-use characters. Update
# URLs, notes. Matches internal utom<b4>.
# b03 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b3>, ufrm<b3>, and Text
# Encoding Converter version 1.5.
# b02 1998-Aug-18 Encoding changed for Mac OS 8.5; add new
# mapping from 0xA0 to EURO SIGN. Matches
# internal utom<b3>, ufrm<b3>.
# n05 1998-Feb-05 Update to match internal utom<n5>, ufrm<n15>
# and Text Encoding Converter version 1.3:
# Use standard Unicodes plus transcoding hints
# instead of single corporate characters, also
# change mappings for 0xE1 & 0xF1 from U+2329
# & U+232A to their canonical decompositions;
# see details below. Also update header
# comments to new format.
# n03 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n4>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,47 +46,62 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Symbol code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Symbol code (in hex as 0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN).
# Column #3 is a comment containing the Unicode name.
# In some cases an additional comment follows the Unicode name.
#
# The entries are in MacOS Symbol code order.
# The entries are in Mac OS Symbol code order.
#
# Several of these mappings require the use of corporate
# characters, including the Apple logo character. See the file
# "MacOS-CorpCharList".
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
# Some of these mappings require the use of corporate characters.
# See the file "CORPCHAR.TXT" and notes below.
#
# Notes on MacOS Symbol:
# ----------------------
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Symbol character set uses the standard control characters
# at 0x00-0x1F and 0x7F.
#
# The MacOS Symbol encoding shares the script code smRoman
# (0) with the standard MacOS Roman encoding. To determine if
# the Symbol encoding is being used, you must check if the
# font name is "Symbol".
# Notes on Mac OS Symbol:
# -----------------------
#
# The layout of the MacOS Symbol character set is identical to
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported directly in programming
# interfaces for QuickDraw Text, the Script Manager, and related
# Text Utilities. For other purposes it is supported via transcoding
# to and from Unicode.
#
# The Mac OS Symbol encoding shares the script code smRoman
# (0) with the Mac OS Roman encoding. To determine if the Symbol
# encoding is being used, you must check if the font name is
# "Symbol".
#
# Before Mac OS 8.5, code point 0xA0 was unused. In Mac OS 8.5
# and later versions, code point 0xA0 is EURO SIGN and maps to
# U+20AC (the Symbol font is updated for Mac OS 8.5 to reflect
# this).
#
# The layout of the Mac OS Symbol character set is identical to
# the layout of the Adobe Symbol encoding vector, with the
# addition of the Apple logo character at 0xF0.
#
@@ -63,14 +110,105 @@
# 0xBE are used to extend vertical and horizontal arrows, etc. In
# addition, there are top, bottom, and center sections for
# parentheses, brackets, integral signs, and other signs that may
# extend vertically for 2 or more lines of normal text. In general,
# Unicode does not encode such fragments, so corporate characters
# are used for round-trip fidelity.
# extend vertically for 2 or more lines of normal text. As of
# Unicode 3.2, most of these are now encoded in Unicode; a few are
# not, so these are mapped using corporate-zone Unicode characters
# (see below).
#
# In addition, Symbol separately encodes both serif and sans-serif
# forms for copyright, trademark, and registered signs. Unicode
# encodes only the abstract characters, so one set of these (the
# sans-serif forms) are mapped using corporate characters.
# sans-serif forms) are also mapped using corporate-zone Unicode
# characters (see below).
#
# The following code points are unused, and are not shown here:
# 0x80-0x9F, 0xFF.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The goals in the mappings provided here are:
# - Ensure roundtrip mapping from every character in the Mac OS
# Symbol character set to Unicode and back
# - Use standard Unicode characters as much as possible, to
# maximize interchangeability of the resulting Unicode text.
# Whenever possible, avoid having content carried by private-use
# characters.
#
# Some of the characters in the Mac OS Symbol character set do not
# correspond to distinct, single Unicode characters. To map these
# and satisfy both goals above, we employ various strategies.
#
# a) If possible, use private use characters in combination with
# standard Unicode characters to mark variants of the standard
# Unicode character.
#
# Apple has defined a block of 32 corporate characters as "transcoding
# hints." These are used in combination with standard Unicode
# characters to force them to be treated in a special way for mapping
# to other encodings; they have no other effect. Sixteen of these
# transcoding hints are "grouping hints" - they indicate that the next
# 2-4 Unicode characters should be treated as a single entity for
# transcoding. The other sixteen transcoding hints are "variant tags"
# - they are like combining characters, and can follow a standard
# Unicode (or a sequence consisting of a base character and other
# combining characters) to cause it to be treated in a special way for
# transcoding. These always terminate a combining-character sequence.
#
# The transcoding coding hint used in this mapping table is the
# variant tag 0xF87F. Since this is combined with standard Unicode
# characters, some characters in the Mac OS Symbol character set map
# to a sequence of two Unicodes instead of a single Unicode character.
#
# For example, the Mac OS Symbol character at 0xE2 is an alternate,
# sans-serif form of the REGISTERED SIGN (the standard mapping is for
# the abstract character at 0xD2, which here has a serif form). So 0xE2
# is mapped to 0x00AE (REGISTERED SIGN) + 0xF87F (a variant tag).
#
# b) Otherwise, use private use characters by themselves to map
# Mac OS Symbol characters which have no relationship to any standard
# Unicode character.
#
# The following additional corporate zone Unicode characters are
# used for this purpose here:
#
# 0xF8E5 radical extender
# 0xF8FF Apple logo
#
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version c01 to version c02:
#
# - Update mappings for 0xBD from 0xF8E6 to 0x23D0 (use new Unicode
# 4.0 char)
# - Correct mapping for 0xE0 from 0x22C4 to 0x25CA
#
# Changes from version b02 to version b03/c01:
#
# - Update mappings for encoded glyph fragments 0xBE, 0xE6-EF, 0xF4,
# 0xF6-FE to use new Unicode 3.2 characters instead of using either
# single corporate-use characters (e.g. 0xBE was mapped to 0xF8E7) or
# sequences combining a standard Unicode character with a transcoding
# hint (e.g. 0xE6 was mapped to 0x0028+0xF870).
#
# Changes from version n05 to version b02:
#
# - Encoding changed for Mac OS 8.5; 0xA0 now maps to 0x20AC, EURO
# SIGN. 0xA0 was unmapped in earlier versions.
#
# Changes from version n03 to version n05:
#
# - Change strict mapping for 0xE1 & 0xF1 from U+2329 & U+232A
# to their canonical decompositions, U+3008 & U+3009.
#
# - Change mapping for the following to use standard Unicode +
# transcoding hint, instead of single corporate-zone
# character: 0xE2-0xE4, 0xE6-0xEE, 0xF4, 0xF6-0xFE.
#
##################
@@ -118,7 +256,7 @@
0x49 0x0399 # GREEK CAPITAL LETTER IOTA
0x4A 0x03D1 # GREEK THETA SYMBOL
0x4B 0x039A # GREEK CAPITAL LETTER KAPPA
0x4C 0x039B # GREEK CAPITAL LETTER LAMBDA
0x4C 0x039B # GREEK CAPITAL LETTER LAMDA
0x4D 0x039C # GREEK CAPITAL LETTER MU
0x4E 0x039D # GREEK CAPITAL LETTER NU
0x4F 0x039F # GREEK CAPITAL LETTER OMICRON
@@ -138,7 +276,7 @@
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x22A5 # UP TACK
0x5F 0x005F # LOW LINE
0x60 0xF8E5 # radical extender
0x60 0xF8E5 # radical extender # corporate char
0x61 0x03B1 # GREEK SMALL LETTER ALPHA
0x62 0x03B2 # GREEK SMALL LETTER BETA
0x63 0x03C7 # GREEK SMALL LETTER CHI
@@ -150,7 +288,7 @@
0x69 0x03B9 # GREEK SMALL LETTER IOTA
0x6A 0x03D5 # GREEK PHI SYMBOL
0x6B 0x03BA # GREEK SMALL LETTER KAPPA
0x6C 0x03BB # GREEK SMALL LETTER LAMBDA
0x6C 0x03BB # GREEK SMALL LETTER LAMDA
0x6D 0x03BC # GREEK SMALL LETTER MU
0x6E 0x03BD # GREEK SMALL LETTER NU
0x6F 0x03BF # GREEK SMALL LETTER OMICRON
@@ -169,8 +307,10 @@
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x223C # TILDE OPERATOR
#
0xA0 0x20AC # EURO SIGN
0xA1 0x03D2 # GREEK UPSILON WITH HOOK SYMBOL
0xA2 0x2032 # PRIME
0xA2 0x2032 # PRIME # minute
0xA3 0x2264 # LESS-THAN OR EQUAL TO
0xA4 0x2044 # FRACTION SLASH
0xA5 0x221E # INFINITY
@@ -186,7 +326,7 @@
0xAF 0x2193 # DOWNWARDS ARROW
0xB0 0x00B0 # DEGREE SIGN
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2033 # DOUBLE PRIME
0xB2 0x2033 # DOUBLE PRIME # second
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x00D7 # MULTIPLICATION SIGN
0xB5 0x221D # PROPORTIONAL TO
@@ -197,8 +337,8 @@
0xBA 0x2261 # IDENTICAL TO
0xBB 0x2248 # ALMOST EQUAL TO
0xBC 0x2026 # HORIZONTAL ELLIPSIS
0xBD 0xF8E6 # vertical arrow extender
0xBE 0xF8E7 # horizontal arrow extender
0xBD 0x23D0 # VERTICAL LINE EXTENSION (for arrows) # for Unicode 4.0 and later
0xBE 0x23AF # HORIZONTAL LINE EXTENSION (for arrows) # for Unicode 3.2 and later
0xBF 0x21B5 # DOWNWARDS ARROW WITH CORNER LEFTWARDS
0xC0 0x2135 # ALEF SYMBOL
0xC1 0x2111 # BLACK-LETTER CAPITAL I
@@ -218,9 +358,9 @@
0xCF 0x2209 # NOT AN ELEMENT OF
0xD0 0x2220 # ANGLE
0xD1 0x2207 # NABLA
0xD2 0x00AE # REGISTERED SIGN
0xD3 0x00A9 # COPYRIGHT SIGN
0xD4 0x2122 # TRADE MARK SIGN
0xD2 0x00AE # REGISTERED SIGN # serif
0xD3 0x00A9 # COPYRIGHT SIGN # serif
0xD4 0x2122 # TRADE MARK SIGN # serif
0xD5 0x220F # N-ARY PRODUCT
0xD6 0x221A # SQUARE ROOT
0xD7 0x22C5 # DOT OPERATOR
@@ -232,34 +372,34 @@
0xDD 0x21D1 # UPWARDS DOUBLE ARROW
0xDE 0x21D2 # RIGHTWARDS DOUBLE ARROW
0xDF 0x21D3 # DOWNWARDS DOUBLE ARROW
0xE0 0x22C4 # DIAMOND OPERATOR
0xE1 0x2329 # LEFT-POINTING ANGLE BRACKET
0xE2 0xF8E8 # registered sign sans serif
0xE3 0xF8E9 # copyright sign sans serif
0xE4 0xF8EA # trade mark sign sans serif
0xE0 0x25CA # LOZENGE # previously mapped to 0x22C4 DIAMOND OPERATOR
0xE1 0x3008 # LEFT ANGLE BRACKET
0xE2 0x00AE+0xF87F # REGISTERED SIGN, alternate: sans serif
0xE3 0x00A9+0xF87F # COPYRIGHT SIGN, alternate: sans serif
0xE4 0x2122+0xF87F # TRADE MARK SIGN, alternate: sans serif
0xE5 0x2211 # N-ARY SUMMATION
0xE6 0xF8EB # left paren top
0xE7 0xF8EC # left paren extender
0xE8 0xF8ED # left paren bottom
0xE9 0xF8EE # left square bracket top
0xEA 0xF8EF # left square bracket extender
0xEB 0xF8F0 # left square bracket bottom
0xEC 0xF8F1 # left curly bracket top
0xED 0xF8F2 # left curly bracket mid
0xEE 0xF8F3 # left curly bracket bottom
0xEF 0xF8F4 # curly bracket extender
0xE6 0x239B # LEFT PARENTHESIS UPPER HOOK # for Unicode 3.2 and later
0xE7 0x239C # LEFT PARENTHESIS EXTENSION # for Unicode 3.2 and later
0xE8 0x239D # LEFT PARENTHESIS LOWER HOOK # for Unicode 3.2 and later
0xE9 0x23A1 # LEFT SQUARE BRACKET UPPER CORNER # for Unicode 3.2 and later
0xEA 0x23A2 # LEFT SQUARE BRACKET EXTENSION # for Unicode 3.2 and later
0xEB 0x23A3 # LEFT SQUARE BRACKET LOWER CORNER # for Unicode 3.2 and later
0xEC 0x23A7 # LEFT CURLY BRACKET UPPER HOOK # for Unicode 3.2 and later
0xED 0x23A8 # LEFT CURLY BRACKET MIDDLE PIECE # for Unicode 3.2 and later
0xEE 0x23A9 # LEFT CURLY BRACKET LOWER HOOK # for Unicode 3.2 and later
0xEF 0x23AA # CURLY BRACKET EXTENSION # for Unicode 3.2 and later
0xF0 0xF8FF # Apple logo
0xF1 0x232A # RIGHT-POINTING ANGLE BRACKET
0xF1 0x3009 # RIGHT ANGLE BRACKET
0xF2 0x222B # INTEGRAL
0xF3 0x2320 # TOP HALF INTEGRAL
0xF4 0xF8F5 # integral extender
0xF4 0x23AE # INTEGRAL EXTENSION # for Unicode 3.2 and later
0xF5 0x2321 # BOTTOM HALF INTEGRAL
0xF6 0xF8F6 # right paren top
0xF7 0xF8F7 # right paren extender
0xF8 0xF8F8 # right paren bottom
0xF9 0xF8F9 # right square bracket top
0xFA 0xF8FA # right square bracket extender
0xFB 0xF8FB # right square bracket bottom
0xFC 0xF8FC # right curly bracket top
0xFD 0xF8FD # right curly bracket mid
0xFE 0xF8FE # right curly bracket bottom
0xF6 0x239E # RIGHT PARENTHESIS UPPER HOOK # for Unicode 3.2 and later
0xF7 0x239F # RIGHT PARENTHESIS EXTENSION # for Unicode 3.2 and later
0xF8 0x23A0 # RIGHT PARENTHESIS LOWER HOOK # for Unicode 3.2 and later
0xF9 0x23A4 # RIGHT SQUARE BRACKET UPPER CORNER # for Unicode 3.2 and later
0xFA 0x23A5 # RIGHT SQUARE BRACKET EXTENSION # for Unicode 3.2 and later
0xFB 0x23A6 # RIGHT SQUARE BRACKET LOWER CORNER # for Unicode 3.2 and later
0xFC 0x23AB # RIGHT CURLY BRACKET UPPER HOOK # for Unicode 3.2 and later
0xFD 0x23AC # RIGHT CURLY BRACKET MIDDLE PIECE # for Unicode 3.2 and later
0xFE 0x23AD # RIGHT CURLY BRACKET LOWER HOOK # for Unicode 3.2 and later

View File

@@ -1,12 +1,35 @@
#=======================================================================
# File name: THAI.TXT
#
# Name: MacOS_Thai [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <6>)
# Date: 17 November 1995
# Authors: Peter Edberg <edberg1@applelink.apple.com>
# Judy Kettenhofen
# Contents: Map (external version) from Mac OS Thai
# character set to Unicode 3.2 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update mapping for 0xDB to use new Unicode
# 3.2 WORD JOINER instead of ZWNBSP (BOM).
# Update URLs. Matches internal utom<b3>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b2>, and Text
# Encoding Converter version 1.5.
# n07 1998-Feb-05 Update to match internal utom<n5>, ufrm<n13>
# and Text Encoding Converter version 1.3:
# Use standard Unicodes plus transcoding hints
# instead of single corporate characters; see
# details below. Also update header comments
# to new format.
# n04 1995-Nov-17 First version (after fixing some typos).
# Matches internal ufrm<n6>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -15,33 +38,47 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Thai code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Thai code (in hex as 0xNN)
# Column #2 is the corresponding Unicode or Unicode sequence
# (in hex as 0xNNNN or 0xNNNN+0xNNNN).
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Thai code order.
# The entries are in Mac OS Thai code order.
#
# Notes on MacOS Thai:
# --------------------
# Some of these mappings require the use of corporate characters.
# See the file "CORPCHAR.TXT" and notes below.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Thai character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Thai:
# ---------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Codes 0xA1-0xDA and 0xDF-0xFB are the character set from Thai
# standard TIS 620-2533, except that the following changes are
@@ -63,13 +100,65 @@
# at display time by the WorldScript software. However, using the
# Thai-DTP keyboard, the presentation forms can in fact be typed
# directly using dead keys. Thus they must be treated as real
# characters in the MacOS Thai encoding, and must be mapped to
# distinct Unicodes - corporate-zone characters in this case - to
# enable roundtrip mapping.
# characters in the Mac OS Thai encoding. They are mapped using
# variant tags; see below.
#
# Several code points are undefined and unused (they cannot be
# typed using any of the Mac OS Thai keyboard layouts): 0x90, 0x9F,
# 0xFC-0xFE. In the table below these are mapped to 0xFFFD.
# 0xFC-0xFE. These are not shown in the table below.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The goals in the Apple mappings provided here are:
# - Ensure roundtrip mapping from every character in the Mac OS Thai
# character set to Unicode and back
# - Use standard Unicode characters as much as possible, to maximize
# interchangeability of the resulting Unicode text. Whenever possible,
# avoid having content carried by private-use characters.
#
# To satisfy both goals, we use private use characters to mark variants
# that are similar to a sequence of one or more standard Unicode
# characters.
#
# Apple has defined a block of 32 corporate characters as "transcoding
# hints." These are used in combination with standard Unicode characters
# to force them to be treated in a special way for mapping to other
# encodings; they have no other effect. Sixteen of these transcoding
# hints are "grouping hints" - they indicate that the next 2-4 Unicode
# characters should be treated as a single entity for transcoding. The
# other sixteen transcoding hints are "variant tags" - they are like
# combining characters, and can follow a standard Unicode (or a sequence
# consisting of a base character and other combining characters) to
# cause it to be treated in a special way for transcoding. These always
# terminate a combining-character sequence.
#
# The transcoding coding hints used in this mapping table are four
# variant tags in the range 0xF873-75. Since these are combined with
# standard Unicode characters, some characters in the Mac OS Thai
# character set map to a sequence of two Unicodes instead of a single
# Unicode character. For example, the Mac OS Thai character at 0x83 is a
# low-left positional variant of THAI CHARACTER MAI EK (the standard
# mapping is for the abstract character at 0xE8). So 0x83 is mapped to
# 0x0E48 (THAI CHARACTER MAI EK) + 0xF875 (a variant tag).
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version b02 to version b03/c01:
#
# - Update mapping for 0xDB to use new Unicode 3.2 character U+2060
# WORD JOINER instead of U+FEFF ZERO WIDTH NO-BREAK SPACE (BOM)
#
# Changes from version n04 to version n07:
#
# - Changed mappings of the positional variants to use standard
# Unicodes + transcoding hint, instead of using single corporate
# zone characters. This affected the mappings for the following:
# 0x83-08C, 0x8F, 0x92-0x9C
#
# - Just comment out unused code points in the table, instead
# of mapping them to U+FFFD.
#
##################
@@ -172,35 +261,35 @@
0x80 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0x81 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0x82 0x2026 # HORIZONTAL ELLIPSIS
0x83 0xF88C # form for THAI CHARACTER MAI EK, low left position
0x84 0xF88F # form for THAI CHARACTER MAI THO, low left position
0x85 0xF892 # form for THAI CHARACTER MAI TRI, low left position
0x86 0xF895 # form for THAI CHARACTER MAI CHATTAWA, low left position
0x87 0xF898 # form for THAI CHARACTER THANTHAKHAT, low left position
0x88 0xF88B # form for THAI CHARACTER MAI EK, low position
0x89 0xF88E # form for THAI CHARACTER MAI THO, low position
0x8A 0xF891 # form for THAI CHARACTER MAI TRI, low position
0x8B 0xF894 # form for THAI CHARACTER MAI CHATTAWA, low position
0x8C 0xF897 # form for THAI CHARACTER THANTHAKHAT, low position
0x83 0x0E48+0xF875 # THAI CHARACTER MAI EK, low left position
0x84 0x0E49+0xF875 # THAI CHARACTER MAI THO, low left position
0x85 0x0E4A+0xF875 # THAI CHARACTER MAI TRI, low left position
0x86 0x0E4B+0xF875 # THAI CHARACTER MAI CHATTAWA, low left position
0x87 0x0E4C+0xF875 # THAI CHARACTER THANTHAKHAT, low left position
0x88 0x0E48+0xF873 # THAI CHARACTER MAI EK, low position
0x89 0x0E49+0xF873 # THAI CHARACTER MAI THO, low position
0x8A 0x0E4A+0xF873 # THAI CHARACTER MAI TRI, low position
0x8B 0x0E4B+0xF873 # THAI CHARACTER MAI CHATTAWA, low position
0x8C 0x0E4C+0xF873 # THAI CHARACTER THANTHAKHAT, low position
0x8D 0x201C # LEFT DOUBLE QUOTATION MARK
0x8E 0x201D # RIGHT DOUBLE QUOTATION MARK
0x8F 0xF899 # form for THAI CHARACTER NIKHAHIT, left position
0x90 0xFFFD # (not used, map to Unicode replacement character)
0x8F 0x0E4D+0xF874 # THAI CHARACTER NIKHAHIT, left position
#
0x91 0x2022 # BULLET
0x92 0xF884 # form for THAI CHARACTER MAI HAN-AKAT, left position
0x93 0xF889 # form for THAI CHARACTER MAITAIKHU, left position
0x94 0xF885 # form for THAI CHARACTER SARA I, left position
0x95 0xF886 # form for THAI CHARACTER SARA II, left position
0x96 0xF887 # form for THAI CHARACTER SARA UE, left position
0x97 0xF888 # form for THAI CHARACTER SARA UEE, left position
0x98 0xF88A # form for THAI CHARACTER MAI EK, left position
0x99 0xF88D # form for THAI CHARACTER MAI THO, left position
0x9A 0xF890 # form for THAI CHARACTER MAI TRI, left position
0x9B 0xF893 # form for THAI CHARACTER MAI CHATTAWA, left position
0x9C 0xF896 # form for THAI CHARACTER THANTHAKHAT, left position
0x92 0x0E31+0xF874 # THAI CHARACTER MAI HAN-AKAT, left position
0x93 0x0E47+0xF874 # THAI CHARACTER MAITAIKHU, left position
0x94 0x0E34+0xF874 # THAI CHARACTER SARA I, left position
0x95 0x0E35+0xF874 # THAI CHARACTER SARA II, left position
0x96 0x0E36+0xF874 # THAI CHARACTER SARA UE, left position
0x97 0x0E37+0xF874 # THAI CHARACTER SARA UEE, left position
0x98 0x0E48+0xF874 # THAI CHARACTER MAI EK, left position
0x99 0x0E49+0xF874 # THAI CHARACTER MAI THO, left position
0x9A 0x0E4A+0xF874 # THAI CHARACTER MAI TRI, left position
0x9B 0x0E4B+0xF874 # THAI CHARACTER MAI CHATTAWA, left position
0x9C 0x0E4C+0xF874 # THAI CHARACTER THANTHAKHAT, left position
0x9D 0x2018 # LEFT SINGLE QUOTATION MARK
0x9E 0x2019 # RIGHT SINGLE QUOTATION MARK
0x9F 0xFFFD # (not used, map to Unicode replacement character)
#
0xA0 0x00A0 # NO-BREAK SPACE
0xA1 0x0E01 # THAI CHARACTER KO KAI
0xA2 0x0E02 # THAI CHARACTER KHO KHAI
@@ -260,7 +349,7 @@
0xD8 0x0E38 # THAI CHARACTER SARA U
0xD9 0x0E39 # THAI CHARACTER SARA UU
0xDA 0x0E3A # THAI CHARACTER PHINTHU
0xDB 0xFEFF # ZERO WIDTH NO-BREAK SPACE
0xDB 0x2060 # WORD JOINER # for Unicode 3.2 and later
0xDC 0x200B # ZERO WIDTH SPACE
0xDD 0x2013 # EN DASH
0xDE 0x2014 # EM DASH
@@ -293,7 +382,3 @@
0xF9 0x0E59 # THAI DIGIT NINE
0xFA 0x00AE # REGISTERED SIGN
0xFB 0x00A9 # COPYRIGHT SIGN
0xFC 0xFFFD # (not used, map to Unicode replacement character)
0xFD 0xFFFD # (not used, map to Unicode replacement character)
0xFE 0xFFFD # (not used, map to Unicode replacement character)
0xFF 0xFFFD # (not used, map to Unicode replacement character)

View File

@@ -1,11 +1,32 @@
#=======================================================================
# File name: TURKISH.TXT
#
# Name: MacOS_Turkish [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <4>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Map (external version) from Mac OS Turkish
# character set to Unicode 2.1 and later.
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments. Matches internal xml
# <c1.1> and Text Encoding Converter 2.0.
# b3,c1 2002-Dec-19 Update URLs, notes. Matches internal
# utom<b1>.
# b02 1999-Sep-22 Update contact e-mail address. Matches
# internal utom<b1>, ufrm<b1>, and Text
# Encoding Converter version 1.5.
# n05 1998-Feb-05 Minor update to header comments
# n03 1997-Dec-14 Update to match internal utom<n5>, ufrm<n15>:
# Change standard mapping for 0xBD from U+2126
# to its canonical decomposition, U+03A9.
# n02 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<n4>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,52 +35,83 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Turkish code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# Column #1 is the Mac OS Turkish code (in hex as 0xNN)
# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
# Column #3 is a comment containing the Unicode name
#
# The entries are in MacOS Turkish code order.
# The entries are in Mac OS Turkish code order.
#
# Two of these mappings requires the use of a corporate character.
# See the file "CORPCHAR.TXT" and notes below.
#
# Control character mappings are not shown in this table, following
# the conventions of the standard UTC mapping tables. However, the
# Mac OS Turkish character set uses the standard control characters at
# 0x00-0x1F and 0x7F.
#
# Notes on Mac OS Turkish:
# ------------------------
#
# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa
# environments, it is only supported via transcoding to and from
# Unicode.
#
# Mac OS Turkish is used for Turkish.
#
# The Mac OS Turkish encoding shares the script code smRoman
# (0) with the Mac OS Roman encoding. To determine if the Turkish
# encoding is being used, you must also check if the system region
# code is 24, verTurkey.
#
# This character set is a variant of standard Mac OS Roman. It adds
# upper & lower G with breve, upper & lower S with cedilla, upper I
# with dot, and moves the dotless lower i from its position at 0xF5
# in standard Mac OS Roman to a position at 0xDD here (leaving the
# 0xF5 code point undefined in Mac OS Turkish). This gives a total
# of 7 code point differences from standard Mac OS Roman.
#
# Unicode mapping issues and notes:
# ---------------------------------
#
# The following corporate zone Unicode characters are used in this
# mapping:
#
# 0xF8A0 undefined1, used to map the single undefined code point
# in Mac OS Turkish (to obtain roundtrip fidelity for all
# code points).
# 0xF8FF Apple logo
#
# Two of these mappings requires the use of a corporate character:
# for the Apple logo character, and for the one undefined code
# point. See the file "MacOS-CorpCharList".
# NOTE: The graphic image associated with the Apple logo character
# is not authorized for use without permission of Apple, and
# unauthorized use might constitute trademark infringement.
#
# Notes on MacOS Turkish:
# -----------------------
# Details of mapping changes in each version:
# -------------------------------------------
#
# The MacOS Turkish encoding shares the script code smRoman
# (0) with the standard MacOS Roman encoding. To determine if
# the Turkish encoding is being used, you must also check if the
# system region code is 24, verTurkey.
# Changes from version n02 to version n03:
#
# This character set is a variant of standard MacOS Roman. It adds
# upper & lower G with breve, upper & lower S with cedilla, upper I
# with dot, and moves the dotless lower i from its position at 0xF5
# in standard MacOS Roman to a position at 0xDD here (leaving the
# 0xF5 code point undefined in MacTurkish). This gives a total of 7
# code point differences from standard MacOS Roman.
# - Change mapping of 0xBD from U+2126 to its canonical
# decomposition, U+03A9.
#
##################
@@ -205,7 +257,7 @@
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x00C6 # LATIN CAPITAL LIGATURE AE
0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
@@ -220,8 +272,8 @@
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
0xBD 0x2126 # OHM SIGN
0xBE 0x00E6 # LATIN SMALL LIGATURE AE
0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK

View File

@@ -1,11 +1,34 @@
#=======================================================================
# File name: UKRAINE.TXT
#
# Name: MacOS_Ukrainian [to Unicode]
# Unicode versions: 1.1, 2.0
# Table version: 0.2 (from internal ufrm version <4>)
# Date: 15 April 1995
# Author: Peter Edberg <edberg1@applelink.apple.com>
# Contents: Notes on Mac OS Ukrainian character set
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: charsets@apple.com
#
# Changes:
#
# c02 2005-Apr-05 Update header comments.
# b3,c1 2002-Dec-19 Update URLs. Matches internal utom<b1>.
# b02 1999-Sep-22 Encoding changed for Mac OS 9.0 to merge
# with Mac OS Cyrillic and support EURO SIGN;
# change mappings for 0xFF. For Mac OS 9.0
# there is no longer a separate Mac OS
# Ukrainian character set; the mappings are
# in CYRILLIC.TXT. Update contact e-mail
# address. Matches internal utom<b1>, ufrm<b1>,
# and Text Encoding Converter version 1.5.
# n04 1998-Feb-05 Update header comments to new format; no
# mapping changes. Matches internal utom<2>,
# ufrm<13>, and Text Encoding Converter
# version 1.3.
# n02 1995-Apr-15 First version (after fixing some typos).
# Matches internal ufrm<4>.
#
# Standard header:
# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -14,266 +37,70 @@
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
# Apple Computer, Inc. ("Apple") makes no warranty or representation,
# either express or implied, with respect to this document and the
# included data, its quality, accuracy, or fitness for a particular
# purpose. In no event will Apple be liable for direct, indirect,
# special, incidental, or consequential damages resulting from any
# defect or inaccuracy in this document or the included data.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
# These mapping tables and character lists are subject to change.
# The latest tables should be available from the following:
#
# Format:
# -------
# <http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the MacOS Ukrainian code (in hex as 0xNN)
# Column #2 is the Unicode (in hex as 0xNNNN)
# Column #3 is the Unicode name (follows a comment sign, '#')
# For general information about Mac OS encodings and these mapping
# tables, see the file "README.TXT".
#
# The entries are in MacOS Ukrainian code order.
# Notes on Mac OS Ukrainian and Mac OS Cyrillic:
# ----------------------------------------------
#
# Notes on MacOS Ukrainian:
# -------------------------
# Before Mac OS 9.0, there were two separate Slavic Cyrillic
# encodings for the Mac OS:
#
# The MacOS Ukrainian encoding shares the script code smCyrillic
# (7) with the standard MacOS Cyrillic encoding. To determine if
# the Ukrainian encoding is being used, you must also check if
# the system region code is 62, verUkraine.
# 1. The Cyrillic currency sign variant (used for localized Russian
# and Bulgarian systems), which had the following:
# 0xA2 U+00A2 CENT SIGN
# 0xB6 U+2202 PARTIAL DIFFERENTIAL
# 0xFF U+00A4 CURRENCY SIGN
#
# This character set is a variant of standard MacOS Cyrillic. It
# adds upper & lower GHE WITH UPTURN, for a grand total of 2 code
# point differences from standard MacOS Cyrillic.
# 2. The Ukrainian currency sign variant (used for localized Ukrainian
# systems and the pre-9.0 Cyrillic Language Kit), which had the
# following:
# 0xA2 U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN
# 0xB6 U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN
# 0xFF U+00A4 CURRENCY SIGN
#
# Before Mac OS 9.0, The Ukrainian currency sign variant shared the
# script code smCyrillic (7) with the Cyrillic currency sign variant.
# The Ukrainian currency sign variant was being used if one of the
# following was true:
# - The system region code was 62, verUkraine (indicates Ukrainian
# localized system), or
# - The system script was not 7, smCyrillic (indicates Cyrillic
# Language Kit instead of localized system).
#
# For Mac OS 9.0 and later, both currency sign variants were replaced
# with a new Euro sign version of Mac OS Cyrillic, which is similar to
# the old Ukrainian currency sign variant but changes 0xFF to EURO
# SIGN. Mappings for this are in CYRILLIC.TXT.
#
# Note: There is a common glyph variation in Ukrainian, in which the
# glyph for CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I may or
# may not have a dot above.
#
# Details of mapping changes in each version:
# -------------------------------------------
#
# Changes from version n04 to version b02:
#
# - Encoding changed for Mac OS 9.0 to merge with Mac OS Cyrillic and
# support EURO SIGN; 0xFF changed from U+00A4 to U+20AC. For Mac OS
# 9.0 there is no longer a separate Mac OS Ukrainian character set, so
# the mappings here are deleted; see the mappings in CYRILLIC.TXT.
#
##################
0x20 0x0020 # SPACE
0x21 0x0021 # EXCLAMATION MARK
0x22 0x0022 # QUOTATION MARK
0x23 0x0023 # NUMBER SIGN
0x24 0x0024 # DOLLAR SIGN
0x25 0x0025 # PERCENT SIGN
0x26 0x0026 # AMPERSAND
0x27 0x0027 # APOSTROPHE
0x28 0x0028 # LEFT PARENTHESIS
0x29 0x0029 # RIGHT PARENTHESIS
0x2A 0x002A # ASTERISK
0x2B 0x002B # PLUS SIGN
0x2C 0x002C # COMMA
0x2D 0x002D # HYPHEN-MINUS
0x2E 0x002E # FULL STOP
0x2F 0x002F # SOLIDUS
0x30 0x0030 # DIGIT ZERO
0x31 0x0031 # DIGIT ONE
0x32 0x0032 # DIGIT TWO
0x33 0x0033 # DIGIT THREE
0x34 0x0034 # DIGIT FOUR
0x35 0x0035 # DIGIT FIVE
0x36 0x0036 # DIGIT SIX
0x37 0x0037 # DIGIT SEVEN
0x38 0x0038 # DIGIT EIGHT
0x39 0x0039 # DIGIT NINE
0x3A 0x003A # COLON
0x3B 0x003B # SEMICOLON
0x3C 0x003C # LESS-THAN SIGN
0x3D 0x003D # EQUALS SIGN
0x3E 0x003E # GREATER-THAN SIGN
0x3F 0x003F # QUESTION MARK
0x40 0x0040 # COMMERCIAL AT
0x41 0x0041 # LATIN CAPITAL LETTER A
0x42 0x0042 # LATIN CAPITAL LETTER B
0x43 0x0043 # LATIN CAPITAL LETTER C
0x44 0x0044 # LATIN CAPITAL LETTER D
0x45 0x0045 # LATIN CAPITAL LETTER E
0x46 0x0046 # LATIN CAPITAL LETTER F
0x47 0x0047 # LATIN CAPITAL LETTER G
0x48 0x0048 # LATIN CAPITAL LETTER H
0x49 0x0049 # LATIN CAPITAL LETTER I
0x4A 0x004A # LATIN CAPITAL LETTER J
0x4B 0x004B # LATIN CAPITAL LETTER K
0x4C 0x004C # LATIN CAPITAL LETTER L
0x4D 0x004D # LATIN CAPITAL LETTER M
0x4E 0x004E # LATIN CAPITAL LETTER N
0x4F 0x004F # LATIN CAPITAL LETTER O
0x50 0x0050 # LATIN CAPITAL LETTER P
0x51 0x0051 # LATIN CAPITAL LETTER Q
0x52 0x0052 # LATIN CAPITAL LETTER R
0x53 0x0053 # LATIN CAPITAL LETTER S
0x54 0x0054 # LATIN CAPITAL LETTER T
0x55 0x0055 # LATIN CAPITAL LETTER U
0x56 0x0056 # LATIN CAPITAL LETTER V
0x57 0x0057 # LATIN CAPITAL LETTER W
0x58 0x0058 # LATIN CAPITAL LETTER X
0x59 0x0059 # LATIN CAPITAL LETTER Y
0x5A 0x005A # LATIN CAPITAL LETTER Z
0x5B 0x005B # LEFT SQUARE BRACKET
0x5C 0x005C # REVERSE SOLIDUS
0x5D 0x005D # RIGHT SQUARE BRACKET
0x5E 0x005E # CIRCUMFLEX ACCENT
0x5F 0x005F # LOW LINE
0x60 0x0060 # GRAVE ACCENT
0x61 0x0061 # LATIN SMALL LETTER A
0x62 0x0062 # LATIN SMALL LETTER B
0x63 0x0063 # LATIN SMALL LETTER C
0x64 0x0064 # LATIN SMALL LETTER D
0x65 0x0065 # LATIN SMALL LETTER E
0x66 0x0066 # LATIN SMALL LETTER F
0x67 0x0067 # LATIN SMALL LETTER G
0x68 0x0068 # LATIN SMALL LETTER H
0x69 0x0069 # LATIN SMALL LETTER I
0x6A 0x006A # LATIN SMALL LETTER J
0x6B 0x006B # LATIN SMALL LETTER K
0x6C 0x006C # LATIN SMALL LETTER L
0x6D 0x006D # LATIN SMALL LETTER M
0x6E 0x006E # LATIN SMALL LETTER N
0x6F 0x006F # LATIN SMALL LETTER O
0x70 0x0070 # LATIN SMALL LETTER P
0x71 0x0071 # LATIN SMALL LETTER Q
0x72 0x0072 # LATIN SMALL LETTER R
0x73 0x0073 # LATIN SMALL LETTER S
0x74 0x0074 # LATIN SMALL LETTER T
0x75 0x0075 # LATIN SMALL LETTER U
0x76 0x0076 # LATIN SMALL LETTER V
0x77 0x0077 # LATIN SMALL LETTER W
0x78 0x0078 # LATIN SMALL LETTER X
0x79 0x0079 # LATIN SMALL LETTER Y
0x7A 0x007A # LATIN SMALL LETTER Z
0x7B 0x007B # LEFT CURLY BRACKET
0x7C 0x007C # VERTICAL LINE
0x7D 0x007D # RIGHT CURLY BRACKET
0x7E 0x007E # TILDE
#
0x80 0x0410 # CYRILLIC CAPITAL LETTER A
0x81 0x0411 # CYRILLIC CAPITAL LETTER BE
0x82 0x0412 # CYRILLIC CAPITAL LETTER VE
0x83 0x0413 # CYRILLIC CAPITAL LETTER GHE
0x84 0x0414 # CYRILLIC CAPITAL LETTER DE
0x85 0x0415 # CYRILLIC CAPITAL LETTER IE
0x86 0x0416 # CYRILLIC CAPITAL LETTER ZHE
0x87 0x0417 # CYRILLIC CAPITAL LETTER ZE
0x88 0x0418 # CYRILLIC CAPITAL LETTER I
0x89 0x0419 # CYRILLIC CAPITAL LETTER SHORT I
0x8A 0x041A # CYRILLIC CAPITAL LETTER KA
0x8B 0x041B # CYRILLIC CAPITAL LETTER EL
0x8C 0x041C # CYRILLIC CAPITAL LETTER EM
0x8D 0x041D # CYRILLIC CAPITAL LETTER EN
0x8E 0x041E # CYRILLIC CAPITAL LETTER O
0x8F 0x041F # CYRILLIC CAPITAL LETTER PE
0x90 0x0420 # CYRILLIC CAPITAL LETTER ER
0x91 0x0421 # CYRILLIC CAPITAL LETTER ES
0x92 0x0422 # CYRILLIC CAPITAL LETTER TE
0x93 0x0423 # CYRILLIC CAPITAL LETTER U
0x94 0x0424 # CYRILLIC CAPITAL LETTER EF
0x95 0x0425 # CYRILLIC CAPITAL LETTER HA
0x96 0x0426 # CYRILLIC CAPITAL LETTER TSE
0x97 0x0427 # CYRILLIC CAPITAL LETTER CHE
0x98 0x0428 # CYRILLIC CAPITAL LETTER SHA
0x99 0x0429 # CYRILLIC CAPITAL LETTER SHCHA
0x9A 0x042A # CYRILLIC CAPITAL LETTER HARD SIGN
0x9B 0x042B # CYRILLIC CAPITAL LETTER YERU
0x9C 0x042C # CYRILLIC CAPITAL LETTER SOFT SIGN
0x9D 0x042D # CYRILLIC CAPITAL LETTER E
0x9E 0x042E # CYRILLIC CAPITAL LETTER YU
0x9F 0x042F # CYRILLIC CAPITAL LETTER YA
0xA0 0x2020 # DAGGER
0xA1 0x00B0 # DEGREE SIGN
0xA2 0x0490 # CYRILLIC CAPITAL LETTER GHE WITH UPTURN
0xA3 0x00A3 # POUND SIGN
0xA4 0x00A7 # SECTION SIGN
0xA5 0x2022 # BULLET
0xA6 0x00B6 # PILCROW SIGN
0xA7 0x0406 # CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
0xA8 0x00AE # REGISTERED SIGN
0xA9 0x00A9 # COPYRIGHT SIGN
0xAA 0x2122 # TRADE MARK SIGN
0xAB 0x0402 # CYRILLIC CAPITAL LETTER DJE
0xAC 0x0452 # CYRILLIC SMALL LETTER DJE
0xAD 0x2260 # NOT EQUAL TO
0xAE 0x0403 # CYRILLIC CAPITAL LETTER GJE
0xAF 0x0453 # CYRILLIC SMALL LETTER GJE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
0xB2 0x2264 # LESS-THAN OR EQUAL TO
0xB3 0x2265 # GREATER-THAN OR EQUAL TO
0xB4 0x0456 # CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
0xB5 0x00B5 # MICRO SIGN
0xB6 0x0491 # CYRILLIC SMALL LETTER GHE WITH UPTURN
0xB7 0x0408 # CYRILLIC CAPITAL LETTER JE
0xB8 0x0404 # CYRILLIC CAPITAL LETTER UKRAINIAN IE
0xB9 0x0454 # CYRILLIC SMALL LETTER UKRAINIAN IE
0xBA 0x0407 # CYRILLIC CAPITAL LETTER YI
0xBB 0x0457 # CYRILLIC SMALL LETTER YI
0xBC 0x0409 # CYRILLIC CAPITAL LETTER LJE
0xBD 0x0459 # CYRILLIC SMALL LETTER LJE
0xBE 0x040A # CYRILLIC CAPITAL LETTER NJE
0xBF 0x045A # CYRILLIC SMALL LETTER NJE
0xC0 0x0458 # CYRILLIC SMALL LETTER JE
0xC1 0x0405 # CYRILLIC CAPITAL LETTER DZE
0xC2 0x00AC # NOT SIGN
0xC3 0x221A # SQUARE ROOT
0xC4 0x0192 # LATIN SMALL LETTER F WITH HOOK
0xC5 0x2248 # ALMOST EQUAL TO
0xC6 0x2206 # INCREMENT
0xC7 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC8 0x00BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
0xC9 0x2026 # HORIZONTAL ELLIPSIS
0xCA 0x00A0 # NO-BREAK SPACE
0xCB 0x040B # CYRILLIC CAPITAL LETTER TSHE
0xCC 0x045B # CYRILLIC SMALL LETTER TSHE
0xCD 0x040C # CYRILLIC CAPITAL LETTER KJE
0xCE 0x045C # CYRILLIC SMALL LETTER KJE
0xCF 0x0455 # CYRILLIC SMALL LETTER DZE
0xD0 0x2013 # EN DASH
0xD1 0x2014 # EM DASH
0xD2 0x201C # LEFT DOUBLE QUOTATION MARK
0xD3 0x201D # RIGHT DOUBLE QUOTATION MARK
0xD4 0x2018 # LEFT SINGLE QUOTATION MARK
0xD5 0x2019 # RIGHT SINGLE QUOTATION MARK
0xD6 0x00F7 # DIVISION SIGN
0xD7 0x201E # DOUBLE LOW-9 QUOTATION MARK
0xD8 0x040E # CYRILLIC CAPITAL LETTER SHORT U
0xD9 0x045E # CYRILLIC SMALL LETTER SHORT U
0xDA 0x040F # CYRILLIC CAPITAL LETTER DZHE
0xDB 0x045F # CYRILLIC SMALL LETTER DZHE
0xDC 0x2116 # NUMERO SIGN
0xDD 0x0401 # CYRILLIC CAPITAL LETTER IO
0xDE 0x0451 # CYRILLIC SMALL LETTER IO
0xDF 0x044F # CYRILLIC SMALL LETTER YA
0xE0 0x0430 # CYRILLIC SMALL LETTER A
0xE1 0x0431 # CYRILLIC SMALL LETTER BE
0xE2 0x0432 # CYRILLIC SMALL LETTER VE
0xE3 0x0433 # CYRILLIC SMALL LETTER GHE
0xE4 0x0434 # CYRILLIC SMALL LETTER DE
0xE5 0x0435 # CYRILLIC SMALL LETTER IE
0xE6 0x0436 # CYRILLIC SMALL LETTER ZHE
0xE7 0x0437 # CYRILLIC SMALL LETTER ZE
0xE8 0x0438 # CYRILLIC SMALL LETTER I
0xE9 0x0439 # CYRILLIC SMALL LETTER SHORT I
0xEA 0x043A # CYRILLIC SMALL LETTER KA
0xEB 0x043B # CYRILLIC SMALL LETTER EL
0xEC 0x043C # CYRILLIC SMALL LETTER EM
0xED 0x043D # CYRILLIC SMALL LETTER EN
0xEE 0x043E # CYRILLIC SMALL LETTER O
0xEF 0x043F # CYRILLIC SMALL LETTER PE
0xF0 0x0440 # CYRILLIC SMALL LETTER ER
0xF1 0x0441 # CYRILLIC SMALL LETTER ES
0xF2 0x0442 # CYRILLIC SMALL LETTER TE
0xF3 0x0443 # CYRILLIC SMALL LETTER U
0xF4 0x0444 # CYRILLIC SMALL LETTER EF
0xF5 0x0445 # CYRILLIC SMALL LETTER HA
0xF6 0x0446 # CYRILLIC SMALL LETTER TSE
0xF7 0x0447 # CYRILLIC SMALL LETTER CHE
0xF8 0x0448 # CYRILLIC SMALL LETTER SHA
0xF9 0x0449 # CYRILLIC SMALL LETTER SHCHA
0xFA 0x044A # CYRILLIC SMALL LETTER HARD SIGN
0xFB 0x044B # CYRILLIC SMALL LETTER YERU
0xFC 0x044C # CYRILLIC SMALL LETTER SOFT SIGN
0xFD 0x044D # CYRILLIC SMALL LETTER E
0xFE 0x044E # CYRILLIC SMALL LETTER YU
0xFF 0x00A4 # CURRENCY SIGN
##################
# For mappings, see CYRILLIC.TXT
##################

26
unicode/vendors/ibm/IBM conversions.txt vendored Normal file
View File

@@ -0,0 +1,26 @@
IBM Conversion Mapping Tables
IBM has a repository of character data mapping tables between
many different character encodings and Unicode. The mapping of
some characters in the IBM conversion tables may vary from those of
other vendors. These differences may be due to differences in
interpretation of international or country standards. They may
also be due to historical product or platform preferences. IBM
conversion tables are available from a number of sources.
Unicode mappings can be found in the http://site.icu-project.org/International
Components\ for\ Unicode\ (ICU) which is a mature, portable set
of C/C++ and Java libraries for Unicode support, software
internationalization (I18N) and globalization (G11N), giving
applications the same results on all platforms.
Additionally, IBM has the https://www.ibm.com/downloads/cas/G01BQVRV>Character\
Data\ Representation\ Architecture (CDRA) which defines a set of
identifiers and supporting resources for data identification and
conversion. The CDRA conversion table resources can be obtained
from https://www.ibm.com/developerworks/views/java/downloads.jsp>IBM
developerWorks. The tables are found in the Downloads &
products view, under the heading Character Data Conversion
Tables. Should you require additional information on any
IBM conversion mapping tables please contact gcoc@jp.ibm.com">gcoc@jp.ibm.com.

View File

@@ -1,3 +1,5 @@
This may be obsolete: (RMK 2021/08)
The following is a listing of the differences between IBM's mappings
of certain IBM code pages and Unicode, and the mappings for IBM code
pages that came from various sources and were earlier made available

View File

@@ -0,0 +1,962 @@
CODEPAGE 1250 ;Eastern Europe - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0083
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x0088
0x89 0x2030 ;Per Mille Sign
0x8a 0x0160 ;Latin Capital Letter S With Caron
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x015a ;Latin Capital Letter S With Acute
0x8d 0x0164 ;Latin Capital Letter T With Caron
0x8e 0x017d ;Latin Capital Letter Z With Caron
0x8f 0x0179 ;Latin Capital Letter Z With Acute
0x90 0x0090
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x0098
0x99 0x2122 ;Trade Mark Sign
0x9a 0x0161 ;Latin Small Letter S With Caron
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x015b ;Latin Small Letter S With Acute
0x9d 0x0165 ;Latin Small Letter T With Caron
0x9e 0x017e ;Latin Small Letter Z With Caron
0x9f 0x017a ;Latin Small Letter Z With Acute
0xa0 0x00a0 ;No-Break Space
0xa1 0x02c7 ;Caron
0xa2 0x02d8 ;Breve
0xa3 0x0141 ;Latin Capital Letter L With Stroke
0xa4 0x00a4 ;Currency Sign
0xa5 0x0104 ;Latin Capital Letter A With Ogonek
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x015e ;Latin Capital Letter S With Cedilla
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x017b ;Latin Capital Letter Z With Dot Above
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x02db ;Ogonek
0xb3 0x0142 ;Latin Small Letter L With Stroke
0xb4 0x00b4 ;Acute Accent
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Cedilla
0xb9 0x0105 ;Latin Small Letter A With Ogonek
0xba 0x015f ;Latin Small Letter S With Cedilla
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x013d ;Latin Capital Letter L With Caron
0xbd 0x02dd ;Double Acute Accent
0xbe 0x013e ;Latin Small Letter L With Caron
0xbf 0x017c ;Latin Small Letter Z With Dot Above
0xc0 0x0154 ;Latin Capital Letter R With Acute
0xc1 0x00c1 ;Latin Capital Letter A With Acute
0xc2 0x00c2 ;Latin Capital Letter A With Circumflex
0xc3 0x0102 ;Latin Capital Letter A With Breve
0xc4 0x00c4 ;Latin Capital Letter A With Diaeresis
0xc5 0x0139 ;Latin Capital Letter L With Acute
0xc6 0x0106 ;Latin Capital Letter C With Acute
0xc7 0x00c7 ;Latin Capital Letter C With Cedilla
0xc8 0x010c ;Latin Capital Letter C With Caron
0xc9 0x00c9 ;Latin Capital Letter E With Acute
0xca 0x0118 ;Latin Capital Letter E With Ogonek
0xcb 0x00cb ;Latin Capital Letter E With Diaeresis
0xcc 0x011a ;Latin Capital Letter E With Caron
0xcd 0x00cd ;Latin Capital Letter I With Acute
0xce 0x00ce ;Latin Capital Letter I With Circumflex
0xcf 0x010e ;Latin Capital Letter D With Caron
0xd0 0x0110 ;Latin Capital Letter D With Stroke
0xd1 0x0143 ;Latin Capital Letter N With Acute
0xd2 0x0147 ;Latin Capital Letter N With Caron
0xd3 0x00d3 ;Latin Capital Letter O With Acute
0xd4 0x00d4 ;Latin Capital Letter O With Circumflex
0xd5 0x0150 ;Latin Capital Letter O With Double Acute
0xd6 0x00d6 ;Latin Capital Letter O With Diaeresis
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x0158 ;Latin Capital Letter R With Caron
0xd9 0x016e ;Latin Capital Letter U With Ring Above
0xda 0x00da ;Latin Capital Letter U With Acute
0xdb 0x0170 ;Latin Capital Letter U With Double Acute
0xdc 0x00dc ;Latin Capital Letter U With Diaeresis
0xdd 0x00dd ;Latin Capital Letter Y With Acute
0xde 0x0162 ;Latin Capital Letter T With Cedilla
0xdf 0x00df ;Latin Small Letter Sharp S
0xe0 0x0155 ;Latin Small Letter R With Acute
0xe1 0x00e1 ;Latin Small Letter A With Acute
0xe2 0x00e2 ;Latin Small Letter A With Circumflex
0xe3 0x0103 ;Latin Small Letter A With Breve
0xe4 0x00e4 ;Latin Small Letter A With Diaeresis
0xe5 0x013a ;Latin Small Letter L With Acute
0xe6 0x0107 ;Latin Small Letter C With Acute
0xe7 0x00e7 ;Latin Small Letter C With Cedilla
0xe8 0x010d ;Latin Small Letter C With Caron
0xe9 0x00e9 ;Latin Small Letter E With Acute
0xea 0x0119 ;Latin Small Letter E With Ogonek
0xeb 0x00eb ;Latin Small Letter E With Diaeresis
0xec 0x011b ;Latin Small Letter E With Caron
0xed 0x00ed ;Latin Small Letter I With Acute
0xee 0x00ee ;Latin Small Letter I With Circumflex
0xef 0x010f ;Latin Small Letter D With Caron
0xf0 0x0111 ;Latin Small Letter D With Stroke
0xf1 0x0144 ;Latin Small Letter N With Acute
0xf2 0x0148 ;Latin Small Letter N With Caron
0xf3 0x00f3 ;Latin Small Letter O With Acute
0xf4 0x00f4 ;Latin Small Letter O With Circumflex
0xf5 0x0151 ;Latin Small Letter O With Double Acute
0xf6 0x00f6 ;Latin Small Letter O With Diaeresis
0xf7 0x00f7 ;Division Sign
0xf8 0x0159 ;Latin Small Letter R With Caron
0xf9 0x016f ;Latin Small Letter U With Ring Above
0xfa 0x00fa ;Latin Small Letter U With Acute
0xfb 0x0171 ;Latin Small Letter U With Double Acute
0xfc 0x00fc ;Latin Small Letter U With Diaeresis
0xfd 0x00fd ;Latin Small Letter Y With Acute
0xfe 0x0163 ;Latin Small Letter T With Cedilla
0xff 0x02d9 ;Dot Above
WCTABLE 693
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x0083 0x83
0x0088 0x88
0x0090 0x90
0x0098 0x98
0x00a0 0xa0 ;No-Break Space
0x00a1 0x21 ;Inverted Exclamation Mark
0x00a2 0x63 ;Cent Sign
0x00a3 0x4c ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0x59 ;Yen Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00aa 0x61 ;Feminine Ordinal Indicator
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00af 0x97 ;Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0x32 ;Superscript Two
0x00b3 0x33 ;Superscript Three
0x00b4 0xb4 ;Acute Accent
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Cedilla
0x00b9 0x31 ;Superscript One
0x00ba 0x6f ;Masculine Ordinal Indicator
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00bc 0x31 ;Vulgar Fraction One Quarter
0x00bd 0x31 ;Vulgar Fraction One Half
0x00be 0x33 ;Vulgar Fraction Three Quarters
0x00bf 0x3f ;Inverted Question Mark
0x00c0 0x41 ;Latin Capital Letter A With Grave
0x00c1 0xc1 ;Latin Capital Letter A With Acute
0x00c2 0xc2 ;Latin Capital Letter A With Circumflex
0x00c3 0x41 ;Latin Capital Letter A With Tilde
0x00c4 0xc4 ;Latin Capital Letter A With Diaeresis
0x00c5 0x41 ;Latin Capital Letter A With Ring Above
0x00c6 0x41 ;Latin Capital Ligature Ae
0x00c7 0xc7 ;Latin Capital Letter C With Cedilla
0x00c8 0x45 ;Latin Capital Letter E With Grave
0x00c9 0xc9 ;Latin Capital Letter E With Acute
0x00ca 0x45 ;Latin Capital Letter E With Circumflex
0x00cb 0xcb ;Latin Capital Letter E With Diaeresis
0x00cc 0x49 ;Latin Capital Letter I With Grave
0x00cd 0xcd ;Latin Capital Letter I With Acute
0x00ce 0xce ;Latin Capital Letter I With Circumflex
0x00cf 0x49 ;Latin Capital Letter I With Diaeresis
0x00d1 0x4e ;Latin Capital Letter N With Tilde
0x00d2 0x4f ;Latin Capital Letter O With Grave
0x00d3 0xd3 ;Latin Capital Letter O With Acute
0x00d4 0xd4 ;Latin Capital Letter O With Circumflex
0x00d5 0x4f ;Latin Capital Letter O With Tilde
0x00d6 0xd6 ;Latin Capital Letter O With Diaeresis
0x00d7 0xd7 ;Multiplication Sign
0x00d8 0x4f ;Latin Capital Letter O With Stroke
0x00d9 0x55 ;Latin Capital Letter U With Grave
0x00da 0xda ;Latin Capital Letter U With Acute
0x00db 0x55 ;Latin Capital Letter U With Circumflex
0x00dc 0xdc ;Latin Capital Letter U With Diaeresis
0x00dd 0xdd ;Latin Capital Letter Y With Acute
0x00df 0xdf ;Latin Small Letter Sharp S
0x00e0 0x61 ;Latin Small Letter A With Grave
0x00e1 0xe1 ;Latin Small Letter A With Acute
0x00e2 0xe2 ;Latin Small Letter A With Circumflex
0x00e3 0x61 ;Latin Small Letter A With Tilde
0x00e4 0xe4 ;Latin Small Letter A With Diaeresis
0x00e5 0x61 ;Latin Small Letter A With Ring Above
0x00e6 0x61 ;Latin Small Ligature Ae
0x00e7 0xe7 ;Latin Small Letter C With Cedilla
0x00e8 0x65 ;Latin Small Letter E With Grave
0x00e9 0xe9 ;Latin Small Letter E With Acute
0x00ea 0x65 ;Latin Small Letter E With Circumflex
0x00eb 0xeb ;Latin Small Letter E With Diaeresis
0x00ec 0x69 ;Latin Small Letter I With Grave
0x00ed 0xed ;Latin Small Letter I With Acute
0x00ee 0xee ;Latin Small Letter I With Circumflex
0x00ef 0x69 ;Latin Small Letter I With Diaeresis
0x00f1 0x6e ;Latin Small Letter N With Tilde
0x00f2 0x6f ;Latin Small Letter O With Grave
0x00f3 0xf3 ;Latin Small Letter O With Acute
0x00f4 0xf4 ;Latin Small Letter O With Circumflex
0x00f5 0x6f ;Latin Small Letter O With Tilde
0x00f6 0xf6 ;Latin Small Letter O With Diaeresis
0x00f7 0xf7 ;Division Sign
0x00f8 0x6f ;Latin Small Letter O With Stroke
0x00f9 0x75 ;Latin Small Letter U With Grave
0x00fa 0xfa ;Latin Small Letter U With Acute
0x00fb 0x75 ;Latin Small Letter U With Circumflex
0x00fc 0xfc ;Latin Small Letter U With Diaeresis
0x00fd 0xfd ;Latin Small Letter Y With Acute
0x00ff 0x79 ;Latin Small Letter Y With Diaeresis
0x0100 0x41 ;Latin Capital Letter A With Macron
0x0101 0x61 ;Latin Small Letter A With Macron
0x0102 0xc3 ;Latin Capital Letter A With Breve
0x0103 0xe3 ;Latin Small Letter A With Breve
0x0104 0xa5 ;Latin Capital Letter A With Ogonek
0x0105 0xb9 ;Latin Small Letter A With Ogonek
0x0106 0xc6 ;Latin Capital Letter C With Acute
0x0107 0xe6 ;Latin Small Letter C With Acute
0x0108 0x43 ;Latin Capital Letter C With Circumflex
0x0109 0x63 ;Latin Small Letter C With Circumflex
0x010a 0x43 ;Latin Capital Letter C With Dot Above
0x010b 0x63 ;Latin Small Letter C With Dot Above
0x010c 0xc8 ;Latin Capital Letter C With Caron
0x010d 0xe8 ;Latin Small Letter C With Caron
0x010e 0xcf ;Latin Capital Letter D With Caron
0x010f 0xef ;Latin Small Letter D With Caron
0x0110 0xd0 ;Latin Capital Letter D With Stroke
0x0111 0xf0 ;Latin Small Letter D With Stroke
0x0112 0x45 ;Latin Capital Letter E With Macron
0x0113 0x65 ;Latin Small Letter E With Macron
0x0114 0x45 ;Latin Capital Letter E With Breve
0x0115 0x65 ;Latin Small Letter E With Breve
0x0116 0x45 ;Latin Capital Letter E With Dot Above
0x0117 0x65 ;Latin Small Letter E With Dot Above
0x0118 0xca ;Latin Capital Letter E With Ogonek
0x0119 0xea ;Latin Small Letter E With Ogonek
0x011a 0xcc ;Latin Capital Letter E With Caron
0x011b 0xec ;Latin Small Letter E With Caron
0x011c 0x47 ;Latin Capital Letter G With Circumflex
0x011d 0x67 ;Latin Small Letter G With Circumflex
0x011e 0x47 ;Latin Capital Letter G With Breve
0x011f 0x67 ;Latin Small Letter G With Breve
0x0120 0x47 ;Latin Capital Letter G With Dot Above
0x0121 0x67 ;Latin Small Letter G With Dot Above
0x0122 0x47 ;Latin Capital Letter G With Cedilla
0x0123 0x67 ;Latin Small Letter G With Cedilla
0x0124 0x48 ;Latin Capital Letter H With Circumflex
0x0125 0x68 ;Latin Small Letter H With Circumflex
0x0126 0x48 ;Latin Capital Letter H With Stroke
0x0127 0x68 ;Latin Small Letter H With Stroke
0x0128 0x49 ;Latin Capital Letter I With Tilde
0x0129 0x69 ;Latin Small Letter I With Tilde
0x012a 0x49 ;Latin Capital Letter I With Macron
0x012b 0x69 ;Latin Small Letter I With Macron
0x012c 0x49 ;Latin Capital Letter I With Breve
0x012d 0x69 ;Latin Small Letter I With Breve
0x012e 0x49 ;Latin Capital Letter I With Ogonek
0x012f 0x69 ;Latin Small Letter I With Ogonek
0x0130 0x49 ;Latin Capital Letter I With Dot Above
0x0131 0x69 ;Latin Small Letter Dotless I
0x0134 0x4a ;Latin Capital Letter J With Circumflex
0x0135 0x6a ;Latin Small Letter J With Circumflex
0x0136 0x4b ;Latin Capital Letter K With Cedilla
0x0137 0x6b ;Latin Small Letter K With Cedilla
0x0139 0xc5 ;Latin Capital Letter L With Acute
0x013a 0xe5 ;Latin Small Letter L With Acute
0x013b 0x4c ;Latin Capital Letter L With Cedilla
0x013c 0x6c ;Latin Small Letter L With Cedilla
0x013d 0xbc ;Latin Capital Letter L With Caron
0x013e 0xbe ;Latin Small Letter L With Caron
0x0141 0xa3 ;Latin Capital Letter L With Stroke
0x0142 0xb3 ;Latin Small Letter L With Stroke
0x0143 0xd1 ;Latin Capital Letter N With Acute
0x0144 0xf1 ;Latin Small Letter N With Acute
0x0145 0x4e ;Latin Capital Letter N With Cedilla
0x0146 0x6e ;Latin Small Letter N With Cedilla
0x0147 0xd2 ;Latin Capital Letter N With Caron
0x0148 0xf2 ;Latin Small Letter N With Caron
0x014c 0x4f ;Latin Capital Letter O With Macron
0x014d 0x6f ;Latin Small Letter O With Macron
0x014e 0x4f ;Latin Capital Letter O With Breve
0x014f 0x6f ;Latin Small Letter O With Breve
0x0150 0xd5 ;Latin Capital Letter O With Double Acute
0x0151 0xf5 ;Latin Small Letter O With Double Acute
0x0152 0x4f ;Latin Capital Ligature Oe
0x0153 0x6f ;Latin Small Ligature Oe
0x0154 0xc0 ;Latin Capital Letter R With Acute
0x0155 0xe0 ;Latin Small Letter R With Acute
0x0156 0x52 ;Latin Capital Letter R With Cedilla
0x0157 0x72 ;Latin Small Letter R With Cedilla
0x0158 0xd8 ;Latin Capital Letter R With Caron
0x0159 0xf8 ;Latin Small Letter R With Caron
0x015a 0x8c ;Latin Capital Letter S With Acute
0x015b 0x9c ;Latin Small Letter S With Acute
0x015c 0x53 ;Latin Capital Letter S With Circumflex
0x015d 0x73 ;Latin Small Letter S With Circumflex
0x015e 0xaa ;Latin Capital Letter S With Cedilla
0x015f 0xba ;Latin Small Letter S With Cedilla
0x0160 0x8a ;Latin Capital Letter S With Caron
0x0161 0x9a ;Latin Small Letter S With Caron
0x0162 0xde ;Latin Capital Letter T With Cedilla
0x0163 0xfe ;Latin Small Letter T With Cedilla
0x0164 0x8d ;Latin Capital Letter T With Caron
0x0165 0x9d ;Latin Small Letter T With Caron
0x0166 0x54 ;Latin Capital Letter T With Stroke
0x0167 0x74 ;Latin Small Letter T With Stroke
0x0168 0x55 ;Latin Capital Letter U With Tilde
0x0169 0x75 ;Latin Small Letter U With Tilde
0x016a 0x55 ;Latin Capital Letter U With Macron
0x016b 0x75 ;Latin Small Letter U With Macron
0x016c 0x55 ;Latin Capital Letter U With Breve
0x016d 0x75 ;Latin Small Letter U With Breve
0x016e 0xd9 ;Latin Capital Letter U With Ring Above
0x016f 0xf9 ;Latin Small Letter U With Ring Above
0x0170 0xdb ;Latin Capital Letter U With Double Acute
0x0171 0xfb ;Latin Small Letter U With Double Acute
0x0172 0x55 ;Latin Capital Letter U With Ogonek
0x0173 0x75 ;Latin Small Letter U With Ogonek
0x0174 0x57 ;Latin Capital Letter W With Circumflex
0x0175 0x77 ;Latin Small Letter W With Circumflex
0x0176 0x59 ;Latin Capital Letter Y With Circumflex
0x0177 0x79 ;Latin Small Letter Y With Circumflex
0x0178 0x59 ;Latin Capital Letter Y With Diaeresis
0x0179 0x8f ;Latin Capital Letter Z With Acute
0x017a 0x9f ;Latin Small Letter Z With Acute
0x017b 0xaf ;Latin Capital Letter Z With Dot Above
0x017c 0xbf ;Latin Small Letter Z With Dot Above
0x017d 0x8e ;Latin Capital Letter Z With Caron
0x017e 0x9e ;Latin Small Letter Z With Caron
0x0180 0x62 ;Latin Small Letter B With Stroke
0x0189 0xd0 ;Latin Capital Letter African D
0x0191 0x46 ;Latin Capital Letter F With Hook
0x0192 0x66 ;Latin Small Letter F With Hook
0x0197 0x49 ;Latin Capital Letter I With Stroke
0x019a 0x6c ;Latin Small Letter L With Bar
0x019f 0x4f ;Latin Capital Letter O With Middle Tilde
0x01a0 0x4f ;Latin Capital Letter O With Horn
0x01a1 0x6f ;Latin Small Letter O With Horn
0x01ab 0x74 ;Latin Small Letter T With Palatal Hook
0x01ae 0x54 ;Latin Capital Letter T With Retroflex Hook
0x01af 0x55 ;Latin Capital Letter U With Horn
0x01b0 0x75 ;Latin Small Letter U With Horn
0x01b6 0x7a ;Latin Small Letter Z With Stroke
0x01c0 0x7c ;Latin Letter Dental Click
0x01c3 0x21 ;Latin Letter Retroflex Click
0x01cd 0x41 ;Latin Capital Letter A With Caron
0x01ce 0x61 ;Latin Small Letter A With Caron
0x01cf 0x49 ;Latin Capital Letter I With Caron
0x01d0 0x69 ;Latin Small Letter I With Caron
0x01d1 0x4f ;Latin Capital Letter O With Caron
0x01d2 0x6f ;Latin Small Letter O With Caron
0x01d3 0x55 ;Latin Capital Letter U With Caron
0x01d4 0x75 ;Latin Small Letter U With Caron
0x01d5 0x55 ;Latin Capital Letter U With Diaeresis And Macron
0x01d6 0x75 ;Latin Small Letter U With Diaeresis And Macron
0x01d7 0x55 ;Latin Capital Letter U With Diaeresis And Acute
0x01d8 0x75 ;Latin Small Letter U With Diaeresis And Acute
0x01d9 0x55 ;Latin Capital Letter U With Diaeresis And Caron
0x01da 0x75 ;Latin Small Letter U With Diaeresis And Caron
0x01db 0x55 ;Latin Capital Letter U With Diaeresis And Grave
0x01dc 0x75 ;Latin Small Letter U With Diaeresis And Grave
0x01de 0x41 ;Latin Capital Letter A With Diaeresis And Macron
0x01df 0x61 ;Latin Small Letter A With Diaeresis And Macron
0x01e4 0x47 ;Latin Capital Letter G With Stroke
0x01e5 0x67 ;Latin Small Letter G With Stroke
0x01e6 0x47 ;Latin Capital Letter G With Caron
0x01e7 0x67 ;Latin Small Letter G With Caron
0x01e8 0x4b ;Latin Capital Letter K With Caron
0x01e9 0x6b ;Latin Small Letter K With Caron
0x01ea 0x4f ;Latin Capital Letter O With Ogonek
0x01eb 0x6f ;Latin Small Letter O With Ogonek
0x01ec 0x4f ;Latin Capital Letter O With Ogonek And Macron
0x01ed 0x6f ;Latin Small Letter O With Ogonek And Macron
0x01f0 0x6a ;Latin Small Letter J With Caron
0x0261 0x67 ;Latin Small Letter Script G
0x02b9 0x27 ;Modifier Letter Prime
0x02ba 0x22 ;Modifier Letter Double Prime
0x02bb 0x91 ;Modifier Letter Turned Comma
0x02bc 0x27 ;Modifier Letter Apostrophe
0x02c4 0x5e ;Modifier Letter Up Arrowhead
0x02c6 0x5e ;Modifier Letter Circumflex Accent
0x02c7 0xa1 ;Caron
0x02c8 0x27 ;Modifier Letter Vertical Line
0x02c9 0xaf ;Modifier Letter Macron
0x02ca 0xb4 ;Modifier Letter Acute Accent
0x02cb 0x60 ;Modifier Letter Grave Accent
0x02cd 0x5f ;Modifier Letter Low Macron
0x02d8 0xa2 ;Breve
0x02d9 0xff ;Dot Above
0x02da 0xb0 ;Ring Above
0x02db 0xb2 ;Ogonek
0x02dc 0x7e ;Small Tilde
0x02dd 0xbd ;Double Acute Accent
0x0300 0x60 ;Combining Grave Accent
0x0301 0xb4 ;Combining Acute Accent
0x0302 0x5e ;Combining Circumflex Accent
0x0303 0x7e ;Combining Tilde
0x0304 0xaf ;Combining Macron
0x0305 0xaf ;Combining Overline
0x0306 0xa2 ;Combining Breve
0x0307 0xff ;Combining Dot Above
0x0308 0xa8 ;Combining Diaeresis
0x030a 0xb0 ;Combining Ring Above
0x030c 0xa1 ;Combining Caron
0x030e 0x22 ;Combining Double Vertical Line Above
0x0327 0xb8 ;Combining Cedilla
0x0331 0x5f ;Combining Macron Below
0x0332 0x5f ;Combining Low Line
0x037e 0x3b ;Greek Question Mark
0x03b2 0xdf ;Greek Small Letter Beta
0x03bc 0xb5 ;Greek Small Letter Mu
0x04bb 0x68 ;Cyrillic Small Letter Shha
0x0589 0x3a ;Armenian Full Stop
0x066a 0x25 ;Arabic Percent Sign
0x2000 0x20 ;En Quad
0x2001 0x20 ;Em Quad
0x2002 0x20 ;En Space
0x2003 0x20 ;Em Space
0x2004 0x20 ;Three-Per-Em Space
0x2005 0x20 ;Four-Per-Em Space
0x2006 0x20 ;Six-Per-Em Space
0x2010 0x2d ;Hyphen
0x2011 0x2d ;Non-Breaking Hyphen
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2024 0x95 ;One Dot Leader
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2032 0x27 ;Prime
0x2033 0x94 ;Double Prime
0x2035 0x60 ;Reversed Prime
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x203c 0x21 ;Double Exclamation Mark
0x2044 0x2f ;Fraction Slash
0x2070 0xb0 ;Superscript Zero
0x2074 0x34 ;Superscript Four
0x2075 0x35 ;Superscript Five
0x2076 0x36 ;Superscript Six
0x2077 0x37 ;Superscript Seven
0x2078 0x38 ;Superscript Eight
0x2080 0x30 ;Subscript Zero
0x2081 0x31 ;Subscript One
0x2082 0x32 ;Subscript Two
0x2083 0x33 ;Subscript Three
0x2084 0x34 ;Subscript Four
0x2085 0x35 ;Subscript Five
0x2086 0x36 ;Subscript Six
0x2087 0x37 ;Subscript Seven
0x2088 0x38 ;Subscript Eight
0x2089 0x39 ;Subscript Nine
0x20ac 0x80 ;Euro Sign
0x20a4 0xa3 ;Lira Sign
0x2102 0x43 ;Double-Struck Capital C
0x2107 0x45 ;Euler Constant
0x210a 0x67 ;Script Small G
0x210b 0x48 ;Script Capital H
0x210c 0x48 ;Black-Letter Capital H
0x210d 0x48 ;Double-Struck Capital H
0x210e 0x68 ;Planck Constant
0x2110 0x49 ;Script Capital I
0x2111 0x49 ;Black-Letter Capital I
0x2112 0x4c ;Script Capital L
0x2113 0x6c ;Script Small L
0x2115 0x4e ;Double-Struck Capital N
0x2118 0x50 ;Script Capital P
0x2119 0x50 ;Double-Struck Capital P
0x211a 0x51 ;Double-Struck Capital Q
0x211b 0x52 ;Script Capital R
0x211c 0x52 ;Black-Letter Capital R
0x211d 0x52 ;Double-Struck Capital R
0x2122 0x99 ;Trade Mark Sign
0x2124 0x5a ;Double-Struck Capital Z
0x2128 0x5a ;Black-Letter Capital Z
0x212a 0x4b ;Kelvin Sign
0x212b 0xc5 ;Angstrom Sign
0x212c 0x42 ;Script Capital B
0x212d 0x43 ;Black-Letter Capital C
0x212e 0x65 ;Estimated Symbol
0x212f 0x65 ;Script Small E
0x2130 0x45 ;Script Capital E
0x2131 0x46 ;Script Capital F
0x2133 0x4d ;Script Capital M
0x2134 0x6f ;Script Small O
0x2190 0x8b ;Leftwards Arrow
0x2191 0x5e ;Upwards Arrow
0x2192 0x9b ;Rightwards Arrow
0x2193 0xa1 ;Downwards Arrow
0x2194 0x2d ;Left Right Arrow
0x2195 0x7c ;Up Down Arrow
0x21a8 0x7c ;Up Down Arrow With Base
0x2205 0xd8 ;Empty Set
0x2212 0x2d ;Minus Sign
0x2213 0xb1 ;Minus-Or-Plus Sign
0x2215 0x2f ;Division Slash
0x2216 0x5c ;Set Minus
0x2217 0x2a ;Asterisk Operator
0x2218 0xb0 ;Ring Operator
0x2219 0x95 ;Bullet Operator
0x221f 0x4c ;Right Angle
0x2223 0x7c ;Divides
0x2236 0x3a ;Ratio
0x223c 0x7e ;Tilde Operator
0x226a 0xab ;Much Less-Than
0x226b 0xbb ;Much Greater-Than
0x22c5 0xb7 ;Dot Operator
0x2302 0xa6 ;House
0x2303 0x5e ;Up Arrowhead
0x2329 0x3c ;Left-Pointing Angle Bracket
0x232a 0x3e ;Right-Pointing Angle Bracket
0x2500 0xa6 ;Box Drawings Light Horizontal
0x2502 0x2d ;Box Drawings Light Vertical
0x250c 0x2d ;Box Drawings Light Down And Right
0x2510 0xac ;Box Drawings Light Down And Left
0x2514 0x4c ;Box Drawings Light Up And Right
0x2518 0x2d ;Box Drawings Light Up And Left
0x251c 0x2b ;Box Drawings Light Vertical And Right
0x2524 0x2b ;Box Drawings Light Vertical And Left
0x252c 0x54 ;Box Drawings Light Down And Horizontal
0x2534 0x2b ;Box Drawings Light Up And Horizontal
0x253c 0x2b ;Box Drawings Light Vertical And Horizontal
0x2550 0x3d ;Box Drawings Double Horizontal
0x2551 0xa6 ;Box Drawings Double Vertical
0x2554 0x2d ;Box Drawings Double Down And Right
0x2557 0xac ;Box Drawings Double Down And Left
0x255a 0x4c ;Box Drawings Double Up And Right
0x255d 0x2d ;Box Drawings Double Up And Left
0x2560 0xa6 ;Box Drawings Double Vertical And Right
0x2563 0xa6 ;Box Drawings Double Vertical And Left
0x2566 0x54 ;Box Drawings Double Down And Horizontal
0x2569 0xa6 ;Box Drawings Double Up And Horizontal
0x256c 0x2b ;Box Drawings Double Vertical And Horizontal
0x2580 0x2d ;Upper Half Block
0x2584 0x2d ;Lower Half Block
0x2588 0x2d ;Full Block
0x2591 0x2d ;Light Shade
0x2592 0x2d ;Medium Shade
0x2593 0x2d ;Dark Shade
0x25a0 0xa6 ;Black Square
0x25ac 0x2d ;Black Rectangle
0x25b2 0x5e ;Black Up-Pointing Triangle
0x25ba 0x3e ;Black Right-Pointing Pointer
0x25bc 0xa1 ;Black Down-Pointing Triangle
0x25c4 0x3c ;Black Left-Pointing Pointer
0x25cb 0x30 ;White Circle
0x25d8 0x95 ;Inverse Bullet
0x25d9 0x30 ;Inverse White Circle
0x263a 0xa2 ;White Smiling Face
0x263b 0xa2 ;Black Smiling Face
0x263c 0x30 ;White Sun With Rays
0x2640 0x2b ;Female Sign
0x2642 0x3e ;Male Sign
0x2660 0xa6 ;Black Spade Suit
0x2663 0xa6 ;Black Club Suit
0x2665 0xa6 ;Black Heart Suit
0x2666 0xa6 ;Black Diamond Suit
0x266a 0x64 ;Eighth Note
0x266b 0x64 ;Beamed Eighth Notes
0x2758 0x7c ;Light Vertical Bar
0x275b 0x91 ;Heavy Single Turned Comma Quotation Mark Ornament
0x275c 0x92 ;Heavy Single Comma Quotation Mark Ornament
0x275d 0x93 ;Heavy Double Turned Comma Quotation Mark Ornament
0x275e 0x94 ;Heavy Double Comma Quotation Mark Ornament
0x3000 0x20 ;Ideographic Space
0x3008 0x3c ;Left Angle Bracket
0x3009 0x3e ;Right Angle Bracket
0x300a 0xab ;Left Double Angle Bracket
0x300b 0xbb ;Right Double Angle Bracket
0x301a 0x5b ;Left White Square Bracket
0x301b 0x5d ;Right White Square Bracket
0x301d 0x93 ;Reversed Double Prime Quotation Mark
0x301e 0x94 ;Double Prime Quotation Mark
0x301f 0x84 ;Low Double Prime Quotation Mark
0x30fb 0xb7 ;Katakana Middle Dot
0x30fc 0x97 ;Katakana-Hiragana Prolonged Sound Mark
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,909 @@
CODEPAGE 1251 ;Cyrillic - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x0402 ;Cyrillic Capital Letter Dje
0x81 0x0403 ;Cyrillic Capital Letter Gje
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0453 ;Cyrillic Small Letter Gje
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x20ac ;Euro Sign
0x89 0x2030 ;Per Mille Sign
0x8a 0x0409 ;Cyrillic Capital Letter Lje
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x040a ;Cyrillic Capital Letter Nje
0x8d 0x040c ;Cyrillic Capital Letter Kje
0x8e 0x040b ;Cyrillic Capital Letter Tshe
0x8f 0x040f ;Cyrillic Capital Letter Dzhe
0x90 0x0452 ;Cyrillic Small Letter Dje
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x0098
0x99 0x2122 ;Trade Mark Sign
0x9a 0x0459 ;Cyrillic Small Letter Lje
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x045a ;Cyrillic Small Letter Nje
0x9d 0x045c ;Cyrillic Small Letter Kje
0x9e 0x045b ;Cyrillic Small Letter Tshe
0x9f 0x045f ;Cyrillic Small Letter Dzhe
0xa0 0x00a0 ;No-Break Space
0xa1 0x040e ;Cyrillic Capital Letter Short U
0xa2 0x045e ;Cyrillic Small Letter Short U
0xa3 0x0408 ;Cyrillic Capital Letter Je
0xa4 0x00a4 ;Currency Sign
0xa5 0x0490 ;Cyrillic Capital Letter Ghe With Upturn
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x0401 ;Cyrillic Capital Letter Io
0xa9 0x00a9 ;Copyright Sign
0xaa 0x0404 ;Cyrillic Capital Letter Ukrainian Ie
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x0407 ;Cyrillic Capital Letter Yi
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x0406 ;Cyrillic Capital Letter Byelorussian-Ukrainian I
0xb3 0x0456 ;Cyrillic Small Letter Byelorussian-Ukrainian I
0xb4 0x0491 ;Cyrillic Small Letter Ghe With Upturn
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x0451 ;Cyrillic Small Letter Io
0xb9 0x2116 ;Numero Sign
0xba 0x0454 ;Cyrillic Small Letter Ukrainian Ie
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x0458 ;Cyrillic Small Letter Je
0xbd 0x0405 ;Cyrillic Capital Letter Dze
0xbe 0x0455 ;Cyrillic Small Letter Dze
0xbf 0x0457 ;Cyrillic Small Letter Yi
0xc0 0x0410 ;Cyrillic Capital Letter A
0xc1 0x0411 ;Cyrillic Capital Letter Be
0xc2 0x0412 ;Cyrillic Capital Letter Ve
0xc3 0x0413 ;Cyrillic Capital Letter Ghe
0xc4 0x0414 ;Cyrillic Capital Letter De
0xc5 0x0415 ;Cyrillic Capital Letter Ie
0xc6 0x0416 ;Cyrillic Capital Letter Zhe
0xc7 0x0417 ;Cyrillic Capital Letter Ze
0xc8 0x0418 ;Cyrillic Capital Letter I
0xc9 0x0419 ;Cyrillic Capital Letter Short I
0xca 0x041a ;Cyrillic Capital Letter Ka
0xcb 0x041b ;Cyrillic Capital Letter El
0xcc 0x041c ;Cyrillic Capital Letter Em
0xcd 0x041d ;Cyrillic Capital Letter En
0xce 0x041e ;Cyrillic Capital Letter O
0xcf 0x041f ;Cyrillic Capital Letter Pe
0xd0 0x0420 ;Cyrillic Capital Letter Er
0xd1 0x0421 ;Cyrillic Capital Letter Es
0xd2 0x0422 ;Cyrillic Capital Letter Te
0xd3 0x0423 ;Cyrillic Capital Letter U
0xd4 0x0424 ;Cyrillic Capital Letter Ef
0xd5 0x0425 ;Cyrillic Capital Letter Ha
0xd6 0x0426 ;Cyrillic Capital Letter Tse
0xd7 0x0427 ;Cyrillic Capital Letter Che
0xd8 0x0428 ;Cyrillic Capital Letter Sha
0xd9 0x0429 ;Cyrillic Capital Letter Shcha
0xda 0x042a ;Cyrillic Capital Letter Hard Sign
0xdb 0x042b ;Cyrillic Capital Letter Yeru
0xdc 0x042c ;Cyrillic Capital Letter Soft Sign
0xdd 0x042d ;Cyrillic Capital Letter E
0xde 0x042e ;Cyrillic Capital Letter Yu
0xdf 0x042f ;Cyrillic Capital Letter Ya
0xe0 0x0430 ;Cyrillic Small Letter A
0xe1 0x0431 ;Cyrillic Small Letter Be
0xe2 0x0432 ;Cyrillic Small Letter Ve
0xe3 0x0433 ;Cyrillic Small Letter Ghe
0xe4 0x0434 ;Cyrillic Small Letter De
0xe5 0x0435 ;Cyrillic Small Letter Ie
0xe6 0x0436 ;Cyrillic Small Letter Zhe
0xe7 0x0437 ;Cyrillic Small Letter Ze
0xe8 0x0438 ;Cyrillic Small Letter I
0xe9 0x0439 ;Cyrillic Small Letter Short I
0xea 0x043a ;Cyrillic Small Letter Ka
0xeb 0x043b ;Cyrillic Small Letter El
0xec 0x043c ;Cyrillic Small Letter Em
0xed 0x043d ;Cyrillic Small Letter En
0xee 0x043e ;Cyrillic Small Letter O
0xef 0x043f ;Cyrillic Small Letter Pe
0xf0 0x0440 ;Cyrillic Small Letter Er
0xf1 0x0441 ;Cyrillic Small Letter Es
0xf2 0x0442 ;Cyrillic Small Letter Te
0xf3 0x0443 ;Cyrillic Small Letter U
0xf4 0x0444 ;Cyrillic Small Letter Ef
0xf5 0x0445 ;Cyrillic Small Letter Ha
0xf6 0x0446 ;Cyrillic Small Letter Tse
0xf7 0x0447 ;Cyrillic Small Letter Che
0xf8 0x0448 ;Cyrillic Small Letter Sha
0xf9 0x0449 ;Cyrillic Small Letter Shcha
0xfa 0x044a ;Cyrillic Small Letter Hard Sign
0xfb 0x044b ;Cyrillic Small Letter Yeru
0xfc 0x044c ;Cyrillic Small Letter Soft Sign
0xfd 0x044d ;Cyrillic Small Letter E
0xfe 0x044e ;Cyrillic Small Letter Yu
0xff 0x044f ;Cyrillic Small Letter Ya
WCTABLE 640
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0098 0x98
0x00a0 0xa0 ;No-Break Space
0x00a4 0xa4 ;Currency Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a9 0xa9 ;Copyright Sign
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00c0 0x41 ;Latin Capital Letter A With Grave
0x00c1 0x41 ;Latin Capital Letter A With Acute
0x00c2 0x41 ;Latin Capital Letter A With Circumflex
0x00c3 0x41 ;Latin Capital Letter A With Tilde
0x00c4 0x41 ;Latin Capital Letter A With Diaeresis
0x00c5 0x41 ;Latin Capital Letter A With Ring Above
0x00c7 0x43 ;Latin Capital Letter C With Cedilla
0x00c8 0x45 ;Latin Capital Letter E With Grave
0x00c9 0x45 ;Latin Capital Letter E With Acute
0x00ca 0x45 ;Latin Capital Letter E With Circumflex
0x00cb 0x45 ;Latin Capital Letter E With Diaeresis
0x00cc 0x49 ;Latin Capital Letter I With Grave
0x00cd 0x49 ;Latin Capital Letter I With Acute
0x00ce 0x49 ;Latin Capital Letter I With Circumflex
0x00cf 0x49 ;Latin Capital Letter I With Diaeresis
0x00d1 0x4e ;Latin Capital Letter N With Tilde
0x00d2 0x4f ;Latin Capital Letter O With Grave
0x00d3 0x4f ;Latin Capital Letter O With Acute
0x00d4 0x4f ;Latin Capital Letter O With Circumflex
0x00d5 0x4f ;Latin Capital Letter O With Tilde
0x00d6 0x4f ;Latin Capital Letter O With Diaeresis
0x00d8 0x4f ;Latin Capital Letter O With Stroke
0x00d9 0x55 ;Latin Capital Letter U With Grave
0x00da 0x55 ;Latin Capital Letter U With Acute
0x00db 0x55 ;Latin Capital Letter U With Circumflex
0x00dc 0x55 ;Latin Capital Letter U With Diaeresis
0x00dd 0x59 ;Latin Capital Letter Y With Acute
0x00e0 0x61 ;Latin Small Letter A With Grave
0x00e1 0x61 ;Latin Small Letter A With Acute
0x00e2 0x61 ;Latin Small Letter A With Circumflex
0x00e3 0x61 ;Latin Small Letter A With Tilde
0x00e4 0x61 ;Latin Small Letter A With Diaeresis
0x00e5 0x61 ;Latin Small Letter A With Ring Above
0x00e7 0x63 ;Latin Small Letter C With Cedilla
0x00e8 0x65 ;Latin Small Letter E With Grave
0x00e9 0x65 ;Latin Small Letter E With Acute
0x00ea 0x65 ;Latin Small Letter E With Circumflex
0x00eb 0x65 ;Latin Small Letter E With Diaeresis
0x00ec 0x69 ;Latin Small Letter I With Grave
0x00ed 0x69 ;Latin Small Letter I With Acute
0x00ee 0x69 ;Latin Small Letter I With Circumflex
0x00ef 0x69 ;Latin Small Letter I With Diaeresis
0x00f1 0x6e ;Latin Small Letter N With Tilde
0x00f2 0x6f ;Latin Small Letter O With Grave
0x00f3 0x6f ;Latin Small Letter O With Acute
0x00f4 0x6f ;Latin Small Letter O With Circumflex
0x00f5 0x6f ;Latin Small Letter O With Tilde
0x00f6 0x6f ;Latin Small Letter O With Diaeresis
0x00f8 0x6f ;Latin Small Letter O With Stroke
0x00f9 0x75 ;Latin Small Letter U With Grave
0x00fa 0x75 ;Latin Small Letter U With Acute
0x00fb 0x75 ;Latin Small Letter U With Circumflex
0x00fc 0x75 ;Latin Small Letter U With Diaeresis
0x00fd 0x79 ;Latin Small Letter Y With Acute
0x00ff 0x79 ;Latin Small Letter Y With Diaeresis
0x0100 0x41 ;Latin Capital Letter A With Macron
0x0101 0x61 ;Latin Small Letter A With Macron
0x0102 0x41 ;Latin Capital Letter A With Breve
0x0103 0x61 ;Latin Small Letter A With Breve
0x0104 0x41 ;Latin Capital Letter A With Ogonek
0x0105 0x61 ;Latin Small Letter A With Ogonek
0x0106 0x43 ;Latin Capital Letter C With Acute
0x0107 0x63 ;Latin Small Letter C With Acute
0x0108 0x43 ;Latin Capital Letter C With Circumflex
0x0109 0x63 ;Latin Small Letter C With Circumflex
0x010a 0x43 ;Latin Capital Letter C With Dot Above
0x010b 0x63 ;Latin Small Letter C With Dot Above
0x010c 0x43 ;Latin Capital Letter C With Caron
0x010d 0x63 ;Latin Small Letter C With Caron
0x010e 0x44 ;Latin Capital Letter D With Caron
0x010f 0x64 ;Latin Small Letter D With Caron
0x0110 0x44 ;Latin Capital Letter D With Stroke
0x0111 0x64 ;Latin Small Letter D With Stroke
0x0112 0x45 ;Latin Capital Letter E With Macron
0x0113 0x65 ;Latin Small Letter E With Macron
0x0114 0x45 ;Latin Capital Letter E With Breve
0x0115 0x65 ;Latin Small Letter E With Breve
0x0116 0x45 ;Latin Capital Letter E With Dot Above
0x0117 0x65 ;Latin Small Letter E With Dot Above
0x0118 0x45 ;Latin Capital Letter E With Ogonek
0x0119 0x65 ;Latin Small Letter E With Ogonek
0x011a 0x45 ;Latin Capital Letter E With Caron
0x011b 0x65 ;Latin Small Letter E With Caron
0x011c 0x47 ;Latin Capital Letter G With Circumflex
0x011d 0x67 ;Latin Small Letter G With Circumflex
0x011e 0x47 ;Latin Capital Letter G With Breve
0x011f 0x67 ;Latin Small Letter G With Breve
0x0120 0x47 ;Latin Capital Letter G With Dot Above
0x0121 0x67 ;Latin Small Letter G With Dot Above
0x0122 0x47 ;Latin Capital Letter G With Cedilla
0x0123 0x67 ;Latin Small Letter G With Cedilla
0x0124 0x48 ;Latin Capital Letter H With Circumflex
0x0125 0x68 ;Latin Small Letter H With Circumflex
0x0126 0x48 ;Latin Capital Letter H With Stroke
0x0127 0x68 ;Latin Small Letter H With Stroke
0x0128 0x49 ;Latin Capital Letter I With Tilde
0x0129 0x69 ;Latin Small Letter I With Tilde
0x012a 0x49 ;Latin Capital Letter I With Macron
0x012b 0x69 ;Latin Small Letter I With Macron
0x012c 0x49 ;Latin Capital Letter I With Breve
0x012d 0x69 ;Latin Small Letter I With Breve
0x012e 0x49 ;Latin Capital Letter I With Ogonek
0x012f 0x69 ;Latin Small Letter I With Ogonek
0x0130 0x49 ;Latin Capital Letter I With Dot Above
0x0134 0x4a ;Latin Capital Letter J With Circumflex
0x0135 0x6a ;Latin Small Letter J With Circumflex
0x0136 0x4b ;Latin Capital Letter K With Cedilla
0x0137 0x6b ;Latin Small Letter K With Cedilla
0x0139 0x4c ;Latin Capital Letter L With Acute
0x013a 0x6c ;Latin Small Letter L With Acute
0x013b 0x4c ;Latin Capital Letter L With Cedilla
0x013c 0x6c ;Latin Small Letter L With Cedilla
0x013d 0x4c ;Latin Capital Letter L With Caron
0x013e 0x6c ;Latin Small Letter L With Caron
0x0141 0x4c ;Latin Capital Letter L With Stroke
0x0142 0x6c ;Latin Small Letter L With Stroke
0x0143 0x4e ;Latin Capital Letter N With Acute
0x0144 0x6e ;Latin Small Letter N With Acute
0x0145 0x4e ;Latin Capital Letter N With Cedilla
0x0146 0x6e ;Latin Small Letter N With Cedilla
0x0147 0x4e ;Latin Capital Letter N With Caron
0x0148 0x6e ;Latin Small Letter N With Caron
0x014c 0x4f ;Latin Capital Letter O With Macron
0x014d 0x6f ;Latin Small Letter O With Macron
0x014e 0x4f ;Latin Capital Letter O With Breve
0x014f 0x6f ;Latin Small Letter O With Breve
0x0150 0x4f ;Latin Capital Letter O With Double Acute
0x0151 0x6f ;Latin Small Letter O With Double Acute
0x0154 0x52 ;Latin Capital Letter R With Acute
0x0155 0x72 ;Latin Small Letter R With Acute
0x0156 0x52 ;Latin Capital Letter R With Cedilla
0x0157 0x72 ;Latin Small Letter R With Cedilla
0x0158 0x52 ;Latin Capital Letter R With Caron
0x0159 0x72 ;Latin Small Letter R With Caron
0x015a 0x53 ;Latin Capital Letter S With Acute
0x015b 0x73 ;Latin Small Letter S With Acute
0x015c 0x53 ;Latin Capital Letter S With Circumflex
0x015d 0x73 ;Latin Small Letter S With Circumflex
0x015e 0x53 ;Latin Capital Letter S With Cedilla
0x015f 0x73 ;Latin Small Letter S With Cedilla
0x0160 0x53 ;Latin Capital Letter S With Caron
0x0161 0x73 ;Latin Small Letter S With Caron
0x0162 0x54 ;Latin Capital Letter T With Cedilla
0x0163 0x74 ;Latin Small Letter T With Cedilla
0x0164 0x54 ;Latin Capital Letter T With Caron
0x0165 0x74 ;Latin Small Letter T With Caron
0x0166 0x54 ;Latin Capital Letter T With Stroke
0x0167 0x74 ;Latin Small Letter T With Stroke
0x0168 0x55 ;Latin Capital Letter U With Tilde
0x0169 0x75 ;Latin Small Letter U With Tilde
0x016a 0x55 ;Latin Capital Letter U With Macron
0x016b 0x75 ;Latin Small Letter U With Macron
0x016c 0x55 ;Latin Capital Letter U With Breve
0x016d 0x75 ;Latin Small Letter U With Breve
0x016e 0x55 ;Latin Capital Letter U With Ring Above
0x016f 0x75 ;Latin Small Letter U With Ring Above
0x0170 0x55 ;Latin Capital Letter U With Double Acute
0x0171 0x75 ;Latin Small Letter U With Double Acute
0x0172 0x55 ;Latin Capital Letter U With Ogonek
0x0173 0x75 ;Latin Small Letter U With Ogonek
0x0174 0x57 ;Latin Capital Letter W With Circumflex
0x0175 0x77 ;Latin Small Letter W With Circumflex
0x0176 0x59 ;Latin Capital Letter Y With Circumflex
0x0177 0x79 ;Latin Small Letter Y With Circumflex
0x0178 0x59 ;Latin Capital Letter Y With Diaeresis
0x0179 0x5a ;Latin Capital Letter Z With Acute
0x017a 0x7a ;Latin Small Letter Z With Acute
0x017b 0x5a ;Latin Capital Letter Z With Dot Above
0x017c 0x7a ;Latin Small Letter Z With Dot Above
0x017d 0x5a ;Latin Capital Letter Z With Caron
0x017e 0x7a ;Latin Small Letter Z With Caron
0x0180 0x62 ;Latin Small Letter B With Stroke
0x0197 0x49 ;Latin Capital Letter I With Stroke
0x019a 0x6c ;Latin Small Letter L With Bar
0x019f 0x4f ;Latin Capital Letter O With Middle Tilde
0x01a0 0x4f ;Latin Capital Letter O With Horn
0x01a1 0x6f ;Latin Small Letter O With Horn
0x01ab 0x74 ;Latin Small Letter T With Palatal Hook
0x01ae 0x54 ;Latin Capital Letter T With Retroflex Hook
0x01af 0x55 ;Latin Capital Letter U With Horn
0x01b0 0x75 ;Latin Small Letter U With Horn
0x01cd 0x41 ;Latin Capital Letter A With Caron
0x01ce 0x61 ;Latin Small Letter A With Caron
0x01cf 0x49 ;Latin Capital Letter I With Caron
0x01d0 0x69 ;Latin Small Letter I With Caron
0x01d1 0x4f ;Latin Capital Letter O With Caron
0x01d2 0x6f ;Latin Small Letter O With Caron
0x01d3 0x55 ;Latin Capital Letter U With Caron
0x01d4 0x75 ;Latin Small Letter U With Caron
0x01d5 0x55 ;Latin Capital Letter U With Diaeresis And Macron
0x01d6 0x75 ;Latin Small Letter U With Diaeresis And Macron
0x01d7 0x55 ;Latin Capital Letter U With Diaeresis And Acute
0x01d8 0x75 ;Latin Small Letter U With Diaeresis And Acute
0x01d9 0x55 ;Latin Capital Letter U With Diaeresis And Caron
0x01da 0x75 ;Latin Small Letter U With Diaeresis And Caron
0x01db 0x55 ;Latin Capital Letter U With Diaeresis And Grave
0x01dc 0x75 ;Latin Small Letter U With Diaeresis And Grave
0x01de 0x41 ;Latin Capital Letter A With Diaeresis And Macron
0x01df 0x61 ;Latin Small Letter A With Diaeresis And Macron
0x01e4 0x47 ;Latin Capital Letter G With Stroke
0x01e5 0x67 ;Latin Small Letter G With Stroke
0x01e6 0x47 ;Latin Capital Letter G With Caron
0x01e7 0x67 ;Latin Small Letter G With Caron
0x01e8 0x4b ;Latin Capital Letter K With Caron
0x01e9 0x6b ;Latin Small Letter K With Caron
0x01ea 0x4f ;Latin Capital Letter O With Ogonek
0x01eb 0x6f ;Latin Small Letter O With Ogonek
0x01ec 0x4f ;Latin Capital Letter O With Ogonek And Macron
0x01ed 0x6f ;Latin Small Letter O With Ogonek And Macron
0x01f0 0x6a ;Latin Small Letter J With Caron
0x0401 0xa8 ;Cyrillic Capital Letter Io
0x0402 0x80 ;Cyrillic Capital Letter Dje
0x0403 0x81 ;Cyrillic Capital Letter Gje
0x0404 0xaa ;Cyrillic Capital Letter Ukrainian Ie
0x0405 0xbd ;Cyrillic Capital Letter Dze
0x0406 0xb2 ;Cyrillic Capital Letter Byelorussian-Ukrainian I
0x0407 0xaf ;Cyrillic Capital Letter Yi
0x0408 0xa3 ;Cyrillic Capital Letter Je
0x0409 0x8a ;Cyrillic Capital Letter Lje
0x040a 0x8c ;Cyrillic Capital Letter Nje
0x040b 0x8e ;Cyrillic Capital Letter Tshe
0x040c 0x8d ;Cyrillic Capital Letter Kje
0x040e 0xa1 ;Cyrillic Capital Letter Short U
0x040f 0x8f ;Cyrillic Capital Letter Dzhe
0x0410 0xc0 ;Cyrillic Capital Letter A
0x0411 0xc1 ;Cyrillic Capital Letter Be
0x0412 0xc2 ;Cyrillic Capital Letter Ve
0x0413 0xc3 ;Cyrillic Capital Letter Ghe
0x0414 0xc4 ;Cyrillic Capital Letter De
0x0415 0xc5 ;Cyrillic Capital Letter Ie
0x0416 0xc6 ;Cyrillic Capital Letter Zhe
0x0417 0xc7 ;Cyrillic Capital Letter Ze
0x0418 0xc8 ;Cyrillic Capital Letter I
0x0419 0xc9 ;Cyrillic Capital Letter Short I
0x041a 0xca ;Cyrillic Capital Letter Ka
0x041b 0xcb ;Cyrillic Capital Letter El
0x041c 0xcc ;Cyrillic Capital Letter Em
0x041d 0xcd ;Cyrillic Capital Letter En
0x041e 0xce ;Cyrillic Capital Letter O
0x041f 0xcf ;Cyrillic Capital Letter Pe
0x0420 0xd0 ;Cyrillic Capital Letter Er
0x0421 0xd1 ;Cyrillic Capital Letter Es
0x0422 0xd2 ;Cyrillic Capital Letter Te
0x0423 0xd3 ;Cyrillic Capital Letter U
0x0424 0xd4 ;Cyrillic Capital Letter Ef
0x0425 0xd5 ;Cyrillic Capital Letter Ha
0x0426 0xd6 ;Cyrillic Capital Letter Tse
0x0427 0xd7 ;Cyrillic Capital Letter Che
0x0428 0xd8 ;Cyrillic Capital Letter Sha
0x0429 0xd9 ;Cyrillic Capital Letter Shcha
0x042a 0xda ;Cyrillic Capital Letter Hard Sign
0x042b 0xdb ;Cyrillic Capital Letter Yeru
0x042c 0xdc ;Cyrillic Capital Letter Soft Sign
0x042d 0xdd ;Cyrillic Capital Letter E
0x042e 0xde ;Cyrillic Capital Letter Yu
0x042f 0xdf ;Cyrillic Capital Letter Ya
0x0430 0xe0 ;Cyrillic Small Letter A
0x0431 0xe1 ;Cyrillic Small Letter Be
0x0432 0xe2 ;Cyrillic Small Letter Ve
0x0433 0xe3 ;Cyrillic Small Letter Ghe
0x0434 0xe4 ;Cyrillic Small Letter De
0x0435 0xe5 ;Cyrillic Small Letter Ie
0x0436 0xe6 ;Cyrillic Small Letter Zhe
0x0437 0xe7 ;Cyrillic Small Letter Ze
0x0438 0xe8 ;Cyrillic Small Letter I
0x0439 0xe9 ;Cyrillic Small Letter Short I
0x043a 0xea ;Cyrillic Small Letter Ka
0x043b 0xeb ;Cyrillic Small Letter El
0x043c 0xec ;Cyrillic Small Letter Em
0x043d 0xed ;Cyrillic Small Letter En
0x043e 0xee ;Cyrillic Small Letter O
0x043f 0xef ;Cyrillic Small Letter Pe
0x0440 0xf0 ;Cyrillic Small Letter Er
0x0441 0xf1 ;Cyrillic Small Letter Es
0x0442 0xf2 ;Cyrillic Small Letter Te
0x0443 0xf3 ;Cyrillic Small Letter U
0x0444 0xf4 ;Cyrillic Small Letter Ef
0x0445 0xf5 ;Cyrillic Small Letter Ha
0x0446 0xf6 ;Cyrillic Small Letter Tse
0x0447 0xf7 ;Cyrillic Small Letter Che
0x0448 0xf8 ;Cyrillic Small Letter Sha
0x0449 0xf9 ;Cyrillic Small Letter Shcha
0x044a 0xfa ;Cyrillic Small Letter Hard Sign
0x044b 0xfb ;Cyrillic Small Letter Yeru
0x044c 0xfc ;Cyrillic Small Letter Soft Sign
0x044d 0xfd ;Cyrillic Small Letter E
0x044e 0xfe ;Cyrillic Small Letter Yu
0x044f 0xff ;Cyrillic Small Letter Ya
0x0451 0xb8 ;Cyrillic Small Letter Io
0x0452 0x90 ;Cyrillic Small Letter Dje
0x0453 0x83 ;Cyrillic Small Letter Gje
0x0454 0xba ;Cyrillic Small Letter Ukrainian Ie
0x0455 0xbe ;Cyrillic Small Letter Dze
0x0456 0xb3 ;Cyrillic Small Letter Byelorussian-Ukrainian I
0x0457 0xbf ;Cyrillic Small Letter Yi
0x0458 0xbc ;Cyrillic Small Letter Je
0x0459 0x9a ;Cyrillic Small Letter Lje
0x045a 0x9c ;Cyrillic Small Letter Nje
0x045b 0x9e ;Cyrillic Small Letter Tshe
0x045c 0x9d ;Cyrillic Small Letter Kje
0x045e 0xa2 ;Cyrillic Small Letter Short U
0x045f 0x9f ;Cyrillic Small Letter Dzhe
0x0490 0xa5 ;Cyrillic Capital Letter Ghe With Upturn
0x0491 0xb4 ;Cyrillic Small Letter Ghe With Upturn
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x203c 0x21 ;Double Exclamation Mark
0x20ac 0x88 ;Euro Sign
0x2116 0xb9 ;Numero Sign
0x2122 0x99 ;Trade Mark Sign
0x2190 0x3c ;Leftwards Arrow
0x2191 0x5e ;Upwards Arrow
0x2192 0x3e ;Rightwards Arrow
0x2193 0x76 ;Downwards Arrow
0x2194 0x2d ;Left Right Arrow
0x2195 0xa6 ;Up Down Arrow
0x21a8 0xa6 ;Up Down Arrow With Base
0x2219 0x95 ;Bullet Operator
0x221a 0x76 ;Square Root
0x221f 0x4c ;Right Angle
0x2302 0xa6 ;House
0x2500 0x2d ;Box Drawings Light Horizontal
0x2502 0xa6 ;Box Drawings Light Vertical
0x250c 0x2d ;Box Drawings Light Down And Right
0x2510 0xac ;Box Drawings Light Down And Left
0x2514 0x4c ;Box Drawings Light Up And Right
0x2518 0x2d ;Box Drawings Light Up And Left
0x251c 0x2b ;Box Drawings Light Vertical And Right
0x2524 0x2b ;Box Drawings Light Vertical And Left
0x252c 0x54 ;Box Drawings Light Down And Horizontal
0x2534 0x2b ;Box Drawings Light Up And Horizontal
0x253c 0x2b ;Box Drawings Light Vertical And Horizontal
0x2550 0x3d ;Box Drawings Double Horizontal
0x2551 0xa6 ;Box Drawings Double Vertical
0x2552 0x2d ;Box Drawings Down Single And Right Double
0x2553 0xe3 ;Box Drawings Down Double And Right Single
0x2554 0xe3 ;Box Drawings Double Down And Right
0x2555 0xac ;Box Drawings Down Single And Left Double
0x2556 0xac ;Box Drawings Down Double And Left Single
0x2557 0xac ;Box Drawings Double Down And Left
0x2558 0x4c ;Box Drawings Up Single And Right Double
0x2559 0x4c ;Box Drawings Up Double And Right Single
0x255a 0x4c ;Box Drawings Double Up And Right
0x255b 0x2d ;Box Drawings Up Single And Left Double
0x255c 0x2d ;Box Drawings Up Double And Left Single
0x255d 0x2d ;Box Drawings Double Up And Left
0x255e 0xa6 ;Box Drawings Vertical Single And Right Double
0x255f 0xa6 ;Box Drawings Vertical Double And Right Single
0x2560 0xa6 ;Box Drawings Double Vertical And Right
0x2561 0xa6 ;Box Drawings Vertical Single And Left Double
0x2562 0xa6 ;Box Drawings Vertical Double And Left Single
0x2563 0xa6 ;Box Drawings Double Vertical And Left
0x2564 0x54 ;Box Drawings Down Single And Horizontal Double
0x2565 0x54 ;Box Drawings Down Double And Horizontal Single
0x2566 0x54 ;Box Drawings Double Down And Horizontal
0x2567 0xa6 ;Box Drawings Up Single And Horizontal Double
0x2568 0xa6 ;Box Drawings Up Double And Horizontal Single
0x2569 0xa6 ;Box Drawings Double Up And Horizontal
0x256a 0x2b ;Box Drawings Vertical Single And Horizontal Double
0x256b 0x2b ;Box Drawings Vertical Double And Horizontal Single
0x256c 0x2b ;Box Drawings Double Vertical And Horizontal
0x2580 0x2d ;Upper Half Block
0x2584 0x2d ;Lower Half Block
0x2588 0x2d ;Full Block
0x258c 0xa6 ;Left Half Block
0x2590 0xa6 ;Right Half Block
0x2591 0x2d ;Light Shade
0x2592 0x2d ;Medium Shade
0x2593 0x2d ;Dark Shade
0x25a0 0xa6 ;Black Square
0x25ac 0x2d ;Black Rectangle
0x25b2 0x5e ;Black Up-Pointing Triangle
0x25ba 0x3e ;Black Right-Pointing Pointer
0x25bc 0xa1 ;Black Down-Pointing Triangle
0x25c4 0x3c ;Black Left-Pointing Pointer
0x25cb 0x30 ;White Circle
0x25d8 0x95 ;Inverse Bullet
0x25d9 0x30 ;Inverse White Circle
0x263a 0x4f ;White Smiling Face
0x263b 0x4f ;Black Smiling Face
0x263c 0x30 ;White Sun With Rays
0x2640 0x2b ;Female Sign
0x2642 0x3e ;Male Sign
0x2660 0xa6 ;Black Spade Suit
0x2663 0xa6 ;Black Club Suit
0x2665 0xa6 ;Black Heart Suit
0x2666 0xa6 ;Black Diamond Suit
0x266a 0x64 ;Eighth Note
0x266b 0x64 ;Beamed Eighth Notes
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,967 @@
CODEPAGE 1252 ;Latin I - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0192 ;Latin Small Letter F With Hook
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x02c6 ;Modifier Letter Circumflex Accent
0x89 0x2030 ;Per Mille Sign
0x8a 0x0160 ;Latin Capital Letter S With Caron
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x0152 ;Latin Capital Ligature Oe
0x8d 0x008d
0x8e 0x017d ;Latin Capital Letter Z With Caron
0x8f 0x008f
0x90 0x0090
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x02dc ;Small Tilde
0x99 0x2122 ;Trade Mark Sign
0x9a 0x0161 ;Latin Small Letter S With Caron
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x0153 ;Latin Small Ligature Oe
0x9d 0x009d
0x9e 0x017e ;Latin Small Letter Z With Caron
0x9f 0x0178 ;Latin Capital Letter Y With Diaeresis
0xa0 0x00a0 ;No-Break Space
0xa1 0x00a1 ;Inverted Exclamation Mark
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x00aa ;Feminine Ordinal Indicator
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x00af ;Macron
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x00b2 ;Superscript Two
0xb3 0x00b3 ;Superscript Three
0xb4 0x00b4 ;Acute Accent
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Cedilla
0xb9 0x00b9 ;Superscript One
0xba 0x00ba ;Masculine Ordinal Indicator
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x00bc ;Vulgar Fraction One Quarter
0xbd 0x00bd ;Vulgar Fraction One Half
0xbe 0x00be ;Vulgar Fraction Three Quarters
0xbf 0x00bf ;Inverted Question Mark
0xc0 0x00c0 ;Latin Capital Letter A With Grave
0xc1 0x00c1 ;Latin Capital Letter A With Acute
0xc2 0x00c2 ;Latin Capital Letter A With Circumflex
0xc3 0x00c3 ;Latin Capital Letter A With Tilde
0xc4 0x00c4 ;Latin Capital Letter A With Diaeresis
0xc5 0x00c5 ;Latin Capital Letter A With Ring Above
0xc6 0x00c6 ;Latin Capital Ligature Ae
0xc7 0x00c7 ;Latin Capital Letter C With Cedilla
0xc8 0x00c8 ;Latin Capital Letter E With Grave
0xc9 0x00c9 ;Latin Capital Letter E With Acute
0xca 0x00ca ;Latin Capital Letter E With Circumflex
0xcb 0x00cb ;Latin Capital Letter E With Diaeresis
0xcc 0x00cc ;Latin Capital Letter I With Grave
0xcd 0x00cd ;Latin Capital Letter I With Acute
0xce 0x00ce ;Latin Capital Letter I With Circumflex
0xcf 0x00cf ;Latin Capital Letter I With Diaeresis
0xd0 0x00d0 ;Latin Capital Letter Eth
0xd1 0x00d1 ;Latin Capital Letter N With Tilde
0xd2 0x00d2 ;Latin Capital Letter O With Grave
0xd3 0x00d3 ;Latin Capital Letter O With Acute
0xd4 0x00d4 ;Latin Capital Letter O With Circumflex
0xd5 0x00d5 ;Latin Capital Letter O With Tilde
0xd6 0x00d6 ;Latin Capital Letter O With Diaeresis
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x00d8 ;Latin Capital Letter O With Stroke
0xd9 0x00d9 ;Latin Capital Letter U With Grave
0xda 0x00da ;Latin Capital Letter U With Acute
0xdb 0x00db ;Latin Capital Letter U With Circumflex
0xdc 0x00dc ;Latin Capital Letter U With Diaeresis
0xdd 0x00dd ;Latin Capital Letter Y With Acute
0xde 0x00de ;Latin Capital Letter Thorn
0xdf 0x00df ;Latin Small Letter Sharp S
0xe0 0x00e0 ;Latin Small Letter A With Grave
0xe1 0x00e1 ;Latin Small Letter A With Acute
0xe2 0x00e2 ;Latin Small Letter A With Circumflex
0xe3 0x00e3 ;Latin Small Letter A With Tilde
0xe4 0x00e4 ;Latin Small Letter A With Diaeresis
0xe5 0x00e5 ;Latin Small Letter A With Ring Above
0xe6 0x00e6 ;Latin Small Ligature Ae
0xe7 0x00e7 ;Latin Small Letter C With Cedilla
0xe8 0x00e8 ;Latin Small Letter E With Grave
0xe9 0x00e9 ;Latin Small Letter E With Acute
0xea 0x00ea ;Latin Small Letter E With Circumflex
0xeb 0x00eb ;Latin Small Letter E With Diaeresis
0xec 0x00ec ;Latin Small Letter I With Grave
0xed 0x00ed ;Latin Small Letter I With Acute
0xee 0x00ee ;Latin Small Letter I With Circumflex
0xef 0x00ef ;Latin Small Letter I With Diaeresis
0xf0 0x00f0 ;Latin Small Letter Eth
0xf1 0x00f1 ;Latin Small Letter N With Tilde
0xf2 0x00f2 ;Latin Small Letter O With Grave
0xf3 0x00f3 ;Latin Small Letter O With Acute
0xf4 0x00f4 ;Latin Small Letter O With Circumflex
0xf5 0x00f5 ;Latin Small Letter O With Tilde
0xf6 0x00f6 ;Latin Small Letter O With Diaeresis
0xf7 0x00f7 ;Division Sign
0xf8 0x00f8 ;Latin Small Letter O With Stroke
0xf9 0x00f9 ;Latin Small Letter U With Grave
0xfa 0x00fa ;Latin Small Letter U With Acute
0xfb 0x00fb ;Latin Small Letter U With Circumflex
0xfc 0x00fc ;Latin Small Letter U With Diaeresis
0xfd 0x00fd ;Latin Small Letter Y With Acute
0xfe 0x00fe ;Latin Small Letter Thorn
0xff 0x00ff ;Latin Small Letter Y With Diaeresis
WCTABLE 698
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x008d 0x8d
0x008f 0x8f
0x0090 0x90
0x009d 0x9d
0x00a0 0xa0 ;No-Break Space
0x00a1 0xa1 ;Inverted Exclamation Mark
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00aa 0xaa ;Feminine Ordinal Indicator
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00af 0xaf ;Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0xb2 ;Superscript Two
0x00b3 0xb3 ;Superscript Three
0x00b4 0xb4 ;Acute Accent
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Cedilla
0x00b9 0xb9 ;Superscript One
0x00ba 0xba ;Masculine Ordinal Indicator
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00bc 0xbc ;Vulgar Fraction One Quarter
0x00bd 0xbd ;Vulgar Fraction One Half
0x00be 0xbe ;Vulgar Fraction Three Quarters
0x00bf 0xbf ;Inverted Question Mark
0x00c0 0xc0 ;Latin Capital Letter A With Grave
0x00c1 0xc1 ;Latin Capital Letter A With Acute
0x00c2 0xc2 ;Latin Capital Letter A With Circumflex
0x00c3 0xc3 ;Latin Capital Letter A With Tilde
0x00c4 0xc4 ;Latin Capital Letter A With Diaeresis
0x00c5 0xc5 ;Latin Capital Letter A With Ring Above
0x00c6 0xc6 ;Latin Capital Ligature Ae
0x00c7 0xc7 ;Latin Capital Letter C With Cedilla
0x00c8 0xc8 ;Latin Capital Letter E With Grave
0x00c9 0xc9 ;Latin Capital Letter E With Acute
0x00ca 0xca ;Latin Capital Letter E With Circumflex
0x00cb 0xcb ;Latin Capital Letter E With Diaeresis
0x00cc 0xcc ;Latin Capital Letter I With Grave
0x00cd 0xcd ;Latin Capital Letter I With Acute
0x00ce 0xce ;Latin Capital Letter I With Circumflex
0x00cf 0xcf ;Latin Capital Letter I With Diaeresis
0x00d0 0xd0 ;Latin Capital Letter Eth
0x00d1 0xd1 ;Latin Capital Letter N With Tilde
0x00d2 0xd2 ;Latin Capital Letter O With Grave
0x00d3 0xd3 ;Latin Capital Letter O With Acute
0x00d4 0xd4 ;Latin Capital Letter O With Circumflex
0x00d5 0xd5 ;Latin Capital Letter O With Tilde
0x00d6 0xd6 ;Latin Capital Letter O With Diaeresis
0x00d7 0xd7 ;Multiplication Sign
0x00d8 0xd8 ;Latin Capital Letter O With Stroke
0x00d9 0xd9 ;Latin Capital Letter U With Grave
0x00da 0xda ;Latin Capital Letter U With Acute
0x00db 0xdb ;Latin Capital Letter U With Circumflex
0x00dc 0xdc ;Latin Capital Letter U With Diaeresis
0x00dd 0xdd ;Latin Capital Letter Y With Acute
0x00de 0xde ;Latin Capital Letter Thorn
0x00df 0xdf ;Latin Small Letter Sharp S
0x00e0 0xe0 ;Latin Small Letter A With Grave
0x00e1 0xe1 ;Latin Small Letter A With Acute
0x00e2 0xe2 ;Latin Small Letter A With Circumflex
0x00e3 0xe3 ;Latin Small Letter A With Tilde
0x00e4 0xe4 ;Latin Small Letter A With Diaeresis
0x00e5 0xe5 ;Latin Small Letter A With Ring Above
0x00e6 0xe6 ;Latin Small Ligature Ae
0x00e7 0xe7 ;Latin Small Letter C With Cedilla
0x00e8 0xe8 ;Latin Small Letter E With Grave
0x00e9 0xe9 ;Latin Small Letter E With Acute
0x00ea 0xea ;Latin Small Letter E With Circumflex
0x00eb 0xeb ;Latin Small Letter E With Diaeresis
0x00ec 0xec ;Latin Small Letter I With Grave
0x00ed 0xed ;Latin Small Letter I With Acute
0x00ee 0xee ;Latin Small Letter I With Circumflex
0x00ef 0xef ;Latin Small Letter I With Diaeresis
0x00f0 0xf0 ;Latin Small Letter Eth
0x00f1 0xf1 ;Latin Small Letter N With Tilde
0x00f2 0xf2 ;Latin Small Letter O With Grave
0x00f3 0xf3 ;Latin Small Letter O With Acute
0x00f4 0xf4 ;Latin Small Letter O With Circumflex
0x00f5 0xf5 ;Latin Small Letter O With Tilde
0x00f6 0xf6 ;Latin Small Letter O With Diaeresis
0x00f7 0xf7 ;Division Sign
0x00f8 0xf8 ;Latin Small Letter O With Stroke
0x00f9 0xf9 ;Latin Small Letter U With Grave
0x00fa 0xfa ;Latin Small Letter U With Acute
0x00fb 0xfb ;Latin Small Letter U With Circumflex
0x00fc 0xfc ;Latin Small Letter U With Diaeresis
0x00fd 0xfd ;Latin Small Letter Y With Acute
0x00fe 0xfe ;Latin Small Letter Thorn
0x00ff 0xff ;Latin Small Letter Y With Diaeresis
0x0100 0x41 ;Latin Capital Letter A With Macron
0x0101 0x61 ;Latin Small Letter A With Macron
0x0102 0x41 ;Latin Capital Letter A With Breve
0x0103 0x61 ;Latin Small Letter A With Breve
0x0104 0x41 ;Latin Capital Letter A With Ogonek
0x0105 0x61 ;Latin Small Letter A With Ogonek
0x0106 0x43 ;Latin Capital Letter C With Acute
0x0107 0x63 ;Latin Small Letter C With Acute
0x0108 0x43 ;Latin Capital Letter C With Circumflex
0x0109 0x63 ;Latin Small Letter C With Circumflex
0x010a 0x43 ;Latin Capital Letter C With Dot Above
0x010b 0x63 ;Latin Small Letter C With Dot Above
0x010c 0x43 ;Latin Capital Letter C With Caron
0x010d 0x63 ;Latin Small Letter C With Caron
0x010e 0x44 ;Latin Capital Letter D With Caron
0x010f 0x64 ;Latin Small Letter D With Caron
0x0110 0xd0 ;Latin Capital Letter D With Stroke
0x0111 0x64 ;Latin Small Letter D With Stroke
0x0112 0x45 ;Latin Capital Letter E With Macron
0x0113 0x65 ;Latin Small Letter E With Macron
0x0114 0x45 ;Latin Capital Letter E With Breve
0x0115 0x65 ;Latin Small Letter E With Breve
0x0116 0x45 ;Latin Capital Letter E With Dot Above
0x0117 0x65 ;Latin Small Letter E With Dot Above
0x0118 0x45 ;Latin Capital Letter E With Ogonek
0x0119 0x65 ;Latin Small Letter E With Ogonek
0x011a 0x45 ;Latin Capital Letter E With Caron
0x011b 0x65 ;Latin Small Letter E With Caron
0x011c 0x47 ;Latin Capital Letter G With Circumflex
0x011d 0x67 ;Latin Small Letter G With Circumflex
0x011e 0x47 ;Latin Capital Letter G With Breve
0x011f 0x67 ;Latin Small Letter G With Breve
0x0120 0x47 ;Latin Capital Letter G With Dot Above
0x0121 0x67 ;Latin Small Letter G With Dot Above
0x0122 0x47 ;Latin Capital Letter G With Cedilla
0x0123 0x67 ;Latin Small Letter G With Cedilla
0x0124 0x48 ;Latin Capital Letter H With Circumflex
0x0125 0x68 ;Latin Small Letter H With Circumflex
0x0126 0x48 ;Latin Capital Letter H With Stroke
0x0127 0x68 ;Latin Small Letter H With Stroke
0x0128 0x49 ;Latin Capital Letter I With Tilde
0x0129 0x69 ;Latin Small Letter I With Tilde
0x012a 0x49 ;Latin Capital Letter I With Macron
0x012b 0x69 ;Latin Small Letter I With Macron
0x012c 0x49 ;Latin Capital Letter I With Breve
0x012d 0x69 ;Latin Small Letter I With Breve
0x012e 0x49 ;Latin Capital Letter I With Ogonek
0x012f 0x69 ;Latin Small Letter I With Ogonek
0x0130 0x49 ;Latin Capital Letter I With Dot Above
0x0131 0x69 ;Latin Small Letter Dotless I
0x0134 0x4a ;Latin Capital Letter J With Circumflex
0x0135 0x6a ;Latin Small Letter J With Circumflex
0x0136 0x4b ;Latin Capital Letter K With Cedilla
0x0137 0x6b ;Latin Small Letter K With Cedilla
0x0139 0x4c ;Latin Capital Letter L With Acute
0x013a 0x6c ;Latin Small Letter L With Acute
0x013b 0x4c ;Latin Capital Letter L With Cedilla
0x013c 0x6c ;Latin Small Letter L With Cedilla
0x013d 0x4c ;Latin Capital Letter L With Caron
0x013e 0x6c ;Latin Small Letter L With Caron
0x0141 0x4c ;Latin Capital Letter L With Stroke
0x0142 0x6c ;Latin Small Letter L With Stroke
0x0143 0x4e ;Latin Capital Letter N With Acute
0x0144 0x6e ;Latin Small Letter N With Acute
0x0145 0x4e ;Latin Capital Letter N With Cedilla
0x0146 0x6e ;Latin Small Letter N With Cedilla
0x0147 0x4e ;Latin Capital Letter N With Caron
0x0148 0x6e ;Latin Small Letter N With Caron
0x014c 0x4f ;Latin Capital Letter O With Macron
0x014d 0x6f ;Latin Small Letter O With Macron
0x014e 0x4f ;Latin Capital Letter O With Breve
0x014f 0x6f ;Latin Small Letter O With Breve
0x0150 0x4f ;Latin Capital Letter O With Double Acute
0x0151 0x6f ;Latin Small Letter O With Double Acute
0x0152 0x8c ;Latin Capital Ligature Oe
0x0153 0x9c ;Latin Small Ligature Oe
0x0154 0x52 ;Latin Capital Letter R With Acute
0x0155 0x72 ;Latin Small Letter R With Acute
0x0156 0x52 ;Latin Capital Letter R With Cedilla
0x0157 0x72 ;Latin Small Letter R With Cedilla
0x0158 0x52 ;Latin Capital Letter R With Caron
0x0159 0x72 ;Latin Small Letter R With Caron
0x015a 0x53 ;Latin Capital Letter S With Acute
0x015b 0x73 ;Latin Small Letter S With Acute
0x015c 0x53 ;Latin Capital Letter S With Circumflex
0x015d 0x73 ;Latin Small Letter S With Circumflex
0x015e 0x53 ;Latin Capital Letter S With Cedilla
0x015f 0x73 ;Latin Small Letter S With Cedilla
0x0160 0x8a ;Latin Capital Letter S With Caron
0x0161 0x9a ;Latin Small Letter S With Caron
0x0162 0x54 ;Latin Capital Letter T With Cedilla
0x0163 0x74 ;Latin Small Letter T With Cedilla
0x0164 0x54 ;Latin Capital Letter T With Caron
0x0165 0x74 ;Latin Small Letter T With Caron
0x0166 0x54 ;Latin Capital Letter T With Stroke
0x0167 0x74 ;Latin Small Letter T With Stroke
0x0168 0x55 ;Latin Capital Letter U With Tilde
0x0169 0x75 ;Latin Small Letter U With Tilde
0x016a 0x55 ;Latin Capital Letter U With Macron
0x016b 0x75 ;Latin Small Letter U With Macron
0x016c 0x55 ;Latin Capital Letter U With Breve
0x016d 0x75 ;Latin Small Letter U With Breve
0x016e 0x55 ;Latin Capital Letter U With Ring Above
0x016f 0x75 ;Latin Small Letter U With Ring Above
0x0170 0x55 ;Latin Capital Letter U With Double Acute
0x0171 0x75 ;Latin Small Letter U With Double Acute
0x0172 0x55 ;Latin Capital Letter U With Ogonek
0x0173 0x75 ;Latin Small Letter U With Ogonek
0x0174 0x57 ;Latin Capital Letter W With Circumflex
0x0175 0x77 ;Latin Small Letter W With Circumflex
0x0176 0x59 ;Latin Capital Letter Y With Circumflex
0x0177 0x79 ;Latin Small Letter Y With Circumflex
0x0178 0x9f ;Latin Capital Letter Y With Diaeresis
0x0179 0x5a ;Latin Capital Letter Z With Acute
0x017a 0x7a ;Latin Small Letter Z With Acute
0x017b 0x5a ;Latin Capital Letter Z With Dot Above
0x017c 0x7a ;Latin Small Letter Z With Dot Above
0x017d 0x8e ;Latin Capital Letter Z With Caron
0x017e 0x9e ;Latin Small Letter Z With Caron
0x0180 0x62 ;Latin Small Letter B With Stroke
0x0189 0xd0 ;Latin Capital Letter African D
0x0191 0x83 ;Latin Capital Letter F With Hook
0x0192 0x83 ;Latin Small Letter F With Hook
0x0197 0x49 ;Latin Capital Letter I With Stroke
0x019a 0x6c ;Latin Small Letter L With Bar
0x019f 0x4f ;Latin Capital Letter O With Middle Tilde
0x01a0 0x4f ;Latin Capital Letter O With Horn
0x01a1 0x6f ;Latin Small Letter O With Horn
0x01ab 0x74 ;Latin Small Letter T With Palatal Hook
0x01ae 0x54 ;Latin Capital Letter T With Retroflex Hook
0x01af 0x55 ;Latin Capital Letter U With Horn
0x01b0 0x75 ;Latin Small Letter U With Horn
0x01b6 0x7a ;Latin Small Letter Z With Stroke
0x01c0 0x7c ;Latin Letter Dental Click
0x01c3 0x21 ;Latin Letter Retroflex Click
0x01cd 0x41 ;Latin Capital Letter A With Caron
0x01ce 0x61 ;Latin Small Letter A With Caron
0x01cf 0x49 ;Latin Capital Letter I With Caron
0x01d0 0x69 ;Latin Small Letter I With Caron
0x01d1 0x4f ;Latin Capital Letter O With Caron
0x01d2 0x6f ;Latin Small Letter O With Caron
0x01d3 0x55 ;Latin Capital Letter U With Caron
0x01d4 0x75 ;Latin Small Letter U With Caron
0x01d5 0x55 ;Latin Capital Letter U With Diaeresis And Macron
0x01d6 0x75 ;Latin Small Letter U With Diaeresis And Macron
0x01d7 0x55 ;Latin Capital Letter U With Diaeresis And Acute
0x01d8 0x75 ;Latin Small Letter U With Diaeresis And Acute
0x01d9 0x55 ;Latin Capital Letter U With Diaeresis And Caron
0x01da 0x75 ;Latin Small Letter U With Diaeresis And Caron
0x01db 0x55 ;Latin Capital Letter U With Diaeresis And Grave
0x01dc 0x75 ;Latin Small Letter U With Diaeresis And Grave
0x01de 0x41 ;Latin Capital Letter A With Diaeresis And Macron
0x01df 0x61 ;Latin Small Letter A With Diaeresis And Macron
0x01e4 0x47 ;Latin Capital Letter G With Stroke
0x01e5 0x67 ;Latin Small Letter G With Stroke
0x01e6 0x47 ;Latin Capital Letter G With Caron
0x01e7 0x67 ;Latin Small Letter G With Caron
0x01e8 0x4b ;Latin Capital Letter K With Caron
0x01e9 0x6b ;Latin Small Letter K With Caron
0x01ea 0x4f ;Latin Capital Letter O With Ogonek
0x01eb 0x6f ;Latin Small Letter O With Ogonek
0x01ec 0x4f ;Latin Capital Letter O With Ogonek And Macron
0x01ed 0x6f ;Latin Small Letter O With Ogonek And Macron
0x01f0 0x6a ;Latin Small Letter J With Caron
0x0261 0x67 ;Latin Small Letter Script G
0x02b9 0x27 ;Modifier Letter Prime
0x02ba 0x22 ;Modifier Letter Double Prime
0x02bc 0x27 ;Modifier Letter Apostrophe
0x02c4 0x5e ;Modifier Letter Up Arrowhead
0x02c6 0x88 ;Modifier Letter Circumflex Accent
0x02c8 0x27 ;Modifier Letter Vertical Line
0x02c9 0xaf ;Modifier Letter Macron
0x02ca 0xb4 ;Modifier Letter Acute Accent
0x02cb 0x60 ;Modifier Letter Grave Accent
0x02cd 0x5f ;Modifier Letter Low Macron
0x02da 0xb0 ;Ring Above
0x02dc 0x98 ;Small Tilde
0x0300 0x60 ;Combining Grave Accent
0x0301 0xb4 ;Combining Acute Accent
0x0302 0x5e ;Combining Circumflex Accent
0x0303 0x7e ;Combining Tilde
0x0304 0xaf ;Combining Macron
0x0305 0xaf ;Combining Overline
0x0308 0xa8 ;Combining Diaeresis
0x030a 0xb0 ;Combining Ring Above
0x030e 0x22 ;Combining Double Vertical Line Above
0x0327 0xb8 ;Combining Cedilla
0x0331 0x5f ;Combining Macron Below
0x0332 0x5f ;Combining Low Line
0x037e 0x3b ;Greek Question Mark
0x0393 0x47 ;Greek Capital Letter Gamma
0x0398 0x54 ;Greek Capital Letter Theta
0x03a3 0x53 ;Greek Capital Letter Sigma
0x03a6 0x46 ;Greek Capital Letter Phi
0x03a9 0x4f ;Greek Capital Letter Omega
0x03b1 0x61 ;Greek Small Letter Alpha
0x03b2 0xdf ;Greek Small Letter Beta
0x03b4 0x64 ;Greek Small Letter Delta
0x03b5 0x65 ;Greek Small Letter Epsilon
0x03bc 0xb5 ;Greek Small Letter Mu
0x03c0 0x70 ;Greek Small Letter Pi
0x03c3 0x73 ;Greek Small Letter Sigma
0x03c4 0x74 ;Greek Small Letter Tau
0x03c6 0x66 ;Greek Small Letter Phi
0x04bb 0x68 ;Cyrillic Small Letter Shha
0x0589 0x3a ;Armenian Full Stop
0x066a 0x25 ;Arabic Percent Sign
0x2000 0x20 ;En Quad
0x2001 0x20 ;Em Quad
0x2002 0x20 ;En Space
0x2003 0x20 ;Em Space
0x2004 0x20 ;Three-Per-Em Space
0x2005 0x20 ;Four-Per-Em Space
0x2006 0x20 ;Six-Per-Em Space
0x2010 0x2d ;Hyphen
0x2011 0x2d ;Non-Breaking Hyphen
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2017 0x3d ;Double Low Line
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2024 0xb7 ;One Dot Leader
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2032 0x27 ;Prime
0x2035 0x60 ;Reversed Prime
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x2044 0x2f ;Fraction Slash
0x2070 0xb0 ;Superscript Zero
0x2074 0x34 ;Superscript Four
0x2075 0x35 ;Superscript Five
0x2076 0x36 ;Superscript Six
0x2077 0x37 ;Superscript Seven
0x2078 0x38 ;Superscript Eight
0x207f 0x6e ;Superscript Latin Small Letter N
0x2080 0x30 ;Subscript Zero
0x2081 0x31 ;Subscript One
0x2082 0x32 ;Subscript Two
0x2083 0x33 ;Subscript Three
0x2084 0x34 ;Subscript Four
0x2085 0x35 ;Subscript Five
0x2086 0x36 ;Subscript Six
0x2087 0x37 ;Subscript Seven
0x2088 0x38 ;Subscript Eight
0x2089 0x39 ;Subscript Nine
0x20ac 0x80 ;Euro Sign
0x20a1 0xa2 ;Colon Sign
0x20a4 0xa3 ;Lira Sign
0x20a7 0x50 ;Peseta Sign
0x2102 0x43 ;Double-Struck Capital C
0x2107 0x45 ;Euler Constant
0x210a 0x67 ;Script Small G
0x210b 0x48 ;Script Capital H
0x210c 0x48 ;Black-Letter Capital H
0x210d 0x48 ;Double-Struck Capital H
0x210e 0x68 ;Planck Constant
0x2110 0x49 ;Script Capital I
0x2111 0x49 ;Black-Letter Capital I
0x2112 0x4c ;Script Capital L
0x2113 0x6c ;Script Small L
0x2115 0x4e ;Double-Struck Capital N
0x2118 0x50 ;Script Capital P
0x2119 0x50 ;Double-Struck Capital P
0x211a 0x51 ;Double-Struck Capital Q
0x211b 0x52 ;Script Capital R
0x211c 0x52 ;Black-Letter Capital R
0x211d 0x52 ;Double-Struck Capital R
0x2122 0x99 ;Trade Mark Sign
0x2124 0x5a ;Double-Struck Capital Z
0x2128 0x5a ;Black-Letter Capital Z
0x212a 0x4b ;Kelvin Sign
0x212b 0xc5 ;Angstrom Sign
0x212c 0x42 ;Script Capital B
0x212d 0x43 ;Black-Letter Capital C
0x212e 0x65 ;Estimated Symbol
0x212f 0x65 ;Script Small E
0x2130 0x45 ;Script Capital E
0x2131 0x46 ;Script Capital F
0x2133 0x4d ;Script Capital M
0x2134 0x6f ;Script Small O
0x2205 0xd8 ;Empty Set
0x2212 0x2d ;Minus Sign
0x2213 0xb1 ;Minus-Or-Plus Sign
0x2215 0x2f ;Division Slash
0x2216 0x5c ;Set Minus
0x2217 0x2a ;Asterisk Operator
0x2218 0xb0 ;Ring Operator
0x2219 0xb7 ;Bullet Operator
0x221a 0x76 ;Square Root
0x221e 0x38 ;Infinity
0x2223 0x7c ;Divides
0x2229 0x6e ;Intersection
0x2236 0x3a ;Ratio
0x223c 0x7e ;Tilde Operator
0x2248 0x98 ;Almost Equal To
0x2261 0x3d ;Identical To
0x2264 0x3d ;Less-Than Or Equal To
0x2265 0x3d ;Greater-Than Or Equal To
0x226a 0xab ;Much Less-Than
0x226b 0xbb ;Much Greater-Than
0x22c5 0xb7 ;Dot Operator
0x2302 0xa6 ;House
0x2303 0x5e ;Up Arrowhead
0x2310 0xac ;Reversed Not Sign
0x2320 0x28 ;Top Half Integral
0x2321 0x29 ;Bottom Half Integral
0x2329 0x3c ;Left-Pointing Angle Bracket
0x232a 0x3e ;Right-Pointing Angle Bracket
0x2500 0x2d ;Box Drawings Light Horizontal
0x2502 0xa6 ;Box Drawings Light Vertical
0x250c 0x2b ;Box Drawings Light Down And Right
0x2510 0x2b ;Box Drawings Light Down And Left
0x2514 0x2b ;Box Drawings Light Up And Right
0x2518 0x2b ;Box Drawings Light Up And Left
0x251c 0x2b ;Box Drawings Light Vertical And Right
0x2524 0xa6 ;Box Drawings Light Vertical And Left
0x252c 0x2d ;Box Drawings Light Down And Horizontal
0x2534 0x2d ;Box Drawings Light Up And Horizontal
0x253c 0x2b ;Box Drawings Light Vertical And Horizontal
0x2550 0x2d ;Box Drawings Double Horizontal
0x2551 0xa6 ;Box Drawings Double Vertical
0x2552 0x2b ;Box Drawings Down Single And Right Double
0x2553 0x2b ;Box Drawings Down Double And Right Single
0x2554 0x2b ;Box Drawings Double Down And Right
0x2555 0x2b ;Box Drawings Down Single And Left Double
0x2556 0x2b ;Box Drawings Down Double And Left Single
0x2557 0x2b ;Box Drawings Double Down And Left
0x2558 0x2b ;Box Drawings Up Single And Right Double
0x2559 0x2b ;Box Drawings Up Double And Right Single
0x255a 0x2b ;Box Drawings Double Up And Right
0x255b 0x2b ;Box Drawings Up Single And Left Double
0x255c 0x2b ;Box Drawings Up Double And Left Single
0x255d 0x2b ;Box Drawings Double Up And Left
0x255e 0xa6 ;Box Drawings Vertical Single And Right Double
0x255f 0xa6 ;Box Drawings Vertical Double And Right Single
0x2560 0xa6 ;Box Drawings Double Vertical And Right
0x2561 0xa6 ;Box Drawings Vertical Single And Left Double
0x2562 0xa6 ;Box Drawings Vertical Double And Left Single
0x2563 0xa6 ;Box Drawings Double Vertical And Left
0x2564 0x2d ;Box Drawings Down Single And Horizontal Double
0x2565 0x2d ;Box Drawings Down Double And Horizontal Single
0x2566 0x2d ;Box Drawings Double Down And Horizontal
0x2567 0x2d ;Box Drawings Up Single And Horizontal Double
0x2568 0x2d ;Box Drawings Up Double And Horizontal Single
0x2569 0x2d ;Box Drawings Double Up And Horizontal
0x256a 0x2b ;Box Drawings Vertical Single And Horizontal Double
0x256b 0x2b ;Box Drawings Vertical Double And Horizontal Single
0x256c 0x2b ;Box Drawings Double Vertical And Horizontal
0x2580 0xaf ;Upper Half Block
0x2584 0x5f ;Lower Half Block
0x2588 0xa6 ;Full Block
0x258c 0xa6 ;Left Half Block
0x2590 0xa6 ;Right Half Block
0x2591 0xa6 ;Light Shade
0x2592 0xa6 ;Medium Shade
0x2593 0xa6 ;Dark Shade
0x25a0 0xa6 ;Black Square
0x263c 0xa4 ;White Sun With Rays
0x2758 0x7c ;Light Vertical Bar
0x3000 0x20 ;Ideographic Space
0x3008 0x3c ;Left Angle Bracket
0x3009 0x3e ;Right Angle Bracket
0x300a 0xab ;Left Double Angle Bracket
0x300b 0xbb ;Right Double Angle Bracket
0x301a 0x5b ;Left White Square Bracket
0x301b 0x5d ;Right White Square Bracket
0x30fb 0xb7 ;Katakana Middle Dot
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,891 @@
CODEPAGE 1253 ;Greek - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0192 ;Latin Small Letter F With Hook
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x0088
0x89 0x2030 ;Per Mille Sign
0x8a 0x008a
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x008c
0x8d 0x008d
0x8e 0x008e
0x8f 0x008f
0x90 0x0090
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x0098
0x99 0x2122 ;Trade Mark Sign
0x9a 0x009a
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x009c
0x9d 0x009d
0x9e 0x009e
0x9f 0x009f
0xa0 0x00a0 ;No-Break Space
0xa1 0x0385 ;Greek Dialytika Tonos
0xa2 0x0386 ;Greek Capital Letter Alpha With Tonos
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0xf8f9 ;Undefined -> EUDC
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x2015 ;Horizontal Bar
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x00b2 ;Superscript Two
0xb3 0x00b3 ;Superscript Three
0xb4 0x0384 ;Greek Tonos
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x0388 ;Greek Capital Letter Epsilon With Tonos
0xb9 0x0389 ;Greek Capital Letter Eta With Tonos
0xba 0x038a ;Greek Capital Letter Iota With Tonos
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x038c ;Greek Capital Letter Omicron With Tonos
0xbd 0x00bd ;Vulgar Fraction One Half
0xbe 0x038e ;Greek Capital Letter Upsilon With Tonos
0xbf 0x038f ;Greek Capital Letter Omega With Tonos
0xc0 0x0390 ;Greek Small Letter Iota With Dialytika And Tonos
0xc1 0x0391 ;Greek Capital Letter Alpha
0xc2 0x0392 ;Greek Capital Letter Beta
0xc3 0x0393 ;Greek Capital Letter Gamma
0xc4 0x0394 ;Greek Capital Letter Delta
0xc5 0x0395 ;Greek Capital Letter Epsilon
0xc6 0x0396 ;Greek Capital Letter Zeta
0xc7 0x0397 ;Greek Capital Letter Eta
0xc8 0x0398 ;Greek Capital Letter Theta
0xc9 0x0399 ;Greek Capital Letter Iota
0xca 0x039a ;Greek Capital Letter Kappa
0xcb 0x039b ;Greek Capital Letter Lamda
0xcc 0x039c ;Greek Capital Letter Mu
0xcd 0x039d ;Greek Capital Letter Nu
0xce 0x039e ;Greek Capital Letter Xi
0xcf 0x039f ;Greek Capital Letter Omicron
0xd0 0x03a0 ;Greek Capital Letter Pi
0xd1 0x03a1 ;Greek Capital Letter Rho
0xd2 0xf8fa ;Undefined -> EUDC
0xd3 0x03a3 ;Greek Capital Letter Sigma
0xd4 0x03a4 ;Greek Capital Letter Tau
0xd5 0x03a5 ;Greek Capital Letter Upsilon
0xd6 0x03a6 ;Greek Capital Letter Phi
0xd7 0x03a7 ;Greek Capital Letter Chi
0xd8 0x03a8 ;Greek Capital Letter Psi
0xd9 0x03a9 ;Greek Capital Letter Omega
0xda 0x03aa ;Greek Capital Letter Iota With Dialytika
0xdb 0x03ab ;Greek Capital Letter Upsilon With Dialytika
0xdc 0x03ac ;Greek Small Letter Alpha With Tonos
0xdd 0x03ad ;Greek Small Letter Epsilon With Tonos
0xde 0x03ae ;Greek Small Letter Eta With Tonos
0xdf 0x03af ;Greek Small Letter Iota With Tonos
0xe0 0x03b0 ;Greek Small Letter Upsilon With Dialytika And Tonos
0xe1 0x03b1 ;Greek Small Letter Alpha
0xe2 0x03b2 ;Greek Small Letter Beta
0xe3 0x03b3 ;Greek Small Letter Gamma
0xe4 0x03b4 ;Greek Small Letter Delta
0xe5 0x03b5 ;Greek Small Letter Epsilon
0xe6 0x03b6 ;Greek Small Letter Zeta
0xe7 0x03b7 ;Greek Small Letter Eta
0xe8 0x03b8 ;Greek Small Letter Theta
0xe9 0x03b9 ;Greek Small Letter Iota
0xea 0x03ba ;Greek Small Letter Kappa
0xeb 0x03bb ;Greek Small Letter Lamda
0xec 0x03bc ;Greek Small Letter Mu
0xed 0x03bd ;Greek Small Letter Nu
0xee 0x03be ;Greek Small Letter Xi
0xef 0x03bf ;Greek Small Letter Omicron
0xf0 0x03c0 ;Greek Small Letter Pi
0xf1 0x03c1 ;Greek Small Letter Rho
0xf2 0x03c2 ;Greek Small Letter Final Sigma
0xf3 0x03c3 ;Greek Small Letter Sigma
0xf4 0x03c4 ;Greek Small Letter Tau
0xf5 0x03c5 ;Greek Small Letter Upsilon
0xf6 0x03c6 ;Greek Small Letter Phi
0xf7 0x03c7 ;Greek Small Letter Chi
0xf8 0x03c8 ;Greek Small Letter Psi
0xf9 0x03c9 ;Greek Small Letter Omega
0xfa 0x03ca ;Greek Small Letter Iota With Dialytika
0xfb 0x03cb ;Greek Small Letter Upsilon With Dialytika
0xfc 0x03cc ;Greek Small Letter Omicron With Tonos
0xfd 0x03cd ;Greek Small Letter Upsilon With Tonos
0xfe 0x03ce ;Greek Small Letter Omega With Tonos
0xff 0xf8fb ;Undefined -> EUDC
WCTABLE 622
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x0088 0x88
0x008a 0x8a
0x008c 0x8c
0x008d 0x8d
0x008e 0x8e
0x008f 0x8f
0x0090 0x90
0x0098 0x98
0x009a 0x9a
0x009c 0x9c
0x009d 0x9d
0x009e 0x9e
0x009f 0x9f
0x00a0 0xa0 ;No-Break Space
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0xb2 ;Superscript Two
0x00b3 0xb3 ;Superscript Three
0x00b4 0x2f ;Acute Accent
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00bd 0xbd ;Vulgar Fraction One Half
0x00c0 0x41 ;Latin Capital Letter A With Grave
0x00c1 0x41 ;Latin Capital Letter A With Acute
0x00c2 0x41 ;Latin Capital Letter A With Circumflex
0x00c3 0x41 ;Latin Capital Letter A With Tilde
0x00c4 0x41 ;Latin Capital Letter A With Diaeresis
0x00c5 0x41 ;Latin Capital Letter A With Ring Above
0x00c7 0x43 ;Latin Capital Letter C With Cedilla
0x00c8 0x45 ;Latin Capital Letter E With Grave
0x00c9 0x45 ;Latin Capital Letter E With Acute
0x00ca 0x45 ;Latin Capital Letter E With Circumflex
0x00cb 0x45 ;Latin Capital Letter E With Diaeresis
0x00cc 0x49 ;Latin Capital Letter I With Grave
0x00cd 0x49 ;Latin Capital Letter I With Acute
0x00ce 0x49 ;Latin Capital Letter I With Circumflex
0x00cf 0x49 ;Latin Capital Letter I With Diaeresis
0x00d1 0x4e ;Latin Capital Letter N With Tilde
0x00d2 0x4f ;Latin Capital Letter O With Grave
0x00d3 0x4f ;Latin Capital Letter O With Acute
0x00d4 0x4f ;Latin Capital Letter O With Circumflex
0x00d5 0x4f ;Latin Capital Letter O With Tilde
0x00d6 0x4f ;Latin Capital Letter O With Diaeresis
0x00d8 0x4f ;Latin Capital Letter O With Stroke
0x00d9 0x55 ;Latin Capital Letter U With Grave
0x00da 0x55 ;Latin Capital Letter U With Acute
0x00db 0x55 ;Latin Capital Letter U With Circumflex
0x00dc 0x55 ;Latin Capital Letter U With Diaeresis
0x00dd 0x59 ;Latin Capital Letter Y With Acute
0x00e0 0x61 ;Latin Small Letter A With Grave
0x00e1 0x61 ;Latin Small Letter A With Acute
0x00e2 0x61 ;Latin Small Letter A With Circumflex
0x00e3 0x61 ;Latin Small Letter A With Tilde
0x00e4 0x61 ;Latin Small Letter A With Diaeresis
0x00e5 0x61 ;Latin Small Letter A With Ring Above
0x00e7 0x63 ;Latin Small Letter C With Cedilla
0x00e8 0x65 ;Latin Small Letter E With Grave
0x00e9 0x65 ;Latin Small Letter E With Acute
0x00ea 0x65 ;Latin Small Letter E With Circumflex
0x00eb 0x65 ;Latin Small Letter E With Diaeresis
0x00ec 0x69 ;Latin Small Letter I With Grave
0x00ed 0x69 ;Latin Small Letter I With Acute
0x00ee 0x69 ;Latin Small Letter I With Circumflex
0x00ef 0x69 ;Latin Small Letter I With Diaeresis
0x00f1 0x6e ;Latin Small Letter N With Tilde
0x00f2 0x6f ;Latin Small Letter O With Grave
0x00f3 0x6f ;Latin Small Letter O With Acute
0x00f4 0x6f ;Latin Small Letter O With Circumflex
0x00f5 0x6f ;Latin Small Letter O With Tilde
0x00f6 0x6f ;Latin Small Letter O With Diaeresis
0x00f8 0x6f ;Latin Small Letter O With Stroke
0x00f9 0x75 ;Latin Small Letter U With Grave
0x00fa 0x75 ;Latin Small Letter U With Acute
0x00fb 0x75 ;Latin Small Letter U With Circumflex
0x00fc 0x75 ;Latin Small Letter U With Diaeresis
0x00fd 0x79 ;Latin Small Letter Y With Acute
0x00ff 0x79 ;Latin Small Letter Y With Diaeresis
0x0100 0x41 ;Latin Capital Letter A With Macron
0x0101 0x61 ;Latin Small Letter A With Macron
0x0102 0x41 ;Latin Capital Letter A With Breve
0x0103 0x61 ;Latin Small Letter A With Breve
0x0104 0x41 ;Latin Capital Letter A With Ogonek
0x0105 0x61 ;Latin Small Letter A With Ogonek
0x0106 0x43 ;Latin Capital Letter C With Acute
0x0107 0x63 ;Latin Small Letter C With Acute
0x0108 0x43 ;Latin Capital Letter C With Circumflex
0x0109 0x63 ;Latin Small Letter C With Circumflex
0x010a 0x43 ;Latin Capital Letter C With Dot Above
0x010b 0x63 ;Latin Small Letter C With Dot Above
0x010c 0x43 ;Latin Capital Letter C With Caron
0x010d 0x63 ;Latin Small Letter C With Caron
0x010e 0x44 ;Latin Capital Letter D With Caron
0x010f 0x64 ;Latin Small Letter D With Caron
0x0110 0x44 ;Latin Capital Letter D With Stroke
0x0111 0x64 ;Latin Small Letter D With Stroke
0x0112 0x45 ;Latin Capital Letter E With Macron
0x0113 0x65 ;Latin Small Letter E With Macron
0x0114 0x45 ;Latin Capital Letter E With Breve
0x0115 0x65 ;Latin Small Letter E With Breve
0x0116 0x45 ;Latin Capital Letter E With Dot Above
0x0117 0x65 ;Latin Small Letter E With Dot Above
0x0118 0x45 ;Latin Capital Letter E With Ogonek
0x0119 0x65 ;Latin Small Letter E With Ogonek
0x011a 0x45 ;Latin Capital Letter E With Caron
0x011b 0x65 ;Latin Small Letter E With Caron
0x011c 0x47 ;Latin Capital Letter G With Circumflex
0x011d 0x67 ;Latin Small Letter G With Circumflex
0x011e 0x47 ;Latin Capital Letter G With Breve
0x011f 0x67 ;Latin Small Letter G With Breve
0x0120 0x47 ;Latin Capital Letter G With Dot Above
0x0121 0x67 ;Latin Small Letter G With Dot Above
0x0122 0x47 ;Latin Capital Letter G With Cedilla
0x0123 0x67 ;Latin Small Letter G With Cedilla
0x0124 0x48 ;Latin Capital Letter H With Circumflex
0x0125 0x68 ;Latin Small Letter H With Circumflex
0x0126 0x48 ;Latin Capital Letter H With Stroke
0x0127 0x68 ;Latin Small Letter H With Stroke
0x0128 0x49 ;Latin Capital Letter I With Tilde
0x0129 0x69 ;Latin Small Letter I With Tilde
0x012a 0x49 ;Latin Capital Letter I With Macron
0x012b 0x69 ;Latin Small Letter I With Macron
0x012c 0x49 ;Latin Capital Letter I With Breve
0x012d 0x69 ;Latin Small Letter I With Breve
0x012e 0x49 ;Latin Capital Letter I With Ogonek
0x012f 0x69 ;Latin Small Letter I With Ogonek
0x0130 0x49 ;Latin Capital Letter I With Dot Above
0x0134 0x4a ;Latin Capital Letter J With Circumflex
0x0135 0x6a ;Latin Small Letter J With Circumflex
0x0136 0x4b ;Latin Capital Letter K With Cedilla
0x0137 0x6b ;Latin Small Letter K With Cedilla
0x0139 0x4c ;Latin Capital Letter L With Acute
0x013a 0x6c ;Latin Small Letter L With Acute
0x013b 0x4c ;Latin Capital Letter L With Cedilla
0x013c 0x6c ;Latin Small Letter L With Cedilla
0x013d 0x4c ;Latin Capital Letter L With Caron
0x013e 0x6c ;Latin Small Letter L With Caron
0x0141 0x4c ;Latin Capital Letter L With Stroke
0x0142 0x6c ;Latin Small Letter L With Stroke
0x0143 0x4e ;Latin Capital Letter N With Acute
0x0144 0x6e ;Latin Small Letter N With Acute
0x0145 0x4e ;Latin Capital Letter N With Cedilla
0x0146 0x6e ;Latin Small Letter N With Cedilla
0x0147 0x4e ;Latin Capital Letter N With Caron
0x0148 0x6e ;Latin Small Letter N With Caron
0x014c 0x4f ;Latin Capital Letter O With Macron
0x014d 0x6f ;Latin Small Letter O With Macron
0x014e 0x4f ;Latin Capital Letter O With Breve
0x014f 0x6f ;Latin Small Letter O With Breve
0x0150 0x4f ;Latin Capital Letter O With Double Acute
0x0151 0x6f ;Latin Small Letter O With Double Acute
0x0154 0x52 ;Latin Capital Letter R With Acute
0x0155 0x72 ;Latin Small Letter R With Acute
0x0156 0x52 ;Latin Capital Letter R With Cedilla
0x0157 0x72 ;Latin Small Letter R With Cedilla
0x0158 0x52 ;Latin Capital Letter R With Caron
0x0159 0x72 ;Latin Small Letter R With Caron
0x015a 0x53 ;Latin Capital Letter S With Acute
0x015b 0x73 ;Latin Small Letter S With Acute
0x015c 0x53 ;Latin Capital Letter S With Circumflex
0x015d 0x73 ;Latin Small Letter S With Circumflex
0x015e 0x53 ;Latin Capital Letter S With Cedilla
0x015f 0x73 ;Latin Small Letter S With Cedilla
0x0160 0x53 ;Latin Capital Letter S With Caron
0x0161 0x73 ;Latin Small Letter S With Caron
0x0162 0x54 ;Latin Capital Letter T With Cedilla
0x0163 0x74 ;Latin Small Letter T With Cedilla
0x0164 0x54 ;Latin Capital Letter T With Caron
0x0165 0x74 ;Latin Small Letter T With Caron
0x0166 0x54 ;Latin Capital Letter T With Stroke
0x0167 0x74 ;Latin Small Letter T With Stroke
0x0168 0x55 ;Latin Capital Letter U With Tilde
0x0169 0x75 ;Latin Small Letter U With Tilde
0x016a 0x55 ;Latin Capital Letter U With Macron
0x016b 0x75 ;Latin Small Letter U With Macron
0x016c 0x55 ;Latin Capital Letter U With Breve
0x016d 0x75 ;Latin Small Letter U With Breve
0x016e 0x55 ;Latin Capital Letter U With Ring Above
0x016f 0x75 ;Latin Small Letter U With Ring Above
0x0170 0x55 ;Latin Capital Letter U With Double Acute
0x0171 0x75 ;Latin Small Letter U With Double Acute
0x0172 0x55 ;Latin Capital Letter U With Ogonek
0x0173 0x75 ;Latin Small Letter U With Ogonek
0x0174 0x57 ;Latin Capital Letter W With Circumflex
0x0175 0x77 ;Latin Small Letter W With Circumflex
0x0176 0x59 ;Latin Capital Letter Y With Circumflex
0x0177 0x79 ;Latin Small Letter Y With Circumflex
0x0178 0x59 ;Latin Capital Letter Y With Diaeresis
0x0179 0x5a ;Latin Capital Letter Z With Acute
0x017a 0x7a ;Latin Small Letter Z With Acute
0x017b 0x5a ;Latin Capital Letter Z With Dot Above
0x017c 0x7a ;Latin Small Letter Z With Dot Above
0x017d 0x5a ;Latin Capital Letter Z With Caron
0x017e 0x7a ;Latin Small Letter Z With Caron
0x0180 0x62 ;Latin Small Letter B With Stroke
0x0191 0x83 ;Latin Capital Letter F With Hook
0x0192 0x83 ;Latin Small Letter F With Hook
0x0197 0x49 ;Latin Capital Letter I With Stroke
0x019a 0x6c ;Latin Small Letter L With Bar
0x019f 0x4f ;Latin Capital Letter O With Middle Tilde
0x01a0 0x4f ;Latin Capital Letter O With Horn
0x01a1 0x6f ;Latin Small Letter O With Horn
0x01ab 0x74 ;Latin Small Letter T With Palatal Hook
0x01ae 0x54 ;Latin Capital Letter T With Retroflex Hook
0x01af 0x55 ;Latin Capital Letter U With Horn
0x01b0 0x75 ;Latin Small Letter U With Horn
0x01cd 0x41 ;Latin Capital Letter A With Caron
0x01ce 0x61 ;Latin Small Letter A With Caron
0x01cf 0x49 ;Latin Capital Letter I With Caron
0x01d0 0x69 ;Latin Small Letter I With Caron
0x01d1 0x4f ;Latin Capital Letter O With Caron
0x01d2 0x6f ;Latin Small Letter O With Caron
0x01d3 0x55 ;Latin Capital Letter U With Caron
0x01d4 0x75 ;Latin Small Letter U With Caron
0x01d5 0x55 ;Latin Capital Letter U With Diaeresis And Macron
0x01d6 0x75 ;Latin Small Letter U With Diaeresis And Macron
0x01d7 0x55 ;Latin Capital Letter U With Diaeresis And Acute
0x01d8 0x75 ;Latin Small Letter U With Diaeresis And Acute
0x01d9 0x55 ;Latin Capital Letter U With Diaeresis And Caron
0x01da 0x75 ;Latin Small Letter U With Diaeresis And Caron
0x01db 0x55 ;Latin Capital Letter U With Diaeresis And Grave
0x01dc 0x75 ;Latin Small Letter U With Diaeresis And Grave
0x01de 0x41 ;Latin Capital Letter A With Diaeresis And Macron
0x01df 0x61 ;Latin Small Letter A With Diaeresis And Macron
0x01e4 0x47 ;Latin Capital Letter G With Stroke
0x01e5 0x67 ;Latin Small Letter G With Stroke
0x01e6 0x47 ;Latin Capital Letter G With Caron
0x01e7 0x67 ;Latin Small Letter G With Caron
0x01e8 0x4b ;Latin Capital Letter K With Caron
0x01e9 0x6b ;Latin Small Letter K With Caron
0x01ea 0x4f ;Latin Capital Letter O With Ogonek
0x01eb 0x6f ;Latin Small Letter O With Ogonek
0x01ec 0x4f ;Latin Capital Letter O With Ogonek And Macron
0x01ed 0x6f ;Latin Small Letter O With Ogonek And Macron
0x01f0 0x6a ;Latin Small Letter J With Caron
0x030d 0xb4 ;Combining Vertical Line Above
0x037e 0x3b ;Greek Question Mark(Best-fit)(#28094)
0x0384 0xb4 ;Greek Tonos
0x0385 0xa1 ;Greek Dialytika Tonos
0x0386 0xa2 ;Greek Capital Letter Alpha With Tonos
0x0388 0xb8 ;Greek Capital Letter Epsilon With Tonos
0x0389 0xb9 ;Greek Capital Letter Eta With Tonos
0x038a 0xba ;Greek Capital Letter Iota With Tonos
0x038c 0xbc ;Greek Capital Letter Omicron With Tonos
0x038e 0xbe ;Greek Capital Letter Upsilon With Tonos
0x038f 0xbf ;Greek Capital Letter Omega With Tonos
0x0390 0xc0 ;Greek Small Letter Iota With Dialytika And Tonos
0x0391 0xc1 ;Greek Capital Letter Alpha
0x0392 0xc2 ;Greek Capital Letter Beta
0x0393 0xc3 ;Greek Capital Letter Gamma
0x0394 0xc4 ;Greek Capital Letter Delta
0x0395 0xc5 ;Greek Capital Letter Epsilon
0x0396 0xc6 ;Greek Capital Letter Zeta
0x0397 0xc7 ;Greek Capital Letter Eta
0x0398 0xc8 ;Greek Capital Letter Theta
0x0399 0xc9 ;Greek Capital Letter Iota
0x039a 0xca ;Greek Capital Letter Kappa
0x039b 0xcb ;Greek Capital Letter Lamda
0x039c 0xcc ;Greek Capital Letter Mu
0x039d 0xcd ;Greek Capital Letter Nu
0x039e 0xce ;Greek Capital Letter Xi
0x039f 0xcf ;Greek Capital Letter Omicron
0x03a0 0xd0 ;Greek Capital Letter Pi
0x03a1 0xd1 ;Greek Capital Letter Rho
0x03a3 0xd3 ;Greek Capital Letter Sigma
0x03a4 0xd4 ;Greek Capital Letter Tau
0x03a5 0xd5 ;Greek Capital Letter Upsilon
0x03a6 0xd6 ;Greek Capital Letter Phi
0x03a7 0xd7 ;Greek Capital Letter Chi
0x03a8 0xd8 ;Greek Capital Letter Psi
0x03a9 0xd9 ;Greek Capital Letter Omega
0x03aa 0xda ;Greek Capital Letter Iota With Dialytika
0x03ab 0xdb ;Greek Capital Letter Upsilon With Dialytika
0x03ac 0xdc ;Greek Small Letter Alpha With Tonos
0x03ad 0xdd ;Greek Small Letter Epsilon With Tonos
0x03ae 0xde ;Greek Small Letter Eta With Tonos
0x03af 0xdf ;Greek Small Letter Iota With Tonos
0x03b0 0xe0 ;Greek Small Letter Upsilon With Dialytika And Tonos
0x03b1 0xe1 ;Greek Small Letter Alpha
0x03b2 0xe2 ;Greek Small Letter Beta
0x03b3 0xe3 ;Greek Small Letter Gamma
0x03b4 0xe4 ;Greek Small Letter Delta
0x03b5 0xe5 ;Greek Small Letter Epsilon
0x03b6 0xe6 ;Greek Small Letter Zeta
0x03b7 0xe7 ;Greek Small Letter Eta
0x03b8 0xe8 ;Greek Small Letter Theta
0x03b9 0xe9 ;Greek Small Letter Iota
0x03ba 0xea ;Greek Small Letter Kappa
0x03bb 0xeb ;Greek Small Letter Lamda
0x03bc 0xec ;Greek Small Letter Mu
0x03bd 0xed ;Greek Small Letter Nu
0x03be 0xee ;Greek Small Letter Xi
0x03bf 0xef ;Greek Small Letter Omicron
0x03c0 0xf0 ;Greek Small Letter Pi
0x03c1 0xf1 ;Greek Small Letter Rho
0x03c2 0xf2 ;Greek Small Letter Final Sigma
0x03c3 0xf3 ;Greek Small Letter Sigma
0x03c4 0xf4 ;Greek Small Letter Tau
0x03c5 0xf5 ;Greek Small Letter Upsilon
0x03c6 0xf6 ;Greek Small Letter Phi
0x03c7 0xf7 ;Greek Small Letter Chi
0x03c8 0xf8 ;Greek Small Letter Psi
0x03c9 0xf9 ;Greek Small Letter Omega
0x03ca 0xfa ;Greek Small Letter Iota With Dialytika
0x03cb 0xfb ;Greek Small Letter Upsilon With Dialytika
0x03cc 0xfc ;Greek Small Letter Omicron With Tonos
0x03cd 0xfd ;Greek Small Letter Upsilon With Tonos
0x03ce 0xfe ;Greek Small Letter Omega With Tonos
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2015 0xaf ;Horizontal Bar
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x203c 0x21 ;Double Exclamation Mark
0x20ac 0x80 ;Euro Sign
0x2122 0x99 ;Trade Mark Sign
0x2190 0x3c ;Leftwards Arrow
0x2191 0x5e ;Upwards Arrow
0x2192 0x3e ;Rightwards Arrow
0x2193 0x76 ;Downwards Arrow
0x2194 0x2d ;Left Right Arrow
0x2195 0xa6 ;Up Down Arrow
0x21a8 0xa6 ;Up Down Arrow With Base
0x221f 0x4c ;Right Angle
0x2302 0xa6 ;House
0x2500 0x2d ;Box Drawings Light Horizontal
0x2502 0xa6 ;Box Drawings Light Vertical
0x250c 0x2d ;Box Drawings Light Down And Right
0x2510 0xac ;Box Drawings Light Down And Left
0x2514 0x4c ;Box Drawings Light Up And Right
0x2518 0x2d ;Box Drawings Light Up And Left
0x251c 0x2b ;Box Drawings Light Vertical And Right
0x2524 0x2b ;Box Drawings Light Vertical And Left
0x252c 0x54 ;Box Drawings Light Down And Horizontal
0x2534 0x2b ;Box Drawings Light Up And Horizontal
0x253c 0x2b ;Box Drawings Light Vertical And Horizontal
0x2550 0x3d ;Box Drawings Double Horizontal
0x2551 0xa6 ;Box Drawings Double Vertical
0x2554 0x2d ;Box Drawings Double Down And Right
0x2557 0xac ;Box Drawings Double Down And Left
0x255a 0x4c ;Box Drawings Double Up And Right
0x255d 0x2d ;Box Drawings Double Up And Left
0x2560 0xa6 ;Box Drawings Double Vertical And Right
0x2563 0xa6 ;Box Drawings Double Vertical And Left
0x2566 0x54 ;Box Drawings Double Down And Horizontal
0x2569 0xa6 ;Box Drawings Double Up And Horizontal
0x256c 0x2b ;Box Drawings Double Vertical And Horizontal
0x2580 0x2d ;Upper Half Block
0x2584 0x2d ;Lower Half Block
0x2588 0x2d ;Full Block
0x2591 0x2d ;Light Shade
0x2592 0x2d ;Medium Shade
0x2593 0x2d ;Dark Shade
0x25a0 0xa6 ;Black Square
0x25ac 0x2d ;Black Rectangle
0x25b2 0x5e ;Black Up-Pointing Triangle
0x25ba 0x3e ;Black Right-Pointing Pointer
0x25bc 0xa1 ;Black Down-Pointing Triangle
0x25c4 0x3c ;Black Left-Pointing Pointer
0x25cb 0x30 ;White Circle
0x25d8 0x95 ;Inverse Bullet
0x25d9 0x30 ;Inverse White Circle
0x263a 0x4f ;White Smiling Face
0x263b 0x4f ;Black Smiling Face
0x263c 0x30 ;White Sun With Rays
0x2640 0x2b ;Female Sign
0x2642 0x3e ;Male Sign
0x2660 0xa6 ;Black Spade Suit
0x2663 0xa6 ;Black Club Suit
0x2665 0xa6 ;Black Heart Suit
0x2666 0xa6 ;Black Diamond Suit
0x266a 0x64 ;Eighth Note
0x266b 0x64 ;Beamed Eighth Notes
0xf8f9 0xaa ;EUDC -> Undefined
0xf8fa 0xd2 ;EUDC -> Undefined
0xf8fb 0xff ;EUDC -> Undefined
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,963 @@
CODEPAGE 1254 ;Turkish - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0192 ;Latin Small Letter F With Hook
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x02c6 ;Modifier Letter Circumflex Accent
0x89 0x2030 ;Per Mille Sign
0x8a 0x0160 ;Latin Capital Letter S With Caron
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x0152 ;Latin Capital Ligature Oe
0x8d 0x008d
0x8e 0x008e
0x8f 0x008f
0x90 0x0090
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x02dc ;Small Tilde
0x99 0x2122 ;Trade Mark Sign
0x9a 0x0161 ;Latin Small Letter S With Caron
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x0153 ;Latin Small Ligature Oe
0x9d 0x009d
0x9e 0x009e
0x9f 0x0178 ;Latin Capital Letter Y With Diaeresis
0xa0 0x00a0 ;No-Break Space
0xa1 0x00a1 ;Inverted Exclamation Mark
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x00aa ;Feminine Ordinal Indicator
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x00af ;Macron
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x00b2 ;Superscript Two
0xb3 0x00b3 ;Superscript Three
0xb4 0x00b4 ;Acute Accent
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Cedilla
0xb9 0x00b9 ;Superscript One
0xba 0x00ba ;Masculine Ordinal Indicator
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x00bc ;Vulgar Fraction One Quarter
0xbd 0x00bd ;Vulgar Fraction One Half
0xbe 0x00be ;Vulgar Fraction Three Quarters
0xbf 0x00bf ;Inverted Question Mark
0xc0 0x00c0 ;Latin Capital Letter A With Grave
0xc1 0x00c1 ;Latin Capital Letter A With Acute
0xc2 0x00c2 ;Latin Capital Letter A With Circumflex
0xc3 0x00c3 ;Latin Capital Letter A With Tilde
0xc4 0x00c4 ;Latin Capital Letter A With Diaeresis
0xc5 0x00c5 ;Latin Capital Letter A With Ring Above
0xc6 0x00c6 ;Latin Capital Ligature Ae
0xc7 0x00c7 ;Latin Capital Letter C With Cedilla
0xc8 0x00c8 ;Latin Capital Letter E With Grave
0xc9 0x00c9 ;Latin Capital Letter E With Acute
0xca 0x00ca ;Latin Capital Letter E With Circumflex
0xcb 0x00cb ;Latin Capital Letter E With Diaeresis
0xcc 0x00cc ;Latin Capital Letter I With Grave
0xcd 0x00cd ;Latin Capital Letter I With Acute
0xce 0x00ce ;Latin Capital Letter I With Circumflex
0xcf 0x00cf ;Latin Capital Letter I With Diaeresis
0xd0 0x011e ;Latin Capital Letter G With Breve
0xd1 0x00d1 ;Latin Capital Letter N With Tilde
0xd2 0x00d2 ;Latin Capital Letter O With Grave
0xd3 0x00d3 ;Latin Capital Letter O With Acute
0xd4 0x00d4 ;Latin Capital Letter O With Circumflex
0xd5 0x00d5 ;Latin Capital Letter O With Tilde
0xd6 0x00d6 ;Latin Capital Letter O With Diaeresis
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x00d8 ;Latin Capital Letter O With Stroke
0xd9 0x00d9 ;Latin Capital Letter U With Grave
0xda 0x00da ;Latin Capital Letter U With Acute
0xdb 0x00db ;Latin Capital Letter U With Circumflex
0xdc 0x00dc ;Latin Capital Letter U With Diaeresis
0xdd 0x0130 ;Latin Capital Letter I With Dot Above
0xde 0x015e ;Latin Capital Letter S With Cedilla
0xdf 0x00df ;Latin Small Letter Sharp S
0xe0 0x00e0 ;Latin Small Letter A With Grave
0xe1 0x00e1 ;Latin Small Letter A With Acute
0xe2 0x00e2 ;Latin Small Letter A With Circumflex
0xe3 0x00e3 ;Latin Small Letter A With Tilde
0xe4 0x00e4 ;Latin Small Letter A With Diaeresis
0xe5 0x00e5 ;Latin Small Letter A With Ring Above
0xe6 0x00e6 ;Latin Small Ligature Ae
0xe7 0x00e7 ;Latin Small Letter C With Cedilla
0xe8 0x00e8 ;Latin Small Letter E With Grave
0xe9 0x00e9 ;Latin Small Letter E With Acute
0xea 0x00ea ;Latin Small Letter E With Circumflex
0xeb 0x00eb ;Latin Small Letter E With Diaeresis
0xec 0x00ec ;Latin Small Letter I With Grave
0xed 0x00ed ;Latin Small Letter I With Acute
0xee 0x00ee ;Latin Small Letter I With Circumflex
0xef 0x00ef ;Latin Small Letter I With Diaeresis
0xf0 0x011f ;Latin Small Letter G With Breve
0xf1 0x00f1 ;Latin Small Letter N With Tilde
0xf2 0x00f2 ;Latin Small Letter O With Grave
0xf3 0x00f3 ;Latin Small Letter O With Acute
0xf4 0x00f4 ;Latin Small Letter O With Circumflex
0xf5 0x00f5 ;Latin Small Letter O With Tilde
0xf6 0x00f6 ;Latin Small Letter O With Diaeresis
0xf7 0x00f7 ;Division Sign
0xf8 0x00f8 ;Latin Small Letter O With Stroke
0xf9 0x00f9 ;Latin Small Letter U With Grave
0xfa 0x00fa ;Latin Small Letter U With Acute
0xfb 0x00fb ;Latin Small Letter U With Circumflex
0xfc 0x00fc ;Latin Small Letter U With Diaeresis
0xfd 0x0131 ;Latin Small Letter Dotless I
0xfe 0x015f ;Latin Small Letter S With Cedilla
0xff 0x00ff ;Latin Small Letter Y With Diaeresis
WCTABLE 694
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x008d 0x8d
0x008e 0x8e
0x008f 0x8f
0x0090 0x90
0x009d 0x9d
0x009e 0x9e
0x00a0 0xa0 ;No-Break Space
0x00a1 0xa1 ;Inverted Exclamation Mark
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00aa 0xaa ;Feminine Ordinal Indicator
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00af 0xaf ;Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0xb2 ;Superscript Two
0x00b3 0xb3 ;Superscript Three
0x00b4 0xb4 ;Acute Accent
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Cedilla
0x00b9 0xb9 ;Superscript One
0x00ba 0xba ;Masculine Ordinal Indicator
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00bc 0xbc ;Vulgar Fraction One Quarter
0x00bd 0xbd ;Vulgar Fraction One Half
0x00be 0xbe ;Vulgar Fraction Three Quarters
0x00bf 0xbf ;Inverted Question Mark
0x00c0 0xc0 ;Latin Capital Letter A With Grave
0x00c1 0xc1 ;Latin Capital Letter A With Acute
0x00c2 0xc2 ;Latin Capital Letter A With Circumflex
0x00c3 0xc3 ;Latin Capital Letter A With Tilde
0x00c4 0xc4 ;Latin Capital Letter A With Diaeresis
0x00c5 0xc5 ;Latin Capital Letter A With Ring Above
0x00c6 0xc6 ;Latin Capital Ligature Ae
0x00c7 0xc7 ;Latin Capital Letter C With Cedilla
0x00c8 0xc8 ;Latin Capital Letter E With Grave
0x00c9 0xc9 ;Latin Capital Letter E With Acute
0x00ca 0xca ;Latin Capital Letter E With Circumflex
0x00cb 0xcb ;Latin Capital Letter E With Diaeresis
0x00cc 0xcc ;Latin Capital Letter I With Grave
0x00cd 0xcd ;Latin Capital Letter I With Acute
0x00ce 0xce ;Latin Capital Letter I With Circumflex
0x00cf 0xcf ;Latin Capital Letter I With Diaeresis
0x00d1 0xd1 ;Latin Capital Letter N With Tilde
0x00d2 0xd2 ;Latin Capital Letter O With Grave
0x00d3 0xd3 ;Latin Capital Letter O With Acute
0x00d4 0xd4 ;Latin Capital Letter O With Circumflex
0x00d5 0xd5 ;Latin Capital Letter O With Tilde
0x00d6 0xd6 ;Latin Capital Letter O With Diaeresis
0x00d7 0xd7 ;Multiplication Sign
0x00d8 0xd8 ;Latin Capital Letter O With Stroke
0x00d9 0xd9 ;Latin Capital Letter U With Grave
0x00da 0xda ;Latin Capital Letter U With Acute
0x00db 0xdb ;Latin Capital Letter U With Circumflex
0x00dc 0xdc ;Latin Capital Letter U With Diaeresis
0x00dd 0x59 ;Latin Capital Letter Y With Acute
0x00df 0xdf ;Latin Small Letter Sharp S
0x00e0 0xe0 ;Latin Small Letter A With Grave
0x00e1 0xe1 ;Latin Small Letter A With Acute
0x00e2 0xe2 ;Latin Small Letter A With Circumflex
0x00e3 0xe3 ;Latin Small Letter A With Tilde
0x00e4 0xe4 ;Latin Small Letter A With Diaeresis
0x00e5 0xe5 ;Latin Small Letter A With Ring Above
0x00e6 0xe6 ;Latin Small Ligature Ae
0x00e7 0xe7 ;Latin Small Letter C With Cedilla
0x00e8 0xe8 ;Latin Small Letter E With Grave
0x00e9 0xe9 ;Latin Small Letter E With Acute
0x00ea 0xea ;Latin Small Letter E With Circumflex
0x00eb 0xeb ;Latin Small Letter E With Diaeresis
0x00ec 0xec ;Latin Small Letter I With Grave
0x00ed 0xed ;Latin Small Letter I With Acute
0x00ee 0xee ;Latin Small Letter I With Circumflex
0x00ef 0xef ;Latin Small Letter I With Diaeresis
0x00f1 0xf1 ;Latin Small Letter N With Tilde
0x00f2 0xf2 ;Latin Small Letter O With Grave
0x00f3 0xf3 ;Latin Small Letter O With Acute
0x00f4 0xf4 ;Latin Small Letter O With Circumflex
0x00f5 0xf5 ;Latin Small Letter O With Tilde
0x00f6 0xf6 ;Latin Small Letter O With Diaeresis
0x00f7 0xf7 ;Division Sign
0x00f8 0xf8 ;Latin Small Letter O With Stroke
0x00f9 0xf9 ;Latin Small Letter U With Grave
0x00fa 0xfa ;Latin Small Letter U With Acute
0x00fb 0xfb ;Latin Small Letter U With Circumflex
0x00fc 0xfc ;Latin Small Letter U With Diaeresis
0x00fd 0x79 ;Latin Small Letter Y With Acute
0x00ff 0xff ;Latin Small Letter Y With Diaeresis
0x0100 0x41 ;Latin Capital Letter A With Macron
0x0101 0x61 ;Latin Small Letter A With Macron
0x0102 0x41 ;Latin Capital Letter A With Breve
0x0103 0x61 ;Latin Small Letter A With Breve
0x0104 0x41 ;Latin Capital Letter A With Ogonek
0x0105 0x61 ;Latin Small Letter A With Ogonek
0x0106 0x43 ;Latin Capital Letter C With Acute
0x0107 0x63 ;Latin Small Letter C With Acute
0x0108 0x43 ;Latin Capital Letter C With Circumflex
0x0109 0x63 ;Latin Small Letter C With Circumflex
0x010a 0x43 ;Latin Capital Letter C With Dot Above
0x010b 0x63 ;Latin Small Letter C With Dot Above
0x010c 0x43 ;Latin Capital Letter C With Caron
0x010d 0x63 ;Latin Small Letter C With Caron
0x010e 0x44 ;Latin Capital Letter D With Caron
0x010f 0x64 ;Latin Small Letter D With Caron
0x0110 0x44 ;Latin Capital Letter D With Stroke
0x0111 0x64 ;Latin Small Letter D With Stroke
0x0112 0x45 ;Latin Capital Letter E With Macron
0x0113 0x65 ;Latin Small Letter E With Macron
0x0114 0x45 ;Latin Capital Letter E With Breve
0x0115 0x65 ;Latin Small Letter E With Breve
0x0116 0x45 ;Latin Capital Letter E With Dot Above
0x0117 0x65 ;Latin Small Letter E With Dot Above
0x0118 0x45 ;Latin Capital Letter E With Ogonek
0x0119 0x65 ;Latin Small Letter E With Ogonek
0x011a 0x45 ;Latin Capital Letter E With Caron
0x011b 0x65 ;Latin Small Letter E With Caron
0x011c 0x47 ;Latin Capital Letter G With Circumflex
0x011d 0x67 ;Latin Small Letter G With Circumflex
0x011e 0xd0 ;Latin Capital Letter G With Breve
0x011f 0xf0 ;Latin Small Letter G With Breve
0x0120 0x47 ;Latin Capital Letter G With Dot Above
0x0121 0x67 ;Latin Small Letter G With Dot Above
0x0122 0x47 ;Latin Capital Letter G With Cedilla
0x0123 0x67 ;Latin Small Letter G With Cedilla
0x0124 0x48 ;Latin Capital Letter H With Circumflex
0x0125 0x68 ;Latin Small Letter H With Circumflex
0x0126 0x48 ;Latin Capital Letter H With Stroke
0x0127 0x68 ;Latin Small Letter H With Stroke
0x0128 0x49 ;Latin Capital Letter I With Tilde
0x0129 0x69 ;Latin Small Letter I With Tilde
0x012a 0x49 ;Latin Capital Letter I With Macron
0x012b 0x69 ;Latin Small Letter I With Macron
0x012c 0x49 ;Latin Capital Letter I With Breve
0x012d 0x69 ;Latin Small Letter I With Breve
0x012e 0x49 ;Latin Capital Letter I With Ogonek
0x012f 0x69 ;Latin Small Letter I With Ogonek
0x0130 0xdd ;Latin Capital Letter I With Dot Above
0x0131 0xfd ;Latin Small Letter Dotless I
0x0134 0x4a ;Latin Capital Letter J With Circumflex
0x0135 0x6a ;Latin Small Letter J With Circumflex
0x0136 0x4b ;Latin Capital Letter K With Cedilla
0x0137 0x6b ;Latin Small Letter K With Cedilla
0x0139 0x4c ;Latin Capital Letter L With Acute
0x013a 0x6c ;Latin Small Letter L With Acute
0x013b 0x4c ;Latin Capital Letter L With Cedilla
0x013c 0x6c ;Latin Small Letter L With Cedilla
0x013d 0x4c ;Latin Capital Letter L With Caron
0x013e 0x6c ;Latin Small Letter L With Caron
0x0141 0x4c ;Latin Capital Letter L With Stroke
0x0142 0x6c ;Latin Small Letter L With Stroke
0x0143 0x4e ;Latin Capital Letter N With Acute
0x0144 0x6e ;Latin Small Letter N With Acute
0x0145 0x4e ;Latin Capital Letter N With Cedilla
0x0146 0x6e ;Latin Small Letter N With Cedilla
0x0147 0x4e ;Latin Capital Letter N With Caron
0x0148 0x6e ;Latin Small Letter N With Caron
0x014c 0x4f ;Latin Capital Letter O With Macron
0x014d 0x6f ;Latin Small Letter O With Macron
0x014e 0x4f ;Latin Capital Letter O With Breve
0x014f 0x6f ;Latin Small Letter O With Breve
0x0150 0x4f ;Latin Capital Letter O With Double Acute
0x0151 0x6f ;Latin Small Letter O With Double Acute
0x0152 0x8c ;Latin Capital Ligature Oe
0x0153 0x9c ;Latin Small Ligature Oe
0x0154 0x52 ;Latin Capital Letter R With Acute
0x0155 0x72 ;Latin Small Letter R With Acute
0x0156 0x52 ;Latin Capital Letter R With Cedilla
0x0157 0x72 ;Latin Small Letter R With Cedilla
0x0158 0x52 ;Latin Capital Letter R With Caron
0x0159 0x72 ;Latin Small Letter R With Caron
0x015a 0x53 ;Latin Capital Letter S With Acute
0x015b 0x73 ;Latin Small Letter S With Acute
0x015c 0x53 ;Latin Capital Letter S With Circumflex
0x015d 0x73 ;Latin Small Letter S With Circumflex
0x015e 0xde ;Latin Capital Letter S With Cedilla
0x015f 0xfe ;Latin Small Letter S With Cedilla
0x0160 0x8a ;Latin Capital Letter S With Caron
0x0161 0x9a ;Latin Small Letter S With Caron
0x0162 0x54 ;Latin Capital Letter T With Cedilla
0x0163 0x74 ;Latin Small Letter T With Cedilla
0x0164 0x54 ;Latin Capital Letter T With Caron
0x0165 0x74 ;Latin Small Letter T With Caron
0x0166 0x54 ;Latin Capital Letter T With Stroke
0x0167 0x74 ;Latin Small Letter T With Stroke
0x0168 0x55 ;Latin Capital Letter U With Tilde
0x0169 0x75 ;Latin Small Letter U With Tilde
0x016a 0x55 ;Latin Capital Letter U With Macron
0x016b 0x75 ;Latin Small Letter U With Macron
0x016c 0x55 ;Latin Capital Letter U With Breve
0x016d 0x75 ;Latin Small Letter U With Breve
0x016e 0x55 ;Latin Capital Letter U With Ring Above
0x016f 0x75 ;Latin Small Letter U With Ring Above
0x0170 0x55 ;Latin Capital Letter U With Double Acute
0x0171 0x75 ;Latin Small Letter U With Double Acute
0x0172 0x55 ;Latin Capital Letter U With Ogonek
0x0173 0x75 ;Latin Small Letter U With Ogonek
0x0174 0x57 ;Latin Capital Letter W With Circumflex
0x0175 0x77 ;Latin Small Letter W With Circumflex
0x0176 0x59 ;Latin Capital Letter Y With Circumflex
0x0177 0x79 ;Latin Small Letter Y With Circumflex
0x0178 0x9f ;Latin Capital Letter Y With Diaeresis
0x0179 0x5a ;Latin Capital Letter Z With Acute
0x017a 0x7a ;Latin Small Letter Z With Acute
0x017b 0x5a ;Latin Capital Letter Z With Dot Above
0x017c 0x7a ;Latin Small Letter Z With Dot Above
0x017d 0x5a ;Latin Capital Letter Z With Caron
0x017e 0x7a ;Latin Small Letter Z With Caron
0x0180 0x62 ;Latin Small Letter B With Stroke
0x0189 0x44 ;Latin Capital Letter African D
0x0191 0x83 ;Latin Capital Letter F With Hook
0x0192 0x83 ;Latin Small Letter F With Hook
0x0197 0x49 ;Latin Capital Letter I With Stroke
0x019a 0x6c ;Latin Small Letter L With Bar
0x019f 0x4f ;Latin Capital Letter O With Middle Tilde
0x01a0 0x4f ;Latin Capital Letter O With Horn
0x01a1 0x6f ;Latin Small Letter O With Horn
0x01ab 0x74 ;Latin Small Letter T With Palatal Hook
0x01ae 0x54 ;Latin Capital Letter T With Retroflex Hook
0x01af 0x55 ;Latin Capital Letter U With Horn
0x01b0 0x75 ;Latin Small Letter U With Horn
0x01b6 0x7a ;Latin Small Letter Z With Stroke
0x01c0 0x7c ;Latin Letter Dental Click
0x01c3 0x21 ;Latin Letter Retroflex Click
0x01cd 0x41 ;Latin Capital Letter A With Caron
0x01ce 0x61 ;Latin Small Letter A With Caron
0x01cf 0x49 ;Latin Capital Letter I With Caron
0x01d0 0x69 ;Latin Small Letter I With Caron
0x01d1 0x4f ;Latin Capital Letter O With Caron
0x01d2 0x6f ;Latin Small Letter O With Caron
0x01d3 0x55 ;Latin Capital Letter U With Caron
0x01d4 0x75 ;Latin Small Letter U With Caron
0x01d5 0x55 ;Latin Capital Letter U With Diaeresis And Macron
0x01d6 0x75 ;Latin Small Letter U With Diaeresis And Macron
0x01d7 0x55 ;Latin Capital Letter U With Diaeresis And Acute
0x01d8 0x75 ;Latin Small Letter U With Diaeresis And Acute
0x01d9 0x55 ;Latin Capital Letter U With Diaeresis And Caron
0x01da 0x75 ;Latin Small Letter U With Diaeresis And Caron
0x01db 0x55 ;Latin Capital Letter U With Diaeresis And Grave
0x01dc 0x75 ;Latin Small Letter U With Diaeresis And Grave
0x01de 0x41 ;Latin Capital Letter A With Diaeresis And Macron
0x01df 0x61 ;Latin Small Letter A With Diaeresis And Macron
0x01e4 0x47 ;Latin Capital Letter G With Stroke
0x01e5 0x67 ;Latin Small Letter G With Stroke
0x01e6 0x47 ;Latin Capital Letter G With Caron
0x01e7 0x67 ;Latin Small Letter G With Caron
0x01e8 0x4b ;Latin Capital Letter K With Caron
0x01e9 0x6b ;Latin Small Letter K With Caron
0x01ea 0x4f ;Latin Capital Letter O With Ogonek
0x01eb 0x6f ;Latin Small Letter O With Ogonek
0x01ec 0x4f ;Latin Capital Letter O With Ogonek And Macron
0x01ed 0x6f ;Latin Small Letter O With Ogonek And Macron
0x01f0 0x6a ;Latin Small Letter J With Caron
0x0261 0x67 ;Latin Small Letter Script G
0x02b9 0x27 ;Modifier Letter Prime
0x02ba 0x22 ;Modifier Letter Double Prime
0x02bb 0x91 ;Modifier Letter Turned Comma
0x02bc 0x27 ;Modifier Letter Apostrophe
0x02c4 0x5e ;Modifier Letter Up Arrowhead
0x02c6 0x88 ;Modifier Letter Circumflex Accent
0x02c7 0x5e ;Caron
0x02c8 0x27 ;Modifier Letter Vertical Line
0x02c9 0xaf ;Modifier Letter Macron
0x02ca 0xb4 ;Modifier Letter Acute Accent
0x02cb 0x60 ;Modifier Letter Grave Accent
0x02cd 0x5f ;Modifier Letter Low Macron
0x02d8 0x5e ;Circumflex Accent
0x02d9 0x27 ;Dot Above
0x02da 0xb0 ;Ring Above
0x02db 0xb8 ;Ogonek
0x02dc 0x98 ;Small Tilde
0x02dd 0xa8 ;Double Acute Accent
0x0300 0x60 ;Combining Grave Accent
0x0301 0xb4 ;Combining Acute Accent
0x0302 0x5e ;Combining Circumflex Accent
0x0303 0x98 ;Combining Tilde
0x0304 0xaf ;Combining Macron
0x0305 0xaf ;Combining Overline
0x0306 0x88 ;Combining Breve
0x0307 0xb7 ;Combining Dot Above
0x0308 0xa8 ;Combining Diaeresis
0x030a 0xa7 ;Combining Ring Above
0x030c 0x88 ;Combining Caron
0x030e 0xa8 ;Combining Double Vertical Line Above
0x0327 0xb8 ;Combining Cedilla
0x0331 0x5f ;Combining Macron Below
0x0332 0x5f ;Combining Low Line
0x03b2 0xdf ;Greek Small Letter Beta
0x03bc 0xb5 ;Greek Small Letter Mu
0x04bb 0x68 ;Cyrillic Small Letter Shha
0x0589 0x3a ;Armenian Full Stop
0x066a 0x25 ;Arabic Percent Sign
0x2000 0x20 ;En Quad
0x2001 0x20 ;Em Quad
0x2002 0x20 ;En Space
0x2003 0x20 ;Em Space
0x2004 0x20 ;Three-Per-Em Space
0x2005 0x20 ;Four-Per-Em Space
0x2006 0x20 ;Six-Per-Em Space
0x2010 0x2d ;Hyphen
0x2011 0x2d ;Non-Breaking Hyphen
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2024 0x95 ;One Dot Leader
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2032 0x27 ;Prime
0x2033 0xa8 ;Double Prime
0x2035 0x60 ;Reversed Prime
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x203c 0x21 ;Double Exclamation Mark
0x2044 0x2f ;Fraction Slash
0x2070 0xb0 ;Superscript Zero
0x2074 0x34 ;Superscript Four
0x2075 0x35 ;Superscript Five
0x2076 0x36 ;Superscript Six
0x2077 0x37 ;Superscript Seven
0x2078 0x38 ;Superscript Eight
0x2080 0xb0 ;Subscript Zero
0x2081 0x30 ;Subscript One
0x2082 0xb2 ;Subscript Two
0x2083 0xb3 ;Subscript Three
0x2084 0x34 ;Subscript Four
0x2085 0x35 ;Subscript Five
0x2086 0x36 ;Subscript Six
0x2087 0x37 ;Subscript Seven
0x2088 0x38 ;Subscript Eight
0x2089 0x39 ;Subscript Nine
0x20ac 0x80 ;Euro Sign
0x20a4 0xa3 ;Lira Sign
0x2102 0x43 ;Double-Struck Capital C
0x2107 0x45 ;Euler Constant
0x210a 0x67 ;Script Small G
0x210b 0x48 ;Script Capital H
0x210c 0x48 ;Black-Letter Capital H
0x210d 0x48 ;Double-Struck Capital H
0x210e 0x68 ;Planck Constant
0x2110 0x49 ;Script Capital I
0x2111 0x49 ;Black-Letter Capital I
0x2112 0x4c ;Script Capital L
0x2113 0x6c ;Script Small L
0x2115 0x4e ;Double-Struck Capital N
0x2118 0x50 ;Script Capital P
0x2119 0x50 ;Double-Struck Capital P
0x211a 0x51 ;Double-Struck Capital Q
0x211b 0x52 ;Script Capital R
0x211c 0x52 ;Black-Letter Capital R
0x211d 0x52 ;Double-Struck Capital R
0x2122 0x99 ;Trade Mark Sign
0x2124 0x5a ;Double-Struck Capital Z
0x2128 0x5a ;Black-Letter Capital Z
0x212a 0x4b ;Kelvin Sign
0x212b 0xc5 ;Angstrom Sign
0x212c 0x42 ;Script Capital B
0x212d 0x43 ;Black-Letter Capital C
0x212e 0x65 ;Estimated Symbol
0x212f 0x65 ;Script Small E
0x2130 0x45 ;Script Capital E
0x2131 0x46 ;Script Capital F
0x2133 0x4d ;Script Capital M
0x2134 0x6f ;Script Small O
0x2190 0x8b ;Leftwards Arrow
0x2191 0x5e ;Upwards Arrow
0x2192 0x9b ;Rightwards Arrow
0x2193 0x76 ;Downwards Arrow
0x2194 0x2d ;Left Right Arrow
0x2195 0x7c ;Up Down Arrow
0x21a8 0x7c ;Up Down Arrow With Base
0x2205 0xd8 ;Empty Set
0x2212 0x2d ;Minus Sign
0x2213 0xb1 ;Minus-Or-Plus Sign
0x2215 0x2f ;Division Slash
0x2216 0x5c ;Set Minus
0x2217 0x2a ;Asterisk Operator
0x2218 0xb0 ;Ring Operator
0x2219 0x95 ;Bullet Operator
0x221f 0x4c ;Right Angle
0x2223 0x7c ;Divides
0x2236 0x3a ;Ratio
0x223c 0x7e ;Tilde Operator
0x226a 0xab ;Much Less-Than
0x226b 0xbb ;Much Greater-Than
0x22c5 0xb7 ;Dot Operator
0x2302 0xa6 ;House
0x2303 0x5e ;Up Arrowhead
0x2329 0x3c ;Left-Pointing Angle Bracket
0x232a 0x3e ;Right-Pointing Angle Bracket
0x2500 0xa6 ;Box Drawings Light Horizontal
0x2502 0x2d ;Box Drawings Light Vertical
0x250c 0x2d ;Box Drawings Light Down And Right
0x2510 0xac ;Box Drawings Light Down And Left
0x2514 0x4c ;Box Drawings Light Up And Right
0x2518 0x2d ;Box Drawings Light Up And Left
0x251c 0x2b ;Box Drawings Light Vertical And Right
0x2524 0x2b ;Box Drawings Light Vertical And Left
0x252c 0x54 ;Box Drawings Light Down And Horizontal
0x2534 0x2b ;Box Drawings Light Up And Horizontal
0x253c 0x2b ;Box Drawings Light Vertical And Horizontal
0x2550 0x3d ;Box Drawings Double Horizontal
0x2551 0xa6 ;Box Drawings Double Vertical
0x2554 0x2d ;Box Drawings Double Down And Right
0x2557 0xac ;Box Drawings Double Down And Left
0x255a 0x4c ;Box Drawings Double Up And Right
0x255d 0x2d ;Box Drawings Double Up And Left
0x2560 0xa6 ;Box Drawings Double Vertical And Right
0x2563 0xa6 ;Box Drawings Double Vertical And Left
0x2566 0x54 ;Box Drawings Double Down And Horizontal
0x2569 0xa6 ;Box Drawings Double Up And Horizontal
0x256c 0x2b ;Box Drawings Double Vertical And Horizontal
0x2580 0x2d ;Upper Half Block
0x2584 0x2d ;Lower Half Block
0x2588 0x2d ;Full Block
0x2591 0x2d ;Light Shade
0x2592 0x2d ;Medium Shade
0x2593 0x2d ;Dark Shade
0x25a0 0xa6 ;Black Square
0x25ac 0x2d ;Black Rectangle
0x25b2 0x5e ;Black Up-Pointing Triangle
0x25ba 0x3e ;Black Right-Pointing Pointer
0x25bc 0xa1 ;Black Down-Pointing Triangle
0x25c4 0x3c ;Black Left-Pointing Pointer
0x25cb 0x30 ;White Circle
0x25d8 0x95 ;Inverse Bullet
0x25d9 0x30 ;Inverse White Circle
0x263a 0x4f ;White Smiling Face
0x263b 0x4f ;Black Smiling Face
0x263c 0x30 ;White Sun With Rays
0x2640 0x2b ;Female Sign
0x2642 0x3e ;Male Sign
0x2660 0xa6 ;Black Spade Suit
0x2663 0xa6 ;Black Club Suit
0x2665 0xa6 ;Black Heart Suit
0x2666 0xa6 ;Black Diamond Suit
0x266a 0x64 ;Eighth Note
0x266b 0x64 ;Beamed Eighth Notes
0x2758 0x7c ;Light Vertical Bar
0x275b 0x91 ;Heavy Single Turned Comma Quotation Mark Ornament
0x275c 0x92 ;Heavy Single Comma Quotation Mark Ornament
0x275d 0x93 ;Heavy Double Turned Comma Quotation Mark Ornament
0x275e 0x94 ;Heavy Double Comma Quotation Mark Ornament
0x3000 0x20 ;Ideographic Space
0x3008 0x3c ;Left Angle Bracket
0x3009 0x3e ;Right Angle Bracket
0x300a 0xab ;Left Double Angle Bracket
0x300b 0xbb ;Right Double Angle Bracket
0x301a 0x5b ;Left White Square Bracket
0x301b 0x3d ;Right White Square Bracket
0x301d 0x22 ;Reversed Double Prime Quotation Mark
0x301e 0x22 ;Double Prime Quotation Mark
0x301f 0x84 ;Low Double Prime Quotation Mark
0x30fb 0xb7 ;Katakana Middle Dot
0x30fc 0x97 ;Katakana-Hiragana Prolonged Sound Mark
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,620 @@
CODEPAGE 1255 ;Hebrew - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe-Quote
0x28 0x0028 ;Opening Parenthesis
0x29 0x0029 ;Closing Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Period
0x2f 0x002f ;Slash
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Opening Square Bracket
0x5c 0x005c ;Backslash
0x5d 0x005d ;Closing Square Bracket
0x5e 0x005e ;Spacing Circumflex
0x5f 0x005f ;Spacing Underscore
0x60 0x0060 ;Spacing Grave
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Opening Curly Bracket
0x7c 0x007c ;Vertical Bar
0x7d 0x007d ;Closing Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081 ;Undefined -> Control
0x82 0x201a ;Low Single Comma Quotation Mark
0x83 0x0192 ;Latin Small Letter Script F
0x84 0x201e ;Low Double Comma Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x02c6 ;Modifier Letter Circumflex
0x89 0x2030 ;Per Mille Sign
0x8a 0x008a ;Undefined -> Control
0x8b 0x2039 ;Left Pointing Single Guillemet
0x8c 0x008c ;Undefined -> Control
0x8d 0x008d ;Undefined -> Control
0x8e 0x008e ;Undefined -> Control
0x8f 0x008f ;Undefined -> Control
0x90 0x0090 ;Undefined -> Control
0x91 0x2018 ;Single Turned Comma Quotation Mark
0x92 0x2019 ;Single Comma Quotation Mark
0x93 0x201c ;Double Turned Comma Quotation Mark
0x94 0x201d ;Double Comma Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x02dc ;Spacing Tilde
0x99 0x2122 ;Trademark
0x9a 0x009a ;Undefined -> Control
0x9b 0x203a ;Right Pointing Single Guillemet
0x9c 0x009c ;Undefined -> Control
0x9d 0x009d ;Undefined -> Control
0x9e 0x009e ;Undefined -> Control
0x9f 0x009f ;Undefined -> Control
0xa0 0x00a0 ;Non-Breaking Space
0xa1 0x00a1 ;Inverted Exclamation Mark
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x20aa ;New Sheqel Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Vertical Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Spacing Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x00d7 ;Multiplication Sign
0xab 0x00ab ;Left Pointing Guillemet
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Trade Mark Sign
0xaf 0x00af ;Spacing Macron
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Or-Minus Sign
0xb2 0x00b2 ;Superscript Digit Two
0xb3 0x00b3 ;Superscript Digit Three
0xb4 0x00b4 ;Spacing Acute
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Paragraph Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Spacing Cedilla
0xb9 0x00b9 ;Superscript Digit One
0xba 0x00f7 ;Division Sign
0xbb 0x00bb ;Right Pointing Guillemet
0xbc 0x00bc ;Fraction One Quarter
0xbd 0x00bd ;Fraction One Half
0xbe 0x00be ;Fraction Three Quarters
0xbf 0x00bf ;Inverted Question Mark
0xc0 0x05b0 ;Hebrew Point Sheva
0xc1 0x05b1 ;Hebrew Point Hataf Segol
0xc2 0x05b2 ;Hebrew Point Hataf Patah
0xc3 0x05b3 ;Hebrew Point Hataf Qamats
0xc4 0x05b4 ;Hebrew Point Hiriq
0xc5 0x05b5 ;Hebrew Point Tsere
0xc6 0x05b6 ;Hebrew Point Segol
0xc7 0x05b7 ;Hebrew Point Patah
0xc8 0x05b8 ;Hebrew Point Qamats
0xc9 0x05b9 ;Hebrew Point Holam
0xca 0x05ba ;Hebrew Point ????
0xcb 0x05bb ;Hebrew Point Qubuts
0xcc 0x05bc ;Hebrew Point Dagesh
0xcd 0x05bd ;Hebrew Point Meteg
0xce 0x05be ;Hebrew Punctuation Maqaf
0xcf 0x05bf ;Hebrew Point Rafe
0xd0 0x05c0 ;Hebrew Point Paseq
0xd1 0x05c1 ;Hebrew Point Shin Dot
0xd2 0x05c2 ;Hebrew Point Sin Dot
0xd3 0x05c3 ;Hebrew Punctuation Sof Pasuq
0xd4 0x05f0 ;Hebrew Ligature Yiddish Double Vav
0xd5 0x05f1 ;Hebrew Ligature Yiddish Vav Yod
0xd6 0x05f2 ;Hebrew Ligature Yiddish Double Yod
0xd7 0x05f3 ;Hebrew Punctuation Geresh
0xd8 0x05f4 ;Hebrew Punctuation Gershayim
0xd9 0xf88d ;Undefined -> EUDC
0xda 0xf88e ;Undefined -> EUDC
0xdb 0xf88f ;Undefined -> EUDC
0xdc 0xf890 ;Undefined -> EUDC
0xdd 0xf891 ;Undefined -> EUDC
0xde 0xf892 ;Undefined -> EUDC
0xdf 0xf893 ;Undefined -> EUDC
0xe0 0x05d0 ;Hebrew Letter Alef
0xe1 0x05d1 ;Hebrew Letter Bet
0xe2 0x05d2 ;Hebrew Letter Gimel
0xe3 0x05d3 ;Hebrew Letter Dalet
0xe4 0x05d4 ;Hebrew Letter He
0xe5 0x05d5 ;Hebrew Letter Vav
0xe6 0x05d6 ;Hebrew Letter Zayin
0xe7 0x05d7 ;Hebrew Letter Het
0xe8 0x05d8 ;Hebrew Letter Tet
0xe9 0x05d9 ;Hebrew Letter Yod
0xea 0x05da ;Hebrew Letter Final Kaf
0xeb 0x05db ;Hebrew Letter Kaf
0xec 0x05dc ;Hebrew Letter Lamed
0xed 0x05dd ;Hebrew Letter Final Mem
0xee 0x05de ;Hebrew Letter Mem
0xef 0x05df ;Hebrew Letter Final Nun
0xf0 0x05e0 ;Hebrew Letter Nun
0xf1 0x05e1 ;Hebrew Letter Samekh
0xf2 0x05e2 ;Hebrew Letter Ayin
0xf3 0x05e3 ;Hebrew Letter Final Pe
0xf4 0x05e4 ;Hebrew Letter Pe
0xf5 0x05e5 ;Hebrew Letter Final Tsadi
0xf6 0x05e6 ;Hebrew Letter Tsadi
0xf7 0x05e7 ;Hebrew Letter Qof
0xf8 0x05e8 ;Hebrew Letter Resh
0xf9 0x05e9 ;Hebrew Letter Shin
0xfa 0x05ea ;Hebrew Letter Tav
0xfb 0xf894 ;Undefined -> EUDC
0xfc 0xf895 ;Undefined -> EUDC
0xfd 0x200e ;Left-To-Right Mark
0xfe 0x200f ;Right-To-Left Mark
0xff 0xf896 ;Undefined -> EUDC
WCTABLE 352
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe-Quote
0x0028 0x28 ;Opening Parenthesis
0x0029 0x29 ;Closing Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Period
0x002f 0x2f ;Slash
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Opening Square Bracket
0x005c 0x5c ;Backslash
0x005d 0x5d ;Closing Square Bracket
0x005e 0x5e ;Spacing Circumflex
0x005f 0x5f ;Spacing Underscore
0x0060 0x60 ;Spacing Grave
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Opening Curly Bracket
0x007c 0x7c ;Vertical Bar
0x007d 0x7d ;Closing Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81 ;Control -> Undefined
0x008a 0x8a ;Control -> Undefined
0x008c 0x8c ;Control -> Undefined
0x008d 0x8d ;Control -> Undefined
0x008e 0x8e ;Control -> Undefined
0x008f 0x8f ;Control -> Undefined
0x0090 0x90 ;Control -> Undefined
0x009a 0x9a ;Control -> Undefined
0x009c 0x9c ;Control -> Undefined
0x009d 0x9d ;Control -> Undefined
0x009e 0x9e ;Control -> Undefined
0x009f 0x9f ;Control -> Undefined
0x00a0 0xa0 ;Non-Breaking Space
0x00a1 0xa1 ;Inverted Exclamation Mark
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Best-fit International Currency to Sheqel
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Vertical Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Spacing Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00ab 0xab ;Left Pointing Guillemet
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Trade Mark Sign
0x00af 0xaf ;Spacing Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Or-Minus Sign
0x00b2 0xb2 ;Superscript Digit Two
0x00b3 0xb3 ;Superscript Digit Three
0x00b4 0xb4 ;Spacing Acute
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Paragraph Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Spacing Cedilla
0x00b9 0xb9 ;Superscript Digit One
0x00bb 0xbb ;Right Pointing Guillemet
0x00bc 0xbc ;Fraction One Quarter
0x00bd 0xbd ;Fraction One Half
0x00be 0xbe ;Fraction Three Quarters
0x00bf 0xbf ;Inverted Question Mark
0x00d7 0xaa ;Multiplication Sign
0x00f7 0xba ;Division Sign
0x0191 0x46 ;F Hook -> F(#25995)
0x0192 0x83 ;Latin Small Letter Script F
0x02c6 0x88 ;Modifier Letter Circumflex
0x02dc 0x98 ;Spacing Tilde
0x05b0 0xc0 ;Hebrew Point Sheva
0x05b1 0xc1 ;Hebrew Point Hataf Segol
0x05b2 0xc2 ;Hebrew Point Hataf Patah
0x05b3 0xc3 ;Hebrew Point Hataf Qamats
0x05b4 0xc4 ;Hebrew Point Hiriq
0x05b5 0xc5 ;Hebrew Point Tsere
0x05b6 0xc6 ;Hebrew Point Segol
0x05b7 0xc7 ;Hebrew Point Patah
0x05b8 0xc8 ;Hebrew Point Qamats
0x05b9 0xc9 ;Hebrew Point Holam
0x05ba 0xca ;Hebrew Point ????
0x05bb 0xcb ;Hebrew Point Qubuts
0x05bc 0xcc ;Hebrew Point Dagesh
0x05bd 0xcd ;Hebrew Point Meteg
0x05be 0xce ;Hebrew Punctuation Maqaf
0x05bf 0xcf ;Hebrew Point Rafe
0x05c0 0xd0 ;Hebrew Point Paseq
0x05c1 0xd1 ;Hebrew Point Shin Dot
0x05c2 0xd2 ;Hebrew Point Sin Dot
0x05c3 0xd3 ;Hebrew Punctuation Sof Pasuq
0x05d0 0xe0 ;Hebrew Letter Alef
0x05d1 0xe1 ;Hebrew Letter Bet
0x05d2 0xe2 ;Hebrew Letter Gimel
0x05d3 0xe3 ;Hebrew Letter Dalet
0x05d4 0xe4 ;Hebrew Letter He
0x05d5 0xe5 ;Hebrew Letter Vav
0x05d6 0xe6 ;Hebrew Letter Zayin
0x05d7 0xe7 ;Hebrew Letter Het
0x05d8 0xe8 ;Hebrew Letter Tet
0x05d9 0xe9 ;Hebrew Letter Yod
0x05da 0xea ;Hebrew Letter Final Kaf
0x05db 0xeb ;Hebrew Letter Kaf
0x05dc 0xec ;Hebrew Letter Lamed
0x05dd 0xed ;Hebrew Letter Final Mem
0x05de 0xee ;Hebrew Letter Mem
0x05df 0xef ;Hebrew Letter Final Nun
0x05e0 0xf0 ;Hebrew Letter Nun
0x05e1 0xf1 ;Hebrew Letter Samekh
0x05e2 0xf2 ;Hebrew Letter Ayin
0x05e3 0xf3 ;Hebrew Letter Final Pe
0x05e4 0xf4 ;Hebrew Letter Pe
0x05e5 0xf5 ;Hebrew Letter Final Tsadi
0x05e6 0xf6 ;Hebrew Letter Tsadi
0x05e7 0xf7 ;Hebrew Letter Qof
0x05e8 0xf8 ;Hebrew Letter Resh
0x05e9 0xf9 ;Hebrew Letter Shin
0x05ea 0xfa ;Hebrew Letter Tav
0x05f0 0xd4 ;Hebrew Ligature Yiddish Double Vav
0x05f1 0xd5 ;Hebrew Ligature Yiddish Vav Yod
0x05f2 0xd6 ;Hebrew Ligature Yiddish Double Yod
0x05f3 0xd7 ;Hebrew Punctuation Geresh
0x05f4 0xd8 ;Hebrew Punctuation Gershayim
0x200e 0xfd ;Left-To-Right Mark
0x200f 0xfe ;Right-To-Left Mark
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Single Turned Comma Quotation Mark
0x2019 0x92 ;Single Comma Quotation Mark
0x201a 0x82 ;Low Single Comma Quotation Mark
0x201c 0x93 ;Double Turned Comma Quotation Mark
0x201d 0x94 ;Double Comma Quotation Mark
0x201e 0x84 ;Low Double Comma Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Left Pointing Single Guillemet
0x203a 0x9b ;Right Pointing Single Guillemet
0x2122 0x99 ;Trademark
0x20ac 0x80 ;Euro Sign
0x20aa 0xa4 ;New Sheqel Sign
0xf88d 0xd9 ;EUDC -> Undefined
0xf88e 0xda ;EUDC -> Undefined
0xf88f 0xdb ;EUDC -> Undefined
0xf890 0xdc ;EUDC -> Undefined
0xf891 0xdd ;EUDC -> Undefined
0xf892 0xde ;EUDC -> Undefined
0xf893 0xdf ;EUDC -> Undefined
0xf894 0xfb ;EUDC -> Undefined
0xf895 0xfc ;EUDC -> Undefined
0xf896 0xff ;EUDC -> Undefined
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE

View File

@@ -0,0 +1,812 @@
CODEPAGE 1256 ;Arabic - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe-Quote
0x28 0x0028 ;Opening Parenthesis
0x29 0x0029 ;Closing Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Period
0x2f 0x002f ;Slash
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Opening Square Bracket
0x5c 0x005c ;Backslash
0x5d 0x005d ;Closing Square Bracket
0x5e 0x005e ;Spacing Circumflex
0x5f 0x005f ;Spacing Underscore
0x60 0x0060 ;Spacing Grave
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Opening Curly Bracket
0x7c 0x007c ;Vertical Bar
0x7d 0x007d ;Closing Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x067e ;Arabic Taa with Three Dots Below
0x82 0x201a ;Low Single Comma Quotation Mark
0x83 0x0192 ;Latin Small Letter Script F
0x84 0x201e ;Low Double Comma Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x02c6 ;Modifier Letter Circumflex
0x89 0x2030 ;Per Mille Sign
0x8a 0x0679 ;Arabic Letter Tteh
0x8b 0x2039 ;Left Pointing Single Guillemet
0x8c 0x0152 ;Latin Capital Letter O E
0x8d 0x0686 ;Arabic Haa with Middle Three Dots Downward
0x8e 0x0698 ;Arabic Ra with Three Dots Above
0x8f 0x0688 ;Arabic Letter Ddal
0x90 0x06af ;Arabic Gaf
0x91 0x2018 ;Single Turned Comma Quotation Mark
0x92 0x2019 ;Single Comma Quotation Mark
0x93 0x201c ;Double Turned Comma Quotation Mark
0x94 0x201d ;Double Comma Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x06a9 ;Arabic Letter Keheh
0x99 0x2122 ;Trademark
0x9a 0x0691 ;Arabic Letter Rreh
0x9b 0x203a ;Right Pointing Single Guillemet
0x9c 0x0153 ;Latin Small Letter O E
0x9d 0x200c ;Zero Width Non-Joiner
0x9e 0x200d ;Zero Width Joiner
0x9f 0x06ba ;Arabic Letter Noon Ghunna
0xa0 0x00a0 ;Non-Breaking Space
0xa1 0x060c ;Arabic Comma
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Vertical Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Spacing Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x06be ;Arabic Letter Heh Doachashmee
0xab 0x00ab ;Left Pointing Guillemet
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Trade Mark Sign
0xaf 0x00af ;Spacing Macron
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Or-Minus Sign
0xb2 0x00b2 ;Superscript Digit Two
0xb3 0x00b3 ;Superscript Digit Three
0xb4 0x00b4 ;Spacing Acute
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Paragraph Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Spacing Cedilla
0xb9 0x00b9 ;Superscript Digit One
0xba 0x061b ;Arabic Semicolon
0xbb 0x00bb ;Right Pointing Guillemet
0xbc 0x00bc ;Fraction One Quarter
0xbd 0x00bd ;Fraction One Half
0xbe 0x00be ;Fraction Three Quarters
0xbf 0x061f ;Arabic Question Mark
0xc0 0x06c1 ;Arabic Letter Heh Goal
0xc1 0x0621 ;Arabic Letter Hamzah
0xc2 0x0622 ;Arabic Letter Maddah On Alef
0xc3 0x0623 ;Arabic Letter Hamzah On Alef
0xc4 0x0624 ;Arabic Letter Hamzah On Waw
0xc5 0x0625 ;Arabic Letter Hamzah Under Alef
0xc6 0x0626 ;Arabic Letter Hamzah On Ya
0xc7 0x0627 ;Arabic Letter Alef
0xc8 0x0628 ;Arabic Letter Baa
0xc9 0x0629 ;Arabic Letter Taa Marbutah
0xca 0x062a ;Arabic Letter Taa
0xcb 0x062b ;Arabic Letter Thaa
0xcc 0x062c ;Arabic Letter Jeem
0xcd 0x062d ;Arabic Letter Haa
0xce 0x062e ;Arabic Letter Khaa
0xcf 0x062f ;Arabic Letter Dal
0xd0 0x0630 ;Arabic Letter Thal
0xd1 0x0631 ;Arabic Letter Ra
0xd2 0x0632 ;Arabic Letter Zain
0xd3 0x0633 ;Arabic Letter Seen
0xd4 0x0634 ;Arabic Letter Sheen
0xd5 0x0635 ;Arabic Letter Sad
0xd6 0x0636 ;Arabic Letter Dad
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x0637 ;Arabic Letter Tah
0xd9 0x0638 ;Arabic Letter Dhah
0xda 0x0639 ;Arabic Letter Ain
0xdb 0x063a ;Arabic Letter Ghain
0xdc 0x0640 ;Arabic Tatweel
0xdd 0x0641 ;Arabic Letter Fa
0xde 0x0642 ;Arabic Letter Qaf
0xdf 0x0643 ;Arabic Letter Caf
0xe0 0x00e0 ;Latin Small Letter A Grave
0xe1 0x0644 ;Arabic Letter Lam
0xe2 0x00e2 ;Latin Small Letter A Circumflex
0xe3 0x0645 ;Arabic Letter Meem
0xe4 0x0646 ;Arabic Letter Noon
0xe5 0x0647 ;Arabic Letter Ha
0xe6 0x0648 ;Arabic Letter Waw
0xe7 0x00e7 ;Latin Small Letter C Cedilla
0xe8 0x00e8 ;Latin Small Letter E Grave
0xe9 0x00e9 ;Latin Small Letter E Acute
0xea 0x00ea ;Latin Small Letter E Circumflex
0xeb 0x00eb ;Latin Small Letter E Diaeresis
0xec 0x0649 ;Arabic Letter Alef Maqsurah
0xed 0x064a ;Arabic Letter Ya
0xee 0x00ee ;Latin Small Letter I Circumflex
0xef 0x00ef ;Latin Small Letter I Diaeresis
0xf0 0x064b ;Arabic Fathatan
0xf1 0x064c ;Arabic Dammatan
0xf2 0x064d ;Arabic Kasratan
0xf3 0x064e ;Arabic Fathah
0xf4 0x00f4 ;Latin Small Letter O Circumflex
0xf5 0x064f ;Arabic Dammah
0xf6 0x0650 ;Arabic Kasrah
0xf7 0x00f7 ;Division Sign
0xf8 0x0651 ;Arabic Shaddah
0xf9 0x00f9 ;Latin Small Letter U Grave
0xfa 0x0652 ;Arabic Sukun
0xfb 0x00fb ;Latin Small Letter U Circumflex
0xfc 0x00fc ;Latin Small Letter U Diaeresis
0xfd 0x200e ;Left-To-Right Mark
0xfe 0x200f ;Right-To-Left Mark
0xff 0x06d2 ;Arabic Letter Yeh Barree
WCTABLE 544
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe-Quote
0x0028 0x28 ;Opening Parenthesis
0x0029 0x29 ;Closing Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Period
0x002f 0x2f ;Slash
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Opening Square Bracket
0x005c 0x5c ;Backslash
0x005d 0x5d ;Closing Square Bracket
0x005e 0x5e ;Spacing Circumflex
0x005f 0x5f ;Spacing Underscore
0x0060 0x60 ;Spacing Grave
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Opening Curly Bracket
0x007c 0x7c ;Vertical Bar
0x007d 0x7d ;Closing Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x00a0 0xa0 ;Non-Breaking Space
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Vertical Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Spacing Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00ab 0xab ;Left Pointing Guillemet
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Trade Mark Sign
0x00af 0xaf ;Spacing Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Or-Minus Sign
0x00b2 0xb2 ;Superscript Digit Two
0x00b3 0xb3 ;Superscript Digit Three
0x00b4 0xb4 ;Spacing Acute
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Paragraph Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Spacing Cedilla
0x00b9 0xb9 ;Superscript Digit One
0x00bb 0xbb ;Right Pointing Guillemet
0x00bc 0xbc ;Fraction One Quarter
0x00bd 0xbd ;Fraction One Half
0x00be 0xbe ;Fraction Three Quarters
0x00c0 0x41 ;A Grave -> A(#25995)
0x00c2 0x41 ;A Circumflex -> A(#25995)
0x00c7 0x43 ;C Cedilla -> C(#25995)
0x00c8 0x45 ;E Grave -> E(#25995)
0x00c9 0x45 ;E Acute -> E(#25995)
0x00ca 0x45 ;E Circumflex -> E(#25995)
0x00cb 0x45 ;E Diaeresis -> E(#25995)
0x00ce 0x49 ;I Circumflex -> I(#25995)
0x00cf 0x49 ;I Diaeresis -> I(#25995)
0x00d4 0x4f ;O Circumflex -> O(#25995)
0x00d7 0xd7 ;Multiplication Sign
0x00d9 0x55 ;U Grave -> U(#25995)
0x00db 0x55 ;U Circumflex -> U(#25995)
0x00dc 0x55 ;U Diaeresis -> U(#25995)
0x00e0 0xe0 ;Latin Small Letter A Grave
0x00e2 0xe2 ;Latin Small Letter A Circumflex
0x00e7 0xe7 ;Latin Small Letter C Cedilla
0x00e8 0xe8 ;Latin Small Letter E Grave
0x00e9 0xe9 ;Latin Small Letter E Acute
0x00ea 0xea ;Latin Small Letter E Circumflex
0x00eb 0xeb ;Latin Small Letter E Diaeresis
0x00ee 0xee ;Latin Small Letter I Circumflex
0x00ef 0xef ;Latin Small Letter I Diaeresis
0x00f4 0xf4 ;Latin Small Letter O Circumflex
0x00f7 0xf7 ;Division Sign
0x00f9 0xf9 ;Latin Small Letter U Grave
0x00fb 0xfb ;Latin Small Letter U Circumflex
0x00fc 0xfc ;Latin Small Letter U Diaeresis
0x0152 0x8c ;Latin Capital Letter O E
0x0153 0x9c ;Latin Small Letter O E
0x0191 0x46 ;F Hook -> F(#25995)
0x0192 0x83 ;Latin Small Letter Script F
0x02c6 0x88 ;Modifier Letter Circumflex
0x060c 0xa1 ;Arabic Comma
0x061b 0xba ;Arabic Semicolon
0x061f 0xbf ;Arabic Question Mark
0x0621 0xc1 ;Arabic Letter Hamzah
0x0622 0xc2 ;Arabic Letter Maddah On Alef
0x0623 0xc3 ;Arabic Letter Hamzah On Alef
0x0624 0xc4 ;Arabic Letter Hamzah On Waw
0x0625 0xc5 ;Arabic Letter Hamzah Under Alef
0x0626 0xc6 ;Arabic Letter Hamzah On Ya
0x0627 0xc7 ;Arabic Letter Alef
0x0628 0xc8 ;Arabic Letter Baa
0x0629 0xc9 ;Arabic Letter Taa Marbutah
0x062a 0xca ;Arabic Letter Taa
0x062b 0xcb ;Arabic Letter Thaa
0x062c 0xcc ;Arabic Letter Jeem
0x062d 0xcd ;Arabic Letter Haa
0x062e 0xce ;Arabic Letter Khaa
0x062f 0xcf ;Arabic Letter Dal
0x0630 0xd0 ;Arabic Letter Thal
0x0631 0xd1 ;Arabic Letter Ra
0x0632 0xd2 ;Arabic Letter Zain
0x0633 0xd3 ;Arabic Letter Seen
0x0634 0xd4 ;Arabic Letter Sheen
0x0635 0xd5 ;Arabic Letter Sad
0x0636 0xd6 ;Arabic Letter Dad
0x0637 0xd8 ;Arabic Letter Tah
0x0638 0xd9 ;Arabic Letter Dhah
0x0639 0xda ;Arabic Letter Ain
0x063a 0xdb ;Arabic Letter Ghain
0x0640 0xdc ;Arabic Tatweel
0x0641 0xdd ;Arabic Letter Fa
0x0642 0xde ;Arabic Letter Qaf
0x0643 0xdf ;Arabic Letter Caf
0x0644 0xe1 ;Arabic Letter Lam
0x0645 0xe3 ;Arabic Letter Meem
0x0646 0xe4 ;Arabic Letter Noon
0x0647 0xe5 ;Arabic Letter Ha
0x0648 0xe6 ;Arabic Letter Waw
0x0649 0xec ;Arabic Letter Alef Maqsurah
0x064a 0xed ;Arabic Letter Ya
0x064b 0xf0 ;Arabic Fathatan
0x064c 0xf1 ;Arabic Dammatan
0x064d 0xf2 ;Arabic Kasratan
0x064e 0xf3 ;Arabic Fathah
0x064f 0xf5 ;Arabic Dammah
0x0650 0xf6 ;Arabic Kasrah
0x0651 0xf8 ;Arabic Shaddah
0x0652 0xfa ;Arabic Sukun
0x0660 0x30 ;Best fit: Arabic-Indic Zero -> Digit Zero
0x0661 0x31 ;Best fit: Arabic-Indic One -> Digit One
0x0662 0x32 ;Best fit: Arabic-Indic Two -> Digit Two
0x0663 0x33 ;Best fit: Arabic-Indic Three -> Digit Three
0x0664 0x34 ;Best fit: Arabic-Indic Four -> Digit Four
0x0665 0x35 ;Best fit: Arabic-Indic Five -> Digit Five
0x0666 0x36 ;Best fit: Arabic-Indic Six -> Digit Six
0x0667 0x37 ;Best fit: Arabic-Indic Seven -> Digit Seven
0x0668 0x38 ;Best fit: Arabic-Indic Eight -> Digit Eight
0x0669 0x39 ;Best fit: Arabic-Indic Nine -> Digit Nine
0x0679 0x8a ;Arabic Letter Tteh
0x067e 0x81 ;Arabic Taa with Three Dots Below
0x0686 0x8d ;Arabic Haa with Middle Three Dots Downward
0x0688 0x8f ;Arabic Letter Ddal
0x0691 0x9a ;Arabic Letter Rreh
0x0698 0x8e ;Arabic Ra with Three Dots Above
0x06a9 0x98 ;Arabic Letter Keheh
0x06af 0x90 ;Arabic Gaf
0x06ba 0x9f ;Arabic Letter Noon Ghunna
0x06be 0xaa ;Arabic Letter Heh Doachashmee
0x06c1 0xc0 ;Arabic Letter Heh Goal
0x06cc 0xed ;Best-fit : Farsi Yeh -> Arabic Yeh (U+064a)
0x06d2 0xff ;Arabic Letter Yeh Barree
0x200c 0x9d ;Zero Width Non-Joiner
0x200d 0x9e ;Zero Width Joiner
0x200e 0xfd ;Left-To-Right Mark
0x200f 0xfe ;Right-To-Left Mark
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Single Turned Comma Quotation Mark
0x2019 0x92 ;Single Comma Quotation Mark
0x201a 0x82 ;Low Single Comma Quotation Mark
0x201c 0x93 ;Double Turned Comma Quotation Mark
0x201d 0x94 ;Double Comma Quotation Mark
0x201e 0x84 ;Low Double Comma Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Left Pointing Single Guillemet
0x203a 0x9b ;Right Pointing Single Guillemet
0x20ac 0x80 ;Euro Sign
0x2122 0x99 ;Trademark
0xfb56 0x81 ;Arabic Letter Peh Isolated
0xfb57 0x81 ;Arabic Letter Peh Final
0xfb58 0x81 ;Arabic Letter Peh Initial
0xfb59 0x81 ;Arabic Letter Peh Medial
0xfb66 0x8a ;Arabic Letter Tteh Isolated
0xfb67 0x8a ;Arabic Letter Tteh Final
0xfb68 0x8a ;Arabic Letter Tteh Initial
0xfb69 0x8a ;Arabic Letter Tteh Medial
0xfb7a 0x8d ;Arabic Letter Tcheh Isolated
0xfb7b 0x8d ;Arabic Letter Tcheh Final
0xfb7c 0x8d ;Arabic Letter Tcheh Initial
0xfb7d 0x8d ;Arabic Letter Tcheh Medial
0xfb88 0x8f ;Arabic Letter Ddal Isolated
0xfb89 0x8f ;Arabic Letter Ddal Final
0xfb8a 0x8e ;Arabic Letter Jeh Isolated
0xfb8b 0x8e ;Arabic Letter Jeh Final
0xfb8c 0x9a ;Arabic Letter Rreh Isolated
0xfb8d 0x9a ;Arabic Letter Rreh Final
0xfb8e 0x98 ;Arabic Letter Keheh Isolated
0xfb8f 0x98 ;Arabic Letter Keheh Final
0xfb90 0x98 ;Arabic Letter Keheh Initial
0xfb91 0x98 ;Arabic Letter Keheh Medial
0xfb92 0x90 ;Arabic Letter Gaf Isolated
0xfb93 0x90 ;Arabic Letter Gaf Final
0xfb94 0x90 ;Arabic Letter Gaf Initial
0xfb95 0x90 ;Arabic Letter Gaf Medial
0xfb9e 0x9f ;Arabic Letter Noon Ghunna Isolated
0xfb9f 0x9f ;Arabic Letter Noon Ghunna Final
0xfba6 0xc0 ;Arabic Letter Heh Goal Isolated
0xfba7 0xc0 ;Arabic Letter Heh Goal Final
0xfba8 0xc0 ;Arabic Letter Heh Goal Initial
0xfba9 0xc0 ;Arabic Letter Heh Goal Medial
0xfbaa 0xaa ;Arabic Letter Heh Doachashmee Isolated
0xfbab 0xaa ;Arabic Letter Heh Doachashmee Final
0xfbac 0xaa ;Arabic Letter Heh Doachashmee Initial
0xfbad 0xaa ;Arabic Letter Heh Doachashmee Medial
0xfbae 0xff ;Arabic Letter Yeh Barree Isolated
0xfbaf 0xff ;Arabic Letter Yeh Barree Final
0xfe70 0xf0 ;Arabic Spacing Fathatan
0xfe71 0xf0 ;Arabic Fathatan On Tatweel
0xfe72 0xf1 ;Arabic Spacing Dammatan
0xfe74 0xf2 ;Arabic Spacing Kasratan
0xfe76 0xf3 ;Arabic Spacing Fathah
0xfe77 0xf3 ;Arabic Fathah On Tatweel
0xfe78 0xf5 ;Arabic Spacing Dammah
0xfe79 0xf5 ;Arabic Dammah On Tatweel
0xfe7a 0xf6 ;Arabic Spacing Kasrah
0xfe7b 0xf6 ;Arabic Kasrah On Tatweel
0xfe7c 0xf8 ;Arabic Spacing Shaddah
0xfe7d 0xf8 ;Arabic Shaddah On Tatweel
0xfe7e 0xfa ;Arabic Spacing Sukun
0xfe7f 0xfa ;Arabic Sukun On Tatweel
0xfe80 0xc1 ;Glyph For Isolate Arabic Hamzah
0xfe81 0xc2 ;Glyph For Isolate Arabic Maddah On Alef
0xfe82 0xc2 ;Glyph For Final Arabic Maddah On Alef
0xfe83 0xc3 ;Glyph For Isolate Arabic Hamzah On Alef
0xfe84 0xc3 ;Glyph For Final Arabic Hamzah On Alef
0xfe85 0xc4 ;Glyph For Isolate Arabic Hamzah On Waw
0xfe86 0xc4 ;Glyph For Final Arabic Hamzah On Waw
0xfe87 0xc5 ;Glyph For Isolate Arabic Hamzah Under Alef
0xfe88 0xc5 ;Glyph For Final Arabic Hamzah Under Alef
0xfe89 0xc6 ;Glyph For Isolate Arabic Hamzah On Ya
0xfe8a 0xc6 ;Glyph For Final Arabic Hamzah On Ya
0xfe8b 0xc6 ;Glyph For Initial Arabic Hamzah On Ya
0xfe8c 0xc6 ;Glyph For Medial Arabic Hamzah On Ya
0xfe8d 0xc7 ;Glyph For Isolate Arabic Alef
0xfe8e 0xc7 ;Glyph For Final Arabic Alef
0xfe8f 0xc8 ;Glyph For Isolate Arabic Baa
0xfe90 0xc8 ;Glyph For Final Arabic Baa
0xfe91 0xc8 ;Glyph For Initial Arabic Baa
0xfe92 0xc8 ;Glyph For Medial Arabic Baa
0xfe93 0xc9 ;Glyph For Isolate Arabic Taa Marbutah
0xfe94 0xc9 ;Glyph For Final Arabic Taa Marbutah
0xfe95 0xca ;Glyph For Isolate Arabic Taa
0xfe96 0xca ;Glyph For Final Arabic Taa
0xfe97 0xca ;Glyph For Initial Arabic Taa
0xfe98 0xca ;Glyph For Medial Arabic Taa
0xfe99 0xcb ;Glyph For Isolate Arabic Thaa
0xfe9a 0xcb ;Glyph For Final Arabic Thaa
0xfe9b 0xcb ;Glyph For Initial Arabic Thaa
0xfe9c 0xcb ;Glyph For Medial Arabic Thaa
0xfe9d 0xcc ;Glyph For Isolate Arabic Jeem
0xfe9e 0xcc ;Glyph For Final Arabic Jeem
0xfe9f 0xcc ;Glyph For Initial Arabic Jeem
0xfea0 0xcc ;Glyph For Medial Arabic Jeem
0xfea1 0xcd ;Glyph For Isolate Arabic Haa
0xfea2 0xcd ;Glyph For Final Arabic Haa
0xfea3 0xcd ;Glyph For Initial Arabic Haa
0xfea4 0xcd ;Glyph For Medial Arabic Haa
0xfea5 0xce ;Glyph For Isolate Arabic Khaa
0xfea6 0xce ;Glyph For Final Arabic Khaa
0xfea7 0xce ;Glyph For Initial Arabic Khaa
0xfea8 0xce ;Glyph For Medial Arabic Khaa
0xfea9 0xcf ;Glyph For Isolate Arabic Dal
0xfeaa 0xcf ;Glyph For Final Arabic Dal
0xfeab 0xd0 ;Glyph For Isolate Arabic Thal
0xfeac 0xd0 ;Glyph For Final Arabic Thal
0xfead 0xd1 ;Glyph For Isolate Arabic Ra
0xfeae 0xd1 ;Glyph For Final Arabic Ra
0xfeaf 0xd2 ;Glyph For Isolate Arabic Zain
0xfeb0 0xd2 ;Glyph For Final Arabic Zain
0xfeb1 0xd3 ;Glyph For Isolate Arabic Seen
0xfeb2 0xd3 ;Glyph For Final Arabic Seen
0xfeb3 0xd3 ;Glyph For Initial Arabic Seen
0xfeb4 0xd3 ;Glyph For Medial Arabic Seen
0xfeb5 0xd4 ;Glyph For Isolate Arabic Sheen
0xfeb6 0xd4 ;Glyph For Final Arabic Sheen
0xfeb7 0xd4 ;Glyph For Initial Arabic Sheen
0xfeb8 0xd4 ;Glyph For Medial Arabic Sheen
0xfeb9 0xd5 ;Glyph For Isolate Arabic Sad
0xfeba 0xd5 ;Glyph For Final Arabic Sad
0xfebb 0xd5 ;Glyph For Initial Arabic Sad
0xfebc 0xd5 ;Glyph For Medial Arabic Sad
0xfebd 0xd6 ;Glyph For Isolate Arabic Dad
0xfebe 0xd6 ;Glyph For Final Arabic Dad
0xfebf 0xd6 ;Glyph For Initial Arabic Dad
0xfec0 0xd6 ;Glyph For Medial Arabic Dad
0xfec1 0xd8 ;Glyph For Isolate Arabic Tah
0xfec2 0xd8 ;Glyph For Final Arabic Tah
0xfec3 0xd8 ;Glyph For Initial Arabic Tah
0xfec4 0xd8 ;Glyph For Medial Arabic Tah
0xfec5 0xd9 ;Glyph For Isolate Arabic Dhah
0xfec6 0xd9 ;Glyph For Final Arabic Dhah
0xfec7 0xd9 ;Glyph For Initial Arabic Dhah
0xfec8 0xd9 ;Glyph For Medial Arabic Dhah
0xfec9 0xda ;Glyph For Isolate Arabic Ain
0xfeca 0xda ;Glyph For Final Arabic Ain
0xfecb 0xda ;Glyph For Initial Arabic Ain
0xfecc 0xda ;Glyph For Medial Arabic Ain
0xfecd 0xdb ;Glyph For Isolate Arabic Ghain
0xfece 0xdb ;Glyph For Final Arabic Ghain
0xfecf 0xdb ;Glyph For Initial Arabic Ghain
0xfed0 0xdb ;Glyph For Medial Arabic Ghain
0xfed1 0xdd ;Glyph For Isolate Arabic Fa
0xfed2 0xdd ;Glyph For Final Arabic Fa
0xfed3 0xdd ;Glyph For Initial Arabic Fa
0xfed4 0xdd ;Glyph For Medial Arabic Fa
0xfed5 0xde ;Glyph For Isolate Arabic Qaf
0xfed6 0xde ;Glyph For Final Arabic Qaf
0xfed7 0xde ;Glyph For Initial Arabic Qaf
0xfed8 0xde ;Glyph For Medial Arabic Qaf
0xfed9 0xdf ;Glyph For Isolate Arabic Caf
0xfeda 0xdf ;Glyph For Final Arabic Caf
0xfedb 0xdf ;Glyph For Initial Arabic Caf
0xfedc 0xdf ;Glyph For Medial Arabic Caf
0xfedd 0xe1 ;Glyph For Isolate Arabic Lam
0xfede 0xe1 ;Glyph For Final Arabic Lam
0xfedf 0xe1 ;Glyph For Initial Arabic Lam
0xfee0 0xe1 ;Glyph For Medial Arabic Lam
0xfee1 0xe3 ;Glyph For Isolate Arabic Meem
0xfee2 0xe3 ;Glyph For Final Arabic Meem
0xfee3 0xe3 ;Glyph For Initial Arabic Meem
0xfee4 0xe3 ;Glyph For Medial Arabic Meem
0xfee5 0xe4 ;Glyph For Isolate Arabic Noon
0xfee6 0xe4 ;Glyph For Final Arabic Noon
0xfee7 0xe4 ;Glyph For Initial Arabic Noon
0xfee8 0xe4 ;Glyph For Medial Arabic Noon
0xfee9 0xe5 ;Glyph For Isolate Arabic Ha
0xfeea 0xe5 ;Glyph For Final Arabic Ha
0xfeeb 0xe5 ;Glyph For Initial Arabic Ha
0xfeec 0xe5 ;Glyph For Medial Arabic Ha
0xfeed 0xe6 ;Glyph For Isolate Arabic Waw
0xfeee 0xe6 ;Glyph For Final Arabic Waw
0xfeef 0xec ;Glyph For Isolate Arabic Alef Maqsurah
0xfef0 0xec ;Glyph For Final Arabic Alef Maqsurah
0xfef1 0xed ;Glyph For Isolate Arabic Ya
0xfef2 0xed ;Glyph For Final Arabic Ya
0xfef3 0xed ;Glyph For Initial Arabic Ya
0xfef4 0xed ;Glyph For Medial Arabic Ya
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE

View File

@@ -0,0 +1,619 @@
CODEPAGE 1257 ;Baltic - ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit 0
0x31 0x0031 ;Digit 1
0x32 0x0032 ;Digit 2
0x33 0x0033 ;Digit 3
0x34 0x0034 ;Digit 4
0x35 0x0035 ;Digit 5
0x36 0x0036 ;Digit 6
0x37 0x0037 ;Digit 7
0x38 0x0038 ;Digit 8
0x39 0x0039 ;Digit 9
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;A
0x42 0x0042 ;B
0x43 0x0043 ;C
0x44 0x0044 ;D
0x45 0x0045 ;E
0x46 0x0046 ;F
0x47 0x0047 ;G
0x48 0x0048 ;H
0x49 0x0049 ;I
0x4a 0x004a ;J
0x4b 0x004b ;K
0x4c 0x004c ;L
0x4d 0x004d ;M
0x4e 0x004e ;N
0x4f 0x004f ;O
0x50 0x0050 ;P
0x51 0x0051 ;Q
0x52 0x0052 ;R
0x53 0x0053 ;S
0x54 0x0054 ;T
0x55 0x0055 ;U
0x56 0x0056 ;V
0x57 0x0057 ;W
0x58 0x0058 ;X
0x59 0x0059 ;Y
0x5a 0x005a ;Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Backslash
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave
0x61 0x0061 ;a
0x62 0x0062 ;b
0x63 0x0063 ;c
0x64 0x0064 ;d
0x65 0x0065 ;e
0x66 0x0066 ;f
0x67 0x0067 ;g
0x68 0x0068 ;h
0x69 0x0069 ;i
0x6a 0x006a ;j
0x6b 0x006b ;k
0x6c 0x006c ;l
0x6d 0x006d ;m
0x6e 0x006e ;n
0x6f 0x006f ;o
0x70 0x0070 ;p
0x71 0x0071 ;q
0x72 0x0072 ;r
0x73 0x0073 ;s
0x74 0x0074 ;t
0x75 0x0075 ;u
0x76 0x0076 ;v
0x77 0x0077 ;w
0x78 0x0078 ;x
0x79 0x0079 ;y
0x7a 0x007a ;z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x201a ;Low Single Comma Quotation Mark
0x83 0x0083 ;Not Used
0x84 0x201e ;Low Double Comma Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x0088
0x89 0x2030 ;Per Mille Sign
0x8a 0x008a
0x8b 0x2039 ;Left Pointing Single Guillement
0x8c 0x008c
0x8d 0x00a8 ;Diaeresis
0x8e 0x02c7 ;Hacek
0x8f 0x00b8 ;Cedilla
0x90 0x0090
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x0098 ;Not Used
0x99 0x2122 ;Trade Mark Sign
0x9a 0x009a
0x9b 0x203a ;Right Pointing Single Guillement
0x9c 0x009c
0x9d 0x00af ;Macron
0x9e 0x02db ;Ogonek
0x9f 0x009f
0xa0 0x00a0 ;No-Break Space
0xa1 0xf8fc ;Undefined -> EUDC
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0xf8fd ;Undefined -> EUDC
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00d8 ;O Stroke
0xa9 0x00a9 ;Copyright Sign
0xaa 0x0156 ;R Cedilla
0xab 0x00ab ;Left Pointing Guillement
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x00c6 ;AE
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x00b2 ;Superscript 2
0xb3 0x00b3 ;Superscript 3
0xb4 0x00b4 ;Acute
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00f8 ;o Stroke
0xb9 0x00b9 ;Superscript 1
0xba 0x0157 ;r Cedilla
0xbb 0x00bb ;Right Pointing Guillement
0xbc 0x00bc ;Fraction 1/4
0xbd 0x00bd ;Fraction 1/2
0xbe 0x00be ;Fraction 3/4
0xbf 0x00e6 ;ae
0xc0 0x0104 ;A Ogonek
0xc1 0x012e ;I Ogonek
0xc2 0x0100 ;A Macron
0xc3 0x0106 ;C Acute
0xc4 0x00c4 ;A Diaeresis
0xc5 0x00c5 ;A Ring Above
0xc6 0x0118 ;E Ogonek
0xc7 0x0112 ;E Macron
0xc8 0x010c ;C Hacek
0xc9 0x00c9 ;E Acute
0xca 0x0179 ;Z Acute
0xcb 0x0116 ;E Dot Above
0xcc 0x0122 ;G Cedilla
0xcd 0x0136 ;K Cedilla
0xce 0x012a ;I Macron
0xcf 0x013b ;L Cedilla
0xd0 0x0160 ;S Hacek
0xd1 0x0143 ;N Acute
0xd2 0x0145 ;N Cedilla
0xd3 0x00d3 ;O Acute
0xd4 0x014c ;O Macron
0xd5 0x00d5 ;O Tilde
0xd6 0x00d6 ;O Diaeresis
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x0172 ;U Ogonek
0xd9 0x0141 ;L Stroke
0xda 0x015a ;S Acute
0xdb 0x016a ;U Macron
0xdc 0x00dc ;U Diaeresis
0xdd 0x017b ;Z Dot Above
0xde 0x017d ;Z Hacek
0xdf 0x00df ;Sharp ss
0xe0 0x0105 ;a Ogonek
0xe1 0x012f ;i Ogonek
0xe2 0x0101 ;a Macron
0xe3 0x0107 ;c Acute
0xe4 0x00e4 ;a Diaeresis
0xe5 0x00e5 ;a Ring Above
0xe6 0x0119 ;e Ogonek
0xe7 0x0113 ;e Macron
0xe8 0x010d ;c Hacek
0xe9 0x00e9 ;e Acute
0xea 0x017a ;z Acute
0xeb 0x0117 ;e Dot Above
0xec 0x0123 ;g Cedilla
0xed 0x0137 ;k Cedilla
0xee 0x012b ;i Macron
0xef 0x013c ;l Cedilla
0xf0 0x0161 ;s Hacek
0xf1 0x0144 ;n Acute
0xf2 0x0146 ;n Cedilla
0xf3 0x00f3 ;o Acute
0xf4 0x014d ;o Macron
0xf5 0x00f5 ;o Tilde
0xf6 0x00f6 ;o Diaeresis
0xf7 0x00f7 ;Division Sign
0xf8 0x0173 ;u Ogonek
0xf9 0x0142 ;l Stroke
0xfa 0x015b ;s Acute
0xfb 0x016b ;u Macron
0xfc 0x00fc ;u Diaeresis
0xfd 0x017c ;z Dot Above
0xfe 0x017e ;z Hacek
0xff 0x02d9 ;Dot Above
WCTABLE 350
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit 0
0x0031 0x31 ;Digit 1
0x0032 0x32 ;Digit 2
0x0033 0x33 ;Digit 3
0x0034 0x34 ;Digit 4
0x0035 0x35 ;Digit 5
0x0036 0x36 ;Digit 6
0x0037 0x37 ;Digit 7
0x0038 0x38 ;Digit 8
0x0039 0x39 ;Digit 9
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;A
0x0042 0x42 ;B
0x0043 0x43 ;C
0x0044 0x44 ;D
0x0045 0x45 ;E
0x0046 0x46 ;F
0x0047 0x47 ;G
0x0048 0x48 ;H
0x0049 0x49 ;I
0x004a 0x4a ;J
0x004b 0x4b ;K
0x004c 0x4c ;L
0x004d 0x4d ;M
0x004e 0x4e ;N
0x004f 0x4f ;O
0x0050 0x50 ;P
0x0051 0x51 ;Q
0x0052 0x52 ;R
0x0053 0x53 ;S
0x0054 0x54 ;T
0x0055 0x55 ;U
0x0056 0x56 ;V
0x0057 0x57 ;W
0x0058 0x58 ;X
0x0059 0x59 ;Y
0x005a 0x5a ;Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Backslash
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave
0x0061 0x61 ;a
0x0062 0x62 ;b
0x0063 0x63 ;c
0x0064 0x64 ;d
0x0065 0x65 ;e
0x0066 0x66 ;f
0x0067 0x67 ;g
0x0068 0x68 ;h
0x0069 0x69 ;i
0x006a 0x6a ;j
0x006b 0x6b ;k
0x006c 0x6c ;l
0x006d 0x6d ;m
0x006e 0x6e ;n
0x006f 0x6f ;o
0x0070 0x70 ;p
0x0071 0x71 ;q
0x0072 0x72 ;r
0x0073 0x73 ;s
0x0074 0x74 ;t
0x0075 0x75 ;u
0x0076 0x76 ;v
0x0077 0x77 ;w
0x0078 0x78 ;x
0x0079 0x79 ;y
0x007a 0x7a ;z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x0083 0x83 ;Not Used
0x0088 0x88
0x008a 0x8a
0x008c 0x8c
0x0090 0x90
0x0098 0x98 ;Not Used
0x009a 0x9a
0x009c 0x9c
0x009f 0x9f
0x00a0 0xa0 ;No-Break Space
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0x8d ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00ab 0xab ;Left Pointing Guillement
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00af 0x9d ;Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0xb2 ;Superscript 2
0x00b3 0xb3 ;Superscript 3
0x00b4 0xb4 ;Acute
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0x8f ;Cedilla
0x00b9 0xb9 ;Superscript 1
0x00bb 0xbb ;Right Pointing Guillement
0x00bc 0xbc ;Fraction 1/4
0x00bd 0xbd ;Fraction 1/2
0x00be 0xbe ;Fraction 3/4
0x00c4 0xc4 ;A Diaeresis
0x00c5 0xc5 ;A Ring Above
0x00c6 0xaf ;AE
0x00c9 0xc9 ;E Acute
0x00d3 0xd3 ;O Acute
0x00d5 0xd5 ;O Tilde
0x00d6 0xd6 ;O Diaeresis
0x00d7 0xd7 ;Multiplication Sign
0x00d8 0xa8 ;O Stroke
0x00dc 0xdc ;U Diaeresis
0x00df 0xdf ;Sharp ss
0x00e4 0xe4 ;a Diaeresis
0x00e5 0xe5 ;a Ring Above
0x00e6 0xbf ;ae
0x00e9 0xe9 ;e Acute
0x00f3 0xf3 ;o Acute
0x00f5 0xf5 ;o Tilde
0x00f6 0xf6 ;o Diaeresis
0x00f7 0xf7 ;Division Sign
0x00f8 0xb8 ;o Stroke
0x00fc 0xfc ;u Diaeresis
0x0100 0xc2 ;A Macron
0x0101 0xe2 ;a Macron
0x0104 0xc0 ;A Ogonek
0x0105 0xe0 ;a Ogonek
0x0106 0xc3 ;C Acute
0x0107 0xe3 ;c Acute
0x010c 0xc8 ;C Hacek
0x010d 0xe8 ;c Hacek
0x0112 0xc7 ;E Macron
0x0113 0xe7 ;e Macron
0x0116 0xcb ;E Dot Above
0x0117 0xeb ;e Dot Above
0x0118 0xc6 ;E Ogonek
0x0119 0xe6 ;e Ogonek
0x0122 0xcc ;G Cedilla
0x0123 0xec ;g Cedilla
0x012a 0xce ;I Macron
0x012b 0xee ;i Macron
0x012e 0xc1 ;I Ogonek
0x012f 0xe1 ;i Ogonek
0x0136 0xcd ;K Cedilla
0x0137 0xed ;k Cedilla
0x013b 0xcf ;L Cedilla
0x013c 0xef ;l Cedilla
0x0141 0xd9 ;L Stroke
0x0142 0xf9 ;l Stroke
0x0143 0xd1 ;N Acute
0x0144 0xf1 ;n Acute
0x0145 0xd2 ;N Cedilla
0x0146 0xf2 ;n Cedilla
0x014c 0xd4 ;O Macron
0x014d 0xf4 ;o Macron
0x0156 0xaa ;R Cedilla
0x0157 0xba ;r Cedilla
0x015a 0xda ;S Acute
0x015b 0xfa ;s Acute
0x0160 0xd0 ;S Hacek
0x0161 0xf0 ;s Hacek
0x016a 0xdb ;U Macron
0x016b 0xfb ;u Macron
0x0172 0xd8 ;U Ogonek
0x0173 0xf8 ;u Ogonek
0x0179 0xca ;Z Acute
0x017a 0xea ;z Acute
0x017b 0xdd ;Z Dot Above
0x017c 0xfd ;z Dot Above
0x017d 0xde ;Z Hacek
0x017e 0xfe ;z Hacek
0x02c7 0x8e ;Hacek
0x02d9 0xff ;Dot Above
0x02db 0x9e ;Ogonek
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Low Single Comma Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Low Double Comma Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Left Pointing Single Guillement
0x203a 0x9b ;Right Pointing Single Guillement
0x20ac 0x80 ;Euro Sign
0x2122 0x99 ;Trade Mark Sign
0xf8fc 0xa1 ;EUDC -> Undefined
0xf8fd 0xa5 ;EUDC -> Undefined
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


View File

@@ -0,0 +1,619 @@
CODEPAGE 1258 ;Viet Nam - ANSI, OEM
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control One
0x12 0x0012 ;Device Control Two
0x13 0x0013 ;Device Control Three
0x14 0x0014 ;Device Control Four
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe
0x28 0x0028 ;Left Parenthesis
0x29 0x0029 ;Right Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Full Stop
0x2f 0x002f ;Solidus
0x30 0x0030 ;Digit Zero
0x31 0x0031 ;Digit One
0x32 0x0032 ;Digit Two
0x33 0x0033 ;Digit Three
0x34 0x0034 ;Digit Four
0x35 0x0035 ;Digit Five
0x36 0x0036 ;Digit Six
0x37 0x0037 ;Digit Seven
0x38 0x0038 ;Digit Eight
0x39 0x0039 ;Digit Nine
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;Latin Capital Letter A
0x42 0x0042 ;Latin Capital Letter B
0x43 0x0043 ;Latin Capital Letter C
0x44 0x0044 ;Latin Capital Letter D
0x45 0x0045 ;Latin Capital Letter E
0x46 0x0046 ;Latin Capital Letter F
0x47 0x0047 ;Latin Capital Letter G
0x48 0x0048 ;Latin Capital Letter H
0x49 0x0049 ;Latin Capital Letter I
0x4a 0x004a ;Latin Capital Letter J
0x4b 0x004b ;Latin Capital Letter K
0x4c 0x004c ;Latin Capital Letter L
0x4d 0x004d ;Latin Capital Letter M
0x4e 0x004e ;Latin Capital Letter N
0x4f 0x004f ;Latin Capital Letter O
0x50 0x0050 ;Latin Capital Letter P
0x51 0x0051 ;Latin Capital Letter Q
0x52 0x0052 ;Latin Capital Letter R
0x53 0x0053 ;Latin Capital Letter S
0x54 0x0054 ;Latin Capital Letter T
0x55 0x0055 ;Latin Capital Letter U
0x56 0x0056 ;Latin Capital Letter V
0x57 0x0057 ;Latin Capital Letter W
0x58 0x0058 ;Latin Capital Letter X
0x59 0x0059 ;Latin Capital Letter Y
0x5a 0x005a ;Latin Capital Letter Z
0x5b 0x005b ;Left Square Bracket
0x5c 0x005c ;Reverse Solidus
0x5d 0x005d ;Right Square Bracket
0x5e 0x005e ;Circumflex Accent
0x5f 0x005f ;Low Line
0x60 0x0060 ;Grave Accent
0x61 0x0061 ;Latin Small Letter A
0x62 0x0062 ;Latin Small Letter B
0x63 0x0063 ;Latin Small Letter C
0x64 0x0064 ;Latin Small Letter D
0x65 0x0065 ;Latin Small Letter E
0x66 0x0066 ;Latin Small Letter F
0x67 0x0067 ;Latin Small Letter G
0x68 0x0068 ;Latin Small Letter H
0x69 0x0069 ;Latin Small Letter I
0x6a 0x006a ;Latin Small Letter J
0x6b 0x006b ;Latin Small Letter K
0x6c 0x006c ;Latin Small Letter L
0x6d 0x006d ;Latin Small Letter M
0x6e 0x006e ;Latin Small Letter N
0x6f 0x006f ;Latin Small Letter O
0x70 0x0070 ;Latin Small Letter P
0x71 0x0071 ;Latin Small Letter Q
0x72 0x0072 ;Latin Small Letter R
0x73 0x0073 ;Latin Small Letter S
0x74 0x0074 ;Latin Small Letter T
0x75 0x0075 ;Latin Small Letter U
0x76 0x0076 ;Latin Small Letter V
0x77 0x0077 ;Latin Small Letter W
0x78 0x0078 ;Latin Small Letter X
0x79 0x0079 ;Latin Small Letter Y
0x7a 0x007a ;Latin Small Letter Z
0x7b 0x007b ;Left Curly Bracket
0x7c 0x007c ;Vertical Line
0x7d 0x007d ;Right Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081 ;Undefined -> Control
0x82 0x201a ;Single Low-9 Quotation Mark
0x83 0x0192 ;Latin Small Letter F With Hook
0x84 0x201e ;Double Low-9 Quotation Mark
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x2020 ;Dagger
0x87 0x2021 ;Double Dagger
0x88 0x02c6 ;Modifier Letter Circumflex Accent
0x89 0x2030 ;Per Mille Sign
0x8a 0x008a ;Undefined -> Control
0x8b 0x2039 ;Single Left-Pointing Angle Quotation Mark
0x8c 0x0152 ;Latin Capital Ligature Oe
0x8d 0x008d ;Undefined -> Control
0x8e 0x008e ;Undefined -> Control
0x8f 0x008f ;Undefined -> Control
0x90 0x0090 ;Undefined -> Control
0x91 0x2018 ;Left Single Quotation Mark
0x92 0x2019 ;Right Single Quotation Mark
0x93 0x201c ;Left Double Quotation Mark
0x94 0x201d ;Right Double Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x02dc ;Small Tilde
0x99 0x2122 ;Trade Mark Sign
0x9a 0x009a ;Undefined -> Control
0x9b 0x203a ;Single Right-Pointing Angle Quotation Mark
0x9c 0x0153 ;Latin Small Ligature Oe
0x9d 0x009d ;Undefined -> Control
0x9e 0x009e ;Undefined -> Control
0x9f 0x0178 ;Latin Capital Letter Y With Diaeresis
0xa0 0x00a0 ;No-Break Space
0xa1 0x00a1 ;Inverted Exclamation Mark
0xa2 0x00a2 ;Cent Sign
0xa3 0x00a3 ;Pound Sign
0xa4 0x00a4 ;Currency Sign
0xa5 0x00a5 ;Yen Sign
0xa6 0x00a6 ;Broken Bar
0xa7 0x00a7 ;Section Sign
0xa8 0x00a8 ;Diaeresis
0xa9 0x00a9 ;Copyright Sign
0xaa 0x00aa ;Feminine Ordinal Indicator
0xab 0x00ab ;Left-Pointing Double Angle Quotation Mark
0xac 0x00ac ;Not Sign
0xad 0x00ad ;Soft Hyphen
0xae 0x00ae ;Registered Sign
0xaf 0x00af ;Macron
0xb0 0x00b0 ;Degree Sign
0xb1 0x00b1 ;Plus-Minus Sign
0xb2 0x00b2 ;Superscript Two
0xb3 0x00b3 ;Superscript Three
0xb4 0x00b4 ;Acute Accent
0xb5 0x00b5 ;Micro Sign
0xb6 0x00b6 ;Pilcrow Sign
0xb7 0x00b7 ;Middle Dot
0xb8 0x00b8 ;Cedilla
0xb9 0x00b9 ;Superscript One
0xba 0x00ba ;Masculine Ordinal Indicator
0xbb 0x00bb ;Right-Pointing Double Angle Quotation Mark
0xbc 0x00bc ;Vulgar Fraction One Quarter
0xbd 0x00bd ;Vulgar Fraction One Half
0xbe 0x00be ;Vulgar Fraction Three Quarters
0xbf 0x00bf ;Inverted Question Mark
0xc0 0x00c0 ;Latin Capital Letter A With Grave
0xc1 0x00c1 ;Latin Capital Letter A With Acute
0xc2 0x00c2 ;Latin Capital Letter A With Circumflex
0xc3 0x0102 ;Latin Capital Letter A With Breve
0xc4 0x00c4 ;Latin Capital Letter A With Diaeresis
0xc5 0x00c5 ;Latin Capital Letter A With Ring Above
0xc6 0x00c6 ;Latin Capital Ligature Ae
0xc7 0x00c7 ;Latin Capital Letter C With Cedilla
0xc8 0x00c8 ;Latin Capital Letter E With Grave
0xc9 0x00c9 ;Latin Capital Letter E With Acute
0xca 0x00ca ;Latin Capital Letter E With Circumflex
0xcb 0x00cb ;Latin Capital Letter E With Diaeresis
0xcc 0x0300 ;Combining Grave Accent
0xcd 0x00cd ;Latin Capital Letter I With Acute
0xce 0x00ce ;Latin Capital Letter I With Circumflex
0xcf 0x00cf ;Latin Capital Letter I With Diaeresis
0xd0 0x0110 ;Latin Capital Letter D Bar
0xd1 0x00d1 ;Latin Capital Letter N With Tilde
0xd2 0x0309 ;Combining Hook Above
0xd3 0x00d3 ;Latin Capital Letter O With Acute
0xd4 0x00d4 ;Latin Capital Letter O With Circumflex
0xd5 0x01a0 ;Latin Capital Letter O With Horn
0xd6 0x00d6 ;Latin Capital Letter O With Diaeresis
0xd7 0x00d7 ;Multiplication Sign
0xd8 0x00d8 ;Latin Capital Letter O With Stroke
0xd9 0x00d9 ;Latin Capital Letter U With Grave
0xda 0x00da ;Latin Capital Letter U With Acute
0xdb 0x00db ;Latin Capital Letter U With Circumflex
0xdc 0x00dc ;Latin Capital Letter U With Diaeresis
0xdd 0x01af ;Latin Capital Letter U With Horn
0xde 0x0303 ;Combining Tilde
0xdf 0x00df ;Latin Small Letter Sharp S
0xe0 0x00e0 ;Latin Small Letter A With Grave
0xe1 0x00e1 ;Latin Small Letter A With Acute
0xe2 0x00e2 ;Latin Small Letter A With Circumflex
0xe3 0x0103 ;Latin Small Letter A With Breve
0xe4 0x00e4 ;Latin Small Letter A With Diaeresis
0xe5 0x00e5 ;Latin Small Letter A With Ring Above
0xe6 0x00e6 ;Latin Small Ligature Ae
0xe7 0x00e7 ;Latin Small Letter C With Cedilla
0xe8 0x00e8 ;Latin Small Letter E With Grave
0xe9 0x00e9 ;Latin Small Letter E With Acute
0xea 0x00ea ;Latin Small Letter E With Circumflex
0xeb 0x00eb ;Latin Small Letter E With Diaeresis
0xec 0x0301 ;Combining Acute Accent
0xed 0x00ed ;Latin Small Letter I With Acute
0xee 0x00ee ;Latin Small Letter I With Circumflex
0xef 0x00ef ;Latin Small Letter I With Diaeresis
0xf0 0x0111 ;Latin Small Letter D Bar
0xf1 0x00f1 ;Latin Small Letter N With Tilde
0xf2 0x0323 ;Combining Dot Below
0xf3 0x00f3 ;Latin Small Letter O With Acute
0xf4 0x00f4 ;Latin Small Letter O With Circumflex
0xf5 0x01a1 ;Latin Small Letter O With Horn
0xf6 0x00f6 ;Latin Small Letter O With Diaeresis
0xf7 0x00f7 ;Division Sign
0xf8 0x00f8 ;Latin Small Letter O With Stroke
0xf9 0x00f9 ;Latin Small Letter U With Grave
0xfa 0x00fa ;Latin Small Letter U With Acute
0xfb 0x00fb ;Latin Small Letter U With Circumflex
0xfc 0x00fc ;Latin Small Letter U With Diaeresis
0xfd 0x01b0 ;Latin Small Letter U With Horn
0xfe 0x20ab ;Dong Sign
0xff 0x00ff ;Latin Small Letter Y With Diaeresis
WCTABLE 350
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control One
0x0012 0x12 ;Device Control Two
0x0013 0x13 ;Device Control Three
0x0014 0x14 ;Device Control Four
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe
0x0028 0x28 ;Left Parenthesis
0x0029 0x29 ;Right Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Full Stop
0x002f 0x2f ;Solidus
0x0030 0x30 ;Digit Zero
0x0031 0x31 ;Digit One
0x0032 0x32 ;Digit Two
0x0033 0x33 ;Digit Three
0x0034 0x34 ;Digit Four
0x0035 0x35 ;Digit Five
0x0036 0x36 ;Digit Six
0x0037 0x37 ;Digit Seven
0x0038 0x38 ;Digit Eight
0x0039 0x39 ;Digit Nine
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;Latin Capital Letter A
0x0042 0x42 ;Latin Capital Letter B
0x0043 0x43 ;Latin Capital Letter C
0x0044 0x44 ;Latin Capital Letter D
0x0045 0x45 ;Latin Capital Letter E
0x0046 0x46 ;Latin Capital Letter F
0x0047 0x47 ;Latin Capital Letter G
0x0048 0x48 ;Latin Capital Letter H
0x0049 0x49 ;Latin Capital Letter I
0x004a 0x4a ;Latin Capital Letter J
0x004b 0x4b ;Latin Capital Letter K
0x004c 0x4c ;Latin Capital Letter L
0x004d 0x4d ;Latin Capital Letter M
0x004e 0x4e ;Latin Capital Letter N
0x004f 0x4f ;Latin Capital Letter O
0x0050 0x50 ;Latin Capital Letter P
0x0051 0x51 ;Latin Capital Letter Q
0x0052 0x52 ;Latin Capital Letter R
0x0053 0x53 ;Latin Capital Letter S
0x0054 0x54 ;Latin Capital Letter T
0x0055 0x55 ;Latin Capital Letter U
0x0056 0x56 ;Latin Capital Letter V
0x0057 0x57 ;Latin Capital Letter W
0x0058 0x58 ;Latin Capital Letter X
0x0059 0x59 ;Latin Capital Letter Y
0x005a 0x5a ;Latin Capital Letter Z
0x005b 0x5b ;Left Square Bracket
0x005c 0x5c ;Reverse Solidus
0x005d 0x5d ;Right Square Bracket
0x005e 0x5e ;Circumflex Accent
0x005f 0x5f ;Low Line
0x0060 0x60 ;Grave Accent
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
0x0064 0x64 ;Latin Small Letter D
0x0065 0x65 ;Latin Small Letter E
0x0066 0x66 ;Latin Small Letter F
0x0067 0x67 ;Latin Small Letter G
0x0068 0x68 ;Latin Small Letter H
0x0069 0x69 ;Latin Small Letter I
0x006a 0x6a ;Latin Small Letter J
0x006b 0x6b ;Latin Small Letter K
0x006c 0x6c ;Latin Small Letter L
0x006d 0x6d ;Latin Small Letter M
0x006e 0x6e ;Latin Small Letter N
0x006f 0x6f ;Latin Small Letter O
0x0070 0x70 ;Latin Small Letter P
0x0071 0x71 ;Latin Small Letter Q
0x0072 0x72 ;Latin Small Letter R
0x0073 0x73 ;Latin Small Letter S
0x0074 0x74 ;Latin Small Letter T
0x0075 0x75 ;Latin Small Letter U
0x0076 0x76 ;Latin Small Letter V
0x0077 0x77 ;Latin Small Letter W
0x0078 0x78 ;Latin Small Letter X
0x0079 0x79 ;Latin Small Letter Y
0x007a 0x7a ;Latin Small Letter Z
0x007b 0x7b ;Left Curly Bracket
0x007c 0x7c ;Vertical Line
0x007d 0x7d ;Right Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81 ;Undefined -> Control
0x008a 0x8a ;Undefined -> Control
0x008d 0x8d ;Undefined -> Control
0x008e 0x8e ;Undefined -> Control
0x008f 0x8f ;Undefined -> Control
0x0090 0x90 ;Undefined -> Control
0x009a 0x9a ;Undefined -> Control
0x009d 0x9d ;Undefined -> Control
0x009e 0x9e ;Undefined -> Control
0x00a0 0xa0 ;No-Break Space
0x00a1 0xa1 ;Inverted Exclamation Mark
0x00a2 0xa2 ;Cent Sign
0x00a3 0xa3 ;Pound Sign
0x00a4 0xa4 ;Currency Sign
0x00a5 0xa5 ;Yen Sign
0x00a6 0xa6 ;Broken Bar
0x00a7 0xa7 ;Section Sign
0x00a8 0xa8 ;Diaeresis
0x00a9 0xa9 ;Copyright Sign
0x00aa 0xaa ;Feminine Ordinal Indicator
0x00ab 0xab ;Left-Pointing Double Angle Quotation Mark
0x00ac 0xac ;Not Sign
0x00ad 0xad ;Soft Hyphen
0x00ae 0xae ;Registered Sign
0x00af 0xaf ;Macron
0x00b0 0xb0 ;Degree Sign
0x00b1 0xb1 ;Plus-Minus Sign
0x00b2 0xb2 ;Superscript Two
0x00b3 0xb3 ;Superscript Three
0x00b4 0xb4 ;Acute Accent
0x00b5 0xb5 ;Micro Sign
0x00b6 0xb6 ;Pilcrow Sign
0x00b7 0xb7 ;Middle Dot
0x00b8 0xb8 ;Cedilla
0x00b9 0xb9 ;Superscript One
0x00ba 0xba ;Masculine Ordinal Indicator
0x00bb 0xbb ;Right-Pointing Double Angle Quotation Mark
0x00bc 0xbc ;Vulgar Fraction One Quarter
0x00bd 0xbd ;Vulgar Fraction One Half
0x00be 0xbe ;Vulgar Fraction Three Quarters
0x00bf 0xbf ;Inverted Question Mark
0x00c0 0xc0 ;Latin Capital Letter A With Grave
0x00c1 0xc1 ;Latin Capital Letter A With Acute
0x00c2 0xc2 ;Latin Capital Letter A With Circumflex
0x00c4 0xc4 ;Latin Capital Letter A With Diaeresis
0x00c5 0xc5 ;Latin Capital Letter A With Ring Above
0x00c6 0xc6 ;Latin Capital Ligature Ae
0x00c7 0xc7 ;Latin Capital Letter C With Cedilla
0x00c8 0xc8 ;Latin Capital Letter E With Grave
0x00c9 0xc9 ;Latin Capital Letter E With Acute
0x00ca 0xca ;Latin Capital Letter E With Circumflex
0x00cb 0xcb ;Latin Capital Letter E With Diaeresis
0x00cd 0xcd ;Latin Capital Letter I With Acute
0x00ce 0xce ;Latin Capital Letter I With Circumflex
0x00cf 0xcf ;Latin Capital Letter I With Diaeresis
0x00d1 0xd1 ;Latin Capital Letter N With Tilde
0x00d3 0xd3 ;Latin Capital Letter O With Acute
0x00d4 0xd4 ;Latin Capital Letter O With Circumflex
0x00d6 0xd6 ;Latin Capital Letter O With Diaeresis
0x00d7 0xd7 ;Multiplication Sign
0x00d8 0xd8 ;Latin Capital Letter O With Stroke
0x00d9 0xd9 ;Latin Capital Letter U With Grave
0x00da 0xda ;Latin Capital Letter U With Acute
0x00db 0xdb ;Latin Capital Letter U With Circumflex
0x00dc 0xdc ;Latin Capital Letter U With Diaeresis
0x00df 0xdf ;Latin Small Letter Sharp S
0x00e0 0xe0 ;Latin Small Letter A With Grave
0x00e1 0xe1 ;Latin Small Letter A With Acute
0x00e2 0xe2 ;Latin Small Letter A With Circumflex
0x00e4 0xe4 ;Latin Small Letter A With Diaeresis
0x00e5 0xe5 ;Latin Small Letter A With Ring Above
0x00e6 0xe6 ;Latin Small Ligature Ae
0x00e7 0xe7 ;Latin Small Letter C With Cedilla
0x00e8 0xe8 ;Latin Small Letter E With Grave
0x00e9 0xe9 ;Latin Small Letter E With Acute
0x00ea 0xea ;Latin Small Letter E With Circumflex
0x00eb 0xeb ;Latin Small Letter E With Diaeresis
0x00ed 0xed ;Latin Small Letter I With Acute
0x00ee 0xee ;Latin Small Letter I With Circumflex
0x00ef 0xef ;Latin Small Letter I With Diaeresis
0x00f1 0xf1 ;Latin Small Letter N With Tilde
0x00f3 0xf3 ;Latin Small Letter O With Acute
0x00f4 0xf4 ;Latin Small Letter O With Circumflex
0x00f6 0xf6 ;Latin Small Letter O With Diaeresis
0x00f7 0xf7 ;Division Sign
0x00f8 0xf8 ;Latin Small Letter O With Stroke
0x00f9 0xf9 ;Latin Small Letter U With Grave
0x00fa 0xfa ;Latin Small Letter U With Acute
0x00fb 0xfb ;Latin Small Letter U With Circumflex
0x00fc 0xfc ;Latin Small Letter U With Diaeresis
0x00ff 0xff ;Latin Small Letter Y With Diaeresis
0x0102 0xc3 ;Latin Capital Letter A With Breve
0x0103 0xe3 ;Latin Small Letter A With Breve
0x0110 0xd0 ;Latin Capital Letter D Bar
0x0111 0xf0 ;Latin Small Letter D Bar
0x0152 0x8c ;Latin Capital Ligature Oe
0x0153 0x9c ;Latin Small Ligature Oe
0x0178 0x9f ;Latin Capital Letter Y With Diaeresis
0x0192 0x83 ;Latin Small Letter F With Hook
0x01a0 0xd5 ;Latin Capital Letter O With Horn
0x01a1 0xf5 ;Latin Small Letter O With Horn
0x01af 0xdd ;Latin Capital Letter U With Horn
0x01b0 0xfd ;Latin Small Letter U With Horn
0x02c6 0x88 ;Modifier Letter Circumflex Accent
0x02dc 0x98 ;Small Tilde
0x0300 0xcc ;Combining Grave Accent
0x0301 0xec ;Combining Acute Accent
0x0303 0xde ;Combining Tilde
0x0309 0xd2 ;Combining Hook Above
0x0323 0xf2 ;Combining Dot Below
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Left Single Quotation Mark
0x2019 0x92 ;Right Single Quotation Mark
0x201a 0x82 ;Single Low-9 Quotation Mark
0x201c 0x93 ;Left Double Quotation Mark
0x201d 0x94 ;Right Double Quotation Mark
0x201e 0x84 ;Double Low-9 Quotation Mark
0x2020 0x86 ;Dagger
0x2021 0x87 ;Double Dagger
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x2030 0x89 ;Per Mille Sign
0x2039 0x8b ;Single Left-Pointing Angle Quotation Mark
0x203a 0x9b ;Single Right-Pointing Angle Quotation Mark
0x20ac 0x80 ;Euro Sign
0x20ab 0xfe ;Dong Sign
0x2122 0x99 ;Trade Mark Sign
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,663 @@
CODEPAGE 874 ;Thai - OEM ANSI
CPINFO 1 0x3f 0x003f ;Single Byte CP, Default Char = Question Mark
MBTABLE 256
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
0x02 0x0002 ;Start Of Text
0x03 0x0003 ;End Of Text
0x04 0x0004 ;End Of Transmission
0x05 0x0005 ;Enquiry
0x06 0x0006 ;Acknowledge
0x07 0x0007 ;Bell
0x08 0x0008 ;Backspace
0x09 0x0009 ;Horizontal Tabulation
0x0a 0x000a ;Line Feed
0x0b 0x000b ;Vertical Tabulation
0x0c 0x000c ;Form Feed
0x0d 0x000d ;Carriage Return
0x0e 0x000e ;Shift Out
0x0f 0x000f ;Shift In
0x10 0x0010 ;Data Link Escape
0x11 0x0011 ;Device Control 1
0x12 0x0012 ;Device Control 2
0x13 0x0013 ;Device Control 3
0x14 0x0014 ;Device Control 4
0x15 0x0015 ;Negative Acknowledge
0x16 0x0016 ;Synchronous Idle
0x17 0x0017 ;End Of Transmission Block
0x18 0x0018 ;Cancel
0x19 0x0019 ;End Of Medium
0x1a 0x001a ;Substitute
0x1b 0x001b ;Escape
0x1c 0x001c ;File Separator
0x1d 0x001d ;Group Separator
0x1e 0x001e ;Record Separator
0x1f 0x001f ;Unit Separator
0x20 0x0020 ;Space
0x21 0x0021 ;Exclamation Mark
0x22 0x0022 ;Quotation Mark
0x23 0x0023 ;Number Sign
0x24 0x0024 ;Dollar Sign
0x25 0x0025 ;Percent Sign
0x26 0x0026 ;Ampersand
0x27 0x0027 ;Apostrophe-Quote
0x28 0x0028 ;Opening Parenthesis
0x29 0x0029 ;Closing Parenthesis
0x2a 0x002a ;Asterisk
0x2b 0x002b ;Plus Sign
0x2c 0x002c ;Comma
0x2d 0x002d ;Hyphen-Minus
0x2e 0x002e ;Period
0x2f 0x002f ;Slash
0x30 0x0030 ;Digit 0
0x31 0x0031 ;Digit 1
0x32 0x0032 ;Digit 2
0x33 0x0033 ;Digit 3
0x34 0x0034 ;Digit 4
0x35 0x0035 ;Digit 5
0x36 0x0036 ;Digit 6
0x37 0x0037 ;Digit 7
0x38 0x0038 ;Digit 8
0x39 0x0039 ;Digit 9
0x3a 0x003a ;Colon
0x3b 0x003b ;Semicolon
0x3c 0x003c ;Less-Than Sign
0x3d 0x003d ;Equals Sign
0x3e 0x003e ;Greater-Than Sign
0x3f 0x003f ;Question Mark
0x40 0x0040 ;Commercial At
0x41 0x0041 ;A
0x42 0x0042 ;B
0x43 0x0043 ;C
0x44 0x0044 ;D
0x45 0x0045 ;E
0x46 0x0046 ;F
0x47 0x0047 ;G
0x48 0x0048 ;H
0x49 0x0049 ;I
0x4a 0x004a ;J
0x4b 0x004b ;K
0x4c 0x004c ;L
0x4d 0x004d ;M
0x4e 0x004e ;N
0x4f 0x004f ;O
0x50 0x0050 ;P
0x51 0x0051 ;Q
0x52 0x0052 ;R
0x53 0x0053 ;S
0x54 0x0054 ;T
0x55 0x0055 ;U
0x56 0x0056 ;V
0x57 0x0057 ;W
0x58 0x0058 ;X
0x59 0x0059 ;Y
0x5a 0x005a ;Z
0x5b 0x005b ;Opening Square Bracket
0x5c 0x005c ;Backslash
0x5d 0x005d ;Closing Square Bracket
0x5e 0x005e ;Spacing Circumflex
0x5f 0x005f ;Spacing Underscore
0x60 0x0060 ;Spacing Grave
0x61 0x0061 ;a
0x62 0x0062 ;b
0x63 0x0063 ;c
0x64 0x0064 ;d
0x65 0x0065 ;e
0x66 0x0066 ;f
0x67 0x0067 ;g
0x68 0x0068 ;h
0x69 0x0069 ;i
0x6a 0x006a ;j
0x6b 0x006b ;k
0x6c 0x006c ;l
0x6d 0x006d ;m
0x6e 0x006e ;n
0x6f 0x006f ;o
0x70 0x0070 ;p
0x71 0x0071 ;q
0x72 0x0072 ;r
0x73 0x0073 ;s
0x74 0x0074 ;t
0x75 0x0075 ;u
0x76 0x0076 ;v
0x77 0x0077 ;w
0x78 0x0078 ;x
0x79 0x0079 ;y
0x7a 0x007a ;z
0x7b 0x007b ;Opening Curly Bracket
0x7c 0x007c ;Vertical Bar
0x7d 0x007d ;Closing Curly Bracket
0x7e 0x007e ;Tilde
0x7f 0x007f ;Delete
0x80 0x20ac ;Euro Sign
0x81 0x0081
0x82 0x0082
0x83 0x0083
0x84 0x0084
0x85 0x2026 ;Horizontal Ellipsis
0x86 0x0086
0x87 0x0087
0x88 0x0088
0x89 0x0089
0x8a 0x008a
0x8b 0x008b
0x8c 0x008c
0x8d 0x008d
0x8e 0x008e
0x8f 0x008f
0x90 0x0090
0x91 0x2018 ;Single Turned Comma Quotation Mark
0x92 0x2019 ;Single Comma Quotation Mark
0x93 0x201c ;Double Turned Comma Quotation Mark
0x94 0x201d ;Double Comma Quotation Mark
0x95 0x2022 ;Bullet
0x96 0x2013 ;En Dash
0x97 0x2014 ;Em Dash
0x98 0x0098
0x99 0x0099
0x9a 0x009a
0x9b 0x009b
0x9c 0x009c
0x9d 0x009d
0x9e 0x009e
0x9f 0x009f
0xa0 0x00a0 ;Non-Breaking Space
0xa1 0x0e01 ;Thai Letter Ko Kai
0xa2 0x0e02 ;Thai Letter Kho Khai
0xa3 0x0e03 ;Thai Letter Kho Khuat
0xa4 0x0e04 ;Thai Letter Kho Khwai
0xa5 0x0e05 ;Thai Letter Kho Khon
0xa6 0x0e06 ;Thai Letter Kho Rakhang
0xa7 0x0e07 ;Thai Letter Ngo Ngu
0xa8 0x0e08 ;Thai Letter Cho Chan
0xa9 0x0e09 ;Thai Letter Cho Ching
0xaa 0x0e0a ;Thai Letter Cho Chang
0xab 0x0e0b ;Thai Letter So So
0xac 0x0e0c ;Thai Letter Cho Choe
0xad 0x0e0d ;Thai Letter Yo Ying
0xae 0x0e0e ;Thai Letter Do Chada
0xaf 0x0e0f ;Thai Letter To Patak
0xb0 0x0e10 ;Thai Letter Tho Than
0xb1 0x0e11 ;Thai Letter Tho Nangmontho
0xb2 0x0e12 ;Thai Letter Tho Phuthao
0xb3 0x0e13 ;Thai Letter No Nen
0xb4 0x0e14 ;Thai Letter Do Dek
0xb5 0x0e15 ;Thai Letter To Tao
0xb6 0x0e16 ;Thai Letter Tho Thung
0xb7 0x0e17 ;Thai Letter Tho Thahan
0xb8 0x0e18 ;Thai Letter Tho Thong
0xb9 0x0e19 ;Thai Letter No Nu
0xba 0x0e1a ;Thai Letter Bo Baimai
0xbb 0x0e1b ;Thai Letter Po Pla
0xbc 0x0e1c ;Thai Letter Pho Phung
0xbd 0x0e1d ;Thai Letter Fo Fa
0xbe 0x0e1e ;Thai Letter Pho Phan
0xbf 0x0e1f ;Thai Letter Fo Fan
0xc0 0x0e20 ;Thai Letter Pho Samphao
0xc1 0x0e21 ;Thai Letter Mo Ma
0xc2 0x0e22 ;Thai Letter Yo Yak
0xc3 0x0e23 ;Thai Letter Ro Rua
0xc4 0x0e24 ;Thai Letter Ru
0xc5 0x0e25 ;Thai Letter Lo Ling
0xc6 0x0e26 ;Thai Letter Lu
0xc7 0x0e27 ;Thai Letter Wo Waen
0xc8 0x0e28 ;Thai Letter So Sala
0xc9 0x0e29 ;Thai Letter So Rusi
0xca 0x0e2a ;Thai Letter So Sua
0xcb 0x0e2b ;Thai Letter Ho Hip
0xcc 0x0e2c ;Thai Letter Lo Chula
0xcd 0x0e2d ;Thai Letter O Ang
0xce 0x0e2e ;Thai Letter Ho Nok Huk
0xcf 0x0e2f ;Thai Pai Yan Noi
0xd0 0x0e30 ;Thai Vowel Sign Sara A
0xd1 0x0e31 ;Thai Vowel Sign Mai Han-Akat
0xd2 0x0e32 ;Thai Vowel Sign Sara Aa
0xd3 0x0e33 ;Thai Vowel Sign Sara Am
0xd4 0x0e34 ;Thai Vowel Sign Sara I
0xd5 0x0e35 ;Thai Vowel Sign Sara Ii
0xd6 0x0e36 ;Thai Vowel Sign Sara Ue
0xd7 0x0e37 ;Thai Vowel Sign Sara Uee
0xd8 0x0e38 ;Thai Vowel Sign Sara U
0xd9 0x0e39 ;Thai Vowel Sign Sara Uu
0xda 0x0e3a ;Thai Vowel Sign Phinthu
0xdb 0xf8c1 ;Undefined -> EUDC
0xdc 0xf8c2 ;Undefined -> EUDC
0xdd 0xf8c3 ;Undefined -> EUDC
0xde 0xf8c4 ;Undefined -> EUDC
0xdf 0x0e3f ;Thai Baht Sign
0xe0 0x0e40 ;Thai Vowel Sign Sara E
0xe1 0x0e41 ;Thai Vowel Sign Sara Ae
0xe2 0x0e42 ;Thai Vowel Sign Sara O
0xe3 0x0e43 ;Thai Vowel Sign Sara Mai Muan
0xe4 0x0e44 ;Thai Vowel Sign Sara Mai Malai
0xe5 0x0e45 ;Thai Lak Khang Yao
0xe6 0x0e46 ;Thai Mai Yamok
0xe7 0x0e47 ;Thai Vowel Sign Mai Tai Khu
0xe8 0x0e48 ;Thai Tone Mai Ek
0xe9 0x0e49 ;Thai Tone Mai Tho
0xea 0x0e4a ;Thai Tone Mai Tri
0xeb 0x0e4b ;Thai Tone Mai Chattawa
0xec 0x0e4c ;Thai Thanthakhat
0xed 0x0e4d ;Thai Nikkhahit
0xee 0x0e4e ;Thai Yamakkan
0xef 0x0e4f ;Thai Fongman
0xf0 0x0e50 ;Thai Digit 0
0xf1 0x0e51 ;Thai Digit 1
0xf2 0x0e52 ;Thai Digit 2
0xf3 0x0e53 ;Thai Digit 3
0xf4 0x0e54 ;Thai Digit 4
0xf5 0x0e55 ;Thai Digit 5
0xf6 0x0e56 ;Thai Digit 6
0xf7 0x0e57 ;Thai Digit 7
0xf8 0x0e58 ;Thai Digit 8
0xf9 0x0e59 ;Thai Digit 9
0xfa 0x0e5a ;Thai Angkhankhu
0xfb 0x0e5b ;Thai Khomut
0xfc 0xf8c5 ;Undefined -> EUDC
0xfd 0xf8c6 ;Undefined -> EUDC
0xfe 0xf8c7 ;Undefined -> EUDC
0xff 0xf8c8 ;Undefined -> EUDC
WCTABLE 394
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
0x0002 0x02 ;Start Of Text
0x0003 0x03 ;End Of Text
0x0004 0x04 ;End Of Transmission
0x0005 0x05 ;Enquiry
0x0006 0x06 ;Acknowledge
0x0007 0x07 ;Bell
0x0008 0x08 ;Backspace
0x0009 0x09 ;Horizontal Tabulation
0x000a 0x0a ;Line Feed
0x000b 0x0b ;Vertical Tabulation
0x000c 0x0c ;Form Feed
0x000d 0x0d ;Carriage Return
0x000e 0x0e ;Shift Out
0x000f 0x0f ;Shift In
0x0010 0x10 ;Data Link Escape
0x0011 0x11 ;Device Control 1
0x0012 0x12 ;Device Control 2
0x0013 0x13 ;Device Control 3
0x0014 0x14 ;Device Control 4
0x0015 0x15 ;Negative Acknowledge
0x0016 0x16 ;Synchronous Idle
0x0017 0x17 ;End Of Transmission Block
0x0018 0x18 ;Cancel
0x0019 0x19 ;End Of Medium
0x001a 0x1a ;Substitute
0x001b 0x1b ;Escape
0x001c 0x1c ;File Separator
0x001d 0x1d ;Group Separator
0x001e 0x1e ;Record Separator
0x001f 0x1f ;Unit Separator
0x0020 0x20 ;Space
0x0021 0x21 ;Exclamation Mark
0x0022 0x22 ;Quotation Mark
0x0023 0x23 ;Number Sign
0x0024 0x24 ;Dollar Sign
0x0025 0x25 ;Percent Sign
0x0026 0x26 ;Ampersand
0x0027 0x27 ;Apostrophe-Quote
0x0028 0x28 ;Opening Parenthesis
0x0029 0x29 ;Closing Parenthesis
0x002a 0x2a ;Asterisk
0x002b 0x2b ;Plus Sign
0x002c 0x2c ;Comma
0x002d 0x2d ;Hyphen-Minus
0x002e 0x2e ;Period
0x002f 0x2f ;Slash
0x0030 0x30 ;Digit 0
0x0031 0x31 ;Digit 1
0x0032 0x32 ;Digit 2
0x0033 0x33 ;Digit 3
0x0034 0x34 ;Digit 4
0x0035 0x35 ;Digit 5
0x0036 0x36 ;Digit 6
0x0037 0x37 ;Digit 7
0x0038 0x38 ;Digit 8
0x0039 0x39 ;Digit 9
0x003a 0x3a ;Colon
0x003b 0x3b ;Semicolon
0x003c 0x3c ;Less-Than Sign
0x003d 0x3d ;Equals Sign
0x003e 0x3e ;Greater-Than Sign
0x003f 0x3f ;Question Mark
0x0040 0x40 ;Commercial At
0x0041 0x41 ;A
0x0042 0x42 ;B
0x0043 0x43 ;C
0x0044 0x44 ;D
0x0045 0x45 ;E
0x0046 0x46 ;F
0x0047 0x47 ;G
0x0048 0x48 ;H
0x0049 0x49 ;I
0x004a 0x4a ;J
0x004b 0x4b ;K
0x004c 0x4c ;L
0x004d 0x4d ;M
0x004e 0x4e ;N
0x004f 0x4f ;O
0x0050 0x50 ;P
0x0051 0x51 ;Q
0x0052 0x52 ;R
0x0053 0x53 ;S
0x0054 0x54 ;T
0x0055 0x55 ;U
0x0056 0x56 ;V
0x0057 0x57 ;W
0x0058 0x58 ;X
0x0059 0x59 ;Y
0x005a 0x5a ;Z
0x005b 0x5b ;Opening Square Bracket
0x005c 0x5c ;Backslash
0x005d 0x5d ;Closing Square Bracket
0x005e 0x5e ;Spacing Circumflex
0x005f 0x5f ;Spacing Underscore
0x0060 0x60 ;Spacing Grave
0x0061 0x61 ;a
0x0062 0x62 ;b
0x0063 0x63 ;c
0x0064 0x64 ;d
0x0065 0x65 ;e
0x0066 0x66 ;f
0x0067 0x67 ;g
0x0068 0x68 ;h
0x0069 0x69 ;i
0x006a 0x6a ;j
0x006b 0x6b ;k
0x006c 0x6c ;l
0x006d 0x6d ;m
0x006e 0x6e ;n
0x006f 0x6f ;o
0x0070 0x70 ;p
0x0071 0x71 ;q
0x0072 0x72 ;r
0x0073 0x73 ;s
0x0074 0x74 ;t
0x0075 0x75 ;u
0x0076 0x76 ;v
0x0077 0x77 ;w
0x0078 0x78 ;x
0x0079 0x79 ;y
0x007a 0x7a ;z
0x007b 0x7b ;Opening Curly Bracket
0x007c 0x7c ;Vertical Bar
0x007d 0x7d ;Closing Curly Bracket
0x007e 0x7e ;Tilde
0x007f 0x7f ;Delete
0x0081 0x81
0x0082 0x82
0x0083 0x83
0x0084 0x84
0x0086 0x86
0x0087 0x87
0x0088 0x88
0x0089 0x89
0x008a 0x8a
0x008b 0x8b
0x008c 0x8c
0x008d 0x8d
0x008e 0x8e
0x008f 0x8f
0x0090 0x90
0x0098 0x98
0x0099 0x99
0x009a 0x9a
0x009b 0x9b
0x009c 0x9c
0x009d 0x9d
0x009e 0x9e
0x009f 0x9f
0x00a0 0xa0 ;Non-Breaking Space
0x00a7 0x15 ;Section Sign
0x00b6 0x14 ;Pilcrow Sign
0x0e01 0xa1 ;Thai Letter Ko Kai
0x0e02 0xa2 ;Thai Letter Kho Khai
0x0e03 0xa3 ;Thai Letter Kho Khuat
0x0e04 0xa4 ;Thai Letter Kho Khwai
0x0e05 0xa5 ;Thai Letter Kho Khon
0x0e06 0xa6 ;Thai Letter Kho Rakhang
0x0e07 0xa7 ;Thai Letter Ngo Ngu
0x0e08 0xa8 ;Thai Letter Cho Chan
0x0e09 0xa9 ;Thai Letter Cho Ching
0x0e0a 0xaa ;Thai Letter Cho Chang
0x0e0b 0xab ;Thai Letter So So
0x0e0c 0xac ;Thai Letter Cho Choe
0x0e0d 0xad ;Thai Letter Yo Ying
0x0e0e 0xae ;Thai Letter Do Chada
0x0e0f 0xaf ;Thai Letter To Patak
0x0e10 0xb0 ;Thai Letter Tho Than
0x0e11 0xb1 ;Thai Letter Tho Nangmontho
0x0e12 0xb2 ;Thai Letter Tho Phuthao
0x0e13 0xb3 ;Thai Letter No Nen
0x0e14 0xb4 ;Thai Letter Do Dek
0x0e15 0xb5 ;Thai Letter To Tao
0x0e16 0xb6 ;Thai Letter Tho Thung
0x0e17 0xb7 ;Thai Letter Tho Thahan
0x0e18 0xb8 ;Thai Letter Tho Thong
0x0e19 0xb9 ;Thai Letter No Nu
0x0e1a 0xba ;Thai Letter Bo Baimai
0x0e1b 0xbb ;Thai Letter Po Pla
0x0e1c 0xbc ;Thai Letter Pho Phung
0x0e1d 0xbd ;Thai Letter Fo Fa
0x0e1e 0xbe ;Thai Letter Pho Phan
0x0e1f 0xbf ;Thai Letter Fo Fan
0x0e20 0xc0 ;Thai Letter Pho Samphao
0x0e21 0xc1 ;Thai Letter Mo Ma
0x0e22 0xc2 ;Thai Letter Yo Yak
0x0e23 0xc3 ;Thai Letter Ro Rua
0x0e24 0xc4 ;Thai Letter Ru
0x0e25 0xc5 ;Thai Letter Lo Ling
0x0e26 0xc6 ;Thai Letter Lu
0x0e27 0xc7 ;Thai Letter Wo Waen
0x0e28 0xc8 ;Thai Letter So Sala
0x0e29 0xc9 ;Thai Letter So Rusi
0x0e2a 0xca ;Thai Letter So Sua
0x0e2b 0xcb ;Thai Letter Ho Hip
0x0e2c 0xcc ;Thai Letter Lo Chula
0x0e2d 0xcd ;Thai Letter O Ang
0x0e2e 0xce ;Thai Letter Ho Nok Huk
0x0e2f 0xcf ;Thai Pai Yan Noi
0x0e30 0xd0 ;Thai Vowel Sign Sara A
0x0e31 0xd1 ;Thai Vowel Sign Mai Han-Akat
0x0e32 0xd2 ;Thai Vowel Sign Sara Aa
0x0e33 0xd3 ;Thai Vowel Sign Sara Am
0x0e34 0xd4 ;Thai Vowel Sign Sara I
0x0e35 0xd5 ;Thai Vowel Sign Sara Ii
0x0e36 0xd6 ;Thai Vowel Sign Sara Ue
0x0e37 0xd7 ;Thai Vowel Sign Sara Uee
0x0e38 0xd8 ;Thai Vowel Sign Sara U
0x0e39 0xd9 ;Thai Vowel Sign Sara Uu
0x0e3a 0xda ;Thai Vowel Sign Phinthu
0x0e3f 0xdf ;Thai Baht Sign
0x0e40 0xe0 ;Thai Vowel Sign Sara E
0x0e41 0xe1 ;Thai Vowel Sign Sara Ae
0x0e42 0xe2 ;Thai Vowel Sign Sara O
0x0e43 0xe3 ;Thai Vowel Sign Sara Mai Muan
0x0e44 0xe4 ;Thai Vowel Sign Sara Mai Malai
0x0e45 0xe5 ;Thai Lak Khang Yao
0x0e46 0xe6 ;Thai Mai Yamok
0x0e47 0xe7 ;Thai Vowel Sign Mai Tai Khu
0x0e48 0xe8 ;Thai Tone Mai Ek
0x0e49 0xe9 ;Thai Tone Mai Tho
0x0e4a 0xea ;Thai Tone Mai Tri
0x0e4b 0xeb ;Thai Tone Mai Chattawa
0x0e4c 0xec ;Thai Thanthakhat
0x0e4d 0xed ;Thai Nikkhahit
0x0e4e 0xee ;Thai Yamakkan
0x0e4f 0xef ;Thai Fongman
0x0e50 0xf0 ;Thai Digit 0
0x0e51 0xf1 ;Thai Digit 1
0x0e52 0xf2 ;Thai Digit 2
0x0e53 0xf3 ;Thai Digit 3
0x0e54 0xf4 ;Thai Digit 4
0x0e55 0xf5 ;Thai Digit 5
0x0e56 0xf6 ;Thai Digit 6
0x0e57 0xf7 ;Thai Digit 7
0x0e58 0xf8 ;Thai Digit 8
0x0e59 0xf9 ;Thai Digit 9
0x0e5a 0xfa ;Thai Angkhankhu
0x0e5b 0xfb ;Thai Khomut
0x2013 0x96 ;En Dash
0x2014 0x97 ;Em Dash
0x2018 0x91 ;Single Turned Comma Quotation Mark
0x2019 0x92 ;Single Comma Quotation Mark
0x201c 0x93 ;Double Turned Comma Quotation Mark
0x201d 0x94 ;Double Comma Quotation Mark
0x2022 0x95 ;Bullet
0x2026 0x85 ;Horizontal Ellipsis
0x203c 0x13 ;Double Exclamation Mark
0x20ac 0x80 ;Euro Sign
0x2190 0x1b ;Leftwards Arrow
0x2191 0x18 ;Upwards Arrow
0x2192 0x1a ;Rightwards Arrow
0x2193 0x19 ;Downwards Arrow
0x2194 0x1d ;Left Right Arrow
0x2195 0x12 ;Up Down Arrow
0x21a8 0x17 ;Up Down Arrow With Base
0x221f 0x1c ;Right Angle
0x2302 0x7f ;House
0x2500 0xc4 ;Box Drawings Light Horizontal ; cp437 heck for Thai text mode setup
0x2502 0xb3 ;Box Drawings Light Vertical ; cp437 heck for Thai text mode setup
0x250c 0xda ;Box Drawings Light Down And Right ; cp437 heck for Thai text mode setup
0x2510 0xbf ;Box Drawings Light Down And Left ; cp437 heck for Thai text mode setup
0x2514 0xc0 ;Box Drawings Light Up And Right ; cp437 heck for Thai text mode setup
0x2518 0xd9 ;Box Drawings Light Up And Left ; cp437 heck for Thai text mode setup
0x2550 0xcd ;Box Drawings Double Horizontal ; cp437 heck for Thai text mode setup
0x2551 0xba ;Box Drawings Double Vertical ; cp437 heck for Thai text mode setup
0x2554 0xc9 ;Box Drawings Double Down And Right ; cp437 heck for Thai text mode setup
0x2557 0xbb ;Box Drawings Double Down And Left ; cp437 heck for Thai text mode setup
0x255a 0xc8 ;Box Drawings Double Up And Right ; cp437 heck for Thai text mode setup
0x255d 0xbc ;Box Drawings Double Up And Left ; cp437 heck for Thai text mode setup
0x25ac 0x16 ;Black Rectangle
0x25b2 0x1e ;Black Up-Pointing Triangle
0x25ba 0x10 ;Black Right-Pointing Pointer
0x25bc 0x1f ;Black Down-Pointing Triangle
0x25c4 0x11 ;Black Left-Pointing Pointer
0x25cb 0x09 ;White Circle
0x25d8 0x08 ;Inverse Bullet
0x25d9 0x0a ;Inverse White Circle
0x263a 0x01 ;White Smiling Face
0x263b 0x02 ;Black Smiling Face
0x263c 0x0f ;White Sun With Rays
0x2640 0x0c ;Female Sign
0x2642 0x0b ;Male Sign
0x2660 0x06 ;Black Spade Suit
0x2663 0x05 ;Black Club Suit
0x2665 0x03 ;Black Heart Suit
0x2666 0x04 ;Black Diamond Suit
0x266a 0x0d ;Eighth Note
0x266b 0x0e ;Beamed Eighth Notes
0x2584 0xdc ;Lower Half Block ; cp437 heck for Thai text mode setup
0xf8c1 0xdb ;EUDC -> Undefined
0xf8c2 0xdc ;EUDC -> Undefined
0xf8c3 0xdd ;EUDC -> Undefined
0xf8c4 0xde ;EUDC -> Undefined
0xf8c5 0xfc ;EUDC -> Undefined
0xf8c6 0xfd ;EUDC -> Undefined
0xf8c7 0xfe ;EUDC -> Undefined
0xf8c8 0xff ;EUDC -> Undefined
0xff01 0x21 ;Fullwidth Exclamation Mark
0xff02 0x22 ;Fullwidth Quotation Mark
0xff03 0x23 ;Fullwidth Number Sign
0xff04 0x24 ;Fullwidth Dollar Sign
0xff05 0x25 ;Fullwidth Percent Sign
0xff06 0x26 ;Fullwidth Ampersand
0xff07 0x27 ;Fullwidth Apostrophe
0xff08 0x28 ;Fullwidth Left Parenthesis
0xff09 0x29 ;Fullwidth Right Parenthesis
0xff0a 0x2a ;Fullwidth Asterisk
0xff0b 0x2b ;Fullwidth Plus Sign
0xff0c 0x2c ;Fullwidth Comma
0xff0d 0x2d ;Fullwidth Hyphen-Minus
0xff0e 0x2e ;Fullwidth Full Stop
0xff0f 0x2f ;Fullwidth Solidus
0xff10 0x30 ;Fullwidth Digit Zero
0xff11 0x31 ;Fullwidth Digit One
0xff12 0x32 ;Fullwidth Digit Two
0xff13 0x33 ;Fullwidth Digit Three
0xff14 0x34 ;Fullwidth Digit Four
0xff15 0x35 ;Fullwidth Digit Five
0xff16 0x36 ;Fullwidth Digit Six
0xff17 0x37 ;Fullwidth Digit Seven
0xff18 0x38 ;Fullwidth Digit Eight
0xff19 0x39 ;Fullwidth Digit Nine
0xff1a 0x3a ;Fullwidth Colon
0xff1b 0x3b ;Fullwidth Semicolon
0xff1c 0x3c ;Fullwidth Less-Than Sign
0xff1d 0x3d ;Fullwidth Equals Sign
0xff1e 0x3e ;Fullwidth Greater-Than Sign
0xff1f 0x3f ;Fullwidth Question Mark
0xff20 0x40 ;Fullwidth Commercial At
0xff21 0x41 ;Fullwidth Latin Capital Letter A
0xff22 0x42 ;Fullwidth Latin Capital Letter B
0xff23 0x43 ;Fullwidth Latin Capital Letter C
0xff24 0x44 ;Fullwidth Latin Capital Letter D
0xff25 0x45 ;Fullwidth Latin Capital Letter E
0xff26 0x46 ;Fullwidth Latin Capital Letter F
0xff27 0x47 ;Fullwidth Latin Capital Letter G
0xff28 0x48 ;Fullwidth Latin Capital Letter H
0xff29 0x49 ;Fullwidth Latin Capital Letter I
0xff2a 0x4a ;Fullwidth Latin Capital Letter J
0xff2b 0x4b ;Fullwidth Latin Capital Letter K
0xff2c 0x4c ;Fullwidth Latin Capital Letter L
0xff2d 0x4d ;Fullwidth Latin Capital Letter M
0xff2e 0x4e ;Fullwidth Latin Capital Letter N
0xff2f 0x4f ;Fullwidth Latin Capital Letter O
0xff30 0x50 ;Fullwidth Latin Capital Letter P
0xff31 0x51 ;Fullwidth Latin Capital Letter Q
0xff32 0x52 ;Fullwidth Latin Capital Letter R
0xff33 0x53 ;Fullwidth Latin Capital Letter S
0xff34 0x54 ;Fullwidth Latin Capital Letter T
0xff35 0x55 ;Fullwidth Latin Capital Letter U
0xff36 0x56 ;Fullwidth Latin Capital Letter V
0xff37 0x57 ;Fullwidth Latin Capital Letter W
0xff38 0x58 ;Fullwidth Latin Capital Letter X
0xff39 0x59 ;Fullwidth Latin Capital Letter Y
0xff3a 0x5a ;Fullwidth Latin Capital Letter Z
0xff3b 0x5b ;Fullwidth Left Square Bracket
0xff3c 0x5c ;Fullwidth Reverse Solidus
0xff3d 0x5d ;Fullwidth Right Square Bracket
0xff3e 0x5e ;Fullwidth Circumflex Accent
0xff3f 0x5f ;Fullwidth Low Line
0xff40 0x60 ;Fullwidth Grave Accent
0xff41 0x61 ;Fullwidth Latin Small Letter A
0xff42 0x62 ;Fullwidth Latin Small Letter B
0xff43 0x63 ;Fullwidth Latin Small Letter C
0xff44 0x64 ;Fullwidth Latin Small Letter D
0xff45 0x65 ;Fullwidth Latin Small Letter E
0xff46 0x66 ;Fullwidth Latin Small Letter F
0xff47 0x67 ;Fullwidth Latin Small Letter G
0xff48 0x68 ;Fullwidth Latin Small Letter H
0xff49 0x69 ;Fullwidth Latin Small Letter I
0xff4a 0x6a ;Fullwidth Latin Small Letter J
0xff4b 0x6b ;Fullwidth Latin Small Letter K
0xff4c 0x6c ;Fullwidth Latin Small Letter L
0xff4d 0x6d ;Fullwidth Latin Small Letter M
0xff4e 0x6e ;Fullwidth Latin Small Letter N
0xff4f 0x6f ;Fullwidth Latin Small Letter O
0xff50 0x70 ;Fullwidth Latin Small Letter P
0xff51 0x71 ;Fullwidth Latin Small Letter Q
0xff52 0x72 ;Fullwidth Latin Small Letter R
0xff53 0x73 ;Fullwidth Latin Small Letter S
0xff54 0x74 ;Fullwidth Latin Small Letter T
0xff55 0x75 ;Fullwidth Latin Small Letter U
0xff56 0x76 ;Fullwidth Latin Small Letter V
0xff57 0x77 ;Fullwidth Latin Small Letter W
0xff58 0x78 ;Fullwidth Latin Small Letter X
0xff59 0x79 ;Fullwidth Latin Small Letter Y
0xff5a 0x7a ;Fullwidth Latin Small Letter Z
0xff5b 0x7b ;Fullwidth Left Curly Bracket
0xff5c 0x7c ;Fullwidth Vertical Line
0xff5d 0x7d ;Fullwidth Right Curly Bracket
0xff5e 0x7e ;Fullwidth Tilde
ENDCODEPAGE


File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,183 @@
The following files describe windows code page behavior for the "ansi" code pages provided by Microsoft.
File Code Page Description
bestfit874.txt windows-874 ANSI/OEM Thai (same as 28605, ISO 8859-15); Thai (Windows)
bestfit932.txt shift_jis ANSI/OEM Japanese; Japanese (Shift-JIS)
bestfit936.txt gb2312 ANSI/OEM Simplified Chinese (PRC, Singapore); Chinese Simplified (GB2312)
bestfit949.txt ks_c_5601-1987 ANSI/OEM Korean (Unified Hangul Code)
bestfit950.txt big5 ANSI/OEM Traditional Chinese (Taiwan; Hong Kong SAR, PRC); Chinese Traditional (Big5)
bestfit1250.txt windows-1250 ANSI Central European; Central European (Windows)
bestfit1251.txt windows-1251 ANSI Cyrillic; Cyrillic (Windows)
bestfit1252.txt windows-1252 ANSI Latin 1; Western European (Windows)
bestfit1253.txt windows-1253 ANSI Greek; Greek (Windows)
bestfit1254.txt windows-1254 ANSI Turkish; Turkish (Windows)
bestfit1255.txt windows-1255 ANSI Hebrew; Hebrew (Windows)
bestfit1256.txt windows-1256 ANSI Arabic; Arabic (Windows)
bestfit1257.txt windows-1257 ANSI Baltic; Baltic (Windows)
bestfit1258.txt windows-1258 ANSI/OEM Vietnamese; Vietnamese (Windows)
These tables include "best fit" behavior which is not present in the other files. Examples of best fit
are converting fullwidth letters to their counterparts when converting to single byte code pages, and
mapping the Infinity character to the number 8.
932, 936, 949 and 950 are all double byte code pages. The remainder are single byte code pages. Each file
is encoded in the code page it describes, eg: bestfit1252.txt is encoded in the windows-1252 encoding. The only
non-ASCII characters however are in the comments so these files may be read by an ASCII parser if necessary.
Each file has sections of key word tags and records. Any text after a ; is ignored as are blank lines. Fields are
delimited by one or more space or tab characters. Each section begins one of the following tags:
CODEPAGE
CPINFO
MBTABLE
WCTABLE
DBCSRANGE (double byte code pages only)
DBSCTABLE (double byte code pages only)
Descriptions of each tag are:
CODEPAGE 932 ; Japanese - ANSI, OEM
The CODEPAGE tag contains 1 field and marks the start of the code page file.
Field 1 -- The only field is the decimal windows code page number for this code page.
CPINFO 2 0x3f 0x30fb ; DBCS CP, Unic Default Char = Katakana Middle Dot
The CPINFO tag describes the code page with 3 fields:
Field 1 -- "1" for a single byte code page, "2" for a double byte code page.
Field 2 -- Replacement characters for unassigned Unicode code points when written to this
code page (currently always ?)
Field 3 -- Replacement characters for illegal or unassigned code page values when converting to Unicode.
This is Katakana middle dot for 932 and ? for all other code pages.
MBTABLE 256
The MBTABLE tag marks the start of the "Multibyte" code page to Unicode conversion table. It has 1 field.
Field 1 -- This field contains the number of following records of code page to Unicode mappings. Note that
lead bytes don’t have mappings, so this is not always 256. For 932 for example it is 196.
MultiByte Mapping Records:
0x00 0x0000 ;Null
0x01 0x0001 ;Start Of Heading
...
0x30 0x0030 ;Digit 0
0x31 0x0031 ;Digit 1
...
Each record consists of two fields to map from the code page to Unicode.
Field 1 -- The code page byte that is being mapped to Unicode, eg "0x3f"
Field 2 -- The Unicode UTF-16 code point that this byte maps to, eg "0x003f"
DBCSRANGE 2 ;2 DBCS Lead Byte Ranges: 0x81-0x9f and 0xe0-0xfc
The DBSCRANGE describes the number of double byte ranges for double byte code page. Ranges are consecutive
lead byte values such as 0x81-0x9f
Field 1 -- This field contains the number of double byte ranges. The next record is the 1st lead byte range
0x81 0x9f ;Lead Byte Range
This record describes the first lead byte range. It is the first record after DBCSRANGE and is followed by one
DBCSTABLE record for each lead byte in the range. If there are additional ranges, another Lead Byte Range record
will follow the last DBCSTABLE in the previous range.
Field 1 -- This field is the first lead byte used in this range.
Field 2 -- This field is the last lead byte used in this range
DBCSTABLE 147 ;LeadByte = 0x81
The DBCSTABLE record describes the mappings available for a particular lead byte. The comment is ignored but
descriptive. The lead byte of the first DBCSTABLE is the first lead byte of the previous Lead Byte Range
record. Each subsequent DBCSTABLE is for the next consecutive lead byte value.
Field 1 -- This field is the number of trail byte mappings following.
Double byte mapping records:
0x40 0x3000 ; Ideographic Space
0x41 0x3001 ; Ideographic Comma
...
Field 1 -- This field is the trail byte to map from.
Field 2 -- This field is the Unicode UTF-16 code point that this lead byte/trail byte combination map to.
Example:
DBCSRANGE 2 ;2 DBCS Lead Byte Ranges: 0x81-0x9f and 0xe0-0xfc
0x81 0x9f ;Lead Byte Range
DBCSTABLE 147 ;LeadByte = 0x81
0x40 0x3000 ; Ideographic Space
0x41 0x3001 ; Ideographic Comma
...
The preceeding example would map the byte sequences 0x81 0x40 to U+3000 and 0x81 0x41 to U+3001.
WCTABLE 698
The WCTABLE tag marks the start of the Unicode UTF-16 (WideChar) to "MultiByte" bytes. It has 1 field.
Field 1 -- This field contains the number of records of Unicode to byte mappings. Note that this is often
more than the number of round trip mappings supported by the code page due to windows "Best Fit" behavior.
Unicode UTF-16 (WideChar) Mapping Records:
These take two forms, differing between single byte and double byte code pages. Both forms have 2 fields:
Single byte WCTABLE records:
0x0000 0x00 ;Null
0x0001 0x01 ;Start Of Heading
...
0x0061 0x61 ;Latin Small Letter A
0x0062 0x62 ;Latin Small Letter B
0x0063 0x63 ;Latin Small Letter C
...
0x221e 0x38 ;Infinity << Best Fit Mapping
...
0xff41 0x61 ;Fullwidth Latin Small Letter A << Best Fit Mapping
0xff42 0x62 ;Fullwidth Latin Small Letter B << Best Fit Mapping
0xff43 0x63 ;Fullwidth Latin Small Letter C << Best Fit Mapping
...
Field 1 -- The Unicode UTF-16 code point for the character being converted.
Field 2 -- The single byte that this UTF-16 code point maps to. If a reverse mapping does not in the MBTABLE,
then this is a Best Fit mapping.
Multibyte WCTABLE records:
0x0000 0x0000 ; Null
0x0001 0x0001 ; Start Of Heading
...
0x0061 0x0061 ; a
0x0062 0x0062 ; b
0x0063 0x0063 ; c
...
0x221e 0x8187 ; Infinity
...
0xff41 0x8281 ; Fullwidth a
0xff42 0x8282 ; Fullwidth b
0xff43 0x8283 ; Fullwidth c
...
Field 1 -- The Unicode UTF-16 code point for the character being converted.
Field 2 -- The byte or bytes that this code point maps to as a 16 bit value. The high byte is the lead byte,
and the low byte is the trail byte. If the high byte is 0, then this is a single byte code point, with
the value of the low byte and no lead byte is emitted.
ENDCODEPAGE
This tag marks the end of the code page data. Anything after this marker is ignored.

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

View File

@@ -4,8 +4,8 @@
# Table version: 2.00
# Table format: Format A
# Date: 04/24/96
# Authors: Lori Brownell <loribr@microsoft.com>
# K.D. Chang <a-kchang@microsoft.com>
# Contact: Shawn.Steele@microsoft.com
#
# General notes: none
#
# Format: Three tab-separated columns

Some files were not shown because too many files have changed in this diff Show More