old-www/HOWTO/Hellenic-HOWTO-10.html

347 lines
17 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
<META NAME="GENERATOR" CONTENT="SGML-Tools 1.0.9">
<TITLE>HELLENIC HOWTO: Pivakes xaraktnrwv, character sets</TITLE>
<LINK HREF="Hellenic-HOWTO-11.html" REL=next>
<LINK HREF="Hellenic-HOWTO-9.html" REL=previous>
<LINK HREF="Hellenic-HOWTO.html#toc10" REL=contents>
</HEAD>
<BODY>
<A HREF="Hellenic-HOWTO-11.html">Next</A>
<A HREF="Hellenic-HOWTO-9.html">Previous</A>
<A HREF="Hellenic-HOWTO.html#toc10">Contents</A>
<HR>
<H2><A NAME="greekcharsets"></A> <A NAME="s10">10. Pivakes xaraktnrwv, character sets</A></H2>
<P>O H/U gia va parastnsei ta glwssika sumbola xrnsimopoiei
1byte=8bit, dnladn exoume 2^8=256 diaforetika grammata.
O kwdikas ASCII (American Standard Code for Information Interchange)
orizei austnra movo ta prwta 128 sumbola (7bit).
Ta alla misa sumbola xrnsimopoiouvtai gia apeikovisn eidikwv sumbolwv
allwv glwsswv alla kai grafikwv sumbolwv.
Avti8eta, me alles eurwpaikes glwsses ta ellnvika eivai e3' oloklnrou sta 8bit.
O profavns logos eivai ta polla diaforetika sumbola pou exei n ellnvikn glwssa
se sxesn me tis upoloipes.
<P>Pros8etes plnrofories gia ta ellnvika sto Diadiktuo, mporeite va breite sto
RFC 1947, "Greek Character Encoding for Electronic Mail Messages".
Deite sxetika
<A HREF="http://andrew2.andrew.cmu.edu/rfc/rfc1947.html">http://andrew2.andrew.cmu.edu/rfc/rfc1947.html</A><P>
<H2><A NAME="allgreekcodepages"></A> <A NAME="ss10.1">10.1 Tupopoinseis twv ellnvikwv</A>
</H2>
<P>Ta ellnvika uparxouv se polles diaforetikes tupopoinseis.
Oi pio suvn8ismeves apo autes, eivai ta 737 kai ta 928.
Amfotera eivai gia movotovika ellnvika.
Ta mev 737 xrnsimopoiouvtai apo to DOS, ta de 928 apo ola ta UNIX kai Windows (me mikres parallages).
To Linux exei sav kuria kwdikoselida ta 928.
To oti exoume gia Ellnvika duo kai pleov protupa, fusika, eivai megalo problnma,
pou 3epervietai me eidikous metatropeis, gia allagn apo to eva set sto allo.
<P>Apo tnv
<A HREF="http://www.oracle.com/download/Linux/oracle8/html/instguide/server805/a66251/appc.htm">tekmnriwsn tns Oracle gia to Linux</A> kai ta
<A HREF="http://wpi.wpi.edu/server/doc/SRF73/ch4.htm#nls01">egxeiridia tou server</A>, mporei kaveis va brei
ta diadedomeva ellnvika protupa pou xrnsimopoiouvtai se baseis dedomevwv
(ara kai sta pio snmavtika sustnmata H/U) kai tous tupopoinmevous (ma pali;)
kwdikous tous:
<UL>
<LI>EL8ISO8859P7, Dnl. to ISO8859-7 gia to UNIX &amp; Internet. Eivai gvwsto kai ws 928 &amp; Latin7.</LI>
<LI>EL8MSWIN1253, Ta Windows ellnvika</LI>
<LI>EL8PC737, Ta DOS ellnvika</LI>
<LI>EL8MACGREEK, O Macintosh xrnsimopoiei ELOT-823 (IBM 851).</LI>
<LI>EL8MACGREEKS</LI>
<LI>EL8PC437S</LI>
<LI>EEC8EUROPA3</LI>
<LI>EL8EBCDIC875, IBM mainframes greek character set</LI>
<LI>EL8DEC, Auta eivai logika ta DEC, VAX/VMS ellnvika. (kaveis palios???)</LI>
</UL>
Episns, to OS/2 xrnsimopoiei tnv kwdikoselida 869 kai 851 gia ta ellnvika.
<P>
<H2><A NAME="737"></A> <A NAME="ss10.2">10.2 737</A>
</H2>
<P>Ta 737 eivai episns gvwsta kai ws 437G (=437Greek),
giati proekuyav apo tropopoinsn twv amerikavikwv 437.
Ta 737 prwtoemfavistnkav stis ellnvikes EPROM twv MDA kai Hercules kartwv grafikwv
twv prwtwv PC, opou briskovtav dnladn sto HARDWARE.
Xrnsimopoin8nkav kata korov sto DOS, kai gia auto ola ta arxeia pou proerxovtai
apo ekei avamevetai va eivai 737.
Epeidn ta 737 8ewrouvtai pleov kataloipo tou DOS, eivai kalutera
va metatreyete ta arxeia pou eivai 737 se 928, bl.
<A HREF="#convertgreek">convertgreek</A>
.
Sto Linux, n kwdikoselida 737 upostnrizetai plnrws movo stnv kovsola (text-mode),
alla uparxouv kai merikes grammatoseires gia X-Windows.
<P>
<H3><A NAME="kernel737"></A> Tropopoinsn purnva gia upostnri3n 737</H3>
<P>Exouv avafer8ei periptwseis, opou to "d" (DELTA mikro) dev plnktrologeitai se
kapoious purnves kai auto sumbaivei giati sumpiptei me to 128+ESC (128+27=155=asc("d")).
Pngaivete sto /usr/src/linux/drivers/char/console.c, kapou leei:
<PRE>
&amp;&amp; (c != 127 || disp_ctrl)
&amp;&amp; (c != 128+27);
alla3te se
&amp;&amp; (c != 127 || disp_ctrl)
/* &amp;&amp; (c != 128+27)*/;
</PRE>
kai kavete compile eva veo purnva.
<P>
<H3><A NAME="737inxwindows"></A> 737 se X-windows</H3>
<P>Ta 737 upostnrizovtai se merikes apo tis fixed grammatoseires
pou eivai sto paketo Grafis:
<A HREF="Hellenic-HOWTO-3.html#graphis">graphis</A>
.
<PRE>
[ah@computer.org]'s report for names (from xlsfonts):
-misc-grfixed-medium-r-normal--0-0-75-75-c-0-grpc-737
-misc-grfixed-medium-r-normal--0-0-85-85-m-0-grpc-737
-misc-grfixed-medium-r-normal--14-110-75-75-c-75-grpc-737
-misc-grfixed-medium-r-normal--16-120-75-75-c-75-grpc-737
-misc-grfixed-medium-r-normal--23-179-85-85-m-120-grpc-737
-misc-grfixed-medium-r-semicondensed--0-0-75-75-c-0-grpc-737
-misc-grfixed-medium-r-semicondensed--10-100-75-75-c-60-grpc-737
-misc-grfixed-medium-r-semicondensed--13-120-75-75-c-60-grpc-737
-misc-grvga-medium-r-normal--0-0-75-75-c-0-grpc-737
-misc-grvga-medium-r-normal--13-120-75-75-c-60-grpc-737
(nomizw kapoia exoyn bugs kai exw skopo na ta diorthwsw se next release).
</PRE>
<P>
<H2><A NAME="928"></A> <A NAME="ss10.3">10.3 928</A>
</H2>
<P>Ta ellnvika 928 eivai n pio sugxrovn kai diadedomevn tupopoinsn
kai ka8ierw8nke arxika apo tov ELOT.
Argotera egivav apodekta kai apo tov ISO ws ISO-Latin-8859-7, n apla Latin7,
akoma kai n UNICODE upostnri3n ellnvikwv basizetai se auta.
Ta 928 xrnsimopoiouvtai se oles tis efarmoges twv UNIX, sto Internet
kai apotelouv to snmerivo protupo kai gia to Linux.
To protupo 928 upostnrizetai, kai stnv kovsola (text-mode), kai se grafiko
periballov (X-Windows).
<P>
<H3><A NAME="Windows-1253"></A> Windows-1253</H3>
<P>H kuria apoklisn twv Windows ellnvikwv (Windows-1253) apo tnv tupopoinsn ELOT 928,
eivai o xaraktnras "A", (A tovoumevo) tou 928 o opoios sta Windows avtistoixei sto
Paragraph mark. Apo ta Windows-1253 leipouv episns n avw teleia,
kai ta ellnvika omoiwmatika &lt;&lt; kai >>.
Epeidn moiraia 8a prepei va apodextoume tov periorismo auto pou mas 8etouv ta MS-Windows,
kai epeidn arketoi xrnstes xrnsimopoiouv wintel platforma ergasias, kalo 8a eivai
va apofeugetai to &lt; A tovoumevo > kata tnv apostoln e-mails, postings, klp.
Evallaktika mporeite va xrnsimopoieite to 'A ( ' = SHIFT+" )
Paromoia problnmata uparxouv kai me ta 'E kai 'O.
Gia eukolia sas, auta eivai ola ta tovoumeva kata 928: AEHIOYO.
<P>
<H2><A NAME="Unicode"></A> <A NAME="ss10.4">10.4 Unicode</A>
</H2>
<P>Ta UNICODE (ISO 10646) eivai 16bit (dnl. 65536 suvdiasmoi) kai perilambavouv polles glwsses,
mazi me ta vea ellnvika, pou exouv offset #370 kai ta arxaia ellnvika me offset #1F00.
Upostnrizovtai apo ta vea mexri ta arxaia (polutovika) ellnvika kai Grammikn B!
To Linux upostnrizei eswterika ta UNICODE, alla akoma n xrnsn tous dev eivai diadedomevn,
giati e3artatai kai apo tnv uio8etnsn tous apo tis efarmoges.
Gia perissotera deite:
<A HREF="http://linuxdoc.org/HOWTO/Unicode-HOWTO.html">http://linuxdoc.org/HOWTO/Unicode-HOWTO.html</A><P>
<PRE>
====================================================================
Vasilis Vasaitis &lt;vas@hal.csd.auth.gr>:
Av kai dev exw asxoln8ei ektevws me to avtikeimevo, mporw va suveisferw
kapoia gvwsn pou exw epi tou 8ematos. Loipov, exoume kai leme:
Kapoia stigmn, se avupopto xrovo, eixa katebasei eva Unicode fixed font
gia ta X windows. Epeidn duskola sbnvw auta pou katebazw, to brnka va
ka8etai akoma sto disko mou. H grammatoseira autn dev periexei to plnres
Unicode, afou auto apoteleitai apo perissoterous apo 38000 xaraktnres, apo
tous opoious oi perissoteroi eivai Kivezika/Iapwvika/Koreatika, pou etsi ki
alliws sto 6x13 tou fixed dev mpaivouv. Omws me peripou 2800 xaraktnres (n
ekdosn pou exw egw toulaxistov) kaluptei plnrws tnv lativikn, ellnvikn,
kurillikn, armevikn, gewrgiavn kai ebraikn grafn, suv kapoia texvika kai
ma8nmatika sumbola. H grammatoseira autn mporei va xrnsimopoin8ei ws protupo
apo opoiovdnpote evdiaferetai va sxediasei grammatoseires me pollous
xaraktnres sxetika me pio praktikes efarmoges, deite parakatw. H selida tou
tupou pou tnv eftia3e, av eivai akoma n idia, eivai:
http://www.cl.cam.ac.uk/~mgk25/
Upostnri3n stnv kovsola:
H kovsola upostnrizei Unicode edw kai kati aiwves, mesw bebaia tou UTF8
(gia osous dev 3erouv, to UTF8 eivai mia avaparastasn tou UniCode me
metablnto mege8os, to opoio gia gia tous 128 prwtous xaraktnres exei tnv
idia morfn me to ASCII). To 8ema eivai oti etsi ki alliws n upostnri3n tns
VGA gia xaraktnres pou emfavizovtai sugxrovws eivai polu periorismevn (256,
512 xwris to avabosbnma).
Upostnri3n sta X:
H grammatoseira pou avaferw parapavw douleuei mia xara, kai n teleutaia
fora pou tn dokimasa ntav priv polu kairo. Episns, tuxaivei va exw evav X
server me evswmatwmevn upostnri3n TrueType fonts (dev fortwvw font server),
kai blepw oti kai ta TrueType douleuouv mia xara. Gia osous dev 3erouv, ta
XFree86 4.0 8a erxovtai me evswmatwmevn upostnri3n TrueType. H Microsoft
(dev exw apo alln etaireia) xrnsimopoiei stis grammatoseires tns to Windows
Glyph List 4 (WGL4), to opoio eivai uposuvolo tou ISO 10646-1 (ligo polu
auto pou exei n grammatoseira pou periegraya arxika).
Efarmoges:
Edw katarreouv ola. Autn tn stigmn uparxouv kava duo programmata pou
kavouv metatropn apo/pros UTF8, to yudit kai to Netscape pou mazeuouv apo
edw ki apo ekei gia va brouv arketa sumbola tou Unicode, kai apo ekei kai
pera to xaos. Pavtws kalov 8a eivai va arxisei prospa8eia gia ta fonts, kai
favtazomai oti oi efarmoges 8a koita3ouv va akolou8nsouv.
---------------
Report apo Panagioti Vrioni:
Gnwrizw oti o Giannis Gyftomitros &lt;yang@hellug.gr> exei hdh arxisei
na asxoleitai me thn dunatothta dhmiourgias Unicode grammatoseirwn
pou na periexoun kai ta ellhhnika (Project Grafis, bl. GRArial k.l.),
isws na exei proxwrhsei kai parapera...
Apo thn ekdosh 6.0, o XFS pou periexetai sto Red Hat exei patch wste
na mporei na emfanisei Trye Type Fonts. Bl. sxetiko "White Paper" stho
"support" ths http://www.redhat.com/ . An balete Unicode TTFonts
(px. ths M$) auta paizoun, me thn ennoia oti fainontai dia8esima
ta fonts me xilia-duo diaforetika encodings. Den kserw omws an paizoun
kai san unicode grammatoseira, px. gia na dei kapoios ena keimeno
me ellhnika, agglika kai kinezika tautoxrona sto Netscape.
=====================================================================
</PRE>
<P>
<H3><A NAME="Unicode Links"></A> Unicode Links</H3>
<P>Uparxei mia fixed grammatoseira gia Xwindows, deite sxetika:
<A HREF="http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html">http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html</A><P>Uparxei kai evas text editor gia Unicode, me to ovoma Yudit,
<A HREF="ftp://metalab.unc.edu/pub/Linux/apps/editors/X/yudit-1.1.tar.gz">ftp://metalab.unc.edu/pub/Linux/apps/editors/X/yudit-1.1.tar.gz</A><P>To protupo UTF-8 eivai pleov standard sto Internet, deite to sxetiko RFC:
<A HREF="http://andrew2.andrew.cmu.edu/rfc/rfc2279.html">http://andrew2.andrew.cmu.edu/rfc/rfc2279.html</A><P>Perissotera gia ta vea ellnvika sta Unicode edw:
<A HREF="http://charts.unicode.org/Unicode.charts/normal/U0370.html">http://charts.unicode.org/Unicode.charts/normal/U0370.html</A><P>
<H2><A NAME="convertgreek"></A> <A NAME="ss10.5">10.5 Metatropeis ellnvikwv</A>
</H2>
<P>
<H3><A NAME="gr2gr"></A> gr2gr</H3>
<P>O Aggelos Xaritsns &lt;
<A HREF="mailto:ah@computer.org">ah@computer.org</A>&gt;
exei grayei tov metatropea autov:
<A HREF="ftp://ftp.hri.org/pub/greek/programs/gr2gr.prl">ftp://ftp.hri.org/pub/greek/programs/gr2gr.prl</A>
Trexei me perl (5 n 4). Suvepws douleuei se opoio leitourgiko
sustnma exei egkatasta8ei perl (unix, dos, win32, os2, mac, vms ...).
<P>Upostnrizei polla diaforetika ellnvika, opws:
<UL>
<LI>928: ELOT 928</LI>
<LI>437: IBM 437 (*default* input)</LI>
<LI>lat: latin greek form, aka greeklish (*default* output)</LI>
<LI>437b: IBM 437b</LI>
<LI>win: Windows</LI>
<LI>mac: Macintosh</LI>
<LI>851: 851</LI>
<LI>869: 869</LI>
<LI>quad: quadtek</LI>
<LI>sym: standard Symbol font codes (English garbled)</LI>
<LI>wgr: WinGreek (Windows shareware prg) encoding</LI>
<LI>troff: troff symbol font escape sequences, no diacritics (dialytika)</LI>
<LI>kdtex: Dryllerakis TeX (only conversions _to_ kdtex work)</LI>
<LI>ibytex: ibygrk TeX (only conversions _to_ ibygrk work)</LI>
</UL>
<P>
<H3><A NAME="grfilter"></A> grfilter</H3>
<P>Sto Ivstitouto Texvologias Upologistwv uparxei to grfilter:
<A HREF="ftp://ftp.cti.gr/pub/src/grfilter.tar">ftp://ftp.cti.gr/pub/src/grfilter.tar</A><P>
<H3><A NAME="greek2lat"></A> greek2lat</H3>
<P>Sto directory
<A HREF="ftp://corfu.forthnet.gr/pub/greek2lat">ftp://corfu.forthnet.gr/pub/greek2lat</A>
uparxei evas metatropeas apo 928 se greeklish, katallnlos kai gia WEB sites.
<P>
<H3><A NAME="trans120.tar.gz"></A> trans120.tar.gz</H3>
<P>O Kwstas Kwstns,
&lt;
<A HREF="mailto:kosta@kostis.net">kosta@kostis.net</A> &gt;
exei grayei episns autov tov metatropea, pou upostnrizei kai polla
ellnvika, alla kai alles glwsses:
<A HREF="http://www.kostis.net/freeware/trans120.tar.gz">http://www.kostis.net/freeware/trans120.tar.gz</A><P>
<H3><A NAME="gkconv"></A> gkconv</H3>
<P>Uparxei kai eva programma tou Giwrgou Spnliwtn, metatrepei 437, Win95, X win.
H dieu8uvsn tou agvoeitai.
<P>
<H3><A NAME="recode"></A> recode</H3>
<P>Auto eivai eva programmataki gevikns xrnsns apo to GNU project,
to opoio upostnrizei metatropeis gia polles diaforetikes glwsses (kai ellnvika).
Isws 8a eprepe ola ta upoloipa programmata kapoia stigmn va evswmatw8ouv se auto.
Deite stnv dieu8uvsn
<A HREF="http://www.delorie.com/gnu/docs/recode/recode_toc.html">http://www.delorie.com/gnu/docs/recode/recode_toc.html</A><P>
<H2><A NAME="filetypes"></A> <A NAME="ss10.6">10.6 Tupoi arxeiwv kai metatropn tous</A>
</H2>
<P>
<DL>
<DT><B> .txt, .doc</B><DD><P>Avaloga me tnv periptwsn, blepe
<A HREF="#convertgreek">convertgreek</A>
<DT><B> .dbf </B><DD><P>Suvn8ws eivai 737, 8elouv prosoxn stnv metatropn,
afnste to gia kava guru.
<DT><B> .diz, </B><DD><P>Suvn8ws eivai 737, blepe
<A HREF="#convertgreek">convertgreek</A>
<DT><B> .html, </B><DD><P>Prepei va eivai 928, kai faivovtai kavovika.
<DT><B> .mov, .avi</B><DD><P>Av exei upotitlous sta ellnvika, 8a eivai OK :-)
<DT><B> .exe, .com</B><DD><P>peta3te ta
</DL>
<P>
<H2><A NAME="InternetandGreek"></A> <A NAME="ss10.7">10.7 Ti uparxei akoma sto Internet sxetika me ellnvika;</A>
</H2>
<P>Xrnsimoi suvdesmoi:
<UL>
<LI>Ellnviko Fortune:
<A HREF="http://kronos.eng.auth.gr/~arvan/fortunes/">http://kronos.eng.auth.gr/~arvan/fortunes/</A> </LI>
<LI>Virgo help on greek:
<A HREF="http://www.virgo.gr/baza/greek.html">http://www.virgo.gr/baza/greek.html</A> </LI>
<LI>Hellenic Resources Institute (HRI):
<A HREF="http://www.hri.org">http://www.hri.org</A> </LI>
<LI>Font pivakes sto HRI:
<A HREF="http://www.hri.org/fonts/unix/pinakec.html">http://www.hri.org/fonts/unix/pinakec.html</A> </LI>
<LI>fonts@argeas:
<A HREF="ftp://argeas.hellug.gr/pub/unix/linux/GREEK/fonts/">ftp://argeas.hellug.gr/pub/unix/linux/GREEK/fonts/</A></LI>
<LI>fonts@HRI:
<A HREF="ftp://ftp.hri.org/pub/greek/fonts/x-win/">ftp://ftp.hri.org/pub/greek/fonts/x-win/</A></LI>
<LI>Selida tou Pavagiwtn Bruwvn gia grammatoseires:
<A HREF="http://users.hol.gr/~vrypan/cactus/grfonts-1.html">http://users.hol.gr/~vrypan/cactus/grfonts-1.html</A></LI>
<LI>Unix Greek Language Software (old):
<A HREF="http://www.cs.columbia.edu/~akonstan/en/greek/software.html">http://www.cs.columbia.edu/~akonstan/en/greek/software.html</A> </LI>
<LI>Unicode Organization:
<A HREF="http://www.unicode.org">http://www.unicode.org</A> </LI>
<LI>I18N FAQ:
<A HREF="http://www.vlsivie.tuwien.ac.at/mike/i18n.html">http://www.vlsivie.tuwien.ac.at/mike/i18n.html</A> </LI>
<LI>ISO fonts:
<A HREF="ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/ISO-fonts">ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/ISO-fonts</A> </LI>
<LI>International fonts:
<A HREF="ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/i18n-fonts">ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/i18n-fonts</A> </LI>
<LI>Using 8 bit characters:
<A HREF="ftp://ftp.ulg.ac.be/pub/docs/iso8859/">ftp://ftp.ulg.ac.be/pub/docs/iso8859/</A> </LI>
<LI>ISO-8859 sets:
<A HREF="http://wwwwbs.cs.tu-berlin.de/~czyborra/charsets/">http://wwwwbs.cs.tu-berlin.de/~czyborra/charsets/</A></LI>
<LI>Much charactersets info (kermit?):
<A HREF="ftp://kermit.columbia.edu/kermit/charsets/">ftp://kermit.columbia.edu/kermit/charsets/</A></LI>
<LI>
<A HREF="http://www.ora.com/homepages/comp.fonts/">http://www.ora.com/homepages/comp.fonts/</A></LI>
</UL>
Yaxte va breite oti xreiazeste me auto to search engine:
<A HREF="http://www.google.org">http://www.google.org</A>
<P>
<HR>
<A HREF="Hellenic-HOWTO-11.html">Next</A>
<A HREF="Hellenic-HOWTO-9.html">Previous</A>
<A HREF="Hellenic-HOWTO.html#toc10">Contents</A>
</BODY>
</HTML>