[windev] Converting from greek....
Serge Wautier
serge at wautier.net
Wed Mar 25 10:58:49 GMT 2009
Roberto,
Would you care to tell us why you want to implement transliteration? It's
not every day that someone comes with such a requirement;-)
Is it only to index the data? If yes, why don't you use Unicode encodings.
UTF-8 (which is ASCII friendly) might be a good candidate.
I never investigated tools to perform accurate transliteration, which indeed
is not a trivial job.
But you can fairly easily roll your own if you don't need too much accuracy,
using the first 2 columns of this table:
<http://www.unicode.org/cldr/data/charts/transforms/Latin-Greek.html>
Note that this table converts a Unicode code point into another one. It
doesn't use 'ANSI' codepages. According to your needs, you may have to
convert the source from codepage 1253 to Unicode and/or the result from
Unicode to codepage 1252, which may prevent you from using some of the
'advanced' characters in the table.
HTH,
Serge.
http://www.apptranslator.com
> -----Original Message-----
> From: windev-bounces at windev.org [mailto:windev-bounces at windev.org] On
> Behalf Of Roberto Tirabassi
> Sent: mercredi 25 mars 2009 10:50
> To: Windows Developers (Mailing List)
> Subject: [windev] Converting from greek....
>
> Hi Gurus...
>
> I'm facing this problem... I have to extract and index text from xml
> files that can contain many different languages, even greek. I'm still
> not facing mid-eastern languages... I'll be satisfied if I can
> transliter greek...
>
> A better explanation: most european letters languages can be expressed
> into ASCII chars plus a modificator (I mean, an accent and so on). As
> everybody know, http://www.zakynthian.com/index1
> <http://www.google.it/imgres?imgurl=http://www.zakynthian.com/ellas_fla
> g.jpg&imgrefurl=http://www.zakynthian.com/index1&h=450&w=600&sz=36&tbni
> d=1J7b9VAYMMZutM::&tbnh=101&tbnw=135&prev=/images%3Fq%3Dellas&usg=__Wnp
> 08j0ycwue7vEPnCph4Isb4Y0=&ei=GP3JSZq0EtqRsAaRzKiuAg&sa=X&oi=image_resul
> t&resnum=5&ct=image&cd=1>
> can be written as ELLAS. I tried using iconv in many ways but did not
> find the way to convert such values. Shoul I do it on my own or is
> there
> a way (using libiconv or not) to obtain such a transiltaretion?
> Thanks.
>
> Roberto Tirabassi.
>
> --
> "The question of whether a computer can think is no more interesting
> than the question of whether a submarine can swim"
> [Edsger W. Dijkstra, Dutch computing scientist]
>
>
>
> Skype: roberto.tirabassi
>
> 3D Informatica, Via Speranza 35,
> 40068, S.Lazzaro di Savena - Bologna, Italy
> Voice: +39051450844, Fax: +39051451942
> WWW: http://www.3di.it
> Documentation: http://www.3di.it/manuali/ - http://wiki.3di.it
> FTP: ftp://ftp@ftp.3di.it, Download:/3di, Upload:/incoming
> --
>
>
>
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL:
> http://lists.windev.org/pipermail/windev/attachments/20090325/e6211b11/
> attachment.htm
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: ellas_flag.jpg
> Type: image/jpeg
> Size: 3691 bytes
> Desc: not available
> Url :
> http://lists.windev.org/pipermail/windev/attachments/20090325/e6211b11/
> ellas_flag.jpg
> --
> Windev mailing list at Windev at windev.org
>
> Lost your password? Need to unsubscribe or change your delivery
> options?
> Go to http://lists.windev.org/mailman/listinfo/windev
> --
> Search the Windev Archives - www.windev.org
More information about the Windev
mailing list