??ࡱ?>?? ??????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????)   !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~?Root Entry????????@???k???,PowerPoint Document(???? |PersistentStorage Directory8 ????????>Current User????????????????????????  !"#$%&'()*+,-./0123456789:;<=>?@AB????DEFGHIJKLMNOPQRSTUVWXYZ????????]^_`abcdefghijklmnopqrstuvwxyz{|}~???? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1?? ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ???? b? (?????b,??? ???5 ????????? 8?????b???G?:?b????$ģbd0???f??? XML/DOM LDAP and X.500 (?) ???/?/?/-=?/-?/-?/-??/-?/-????0??0?40????dd ?40UuAd?40UuAd?40UuAd?40????dd?40?dd>?40UuAdE?40UuAd?40?dd?40?dd???(1?1?1??????????8??????????0 p????%?8?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????8? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??! ? ??]?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????-?!5XML: Language and Charset tagging???</?,/!?/-???X0?H0!?40????dd???(1?1!?1?????? ????  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????t?85Character is atomic unit of text All ISO 10646 characters + TAB, CR, LF The mechanism for Encoding can vary for different characters All XML processors must accept UTF-8 and UTF-16 Character Encoding in Entities (XML 4.3.3) EncodingDecl : : = S ?encoding? Eq ? ? ? EncName ? ? ? | ? ? ? EncName ? ? ? ) Autodetection of Character Encoding Language identification (XML 2.12) Tag for identification of languages LanguageID : : = Langcode (?-? Subcode) Langcode : : = ISO639Code | IanaCode | UserCode????/??/!?/-'?/-m?/-+?/-??/-$?/-#?/-$?/-Y?/-???(0?0!?40????dd'?40?dd=?40UuAd0?40UuAd+?40?dd??40UuAd$?40?dd#?40????dd$?40?dd)?40UuAd0?40UuAd???(1?18?1????????8?+??????????0 p????%?+?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????+? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??X?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????(?5Charset and Language tagging???</?,/?/-???X0?H0?40????dd???(1?1?1?????? ??;?  b? (?????b6???????? ?????? 8?????b?????9?b???c?$ģbd0???o???????? w???? ?42?dd?3??5MIME types text, img, audio, video Charset = Character Set + Character Encoding Scheme Transfer Encoding Scheme base64 quoted-printable Language RFC 1766 ISO639-2 ????/??/ ?/-e?/-?/- ?/-?/-????0??0 ?40????dd?40?dd4?40?dd?40?dd?40UuAd?40UuAd ?40????dd ?40?dd ?40?dd???(1?1??1????????@ ?7??????????0 p????%?7?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????7? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??b?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????2?&5Language Definition in DC Metadata set???</?,/&?/-???X0?H0&?40????dd???(1?1&?1?????? ??9?  b? (?????b6???????? ?????? 8?????b?????9?b???a?$ģbd0???o???????? w???? ??5 ???</?,/??/-????0??0?40????dd!?40????dd?40????dd?40????dd?40????dd"?40????dd???(1?1??1????????e?9??????????0 p????%?9?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????9? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??? ? ??X?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????(?5Multilingual Subject Gateway???</?,/?/-???X0?H0?40????dd???(1?1?1?????? ??h?  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????8?5Developing multilingual subject gateways (SOSIG as example) SOSIG accept any languages evaluated for quality Translation should be coherent and checked Different language version should be equally well maintained SOSIG Cataloguing rules TITLE will be displayed in the first language ALTERNATIVE TITLE in other languages DESCRIPTION will mention different languages in which resource is available URI of all language versions Labeling URI language Library standards for multilingual provision NISO Z39.53 Language codes USMARC Language codes????/??/<?/-??/-??/--?/-0?/-????0??0<?40????dd1?40?dd+?40?dd=?40?dd?40?dd.?40UuAd%?40UuAdL?40UuAd?40UuAd?40UuAd-?40????dd?40?dd?40?dd???(1?1?1????????m?:??????????0 p????%?:?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????:? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??? ? ??v?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????F?:5Multilingual provision in popular Internet Search Engines???</?,/:?/-???X0?H0:?40????dd???(1?1:?1?????? ??R?  b? (?????b6???????? ?????? 8?????b?????9?b???z?$ģbd0???o???????? w????"??5AltaVista Search in 25 languages Documents indexed as is Automatic translation - very simple and naive Other sites that have dedicated national sites interface language language resoures no special language policy Euroseek Excite Lycos Infoseek???D/?4/ ?/-?/-?/-.?/-0?/-@?/-?/-???p0?`0 ?40????dd?40?dd?40UuAd.?40?dd0?40????dd?40UuAd?40UuAd?40UuAd ?40?dd?40?dd?40?dd?40?dd???(1?1??1???????? ?;??????????0 p????%?;?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????;? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??v?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????F?:5New Developments in Subject Gateways, Indexing, Searching???</?,/:?/-???X0?H0:?40????dd???(1?1:?1?????? ????  b? (?????b6???????? ?????? 8?????b???R?9?b???"?$ģbd0???o???????? w??????r5NRENs projects Subject gateways Commercial Search Engines Multilingual Text Retrieval and Processing TUSTEP system???h/?X/e?/- ?/-???x0?h0?40????dd?40????dd?40????dd+?40????dd ?40?dd???(1?1r?1??????????=??????????0 p????%?=?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????=? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ?? ? ??I?  b? (?????b??????? ????????? 8?????b?????9?b???q?$ģbd0???A???$???? l???????? 5NREN projects???</?,/ ?/-???X0?H0 ?40????dd???(1?1 ?1?????? ????  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????k??5Social Science Information Gateway http://sosig.esrc.bris.ac.uk/ ROADS Project Software/Documentation Server - http://www.roads.lut.ac.uk/ CHIP-Pilot (Clearing House for Internet Projects) - http://www.terena.nl/chip/ IMesh - International Collaboration on Internet Subject Gateways - http://www.desire.org/html/subjectgateways/community/imesh/ DFN Indexing and Searching projects - http://www.dfn.de/links/suchen.html X.500 Directory E-mail Addresses Search (AMBIX-D) - http://ambix.uni-tuebingen.de:8889 TUSTEP Munltilingual Textdata Processing and Fuzzy Searching - http://www.uni-tuebingen.de/zdv/tustep/tdv_eng.html IKEM Toolkit - http://bikit.rug.ac.be:80/ikem/ DRUID Classification Tools, University of Twente - http://twentyone.tpd.tno.nl/druid/???h/?X/??/-"?/-????0??0A?40????ddJ?40????ddO?40????dd?40????ddJ?40????ddX?40????ddt?40????dd/?40????ddU?40????dd???(1?1??1?????????<??????????0 p????%?<?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????<? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??O?  b? (?????b??????? ????????? 8?????b?????9?b???w?$ģbd0???A???$???? l????????5Search Engines news???</?,/?/-???X0?H0?40????dd???(1?1?1?????? ??$?  b? (?????b6???????? ?????? 8?????b???|?9?b???L?$ģbd0???o???????? w???????5CLEVER project at IBM Almaden Research Center - http://www.almaden.ibm.com/cs/k53/clever.html Cora Search Engine - http://www.cora.justresearch.com/about.html Google Search Engine - http://www.google.com/why_use.html Free AltaVista Search Intranet v2.3A Entry Level Software http://www.altavista.software.digital.com/search/intranet/free_3k/index.asp Ultraseek Server for Linux Platforms http://software.infoseek.com/products/ultraseek/linux/ultrareq.htm???</?,/??/-???x0?h0^?40????ddB?40????dd;?40????dd??40????ddg?40????dd???(1?1??1??????????,??????????0 p????%?,?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????,? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??+ ? ??p?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????@?45TUSTEP TUebingen System of Text Processing Programs???</?,/4?/-???X0?H04?40????dd???(1?14?1?????? ????  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????k?42?dd?3?/51. File structure 2. Multilingual capabilities 3. Internal data presentation 4. Database publishing/output data presentation 5. CGI 6. Sample implementation http://lddv.zdv.uni-tuebingen.de/cgi-bin/opac/zdvlit Try entries like Smith or Meier or... http://lddv.zdv.uni-tuebingen.de/cgi-bin/km/npquery ????/??/??/-7?/-&?/-5?/-????0??0?40????dd?40????dd?40????dd0?40????dd?40????dd?40????dd7?40?dd&?40UuAd5?40?dd???(1?1/?1????????? ?-??????????0 p????%?-?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????-? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??? ??R?  b? (?????b??????? ????????? 8?????b?????9?b???z?$ģbd0???A???$???? l???????"?5TUSTEP: File structure???</?,/?/-???X0?H0?40????dd???(1?1?1?????? ????  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????_?#5TUSTEP can handle basically all kinds of (explicitely or implicitely) structured text files) Special support for XML "Databases" (i. e. files with a repeated and regular structure) are only a special case of this. Fuzzy search and other retrieval actions can then be used to access the data????/??/^?/-y?/-L?/-???00? 0^?40?dd?40UuAda?40UuAdL?40?dd???(1?1#?1????????t?.??????????0 p????%?.?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????.? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??? ? ??]?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????-?!5TUSTEP: Multilingual capabilities???</?,/!?/-???X0?H0!?40????dd???(1?1!?1?????? ??r?  b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????B?&5TUSTEP supports the following scripts: - Latin - Cyrillic - Greek (classical and modern) - Hebrew (with support for Yiddish) - Arabic - Estrangelo - Coptic - Old Church Slavonic More: Phonetics, Egyptian hieroglyphs allows use of combining diacritics Experimental: Indic scripts and Armenian ????/??/'?/-??/-?/-C?/-)?/-????0??0'?40?dd?40UuAd ?40UuAd?40UuAd$?40UuAd ?40UuAd ?40UuAd ?40UuAd?40UuAd?40?dd ?40UuAd#?40UuAd)?40?dd???(1?1&?1?????????/??????????0 p????%?/?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????/? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??r?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????B?65TUSTEP: Internal data presentation and transformation???</?,/6?/-???X0?H06?40????dd???(1?16?1?????? ????  b? (?????b6???????? ?????? 8?????b???U?9?b???%?$ģbd0???o???????? w???????5TUSTEP uses internally a script tagging system with transliteration into ASCII which allows all data to be encoded in a human-readable and easily transmittable form TUSTEP has a module for importing from and exporting into the UCS (UTF8 and UTF16) Example: #r+Novij rafiqnij clovnik ykra^ins^bko%:^i movi#r- Transformation module allows use of other tagging systems and other transliteration schemes????/??/??/-=?/-[?/-???00? 0??40?ddT?40?dd=?40UuAd[?40?dd???(1?1??1????????2 ?0??????????0 p????%?0?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????0? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ???? ??W?  b? (?????b??????? ????????? 8?????b?????9?b????$ģbd0???A???$???? l???????'?5TUSTEP: Database publishing???</?,/?/-???X0?H0?40????dd???(1?1?1?????? ??6?  b? (?????b6???????? ?????? 8?????b?????9?b???^?$ģbd0???o???????? w??????5TUSTEP's typesetting module offers a high-quality, fast and easy way of publishing all or part of the database in paper (or pdf) form???h/?X/?/-i?/-????0??0?40????ddi?40?dd???(1?1??1????????? ?1??????????0 p????%?1?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????1? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??? ??G?  b? (?????b??????? ????????? 8?????b?????9?b???o?$ģbd0???A???$???? l???????? 5TUSTEP: CGI???</?,/ ?/-???X0?H0 ?40????dd???(1?1 ?1?????? ????  b? (?????b6???????? ?????? 8?????b??? ?9?b?????$ģbd0???o???????? w???????5Complete control over input and output forms Possibility to configure exactly the kind of search(es), e.g. exact matches only SoundEX "intelligent" fuzzy search "brute" fuzzy search that allows a number of different letters.???h/?X/m?/-w?/-????0??0.?40?dd??40?dd?40UuAd ?40UuAd?40UuAd??40UuAd???(1?1??1??????????3??????????0 p????%?3?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????3? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??j ? ??d?  b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????4?(5Multilinguality framework of the project???</?,/(?/-???X0?H0(?40????dd???(1?1(?1?????? ????  b? (?????b6???????? ?????? 8?????b???>?9?b????$ģbd0???o???????? w???????5Multiple language indexing multiple language documents/indexes Cross-language Searching Multiple language indexes/documents Automatic Query forwarding based on thesauri Automatic translation Multilingual information retrieval Translation Request Protocol Language and Character Encoding tagging XML as internal presentation of data Using XML language and charset tagging Metadata DC.Language definition????/??/?/-$?/-?/-Q?/-?/-@?/-(?/-%?/-'?/- ?/-?/-????0??0?40????dd$?40?dd?40????dd$?40?dd-?40?dd?40????dd#?40?dd?40?dd(?40????dd%?40?dd'?40UuAd ?40?dd?40UuAd???(1?1??1????????? ?4??????????0 p????%?4?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????4? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??X? ???  b? (?????b??????? ????????? 8?????b???t?9?b???D?$ģbd0???A???$???? l?????????42????dd????3?5???/?/???0?0???1?1?? ???  b? (?????b6???????? ?????? 8?????b???t?9?b???D?$ģbd0???o???????? w??????42????dd????3?5???/?/???0?0???1?1????? ?5??????????0 p????%?5?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????5? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??X? ???  b? (?????b??????? ????????? 8?????b???t?9?b???D?$ģbd0???A???$???? l?????????42????dd????3?5???/?/???0?0???1?1?? ???  b? (?????b6???????? ?????? 8?????b???t?9?b???D?$ģbd0???o???????? w??????42????dd????3?5???/?/???0?0???1?1????? ?6??????????0 p????%?6?????????L0 ???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ??(? ???? b? (?????b[????????????????? 8???????b???D?????6? ??? b? (?????b???o???X ????? 8?????b???t?:?b???D?$D?bd0???:????????; ??????42????dd?3 ?5???/?/???0?0???1?1??  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???b? ??X? ???  b? (?????b??????? ????????? 8?????b???t?9?b???D?$ģbd0???A???$???? l?????????42????dd????3?5???/?/???0?0???1?1?? ???  b? (?????b6???????? ?????? 8?????b???t?9?b???D?$ģbd0???o???????? w??????42????dd????3?5???/?/???0?0???1?1?????& ????????????????0 p!??$7????????f???????$7???3??????????????$7???___????????????? ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???????b? ??5? ???? (?????b?????' o????? 8???b ? ??\? b? (?????b??????? ????????? 8?????b?????9?b?????$ģbd0???A???$???? l???????,? 5Click to edit Master title style???</?,/ ?/-???X0?H0 ?40????dd???(1?1 ?1?????? ??^? b? (?????b6???????? ?????? 8?????b?????9?b?????$ģbd0???o???????? w????.?R5Click to edit Master text styles Second Level Third Level Fourth Level Fifth Level????/??/!?/- ?/- ?/- ?/- ?/-???x0?h0!?40????dd ?40?dd ?40UuAd ?40???dd ?40???dd???(1?1R?1?????? ???? b? (?????bk???????p????? 8?????b?????9?b?????$??bd0????????F???T????R?5?*???h/?X/?/ -?/ ?-???X0?H0?40????dd???@1?01?1?????1?? ??=? b? (?????b/?????p????? 8?????b?????9?b???e?$??bd0???h?????T???? ?5*???</?,/?/ ?-???X0?H0?40????dd???(1?1?1?? ???? b? (?????b??? p????? 8?????b??? ?9?b?????$??bd0?????` T??????5Slide2_*????/??/?/ -?/-?/?-???X0?H0?40????dd???@1?01?1?????1?? ??X? ???4?? ???? (?????b???????? 7???????? 8???b? ???? (?????b????Z???? i???????? 8???b? ???? ?????? ??? (?????????????????????????????????b? ? 8??b??????b? ??rz??????????????????????2???????????????????????????????2?????????????????????x???????????????????????z???????? ?? ? (????????v???????????????v??????? ?b? ? 8??b??????b? ??j z???????:??????????????? ???????????????u??????????????? ??????????????:???????y???????????????z???????? ?? ? (???????8??????????????8??????? ?b? ? 8??b??????b? ??j ???c???????????????????????????5???`???????????7???0??????????????????????????????0??????????????c???? ?? ? (???????????????????????????????? ?b? ? 8??b??????????b? ??j ??????????????????????????????????????????????????????????????????????????????????????????????????????????????????????0 p?  ????????.1999. Yu.Demchenko. TERENA???30Multilinguality in Indexing, Searching and Metadata???4x?$x??????????f??????? ???? (?????b????????0 p? 8???????b? ??. ? ???? (?????b????????/ o????? 8???????b ? ???? (?????b????/ o????? 8???b ? ??\? b? (?????b????`???\ 0???????? 8?????b?????9?b?????$ģbd0???????|???# ???????,? 5Click to edit Master title style???</?,/ ?/-???X0?H0 ?40????dd???(1?1 ?1?????? ??_?  b? (?????b*???????:?????? 8?????b?????9?b?????$ģbd0???c????????????/?#5Click to edit Master subtitle style???</?,/#?/-???X0?H0#?40????dd???(1?1#?1?????? ??=? b? (?????b????P????p????? 8?????b?????9?b???e?$??bd0???????l???T???? ?5*???</?,/?/?-???X0?H0?40????dd???(1?1?1?? ??=? b? (?????b#???o?p????? 8?????b?????9?b???e?$??bd0???\?????T???? ?5*???</?,/?/?-???X0?H0?40????dd???(1?1?1?? ???? b? (?????bjc~ p????? 8?????b?????9?b?????$??bd0????E T????Y? 5Slide 2_*???h/?X/?/-?/?-???X0?H0 ?40????dd???@1?01?1?????1????'???????????L0 c0???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???????b? ???? ??=?  b? (?????b???????????????????? 8?????b????? ?b???e?$ģbd0?????????????????????? ?5*???</?,/?/ ?-???X0?H0?40????dd???(1?1?1?? ??=? b? (?????b ????@???????? 8?????b????? ?b???e?$ģbd0???,????4??????? ?5*???</?,/?/ ?-???X0?H0?40????dd???(1?1?1?? ???? b? (?????b[????????????????? 8???????b???D??????? ??^? b? (?????b???o???X ????? 8?????b?????:?b?????$ģbd0???:????????; ????.?R5Click to edit Master text styles Second level Third level Fourth level Fifth level????/??/!?/ - ?/ - ?/ - ?/ - ?/ -???x0?h0!?40????dd ?40????dd ?40????dd ?40????dd ?40????dd???(1?1R?1?????? ??=? b? (?????b????? ????< ????? 8?????b????? #?b???e?$ģb#d0???????? ????< ???? ?5*???</?,/?/ ?-???X0?H0?40????dd???(1?1?1?? ??=? b? (?????b ? @< ????? 8?????b????? #?b???e?$ģb#d0???,? 4< ???? ?5*???</?,/?/ ?-???X0?H0?40????dd???(1?1?1????d?????????L0 ?-???H????.???/???0???4x?$x??????̙33???????? ???? (?????b????????L0 ? 8???b? ?????? ??????????0???42d2d>$ ????n???b ?b??$d???e?,? :"PowerPoint DocumentYuri DemchenkoMicrosoft (R) PowerPoint (R) Windows ?_????? ?????Oh??+'??0? px??? ? ( 4 @ LX`?? HTTP и CGI TP=D:\msoffice\Templates\Presentation Designs\International.potiYuri Demchenkop114MicrHeader????????9PP97_DUALSTORAGE" ??d?O?????)?????巾???巾Pictures???? ????????PowerPoint Document(???? ?????a???;?(`??` ??/? 0????DTimes New Roman??b??b$?b??0H?bH?b??0?DSymbolew Roman??b??b$?b??0H?bH?b??0 ?DMonotype Sorts??b??b$?b??0H?bH?b??0?  ??@  @@``???  @?n???" dd@?????????  @@``?? ?????dY?.*&3   !#% "$&)+-/*,.0  S ?~??????????1???????????0? ??????n?@???????8???????g??42d2dT?b??0L?b>$ ????????p?pp?0 ? <?4BdBd??b??b>? ???????-?41999. Yu.Demchenko. TERENA ?fMultilinguality in Indexing, Searching and MetadataO? ?=??1???/Multilinguality and cross-language searching?40/++?!??QMultilingual aspects in Indexing, Searching and Metadata (Resource Description)?RR?&??9Multilingual aspects in Indexing, Searching and Metadata?:: ?0 ??KIETF Model of Multilingual support in Internet Applications Electronic Mail Interactive applications Charset and Language tagging MIME types XML Language and Charset tagging DC language definition Metadata and RDF DC.Language Existing solutions TUSTEP Search Engines and Subject Gateways Multilingual framework for the REIS Project??<)C +,<)C +,?>e2 ~?*??;IETF Model of Multilingual support in Internet Applications??Electronic Mail Language Character Encoding Scheme Transfer Encoding Scheme Interactive applications WWW: HTTP/HTML http-equiv="Content-Type" Content="text/html; charset=euc-jp" XML/DOM LDAP and X.500 (?) ?~=?=???z# #?(8??!XML: Language and Charset tagging???pCharacter is atomic unit of text All ISO 10646 characters + TAB, CR, LF The mechanism for Encoding can vary for different characters All XML processors must accept UTF-8 and UTF-16 Character Encoding in Entities (XML 4.3.3) EncodingDecl : : = S  encoding Eq    EncName    |    EncName    ) <?xml encoding+ UTF-8 ?> <?xml encoding+ EUC-JP ?> Autodetection of Character Encoding Language identification (XML 2.12) Tag for identification of languages LanguageID : : = Langcode ( - Subcode) Langcode : : = ISO639Code | IanaCode | UserCode??!'m+?$#$Z!'m+?$#$Z???  ]    ? +??Charset and Language tagging????MIME types text, img, audio, video Charset = Character Set + Character Encoding Scheme Transfer Encoding Scheme base64 quoted-printable Language RFC 1766 ISO639-2 ?l e  e ???&7??&Language Definition in DC Metadata set? ??*<meta name =  DC.language scheme=  rfc1766  ISO639-2 content=  es > <meta name =  DC.title lang =  es content=  La Mesa y Silla Roja >??B?*9??Multilingual Subject Gateway??Developing multilingual subject gateways (SOSIG as example) SOSIG accept any languages evaluated for quality Translation should be coherent and checked Different language version should be equally well maintained SOSIG Cataloguing rules TITLE will be displayed in the first language ALTERNATIVE TITLE in other languages DESCRIPTION will mention different languages in which resource is available URI of all language versions Labeling URI language Library standards for multilingual provision NISO Z39.53 Language codes USMARC Language codes?Z<??-1<??-1?? k?+:??:Multilingual provision in popular Internet Search Engines???AltaVista Search in 25 languages Documents indexed as is Automatic translation - very simple and naive Other sites that have dedicated national sites interface language language resoures no special language policy Euroseek Excite Lycos Infoseek?~ .0@ .0@?6 ?  ?.;??:New Developments in Subject Gateways, Indexing, Searching??rNRENs projects Subject gateways Commercial Search Engines Multilingual Text Retrieval and Processing TUSTEP system?$ee? s ?0=?? NREN projects???Social Science Information Gateway http://sosig.esrc.bris.ac.uk/ ROADS Project Software/Documentation Server - http://www.roads.lut.ac.uk/ CHIP-Pilot (Clearing House for Internet Projects) - http://www.terena.nl/chip/ IMesh - International Collaboration on Internet Subject Gateways - http://www.desire.org/html/subjectgateways/community/imesh/ DFN Indexing and Searching projects - http://www.dfn.de/links/suchen.html X.500 Directory E-mail Addresses Search (AMBIX-D) - http://ambix.uni-tuebingen.de:8889 TUSTEP Munltilingual Textdata Processing and Fuzzy Searching - http://www.uni-tuebingen.de/zdv/tustep/tdv_eng.html IKEM Toolkit - http://bikit.rug.ac.be:80/ikem/ DRUID Classification Tools, University of Twente - http://twentyone.tpd.tno.nl/druid/???#?4*AAe 3 B . ,          ?1<??Search Engines news???CLEVER project at IBM Almaden Research Center - http://www.almaden.ibm.com/cs/k53/clever.html Cora Search Engine - http://www.cora.justresearch.com/about.html Google Search Engine - http://www.google.com/why_use.html Free AltaVista Search Intranet v2.3A Entry Level Software http://www.altavista.software.digital.com/search/intranet/free_3k/index.asp Ultraseek Server for Linux Platforms http://software.infoseek.com/products/ultraseek/linux/ultrareq.htm?D 2  7 8   ? ,??4TUSTEP TUebingen System of Text Processing Programs? $??/1. File structure 2. Multilingual capabilities 3. Internal data presentation 4. Database publishing/output data presentation 5. CGI 6. Sample implementation http://lddv.zdv.uni-tuebingen.de/cgi-bin/opac/zdvlit Try entries like Smith or Meier or... http://lddv.zdv.uni-tuebingen.de/cgi-bin/km/npquery ?Z?7&5?7&5??? 0  ?-??TUSTEP: File structure??#TUSTEP can handle basically all kinds of (explicitely or implicitely) structured text files) Special support for XML "Databases" (i. e. files with a repeated and regular structure) are only a special case of this. Fuzzy search and other retrieval actions can then be used to access the data?6^yM^yM?,*  ??.??!TUSTEP: Multilingual capabilities??&TUSTEP supports the following scripts: - Latin - Cyrillic - Greek (classical and modern) - Hebrew (with support for Yiddish) - Arabic - Estrangelo - Coptic - Old Church Slavonic More: Phonetics, Egyptian hieroglyphs allows use of combining diacritics Experimental: Indic scripts and Armenian ?Z'?C*'?C*?,? x?/??6TUSTEP: Internal data presentation and transformation???TUSTEP uses internally a script tagging system with transliteration into ASCII which allows all data to be encoded in a human-readable and easily transmittable form TUSTEP has a module for importing from and exporting into the UCS (UTF8 and UTF16) Example: #r+Novij rafiqnij clovnik ykra^ins^bko%:^i movi#r- Transformation module allows use of other tagging systems and other transliteration schemes?6?=\?=\?>`?0??TUSTEP: Database publishing???TUSTEP's typesetting module offers a high-quality, fast and easy way of publishing all or part of the database in paper (or pdf) form?$jj?$t?1?? TUSTEP: CGI???Complete control over input and output forms Possibility to configure exactly the kind of search(es), e.g. exact matches only SoundEX "intelligent" fuzzy search "brute" fuzzy search that allows a number of different letters.?$mxmx?,b [?3??(Multilinguality framework of the project???Multiple language indexing multiple language documents/indexes Cross-language Searching Multiple language indexes/documents Automatic Query forwarding based on thesauri Automatic translation Multilingual information retrieval Translation Request Protocol Language and Character Encoding tagging XML as internal presentation of data Using XML language and charset tagging Metadata DC.Language definition??$Q@(%' $Q@(%' ?,b ?4?? ?? ? 5??  ??  ?!6??  ??  /?L? ? ???????? ?" ?# ?$ ?%?'?)?,?-?/?2?3P???b???? 0`? ????????f??????`? ???3?????????????`? ???___?????????????>???" dd=??????????????" dd?=?????????????uA?4? d?O?" ?i ?n???" dd??????????   @@``P?P   4 O i`? p?@??@   _ ?W ?)?? ?( ? ??p ? ? ?H??????d???? ?'W??? ? ? ?Z??v?a????a?????????? ??x8???? v ?T?? Click to edit Master title style?!? !?: ? ? ?T?D v?a????a????????? ??Sf??? v ???RClick to edit Master text styles Second Level Third Level Fourth Level Fifth Level?!    ? S?, ?  ?`?? v?a????a??????????? ?? ????? v ?????*? ????=44OOii?  ?   ?`?!v?a????a??????????? ?? _???  v ?a??*? ???=44OOii?J ?!  ?`??!v?a????a??????????? ??!????? v ???Slide2_*?(  ??$??=44OOii?Z?F ?1?lY ?$ ??~???~ ?" ? ?N?????????2?????1?l$?~ ?# ? ?N?????????2?????1IlY??F ??? ?) ???c?8 ?% s ?B?C{DE?8F?@??????????????????@????????F??h??=?Zhz?zFz?\F3? @???????????????????0 ?& s ?B?C?DE?4F?<??????????????????@????? ????i??<?????<??#i?????@???????????????g?5?0 ?' s ?B-C?DE?4F?<??????????????????@????? ??o?????*l??,J??????Jz?o@???????????????Arn*? ?( ? ??BKCoDE?4F?<?????????? ??(%+(J27JQ+E%nEQ7@???????????????????H ? ? ?0??@??޽h??? ?? ??????????f?????? ?"International.pot?&? ?0 ??? ?% ?f?( ??4p? ~?p? ? ?^ ? ? ?6??????? ?@_??p ? ? ?H??????d???? ??_??? ? ? ?Z?$???a????a?????????? ???????? v ?T?? Click to edit Master title style?!? !?? ? ? ?Z?䵳?a????a?????????? ??HZjG ?? v ?W??#Click to edit Master subtitle style?$? $? ?  ?`?D???a????a??????????? ???????? v ?[??*????=44OOii? ?  ?`?????a????a??????????? ???S ???  v ?]??*????=44OOii?9 ?  ?`?d???a????a??????????? ???????? v ??? Slide 2_*?  ??$??=44OOii?H ? ? ?0??@??޽h??? ?? ??????????f?????????? 0 ??0??N?( ? ??  ? ? ?T?ķ??jJ??jJ??????? ???? ,G??  v ?q??*? ??? ??? ? ? ?T?$???jJ??jJ??????? ????l ?G?? v ?s??*? ??? ???p ? ? ?0?????1? ????? ?? v?: ? ? ?T??????g?ֳ??g?ֳ?????? ??? LL??? v ???RClick to edit Master text styles Second level Third level Fourth level Fifth level?!    ? S? ? ? ?Z?丳?jJ??jJ???????? ?? ,l??  v ?q??*? ??? ??? ? ? ?Z?D???jJ??jJ???????? ??l ?l?? v ?s??*? ??? ???H ? ? ?0??b?f?@???? ?? ??????̙33????????? ??@??0?( ? ??H ? ? ?0???b?f?@??? ?? ??????̙33???????????0 0?(P????( ? ??? ? # ?l?????g????g????????????? ? ??\e$?? v ? ??? ? # ?l????g????g????????????? ? ?| ??_?? v ? ??H ? ? ?0???@??޽h?? ?? ??????????f????????? ?0 0?(p?????( ? ???? ?? # ?l?ĺ??a????a????????????? ??x8????  v ? ??? ?? # ?l?$???a????a????????????? ??Sf??? v ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 0?(??????( ?? ???? ?? # ?l?D???a????a????????????? ??x8????  ? ? ??? ?? # ?l?????a????a????????????? ??Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ?????4??( ?% ?4?l ?4 C ??d?????x8????  ? ? ??l ?4 C ??Ľ????Sf??? ? ? ??H ?4 ? ?0???@??޽h?? ?? ??????????f????????? ?0 ????????( ?o  ???l ?? C ???????x8????  ? ? ??l ?? C ??$a????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ?????,??( ? ?,?l ?, C ???a????x8????  ? ? ??l ?, C ??Db????Sf??? ? ? ??H ?, ? ?0???@??޽h?? ?? ??????????f????????? ?0 ????<??( ? ?<?l ?< C ??dc????x8????  ? ? ??l ?< C ???c????Sf??? ? ? ??H ?< ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???0?@??( ?w@#@ ?@?l ?@ C ??De????x8????  ? ? ??l ?@ C ???e????Sf??? ? ? ??H ?@ ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???P?L??( ? ?L?l ?L C ??df????x8????  ? ? ??l ?L C ???f????Sf??? ? ? ??H ?L ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???p?X??( ?w@#@ ?X?l ?X C ???g????x8????  ? ? ??l ?X C ??Dh????Sf??? ? ? ??H ?X ? ?0???@??޽h?? ?? ??????????f????????? ?0 ?????T??( ? ?T?l ?T C ??i????x8????  ? ? ??l ?T C ??di????Sf??? ? ? ??H ?T ? ?0???@??޽h?? ?? ??????????f????????? ?0 ????????( ? ???l ?? C ???j????x8????  ? ? ??l ?? C ???j????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ????????( ? ???l ?? C ??l????x8????  ? ? ??l ?? C ??dl????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ?????     ????1 !"#$%&'(*????+,-./02????3???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????( ? ???l ?? C ??d????x8????  ? ? ??l ?? C ???????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???????( ??? ???l ?? C ??D????x8????  ? ? ??l ?? C ???????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???0????( ?4?@@S@ ???l ?? C ??d?????x8????  ? ? ??l ?? C ????????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ? 0 ???P????( ?4?@@S@ ???l ?? C ????????x8????  ? ? ??l ?? C ????????Sf??? ? ? ??H ?? ? ?0???@??޽h?? ?? ??????????f????????? ? 0 ????? ??( ???@? ? ?l ?  C ??$?????x8????  ? ? ??l ?  C ????????Sf??? ? ? ??H ?  ? ?0???@??޽h?? ?? ??????????f????????? ? 0 ???????( ?4?@@S@ ??l ? C ????????x8????  ? ? ??l ? C ??$?????Sf??? ? ? ??H ? ? ?0???@??޽h?? ?? ??????????f????????? ? 0 ???????( ?w@#@ ??l ? C ????????x8????  ? ? ??l ? C ???????Sf??? ? ? ??H ? ? ?0???@??޽h?? ?? ??????????f????????? ?0 ???????( ?w@#@ ??l ? C ????????x8????  ? ? ??l ? C ??T?????Sf??? ? ? ??H ? ? ?0???@??޽h?? ?? ??????????f????????? 0 ??z`????( ????? ???R ?? 3 ??????? ??  v?? ?? C ??d?????? LL???  v ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????& 0 ??z?????( ??f?????? ???R ?? 3 ??????? ??  ??? ?? C ??仳???? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????* 0 ??z?????( ? ???R ?? 3 ??????? ??  ??? ?? C ???????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????+ 0 ??z?????( ? ???R ?? 3 ??????? ??  ??? ?? C ???a????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????, 0 ??z?????( ?8 ???R ?? 3 ??????? ??  ??? ?? C ???k????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????- 0 ??z?????( ??8 ?a  ???R ?? 3 ??????? ??  ??? ?? C ???l????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????. 0 ??z????( ????? ???R ?? 3 ??????? ??  ??? ?? C ???????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????/ 0 ??z ????( ? ???R ?? 3 ??????? ??  ??? ?? C ???????? LL???  ? ? ??  ?H ?? ? ?0???b?f?@??? ?? ??????̙33??????????0 0 ??z@???( ?8?  h ??R ? 3 ??????? ??  ??? ? C ??$?????? LL???  ? ? ??  ?H ? ? ?0???b?f?@??? ?? ??????̙33??????????1 0 ??z`???( ?@w@ ??R ? 3 ??????? ??  ??? ? C ??D?????? LL???  ? ? ??  ?H ? ? ?0???b?f?@??? ?? ??????̙33??????????3 0 ??z????( ? ??R ? 3 ??????? ??  ??? ? C ??d?????? LL???  ? ? ??  ?H ? ? ?0???b?f?@??? ?? ??????̙33??????????4 0 ??z?? ??( ? ? ?R ?  3 ??????? ??  ??? ?  C ??D?????? LL???  ? ? ??  ?H ?  ? ?0???b?f?@??? ?? ??????̙33??????????5 0 ??z??$??( ?  ?$?R ?$ 3 ??????? ??  ??? ?$ C ????????? LL???  ? ? ??  ?H ?$ ? ?0???b?f?@??? ?? ??????̙33??????????6 0 ??z?(??( ? ?(?R ?( 3 ??????? ??  ??? ?( C ????????? LL???  ? ? ??  ?H ?( ? ?0???b?f?@??? ?? ??????̙33??????????7 0 ??z?0??( ? ?0?R ?0 3 ??????? ??  ??? ?0 C ??c????? LL???  ? ? ??  ?H ?0 ? ?0???b?f?@??? ?? ??????̙33??????????8 0 ??z??8??( ??? ?8?R ?8 3 ??????? ??  ??? ?8 C ????????? LL???  ? ? ??  ?H ?8 ? ?0???b?f?@??? ?? ??????̙33??????????9 0 ??z ?D??( ? ?D?R ?D 3 ??????? ??  ??? ?D C ???d????? LL???  ? ? ??  ?H ?D ? ?0???b?f?@??? ?? ??????̙33??????????: 0 ??z@?H??( ?????<? ?H?R ?H 3 ??????? ??  ??? ?H C ??f????? LL???  ? ? ??  ?H ?H ? ?0???b?f?@??? ?? ??????̙33??????????; 0 ??z`?P??( ?| ?P?R ?P 3 ??????? ??  ??? ?P C ???g????? LL???  ? ? ??  ?H ?P ? ?0???b?f?@??? ?? ??????̙33??????????= 0 ??z??\??( ??O? ?\?R ?\ 3 ??????? ??  ??? ?\ C ???h????? LL???  ? ? ??  ?H ?\ ? ?0???b?f?@??? ?? ??????̙33??????????< 0 ??z??`??( ??? ?`?R ?` 3 ??????? ??  ??? ?` C ??$j????? LL???  ? ? ??  ?H ?` ? ?0???b?f?@??? ?? ??????̙33???????r???S?Z?;oL{[^?`?dr Pӄ??g?1???`?sō?u?w?ys{@??Y?#??`S}3??????K???fߝc???h?js?=?sl?Sn3pѦ???:? e?30SummaryInformation(?????DocumentSummaryInformation8????????????C?Current User????????????[.SummaryInformation(????????????\?osoft PowerPoint 7.0sen@???@??G?}Y?@`??X7?@?N??巾?G^????oZ  'T&?????? &????&#????TNPP??0D v & TNPP? &????&TNPP   ?? ????- ????"-- !???-- ?"---- !?S?E---&????t??&????-?-????- $t?t?vv????-? $v?v?xx????-? $x?x?{{????-? ${?{?}}????-? $}?}?????-? $????????-? $??????????-? $??????????-? $??????????-? $??????????-? $???????jj?-? $???????UU?-? $???????CC?-? $???????,,?-? $????????-? $??????-?--&????&????&????Gk?&???? ?- ????Times New Roman-? ???..2 ?Q1999. Yu.Demchenko. TERENA        .&????Gpy?&???? ??? ???.T2 ?+3Multilinguality in Indexing, Searching and Metadata           .&?????n??&???? ??? ???.2 ?ZSlide 2  . ???. 2 ??_ . ???. 2 ??1 .--BL?6-- ???????Times New Roman?-? .2 ?Multilinguality3.????Times New Roman?-? . 2 ??and.????Times New Roman?-? .-2 1?cross-language searching .--7 vj-- ????Times New Roman?-? .%2 ?.Multilingual aspects    . .02 ??in Indexing, Searching and       . .2 ?8Metadata  . .12 ?(i.e., Resource Description)         .--??"System-?&TNPP &?????? ??՜.??+,??D??՜.??+,??,??????? ? ?? ????  ??A4 Paper (210x297 mm)Noa??? Times New RomanSymbolMonotype SortsInternational.pot0Multilinguality and cross-language searching:Multilingual aspects in Indexing, Searching and Metadata<IETF Model of Multilingual support in Internet Applications"XML: Language and Charset taggingCharset and Language tagging'Language Definition in DC Metadata setMultilingual Subject Gateway;Multilingual provision in popular Internet Search Engines;New Developments in Subject Gateways, Indexing, SearchingNREN projectsSearch Engines news5TUSTEP TUebingen System of Text Processing ProgramsTUSTEP: File structure"TUSTEP: Multilingual capabilities7TUSTEP: Internal data presentation and transformationTUSTEP: Database publishing TUSTEP: CGI)Multilinguality framework of the projectNo Slide TitleNo Slide TitleNo Slide Title  Fonts UsedDesign Template Slide Titles? 6> _PID_GUID?AN{BCDA83A0-1BA4-11D3-B55F-006097FEF696}?&_???=??bYuri Demchenko?? ?????Oh??+'??0? px??? ? ( 4 @ LX`? HTTP ? CGIs TP=D:\msoffice\Templates\Presentation Designs\International.potoYuri Demchenkop114Microsoft PowerPoint 7.0sen@?&?@??G?}Y?@`??X7?@???巾?G^????oZ  'T&?????? &????&#????TNPP??0D v & TNPP? &????&TNPP   ?? ????- ????"-- !???-- ?"---??????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????- !?S?E---&????t??&????-?-????- $t?t?vv????-? $v?v?xx????-? $x?x?{{????-? ${?{?}}????-? $}?}?????-? $????????-? $??????????-? $??????????-? $??????????-? $??????????-? $???????jj?-? $???????UU?-? $???????CC?-? $???????,,?-? $????????-? $??????-?--&????&????&????Gk?&???? ?- ????Times New Roman-? ???..2 ?Q1999. Yu.Demchenko. TERENA        .&????Gpy?&???? ??? ???.T2 ?+3Multilinguality in Indexing, Searching and Metadata           .&?????n??&???? ??? ???.2 ?ZSlide 2  . ???. 2 ??_ . ???. 2 ??1 .--BL?6-- ???????Times New Roman?-? .2 ?Multilinguality3.????Times New Roman?-? . 2 ??and.????Times New Roman?-? .-2 1?cross-language searching .--7 vj-- ????Times New Roman?-? .%2 ?.Multilingual aspects    . .02 ??in Indexing, Searching and       . .2 ?8Metadata  . .12 ?(i.e., Resource Description)         .--??"System-?&TNPP &????sc?? ??՜.??+,??0? hp???? ? ? ?? ? X?A4 Paper (210x297 mm).0? DocumentSummaryInformation8??????????????????????????????????????????????????Times New RomanSymbolMonotype SortsInternational.pot0Multilinguality and cross-language searching:Multilingual aspects in Indexing, Searching and Metadata<IETF Model of Multilingual support in Internet Applications"XML: Language and Charset taggingCharset and Language tagging'Language Definition in DC Metadata setMultilingual Subject Gateway;Multilingual provision in popular Internet Search Engines;New Developments in Subject Gateways, Indexing, SearchingNREN projectsSearch Engines news5TUSTEP TUebingen System of Text Processing ProgramsTUSTEP: File structure"TUSTEP: Multilingual capabilities7TUSTEP: Internal data presentation and transformationTUSTEP: Database publishing TUSTEP: CGI)Multilinguality framework of the projectNo Slide TitleNo Slide TitleNo Slide Title  Fonts UsedDesign Template Slide Titles