unicode + HTML

NET_Stranger

кто баловался с unicode?.. как думаете, можно ли написать HTML страничку с Unicode символами, которая будет доступна(см. ниже) 80% пользователей интернета? Речь идет о русскоязычных пользователях windows, а задача состоит в том, чтобы прочитать HTML на старорусском языке?..

maggi14

Нет. Если ты имеешь в виду старославянский или, тем более, глаголицу, то учти, что у многих пользователей нет соответствующих уникодовских шрифтов. Точнее, диапазонов. Хотя нормальные символы (греческие, кана, большая часть диакритики, кириллица, в т.ч. западнославянская, большая часть таблицы IPA, еврейское квадратное письмо и т.д.) у большинства отображаются, со старославянским, грузинским, деванагари и рядом других популярных алфавитов часто возникают проблемы. Глаголицы вообще нет в стандартных шрифтах, по-моему, она не предусмотрена Уникодом.

artimon

Единственный шрифт со старославянскими буквами, который у меня есть — Arial Unicode MS, который поставляется с MS Office.
Так что вряд ли.

Dasar

В unicode про Cyrillic написанно следующее:
Cyrillic: U+0400–U+04FF
The Cyrillic script is one of several scripts that were derived from the Greek script. Cyrillic
has traditionally been used for writing various Slavic languages, among which Russian is
predominant. In the nineteenth and early twentieth centuries, Cyrillic was extended to
write the non-Slavic minority languages of Russia and neighboring countries. The Cyrillic
script is written in linear sequence from left to right with the occasional use of nonspacing
marks. Cyrillic letters come in upper- and lowercase pairs, with the exception of U+04C0
  , which has no lowercase form.
Standards. The Cyrillic block of the Unicode Standard is based on ISO/IEC 8859-5. The
Unicode Standard encodes Cyrillic characters in the same relative positions as in ISO/IEC
8859-5.
Historic Letters. The historical form of the Cyrillic alphabet is treated as a font style varia-tion
of modern Cyrillic because the historical forms are relatively close to the modern
appearance, and because some of them are still in modern use in languages other than Rus-sian
(for example, U+0406 “I”    is used in modern Ukrainian and
Byelorussian). Some of the letters in this range were used in modern typefaces in Russian
and Bulgarian. Prior to 1917, Russian made use of yat, fita, and izhitsa; prior to 1945, Bul-garia
made use of these three as well as the big yus.
Extended Cyrillic. These letters are used in alphabets for Turkic languages such as Azer-baijani,
Bashkir, Kazakh, and Tatar; for Caucasian languages such as Abkhasian, Avar, and
Chechen; and for Uralic languages such as Mari, Khanty, and Kildin Sami. The orthogra-phies
of some of these languages have often been revised in the past; some of them have
switched from Arabic to Latin to Cyrillic, and back again. Azerbaijani, for instance, is now
officially using a Turkish-based Latin script.
Glagolitic. The history of the creation of the Slavic scripts and their relationship has been
lost. The Unicode Standard regards Glagolitic as a separate script from Cyrillic, not as a font
change from Cyrillic. This position is taken primarily because Glagolitic appears unrecog-nizably
different from Cyrillic, and secondarily because Glagolitic has not grown to match
the expansion of Cyrillic. The Glagolitic script is not currently supported by the Unicode
Standard.

Cyrillic Supplement: U+0500–U+052F
Komi. The characters in the range U+0500..U+050F are found in ISO 10754, and were
used in Komi Cyrillic orthography from 1919 to about 1940. These letters use glyphs that
differ structurally from other characters in the Unicode Standard that represent similar
sounds—namely Serbian v and w , which are ligatures of base letters x and y with a palatal-izing
soft sign z . The Molodtsov orthography made use of a different kind of palatalization
hook for Komi { , | , } , ~ , and so on.

NET_Stranger

я со знакомым решили, что для реализации этого проекта нужно купить фонт на заказ, пользователь перед прочтением текста будет инсталлировать себе фонт, скачав его с сайта.

maggi14

Ну, так люди и делают. В принципе, еще можно заставить пользователя установить Ариал Уникод. Но с глаголицей все равно облом. А шрифты (разные) я могу предоставить.
Оставить комментарий
Имя или ник:
Комментарий: