|
jawbin
高级用户
   
积分 994
发帖 444
注册 2005-1-29
状态 离线
|
|
2006-5-25 13:19 |
|
|
hj_18
中级用户
  
积分 255
发帖 57
注册 2003-6-26
状态 离线
|
『第 17 楼』:
那么就ttf2pcf好了
使用 LLM 解释/回答一下
找个有完整的韩文字符的ttf字库,用ttf2bdf转换一下好了,Unix/Linux下还有别的转换字体的工具,比ttf2bdf要好,比如snftobdf,google一下好了,我还没试过,不过效果不一定比手工制作的好。
Looking for a complete Korean character TTF font library, then convert it with ttf2bdf. Under Unix/Linux, there are other tools to convert fonts, which are better than ttf2bdf, such as snftobdf. Just Google it. I haven't tried it yet, but the effect may not be better than handmade ones.
|
|
2006-5-26 10:31 |
|
|
asbai
高级用户
   
积分 653
发帖 252
注册 2006-4-16
状态 离线
|
『第 18 楼』:
使用 LLM 解释/回答一下
Originally posted by hj_18 at 2006-5-26 10:31:
找个有完整的韩文字符的ttf字库,用ttf2bdf转换一下好了,Unix/Linux下还有别的转换字体的工具,比ttf2bdf要好,比如snftobdf,google一下好了,我还没试过@...
。。。。sigh,hj兄,俺说韩文只是举个例子而已 <img src="images/smilies/face-wink.png" align="absmiddle" border="0">
其实俺想要的是支持完整unicode字符集的,没有版权问题的字库。转换工具是比较好找,实在不行就只好放弃版权问题用M$的Unicode字库啦,呵呵
Originally posted by hj_18 at 2006-5-26 10:31:
Find a complete Korean character ttf font, then convert it with ttf2bdf. Under Unix/Linux, there are other font conversion tools better than ttf2bdf, such as snftobdf. Just Google it. I haven't tried it yet @...
......sigh, Brother hj, I just used Korean as an example ;)
Actually, what I want is a font that supports the complete Unicode character set and has no copyright issues. Conversion tools are relatively easy to find. If it really doesn't work, I'll have to give up the copyright issue and use Microsoft's Unicode fonts, heh heh
|
|
2006-5-27 06:31 |
|
|
hj_18
中级用户
  
积分 255
发帖 57
注册 2003-6-26
状态 离线
|
『第 19 楼』:
使用 LLM 解释/回答一下
没有什么“支持完整unicode字符集的,没有版权问题的字库”。
顶多用几个没有版权问题的字库,拼凑一个足够大的字库。
unicode只规定字符如何编码,与字符是如何显示的之间并无关系。
There is no such thing as "a font library that supports the complete Unicode character set and has no copyright issues." At most, use a few font libraries with no copyright issues to piece together a sufficiently large font library. Unicode only specifies how characters are encoded, and there is no relationship between how characters are displayed.
|
|
2006-5-30 08:05 |
|
|
asbai
高级用户
   
积分 653
发帖 252
注册 2006-4-16
状态 离线
|
『第 20 楼』:
使用 LLM 解释/回答一下
Originally posted by hj_18 at 2006-5-30 08:05:
没有什么“支持完整unicode字符集的,没有版权问题的字库”。
顶多用几个没有版权问题的字库,拼凑一个足够大的字库。
unicode只规定字符如何编砠...
不知道hj_18兄专门题字符集编码是啥意思?俺只说“unicode字符集”没有提到字符集编码的问题,呵呵。
unicode字符集编码一般有 ucs-2/ucs-4,utf-8/utf-16/utf-32 几种,由于没有规定字节序所以有多了几种变体。
一般支持完整 unicode 4.0 字符集的商用字体都至少用 ucs-4 编码。不过由于 unicode 字符集编码在设计时考虑的比较周到,在几种编码之间转换非常高效而且不需要查表~
我说至少的原因是很多字体格式可以为每个字符提供多种相应编码,比如为一个CJK汉字同时提供它的 ucs-4/GBK/Shift-JIS 对应编码等等。
PS: 如果hj兄见过不使用 ucs/utf 编码来映射,但是支持完整 unicode 字符集的字体,不防贡献出来分享一下。不用4.0,能完整 cover unicode 2.0 俺就知足了,呵呵。
Last edited by asbai on 2006-5-30 at 13:54 ]
Originally posted by hj_18 at 2006-5-30 08:05:
There is no "font library that supports the complete Unicode character set and has no copyright issues".
At most, a sufficiently large font library is pieced together using several font libraries with no copyright issues.
Unicode only stipulates how characters are encoded...
I don't know what brother hj_18 specifically means by character set encoding? I only mentioned the "Unicode character set" and didn't mention the issue of character set encoding, heh heh.
Unicode character set encodings generally include UCS-2/UCS-4, UTF-8/UTF-16/UTF-32, etc. Due to not stipulating the byte order, there are several more variants.
Generally, commercial fonts that support the complete Unicode 4.0 character set use at least UCS-4 encoding. However, since the Unicode character set encoding was considered relatively thoughtful during design, conversion between several encodings is very efficient and no look-up table is needed~
The reason I said at least is that many font formats can provide multiple corresponding encodings for each character, such as providing its UCS-4/GBK/Shift-JIS corresponding encodings for a CJK Chinese character, etc.
PS: If Brother hj has seen a font that supports the complete Unicode character set without using UCS/UTF encoding, feel free to contribute and share it. Even if it doesn't use 4.0 and can fully cover Unicode 2.0, I'll be satisfied, heh heh.
Last edited by asbai on 2006-5-30 at 13:54 ]
|
|
2006-5-30 13:50 |
|
|
Roy
管理员
          專業島民
积分 4869
发帖 1633
注册 2002-12-10
状态 离线
|
『第 21 楼』:
使用 LLM 解释/回答一下
Unicode字形的用ISO 10646儲存的啊
ISO 10646標準
ISO 10646是一個國際標準編號,該標準的英文全稱為:Information Technology - Universal Multiple - Octet Coded Character Set,簡稱UCS。中文全稱為:信息技術--通用多八位編碼字符集,亦稱大字符集。這一標準為世界各種主要語文的字符(包括繁體及簡體的中文字)及附加符號,編訂統一的內碼。
ISO 10646標準由國際標準化組織ISO頒布,用來實現全球所有文種的統一編碼。該標準被廣泛應用於電子化地表示、傳輸、交換、處理、儲存、輸入及顯現世界上各種語言的書面形式以及附加符號。
國際標準化組織於一九九三年發表 ISO 10646 國際編碼標準的首個版本,全名是 ISO/IEC 10646 第一部分「ISO/IEC 10646-1:1993」。它收錄了 20902 個表意字符。二零零零年十月發表了第一部分的新版,新增收了6,582個表意字符於擴展區A裡。ISO/IEC 10646 的第二部分在 2001 年發表,增收了 42711個表意字符於擴展區B裡。
統一碼與ISO 10646
統一碼(Unicode)是Universal Code的縮寫,由一個名為 Unicode 學術學會的機構制訂的字符編碼系統,以支持世界主要語言的書面文本的交換、處理及顯示。統一碼亦支持不同語言的傳統性 及歷史性原文。
統一碼與ISO 10646國際編碼標準從內容上來說是同步一致的。在一九九一年,Unicode學術學會與ISO國際標準化組織決定共同制訂一套適用於多種語言文本的通用編碼標準。統一碼與ISO10646國際編碼標準於一九九二年一月正式合作發展一套通用編碼標準。自此以後,兩個組織便一直緊密合作,同步發展統一碼及 ISO 10646 國際編碼標準。
統一碼3.0版本與ISO 10646 國際編碼標準所包含的字符及使用的編碼是相同的,包括東亞的漢字字符。統一碼3.0版本已於二零零零年二月由Unicode學術學會正式推出,當中收納了 49,194個來自世界各地不同語言的字符,其中包含27,484個漢字字符。此外,國際標準化組織提供 ISO 10646 國際編碼標準內的字符及編碼資料,Unicode 學術學會則對這些字符及編碼資料提出應用的方法以及語義資料作補充。
在統一碼3.1版已於2001年發表,除了Unicode 3.0 版中的49194個字符,又增加了44,946個新字符,其中42,711個為漢字字符。
Unicode glyphs stored with ISO 10646
ISO 10646 Standard
ISO 10646 is an international standard number. The full English name of this standard is: Information Technology - Universal Multiple - Octet Coded Character Set, abbreviated as UCS. The full Chinese name is: Information Technology - Universal Multiple - Octet Coded Character Set, also known as the large character set. This standard formulates unified internal codes for characters (including traditional and simplified Chinese characters) and additional symbols of various major languages in the world.
The ISO 10646 standard is promulgated by the International Organization for Standardization (ISO) to achieve unified coding of all scripts in the world. This standard is widely used in the electronic representation, transmission, exchange, processing, storage, input and display of written forms of various languages in the world and additional symbols.
The International Organization for Standardization published the first version of the ISO 10646 international coding standard in 1993. The full name is ISO/IEC 10646 Part 1 "ISO/IEC 10646-1:1993". It contains 20,902 ideographic characters. In October 2000, a new version of Part 1 was released, adding 6,582 ideographic characters to Extension A. Part 2 of ISO/IEC 10646 was released in 2001, adding 42,711 ideographic characters to Extension B.
Unicode and ISO 10646
Unicode is an abbreviation of Universal Code. It is a character coding system formulated by an organization called the Unicode Consortium to support the exchange, processing and display of written texts in major languages of the world. Unicode also supports traditional and historical original texts in different languages.
In terms of content, Unicode and the ISO 10646 international coding standard are synchronized and consistent. In 1991, the Unicode Consortium and the ISO International Organization for Standardization decided to jointly formulate a general coding standard suitable for text in multiple languages. Unicode and the ISO 10646 international coding standard formally cooperated to develop a general coding standard in January 1992. Since then, the two organizations have been closely cooperating and synchronously developing the Unicode and ISO 10646 international coding standards.
The Unicode version 3.0 and the ISO 10646 international coding standard contain the same characters and use the same coding, including East Asian Chinese character characters. The Unicode version 3.0 was officially launched by the Unicode Consortium in February 2000, which contains 49,194 characters from different languages around the world, including 27,484 Chinese character characters. In addition, the International Organization for Standardization provides the characters and coding data in the ISO 10646 international coding standard, and the Unicode Consortium supplements the application methods and semantic data for these characters and coding data.
The Unicode version 3.1 was released in 2001. In addition to the 49,194 characters in the Unicode version 3.0, 44,946 new characters were added, of which 42,711 are Chinese character characters.
|

我的網站:http://mw16.2ya.com/ 我的網誌: http://scrappedblog.blogspot.com/
~
我的Winamp正在播放的歌曲:
 |
|
2006-5-30 23:30 |
|
|
fdsiuha
高级用户
    闷
积分 587
发帖 302
注册 2005-7-25
状态 离线
|
|
2006-6-21 23:50 |
|
|