China DOS Union

-- Unite DOS · Advance DOS · Grow DOS --

Union site: www.cn-dos.net Forum site: www.cn-dos.net/forum
DOS stands for freedom, openness and progress. Let us work hard, learn from the openness and GNU spirit of FreeDOS and Linux, and together build and grow a free GNU GPL world!

中国DOS联盟论坛
The time now is 2026-06-25 09:36
中国DOS联盟论坛 » 网络日志(Blog) » National Standard GB18030-2005 "Information Technology - Chinese Coding Character Set" View 9,605 Replies 75
Original Poster Posted 2016-06-25 20:14 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Language Text Tools
18030 Coding Dictionary (Need to edit one that can be circulated on the network, searchable and can perform number system conversion)
Xinhua Dictionary
Chinese Dictionary
China Encyclopedia
Chinese Character Research Category
Mainly focused on the simple and accurate expression of help articles and entries, with direct or relatively few ambiguous spaces for character and word meaning conversion, or direct definitions and new meanings
Use 18030 to make a text tool?

Hu Zheng's StarDict and word library, add a 18038 translation word library?

Chinese Standard Name: Information Technology - Chinese Coding Character Set
http://www.gb688.cn/bzgk/gb/newGbInfo?hcno=C344D8D120B341A8DD328954A9B27A99

Total number of code points: 128 + 23940 + 1587600 = 1611668
1-byte 2-byte 4-byte
GB: Interactive query of Chinese characters and codes
http://www.qqxiuzi.cn/bianma/zifuji.php

Online Number System Conversion
http://tool.oschina.net/hexconvert/

http://upload.cn-dos.net/img/2189.rar

Online Chinese Dictionary
http://xh.5156edu.com/html3/8416.html

Digging Deeper into Programming : Jacky Lin
https://zhuanlan.zhihu.com/paogenjiudi

[ Last edited by zzz19760225 on 2018-7-19 at 23:31 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 2 Posted 2016-06-25 21:20 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
1 Front GB18030 encoding: C7B0 GB2312 encoding: C7B0 BIG5 encoding: AB65 GBK encoding: C7B0 Unicode encoding: 524D
2 Back GB18030 encoding: BAF3 GB2312 encoding: BAF3 BIG5 encoding: A65A GBK encoding: BAF3 Unicode encoding: 540E
3 Left GB18030 encoding: D7F3 GB2312 encoding: D7F3 BIG5 encoding: A5AA GBK encoding: D7F3 Unicode encoding: 5DE6
4 Right GB18030 encoding: D3D2 GB2312 encoding: D3D2 BIG5 encoding: A56B GBK encoding: D3D2 Unicode encoding: 53F3
5 Up GB18030 encoding: C9CF GB2312 encoding: C9CF BIG5 encoding: A457 GBK encoding: C9CF Unicode encoding: 4E0A
6 Down GB18030 encoding: CFC2 GB2312 encoding: CFC2 BIG5 encoding: A455 GBK encoding: CFC2 Unicode encoding: 4E0B
7 Inside GB18030 encoding: C4DA GB2312 encoding: C4DA BIG5 encoding: None GBK encoding: C4DA Unicode encoding: 5185
8 Outside GB18030 encoding: CDE2 GB2312 encoding: CDE2 BIG5 encoding: A57E GBK encoding: CDE2 Unicode encoding: 5916
9 Middle GB18030 encoding: D6D0 GB2312 encoding: D6D0 BIG5 encoding: A4A4 GBK encoding: D6D0 Unicode encoding: 4E2D

10 Add GB18030 encoding: CCED GB2312 encoding: CCED BIG5 encoding: B24B GBK encoding: CCED Unicode encoding: 6DFB
11 Save GB18030 encoding: B4E6 GB2312 encoding: B4E6 BIG5 encoding: A673 GBK encoding: B4E6 Unicode encoding: 5B58
12 Clear GB18030 encoding: C7E5 GB2312 encoding: C7E5 BIG5 encoding: B24D GBK encoding: C7E5 Unicode encoding: 6E05
13 Sequence GB18030 encoding: D0F2 GB2312 encoding: D0F2 BIG5 encoding: A7C7 GBK encoding: D0F2 Unicode encoding: 5E8F
14 Line GB18030 encoding: D0D0 GB2312 encoding: D0D0 BIG5 encoding: A6E6 GBK encoding: D0D0 Unicode encoding: 884C
15 Column GB18030 encoding: C1D0 GB2312 encoding: C1D0 BIG5 encoding: A643 GBK encoding: C1D0 Unicode encoding: 5217
16 Position GB18030 encoding: CEBB GB2312 encoding: CEBB BIG5 encoding: A6EC GBK encoding: CEBB Unicode encoding: 4F4D
17 Display GB18030 encoding: CFD4 GB2312 encoding: CFD4 BIG5 encoding: None GBK encoding: CFD4 Unicode encoding: 663E
18 Search GB18030 encoding: CBD1 GB2312 encoding: CBD1 BIG5 encoding: B76A GBK encoding: CBD1 Unicode encoding: 641C

19 Is GB18030 encoding: CAC7 GB2312 encoding: CAC7 BIG5 encoding: AC4F GBK encoding: CAC7 Unicode encoding: 662F
20 Whether GB18030 encoding: B7F1 GB2312 encoding: B7F1 BIG5 encoding: A75F GBK encoding: B7F1 Unicode encoding: 5426
21 Or GB18030 encoding: BBF2 GB2312 encoding: BBF2 BIG5 encoding: A9CE GBK encoding: BBF2 Unicode encoding: 6216
22 Time GB18030 encoding: CAB1 GB2312 encoding: CAB1 BIG5 encoding: None GBK encoding: CAB1 Unicode encoding: 65F6
23 Way GB18030 encoding: B5C0 GB2312 encoding: B5C0 BIG5 encoding: B944 GBK encoding: B5C0 Unicode encoding: 9053
24 Name GB18030 encoding: C3FB GB2312 encoding: C3FB BIG5 encoding: A657 GBK encoding: C3FB Unicode encoding: 540D
25 Real GB18030 encoding: CAB5 GB2312 encoding: CAB5 BIG5 encoding: None GBK encoding: CAB5 Unicode encoding: 5B9E
26 One GB18030 encoding: D2BB GB2312 encoding: D2BB BIG5 encoding: A440 GBK encoding: D2BB Unicode encoding: 4E00
27

Merge the relative relationships of front and back.
Front = -Back
Left = -Right
Up = -Down
Inside is not equal to -Outside?
The relative concept of one person, and the unification problem of the relative concept of another person, such as self-interest, good and evil, advantages and disadvantages, nature itself (natural nature). Nature belongs to a kind of surface appearance flow, representing the overall energy structure order behind, nature and nature are merged. Naturally and naturally and naturally formed. Chaos and chaos? In each small cycle, there are also some names and realities of the entire universe. The enumeration of relative concepts is merged into positive and negative forms, and the existence of real time and space.
26 can be directly aimed at the English key positions, no adjustment is required.

[ Last edited by zzz19760225 on 2017-11-28 at 11:54 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 3 Posted 2016-06-26 18:27 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
gb18030
Edit

The national standard GB18030-2005 "Information Technology Chinese Coding Character Set" is the most important Chinese character coding standard in China after GB2312-1980 and GB13000.1-1993, and it is one of the basic standards that China's computer systems must follow. There are two versions of GB18030: GB18030-2000 and GB18030-2005. GB18030-2000 is the replacement version of GBK. Its main feature is that it adds the Chinese characters of CJK Unified Ideographs Extension A on the basis of GBK. The main feature of GB18030-2005 is that it adds the Chinese characters of CJK Unified Ideographs Extension B on the basis of GB18030-2000.

Chinese Name
"Information Technology Chinese Coding Character Set"
National Standard Number
GB18030
Release Date
2000-3-17; 2005-11-8
Issuing Institution
State Bureau of Quality and Technical Supervision

Table of Contents

1 Concept Introduction
2 Version Differences
3 Scope of Application
4 Requirements Met
5 Overall Structure

6 Character Set
▪ GB18030-2000 Character Set
▪ GB18030-2005 Character Set
7 Chinese Characters
▪ GB18030-2000 Chinese Characters
▪ GB18030-2005 Chinese Characters

8 Version Changes
9 Formulating Standards
10 Standard Process

Concept Introduction
Edit
The national standard GB18030-2000 "Supplementary to the Chinese Character Coding Character Set Basic Set for Information Exchange" is the most important Chinese character coding standard in China after GB2312-1980 and GB13000-1993, and it is one of the basic standards that China's computer systems must follow.
The GB18030-2000 coding standard was jointly released by the Ministry of Information Industry and the State Bureau of Quality and Technical Supervision on March 17, 2000, and will be formally enforced as a national standard in January 2001.
GB18030-2005 "Information Technology Chinese Coding Character Set" is a mandatory standard for an ultra-large Chinese coding character set independently developed in China, mainly containing Chinese characters and including various ethnic minority characters in China (such as Tibetan, Mongolian, Dai, Yi, Korean, Uyghur, etc.), and more than 70,000 Chinese characters are included.
Version Differences
Edit
The latest version of GB18030 is GB18030-2005. The coding system structures of GB18030-2005 and GB18030-2000 are completely the same. Then, what are the differences and connections between the 2000 version and the 2005 version of GB18030?
GB18030-2000 released in 2000, with the full name "Information Technology Chinese Character Coding Character Set Expansion of the Basic Set". GB18030-2000 only specifies the coding of commonly used non-Chinese symbols and 27,533 Chinese characters (including radicals, components, etc.).
GB18030-2000 is a full-text mandatory standard, and products sold on the market must comply with it. GB18030-2005 released in 2005 adds the coding of 42,711 Chinese characters and various ethnic minority characters in China on the basis of GB18030-2000. The added content is recommendatory. The content in the original GB18030-2000 is mandatory, and products sold on the market must comply with it. Therefore, GB18030-2005 is a part of the mandatory standard, and replaces GB18030-2000 from the date of release. The single-byte coding part, double-byte coding part, and four-byte coding part CJK Unified Ideographs Extension A (i.e., 0x8139EE39—0x82358738) part of GB18030-2005 are mandatory.
Scope of Application
Edit
Article 1 of the GB 18030 standard stipulates: "This standard is applicable to the processing, exchange, storage, transmission, display, input, and output of graphic character information." Simply put, GB18030 can be used for all information processing products that handle Chinese (including Chinese characters and ethnic minority scripts) information, especially Chinese character information. The GB18030-2005 standard can be applied to software products for Chinese processing, such as operating systems, databases, middleware, office software, financial software, CAD software, spreadsheet software, education software, font libraries, etc. The GB18030-2005 standard can also be applied to hardware products with Chinese character processing functions, such as printers, mobile phones, PDA products, etc.
Requirements Met
Edit
Q: What requirements must a product meet to be in compliance with GB 18030?
A: A product that meets both of the following requirements is a product in compliance with the mandatory part of GB 18030-2005:
(1) The product can correctly input, output, and process all Chinese character characters specified in the mandatory part of GB 18030-2005;
(2) The product can correctly identify the codes corresponding to all Chinese character characters specified in the mandatory part of GB 18030-2005.
Overall Structure
Edit
Code Position Overall Structure Code Position Overall Structure
The standard uses three methods: single-byte, double-byte, and four-byte to code characters. The single-byte part adopts the coding structure and rules of GB/T 11383, using code positions from 0×00 to 0×7F (corresponding to the corresponding code positions of ASCII code). For the double-byte part, the first byte code position is from 0×81 to 0×FE, and the second byte code positions are respectively from 0×40 to 0×7E and 0×80 to 0×FE. The four-byte part uses 0×30 to 0×39, which is not used in GB/T 11383, as the suffix for expanding the double-byte coding. The expanded four-byte coding ranges from 0×81308130 to 0×FE39FE39. The first and third byte coding positions are both from 0×81 to 0×FE, and the second and fourth byte coding positions are both from 0×30 to 0×39. The overall structure of code positions is shown in the right figure.
Character Set
Edit
GB18030-2000 Character Set
The character set part of GB18030-2000 is written as follows:
The characters included in this standard are coded in single-byte, double-byte, and four-byte respectively.
1. Single-byte part
In this standard, the single-byte part includes all 128 characters from 0x00 to 0x7F of GB 11383 and the single-byte coded euro symbol.
2. Double-byte part
In this standard, the double-byte part includes the following content:
All CJK unified Chinese character characters of GB 13000.1.
21 Chinese characters selected from the CJK compatibility area of GB 13000.1.
139 graphic characters used in Taiwan region of China that are included in GB 13000.1 but not included in GB 2312.
31 other characters included in GB 13000.1.
Non-Chinese symbols in GB 2312.
19 vertical punctuation marks of GB 12345.
10 lowercase Roman numerals not included in GB 2312.
5 accented pinyin letters not included in GB 2312 and ɑ and ɡ.
Chinese character numeral "〇".
13 ideographic description characters.
80 supplemented Chinese characters and radicals/components.
Double-byte coded euro symbol.
3. Four-byte part
The four-byte part of this standard includes all characters in GB 13000.1, including CJK Unified Ideographs Extension A, except for the above double-byte characters.
GB18030-2005 Character Set
The character set part of GB18030-2005 is written as follows:
The characters included in this standard are coded in single-byte, double-byte, or four-byte respectively.
1. Single-byte part
In this standard, the single-byte part includes all 128 characters from 0x00 to 0x7F of GB/T 11383-1989.
2. Double-byte part
In this standard, the double-byte part includes the following content:
All CJK unified Chinese character characters of GB 13000.1-1993.
21 Chinese characters selected from the CJK compatibility area of GB 13000.1-1993.
139 graphic characters used in Taiwan region of China that are included in GB 13000.1-1993 but not included in GB 2312.
31 other characters included in GB 13000.1-1993.
Non-Chinese symbols in GB 2312.
19 vertical punctuation marks of GB 12345.
10 lowercase Roman numerals not included in GB 2312.
5 accented pinyin letters not included in GB 2312 and ɑ and ɡ.
Chinese character numeral "〇".
13 ideographic description characters.
80 Chinese characters and radicals/components supplemented to GB 13000.1-1993.
Double-byte coded euro symbol.
3. Four-byte part
The four-byte part of this standard includes the characters of CJK Unified Ideographs Extension A, CJK Unified Ideographs Extension B, and ethnic minority characters in China that have been coded in GB13000, except for the above double-byte characters.
The most significant change in GB18030-2005 is the addition of CJK Unified Ideographs Extension B. It also removes the single-byte coded euro symbol 0x80).
GB18030 has 1,611,668 code positions, and 76,556 characters are defined in GB18030-2005. As China's research on Chinese character arrangement and coding continues to deepen, and with the continuous development of the international standard ISO/IEC 10646, the characters included in GB18030 will be increased in new versions.
Chinese Characters
Edit
GB18030-2000 Chinese Characters
As shown in the following table, GB18030-2000 includes 27,533 Chinese characters:
Category

Code Position Range

Number of Code Positions

Number of Characters

Character Type
Double-byte part

First byte 0xB0-0xF7

6768

6763

Chinese characters
Second byte 0xA1-0xFE
First byte 0x81-0xA0

6080

6080

Chinese characters
Second byte 0x40-0xFE
First byte 0xAA-0xFE

8160

8160

Chinese characters
Second byte 0x40-0xA0
Four-byte part

First byte 0x81-0x82

25200

6530

CJK Unified Ideographs Extension A
Second byte 0x30-0x39
Third byte 0x81-0xFE
Fourth byte 0x30-0x39
27533 is 6763+6080+8160+6530. The 6763+6080+8160=21003 Chinese characters in the double-byte part are the 21,000 Chinese characters of GBK.
In Unicode, there are 6,582 Chinese characters in CJK Unified Ideographs Extension A. Why are there only 6,530 Chinese characters here?
This is because 52 Chinese characters of CJK Unified Ideographs Extension A have already been included in the double-byte part in the GBK era, so there are still 6,530 Chinese characters left.
GB18030-2005 Chinese Characters
As shown in the following table, GB18030-2005 includes 70,244 Chinese characters:
Category

Code Position Range

Number of Code Positions

Number of Characters

Character Type
Double-byte part

First byte 0xB0-0xF7

6768

6763

Chinese characters
Second byte 0xA1-0xFE
First byte 0x81-0xA0

6080

6080

Chinese characters
Second byte 0x40-0xFE
First byte 0xAA-0xFE

8160

8160

Chinese characters
Second byte 0x40-0xA0
Four-byte part

First byte 0x81-0x82

25200

6530

CJK Unified Ideographs Extension A
Second byte 0x30-0x39
Third byte 0x81-0xFE
Fourth byte 0x30-0x39
First byte 0x95-0x98
50400

42711

CJK Unified Ideographs Extension B
Second byte 0x30-0x39
Third byte 0x81-0xFE
Fourth byte 0x30-0x39
70244 is 6763+6080+8160+6530+42711.
Version Changes
Edit
The main changes of GB18030-2005 compared with GB18030-2000 are as follows:
1. The glyphs of CJK Unified Ideographs Extension B and ethnic minority characters in China that have been coded in GB13000 are added in the four-byte character table. In fact, GB18030-2000 has mapped these code positions, but GB18030-2000 did not give the glyphs of these characters.
2. The coding of character? is adjusted.
3. The single-byte coded euro symbol (0x80) is removed.
(Correction: In fact, neither GBK nor GB18030-2000 has the single-byte coded euro symbol. Microsoft's CP936 has this symbol and defines it at 0x80)
Formulating Standards
Edit
According to international practice, the following standards are adopted in the formulation of GB 18030:
GB 18030-2000 includes 27,533 Chinese characters, and GB 18030-2005 includes 70,244 Chinese characters. The total coding space of GB18030 exceeds 1.5 million code positions, providing a solution for the use of personal names and place names, and providing a unified information platform foundation for fields such as Chinese character research and ancient book collation.
Most computer systems in China still use GB 2312 coding. GB 18030 is consistent with GB 2312, well solving the conversion problem from the old system to the new system, and with relatively low transformation cost. From the perspective of the development of China's information technology and information industry, considering the needs of Chinese users and solving the compatibility of existing systems and support for multiple operating systems, adopting GB 18030 is a better choice in China, and GB 13000.1 is more suitable for future international information exchange. Considering the compatibility issue between GB 18030 and GB 13000, the standard drafting group has prepared a code mapping table between GB 18030 and GB 13000.1, enabling free conversion between the two coding systems. At the same time, a GB 18030 basic dot matrix font library has also been developed.
Many countries and regions in the world have formulated corresponding coding standards and internal code systems from the perspective of facilitating the application of their own countries and ethnic groups, such as Japan's JIS X 0208 and JIS X 0212, South Korea's KS C 5601 and KS C 5657, etc. This is a common practice internationally. Formulating GB 18030 also conforms to international practice. It is fully compatible with GB 2312 and compatible with GB 13000.1 in character set, making full use of existing resources, ensuring compatibility between different systems, sharing resources to the greatest extent, and leaving a huge development space for China's software industry. It can be believed that the implementation of GB 18030 will be conducive to the development and scale formation of domestic software, and will take China's Chinese information technology to a new level.
Standard Process
Edit
In 1980, China promulgated the first Chinese character coding character set standard, namely GB 2312-80 "Information Exchange Use Chinese Character Coding Character Set Basic Set". This standard includes 6,763 Chinese characters and common symbols, laying the foundation for Chinese information processing.
With the expansion of international exchanges and cooperation, information processing applications have put forward requirements for character sets with multiple scripts, large character quantities, and multiple purposes. In 1993, the International Organization for Standardization issued ISO/IEC 10646-1 "Information Technology Universal Multiple-Octet Coded Character Set Part 1 Architecture and Basic Multilingual Plane". China equivalently adopted this standard to formulate GB 13000.1-1993. This standard adopts a brand-new multilingual coding system and includes 20,902 Chinese characters from China, Japan, and South Korea. It is the future development direction of the coding system. Due to its new coding system being incompatible with most existing operating systems and external devices, its implementation still needs a process and cannot completely solve the urgent needs of China's current applications.
Considering that the full implementation of GB 13000 will take some time, as well as the continuity of the GB 2312 coding system and the effective use and transition of existing resources and systems, we have chosen a plan to expand on the basis of GB 2312 (GB 2311) and be compatible with GB 13000.1 in character set, developing a new standard - Expansion of the Basic Set of Chinese Character Coding, and then improving GB 2312 to meet the urgent needs of applications such as China's postal, household registration, finance, and geographic information systems.
This project has been included in the national standard formulation plan for 1998. In October 1998, a standard drafting group was formed by technical personnel from the Fourth Research Institute of the Ministry of Information Industry, the Computer Technology Research Institute of Peking University, the Founder Group, the Xintian Di Company, the Sitong New Century Company, the Software Institute of the Chinese Academy of Sciences, the Great Wall Software Company, the China Software Corporation, the Kingsoft Software Company, and the Lenovo Company. During the standard research and development process, the National Information Technology Standardization Technical Committee repeatedly convened the standard drafting group and well-known companies to fully study and demonstrate the standard draft, and specially invited companies such as Microsoft, Hewlett-Packard, Sun, and IBM to participate and widely solicit opinions. The standard drafting group, after repeated consideration and verification, put forward the standard formulation principle - be compatible with the de facto internal code standard corresponding to the GB 2312 information processing exchange code, support all CJK unified Chinese character characters and all CJK Extension A characters of GB 13000.1 in character set, and determined the coding system and 27,484 Chinese characters, forming a plan with both compatibility, extensibility, and forward-looking.
The Ministry of Information Industry and the former State Bureau of Quality and Technical Supervision jointly released this standard on March 17, 2000, namely GB 18030-2000 "Expansion of the Chinese Character Coding Character Set Basic Set for Information Exchange Use in Information Technology". This standard is a national mandatory standard and has been implemented from the date of release, with a transition period until August 31, 2001.
During this period, the National Information Technology Standardization Technical Committee formulated and released "Chinese Character Expansion Specification GBK 1.0" and it is widely used in systems such as MS Windows 9x/Me/NT/2000 and IBM OS/2. GB18030 is a national standard, technically a superset of GBK and compatible with it, therefore, GBK will end its historical mission.

References

1. About GB18030 Chinese Character Coding Standard Set .Sina.com.2001-07-26
2. Character Set and Common Sense of Characters in Java Sorting .My Exception Net.2014-04-12
3. GB18030 Coding Research and Mapping between GBK, GB18030 and Unicode .Cutting Firewood Ding Ding Bird Singing Ying Ying
4. Expansion of the Basic Set of Chinese Character Coding Character Set for Information Exchange Use .Doc88.com
5. Introduction to GB18030-2000 Coding Standard .ITPUB Forum
6. Chinese Characters in Unicode, GB2312, GBK and GB18030 .Cutting Firewood Ding Ding Bird Singing Ying Ying
7. Chinese Characters in Unicode, GB2312, GBK and GB18030 .Language and Character Network.2008-04-23

[ Last edited by zzz19760225 on 2016-6-27 at 05:50 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 4 Posted 2016-06-26 18:27 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
GAOYANWEI520
Personal Signature: Seeking Happiness
http://jingyan.baidu.com/article/11c17a2c77dd62f447e39d57.html

Simple Text Editor Implemented in C Language

Views: 914
|
Updated: 2014-06-10 20:51

Below is a data structure course design I did during my college years, a simple text editor implemented in C language. Looking back, it was relatively simple, for the reference of younger schoolmates.

1. Design Purpose
A simple text editor is a user-oriented system service program, widely used for a series of operations such as input, deletion, replacement, search, modification of source programs, and even for editing and typesetting of newspapers and books, as well as drafting and polishing of official documents and letters in offices. It is a relatively practical application software. This program is specially designed for users to perform text editing and can meet most requirements of text operations.

2. Design Content and Requirements
(1) Read a (text) file to establish a new string (2) View the content and composition information of the source (text) file (3) View the current string (4) View the composition information of the current string (5) Search for a string of specified length starting from a specified position (6) Search for the position of a specified string (7) Insert a string at a specified position in the current string (8) Delete a specified substring from the current string (9) Replace a specified substring in the current string (10) Save the modifications to the (text) file (11) Save as a (text) file (12) Exit the program

3. Data Structures Adopted in This Design
The only data structure used in this program is a linked list.
The basic operations and storage structures used in the program both adopt the linked storage structure of strings, and corresponding operation functions are defined.

typedef struct _String//Define the linked string structure
{
char character ;//Character composing the string (data field)
struct _String* next ;//Pointer to the next string structure (link field)
} String ;//String type

4. Detailed Design of Functional Modules
4.1 Detailed Design Idea
The basic operations and storage structures used in this program adopt the linked storage structure of strings, and corresponding operation functions are defined. The operations of text editing call the following functions and combine with library functions to implement the graphical interface of text editing.

Define the linked storage structure:

typedef struct _String//Define the linked string structure
{
char character ;//Character composing the string (data field)
struct _String* next ;//Pointer to the next string structure (link field)
} String ;//String type

Define the string information structure:

typedef struct _StringInformation//Define the string information structure
{
String* str ;//First node of the linked string
int letters_count ;//Number of letters
int figures_count ;//Number of digits
int space_count ;//Number of spaces
int endline_count ;//Number of newlines
int others_count ;//Number of other characters
int characters_count ;//Total number of characters
} StringInformation ;//String information type

Define the string index structure:

typedef struct _Index//Define the string index structure
{
int sub_index ;//Position (or index) of the substring in its parent string
struct _Index* next_index ;//Pointer to the next string index structure of the substring
} Index ;//String index type

The sub-functions to be called are:

int StringToUnsignedInt( char* s , int max );//Convert string to unsigned integer
void InitalizeStringInformation( StringInformation* SI , int initalizeAll ) ;//Initialize string information
void CreatStringFromArray( String** S , char* cString ) ;//Establish string from character array
int CreatStringFromFile( String** S ,char* filePath ) ; //Establish string from file
int StringLength( String* S ) ;//Find string length
int StringCompare( String* S1 , String* S2 ) ;//Compare strings
void StringCopy( String** targetString , String* sourceString ) ;//Copy string
void SubString(String* S ,int index ,int subLength ,String** subS ) ;//Extract substring
int SubStringIndex( String* S , String* subS ) ; //Find the first occurrence position of the substring in the parent string
int StringInsert( String** S , int index , String* insertedString ) ; //Insert string
int SubStringDelete( String** S , int index , int subLength ) ; //Delete substring
void PrintString( String* S ) ;//Output string
void PrintIndex( Index* subIndex ) ;//Output all indices of the substring
void PrintStringInformation( StringInformation* SI ) ;//Output string composition information
void SaveToFile( String* S , char* filePath ) ;//Save string to file
void DestroyString( String** S ) ;//Destroy string
void DestroyIndex( Index** subIndex ) ;//Destroy substring index

(1) Read a (text) file to establish a new string
The design idea of this function is to establish a string from a character array or from a file. For establishing a string from a file, first define a string StringInformation S, tempString = S.str, then open the txt file in the computer gets( filePath ), and call the function CreatStringFromFile( &S.str , filePath ) to write filePath into string S to implement it.

(2) View the content and composition information of the source (text) file
First call the function PrintString( Source.str ) to output the string, then call the sub-function PrintStringInformation( &Source ) to output the string composition information, and finally destroy the string DestroyString( &Source.str ).

(3) View the current string
First clear the screen system( "cls" );then call the sub-function PrintString( S.str ) to output string S.

(4) View the composition information of the current string
Call the sub-function PrintStringInformation( &S ) to output the composition information of string S

(5) Search for a string of specified length starting from a specified position
First input the starting position of the substring gets( input ), position = StringToUnsignedInt( input , -1 ) call this function to convert input to unsigned integer and assign the value to position; then input the length of the substring gets( input ), subLength = StringToUnsignedInt( input , -1 ) call this function to convert input to unsigned integer and assign the value to subLength, then call the function SubString( S.str , position , subLength , &subString ) to extract the substring;

(6) Search for the position of a specified string
First input the string to be searched gets( cString ), CreatStringFromArray( &subString , cString ) then call this function to set cString as string subString, call SubStringIndexAll( S.str , subString , &subIndex ) to output all positions where substring subString appears in parent string S, then destroy subString.

(7) Insert a string at a specified position in the current string
First input the position of the string to be inserted gets( input ), then call the function position = StringToUnsignedInt( input , -1 ) to convert input to unsigned integer and assign the value to position;

Then input the string to be inserted gets( cString ), CreatStringFromArray( &newString , cString ) use the method of establishing string from character array to establish newString, then call the function StringInsert( &S.str , position , newString ) to insert newString into file S.str, then destroy newString.

(8) Delete a specified substring from the current string
First input the substring to be deleted gets( cString ), then call the function CreatStringFromArray( &subString , cString ) to establish new string subString, call the function count = SubStringIndexAll( S.str , subString , &subIndex ) to output all positions where substring subString appears in parent string S.

Then input the position of the substring to be deleted gets( input ), position = StringToUnsignedInt( input , -1) call this function to convert input to unsigned integer and assign the value to position ; then call the function to delete the substring SubStringReplaceSingle( &S.str , position , subString , NULL ).

(9) Replace a specified substring in the current string
First input the substring to be replaced gets( cString ) , CreatStringFromArray( &subString , cString ) call the function to generate string subString from cString , call the function SubStringIndexAll( S.str , subString , &subIndex ) to output all positions where substring subString appears in parent string S.

Then input the position of the substring to be replaced gets( input ), convert the input input to unsigned integer position = StringToUnsignedInt( input , -1 ); then input a string to replace the above string gets( cString ), call CreatStringFromArray( &replaceString , cString ) to set the input cString CreatStringFromArray( &replaceString , cString ) as new string replaceString, then call the function SubStringReplaceSingle( &S.str , position , subString , replaceString ) to replace the substring , then destroy subString.

(10) Save the modifications to the (text) file
Call the function SaveToFile( S.str , filePath ) to save the modified text

(11) Save as a (text) file
First input the absolute path of the file to be saved as gets( anotherPath ). Then call the function SaveToFile( S.str , anotherPath ) to save the file as another name

(12) Exit the program
Call the function DestroyString( &S.str ) to destroy string S, then exit the program

[ Last edited by zzz19760225 on 2016-6-27 at 06:19 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 5 Posted 2016-06-26 18:28 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
1. In the era of characters, the distinction between common and non-common usage ranges.
The direct content related to people's daily one-day is the central range of common usage. How to summarize and refine the required number of characters within this range, whether a standard is needed, or a standard that negates the standard and a loose structure similar to the inertial usage habit standard.
This range goes through several sub-ranges in people's mind consciousness from unconsciousness to consciousness to conscious neglect of consciousness itself, and goes through a cognitive process of ignorance, knowledge, knowability, unknownness, and unknowability. In the cognitive process, emotions such as one's life and death, gains and losses are summarized. From the personal perspective, one views the physical and mental world, the universe, and generalizes all concepts from the personal perspective. The cumulative overlapping of personal knowledge forms the comprehensive group knowledge. On the accumulation of comprehensive group knowledge, the calculation structure of the change between name and reality, and various optimization possibilities are carried out.
The text is screened in this general order.
Two dictionaries, one with stroke priority and the other with language pinyin arrangement,
Looking for the implicit law between the stroke dictionary and the people-centered arrangement order. (People, things, and events are all components of the invisible power of the universe. The law belongs to one of the existence modes of power. Power has geometric attributes and other contents. People, things, and events also have similar contents. On these basic contents, they are combined again to form characteristics, which become people, things, and events.) The function set of the law includes strokes and central sorting, forming three parts: name, usage, and principle.
Calculation instructions, array structure sorting, shadow information, mirror information, and information groups parallel in four-dimensional time and space. There is a trend selection between free information with infinite possibilities and limited effective era updates here. Under the engineering construction mode and user autonomous mode, it is necessary to study the shadow information on all or part of the content information of computer calculation and program content to compare and find problems, and solve problems in response to the found problems. This shadow information can be multi-angle. For example, a well-shaped eight sides plus two sides behind and in front of the character, totaling ten sides. Suppose the shadow is ten shadows formed in different directions around the object of the calculation process and program flow. Among them, the front shadow or surface phenomenon is mainly the first. There is no need to see the essence first, just look at the surface shadow. There is an edge between the shadow and the object itself, and the third mode of the user observer relative to both. The shadow information can be canceled, hidden, displayed in real time on the screen, selected to be displayed, or saved as text information. The interaction error and distortion problem between the object and the shadow. For example, the process of calculating a Chinese character "一" shows voltage problems or bad tracks in magnetic track storage, or other forces. Binary numbers have omissions and accidents. The input information, calculation process, and result are displayed on the shadow information, and the accident cannot be displayed, only the error exists by comparison. The accident has little impact on small and numerous similar repeated calculations, but has an impact on continuous calculations of large digital groups. This is similar to the verification in network information transmission, ensuring that digital information is stored within an error value range, hard disk data format, hardware design and performance quality?
The conversion between binary and hexadecimal needs to be familiar. There are some errors in this conversion. It is best to finally define it after conversion to reduce this conversion process and form a Chinese character binary dictionary. Viewing and inputting seem to be a problem? If there are differences, they can be compared. What is needed is a way of排比 (arrangement). This way of排比 has a law, and this law can become a mathematical tool for internal optimization of characters. This law can improve efficiency, and combined with the ten-sided shadow information structure, it forms a temporary definition that can be used for its own information recognition. Self-correction of errors and correction of others' errors.
18030 Chinese-English and pinyin letters form pinyin verbs, and simple English nouns, mathematical letters form the running part below the hardware interface. This part of content is standardized for the possible Chinese character information environment. Use the same compilation in the explanation information to strive to display the explanation corresponding to the Chinese character, similar to the explanation display of methods and principles on Zhihu. In the existing English driver interface environment, Chinese characters cannot be displayed. Replace it with the 18030 standard compilation interface. On the basis of the 18030 binary hardware standard displaying pinyin verbs, nouns, existing English content, and mathematical formulas, transport the Chinese character explanation to help learning and beginners. This is what the Chinese character information environment needs. A standard attempt under the conditions of no operating system and no Chinese character font output driver. The Loongson chip array directly converts binary to the 18030 information instruction set. Can this instruction set be compiled? If compiling the instruction set for the hardware from a personal user perspective, what content needs to be understood? Just describe simple characters such as up, down, left, right, middle, one, zero, people, and things, and carry out similar extensions for these characters to the work of the 18030 batch translation tool, which can basically output the required information. The 18030 binary information list is a tool table as important as the 18030 itself information table. Through the text editing function of the existing computer system, the similar information conversion work can be better handled. Whether to include Chinese character dictionaries and Chinese-English dictionaries in this process, using modes similar to StarDict, info, man, and help, and another information display and explanation mode of emacs. General novices and newbies need it, while heavy-handed old hands and aunts don't need such cumbersome contents.
A batch of transitional tools and contents: 1. Current 18030, 2. Binary arrangement table, 3. Chinese character dictionary, 4. Chinese-English dictionary, 5. Simple Chinese information of Loongson instruction set, 6. Existing computer hardware and software information, 7. Existing computer system available text editor and functional Chinese tutorial. Does the instruction set involve circuits, another very large knowledge group world. The above is simply divided into seven categories. Among them, the existing ones that can be collected and sorted for future use are 1, 3, 4, 6, 7, five. 2. Binary can be realized by improving existing free software tools. If it can be realized, it will be attributed to free software after use. 5. The process of Loongson instruction set reading 18030 binary requires a possible very large knowledge group. The possibility of disability is basically 99%, but it can also be used as a toy for alternative children's games. The first five can be downloaded as txt text on the network for easy editing, or collected manually (this is not a patent, why there is no txt, or find pdf to convert to txt, which may have information conversion errors. It is better to have materials with relatively high precision.) How to make 3 and 4 into instruction-type help information text or automatic information display text mode. Do you need to understand the multiple information display modes of linux? The content of 7 is relatively easy to choose. In the character mode, vi, gvim and emacs under the desktop are relatively transparent. The problem of 6 is that the information is hard to find, and some unknown existing information needs to be supplemented. Maybe find the Intel website. Can the virtual machine simulate the hardware to prevent possible risks of breaking. (If there is system photo storage, parallel systems can be used for parallel operation to recover problems.)
Hardware mirror tool, display mirror and shadow. This display is to produce a shadow by similar to making sunlight to shine on things and then look at the shadow. If using C language, use gcc to write an application program to display the Chinese character "一" in the existing operating system, through the software program that contacts the hardware.
Program, content selection in the information grid, arrangement of grids in the program line, arrangement at the line level, overall structure, target content. Simplify this process, remove repeated labor, and the structure should be simple. Reduce the impact of repeated similar labor on the continuity of thinking. The rows and columns of sublime. If an emacs-org with structural building blocks, tree structure, mind map type is added, and then a three-dimensional graphic description that is relatively simple to start, the three are in the same function or software. Similar to the three types of the graphic area, description area, and numerical arrangement area of engineering drawings, and these three types can be expanded and extended. Of course, it is best to be able to combine, splice, and interact with standard definitions. Three-dimensional dynamic graphics, it seems that three-dimensional animation graphics are like this. Maybe find a three-dimensional dynamic programming language editor.
Content of the central axis, define the timeline, and accumulate the order of commands before and after. Explore the cooling effect of heat dissipation and reduce the workload of calculation accumulation. Restrict the overall system workload within a certain amount. A low-power consumption and low-temperature computer without a fan, similar to a calculator, with low working ability.
Hardware application software, system, software with programming editor interface functions, system. Directly similar to the system and application software that the hardware directly runs in DOS.
Programming array at the automatic recognition level, mainly special symbols such as semicolons, parentheses, square brackets, etc. These automatic arrangement functions can be modified and keep the modified state, or can be re-automatically arranged by selecting the area.
Fun: It can be in color modification, error flashing modification for automatic error checking. Different operations can introduce some novice scenarios. For example, after writing twenty characters, a character animal walks by. After writing one hundred characters, a character dragon swims by. After a section of code is completed, a character sun appears. When an error is checked, a section of hail falls. The disordered display of beauty's beauty and slim figure information, the performance of child-like animated characters, housewives' food production and education of children, the active distance beauty and prevention of participation in the couple relationship, the naughty and mischievous image of the elderly's character children, the mosquito bite and hungry character trash can image of the wanderer, and the emotional image of eye fatigue, loss of hope to re-open eyes and slightly narrow and stabilize after experiencing the risk of others' death and own death, etc.
18030 has the road integrated in front. Searching and sorting out the previous content step by step and then integrating it is a relatively independent and complete 18030. Use a digital statistic to represent the existence state of this information body, how many binary digits, the rows and columns displayed, the occupied capacity, and the calculation method of the capacity needs to be unified. This unique and accurate can be used as a row-by-row index, for example, a file comparison software, which can directly view the additional displayed value without a special software, similar to the effect under DOS.
Font files of the DOS forum?

[ Last edited by zzz19760225 on 2016-9-28 at 00:26 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 6 Posted 2016-06-26 18:31 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Partition table based on two bytes?

[ Last edited by zzz19760225 on 2016-7-13 at 17:26 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 7 Posted 2016-06-26 18:32 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
1
Graphical Chinese Characters
Simple centrally symmetric Chinese characters with central symmetry,
Simple symmetric graphical effect Chinese characters,
Simple Chinese characters with partial deviation in the character shape unit,
Deviated upward,
Deviated downward,
Deviated leftward,
Deviated rightward,

Pictographic Chinese Characters
Expressive and descriptive language with human as the content,
Expressive and descriptive language with objects as the content,
Expressive and descriptive language with moods, thoughts, and imaginations as the content,

Symbolic Chinese Characters
Incorporation and application of radicals, mathematical symbols, English, Latin and other world symbols and network symbols,

Coded Chinese Characters
Logical relationships between points and different points. This logical relationship corresponds to the content of all things in time and space through mathematics, Zhouyi, traditional Chinese medicine, material force (instead of the translated term of original physics), chemistry, etc. From a small perspective, it is force; from a large perspective, it is momentum; in life, it is Tai Chi. It is the discovery and application by interest enthusiasts, the combined application of individual forces in military formations, the blocking of purposeful remarks by organized流氓言论 behaviors on the network (using the evil part of human nature, with the operation experience of the underworld. This matter is at the position of "the way is one foot high and the devil is one foot high". It cannot be rushed. Its factor relationships are many within people), and the content of daily free life in limited steps (the gradual extension of multi-faceted and comprehensive point factors of freedom, the transformation of individuals to the whole, which is also similar to the content of man can conquer nature).

Defined Chinese Characters
Designate this as that.

For the selection of functional content,
User selection,
Hardware selection,
Selection for system effect enhancement, realizing line connection content at one point, similar to the library in C language, the dependence of Linux and Windows,
Dependence is established on the original factual basis of hardware, not on the redundant information set by software usage inertia and human for interests.
On the basis of reducing or omitting dependence, increase independent small and large software. Through the enrichment of independent software, decompose each function of the system as a whole, so that it does not have a monopoly effect, but only a transparent brand effect serving the people, similar to the content of forcing the United States to implement democratic ideas, such as free immigration.

[ Last edited by zzz19760225 on 2016-9-8 at 07:21 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 8 Posted 2016-06-26 18:33 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
http://www.ltaaa.com/essence-view-id-1750.html
Why Chinese is One of the Advanced Languages in the World
2013-07-01 Surrender is a Habit 43879 131 37
Article Introduction
First of all, we need to admit that the title is just a gimmick. I always see people arguing whether Chinese is advanced or backward. Although I am not a professional, I will use some concepts in communication and storage to talk about why Chinese is a very advanced language. It should be noted that although the following arguments are based on actual experimental data, the calculations are very rough, and the scale of the experiments is not large. In other words, although Chinese has an advantage here, if we change a batch of experimental participants, it may be the other way around. There are really no particularly comprehensive tests at present. So you can just take a look at the following numbers and don't take them too seriously. In fact, the efficiency of the world's major languages has reached a bottleneck of the current human brain, and the differences are not large overall. My main purpose is to fight against reverse nationalists.
Text Content



First of all, we need to admit that the title is just a gimmick. I always see people arguing whether Chinese is advanced or backward. Although I am not a professional, I will use some concepts in communication and storage to talk about why Chinese is a very advanced language. It should be noted that although the following arguments are based on actual experimental data, the calculations are very rough, and the scale of the experiments is not large. In other words, although Chinese has an advantage here, if we change a batch of experimental participants, it may be the other way around. There are really no particularly comprehensive tests at present. So you can just take a look at the following numbers and don't take them too seriously. In fact, the efficiency of the world's major languages has reached a bottleneck of the current human brain, and the differences are not large overall. My main purpose is to fight against reverse nationalists.

I. Criteria for Evaluating Language Level

Spoken language, when analyzed today, is actually a communication protocol. That is to say, language actually transforms people's thoughts into sound waves with different frequencies and waveforms through the pronunciation organs, and then another individual's auditory organs and related brain tissues transform them back into thoughts. A communication protocol is a rule, a rule that stipulates how to transform thoughts/information into signals that are easy to transmit. In computers, communication protocols basically have these two evaluation criteria: transmission efficiency and noise resistance. The so-called transmission efficiency means how much information can be transmitted according to this communication protocol in a unit time. The so-called noise resistance means how much noise this communication method can still ensure that most of the information is transmitted correctly.

Transmission efficiency has two aspects, one is coding efficiency and the other is transmission speed. Coding efficiency means how short a string of signals this communication protocol can use to express an information. Transmission speed means how fast a section of signal can be transmitted.

To evaluate whether a spoken language is advanced, we need to analyze the above questions.

Characters are a way of data storage. The requirements for the storage format are different from those of the communication protocol. The storage format requires small storage space and fast reading and writing speed. Compared with the reading speed, the writing speed is secondary. This is because on average, one writing corresponds to many readings, and the time people spend reading characters is generally far longer than writing. Especially in modern society, handwriting is getting less and less, and computer input, printing, and typesetting have greatly accelerated the speed of recording characters. However, the reading speed has not been improved much. Therefore, in modern society, the reading speed occupies a more important position in defining the level of characters.

It should be noted that in the following discussion, "syllable" all adopts the definition of Western linguistics. Roughly speaking, a group of consecutive vowels and the consonants before and after them together form a syllable. For example, To, Bliss, Strength are monosyllabic, and Chinese Ba, Chuang are also monosyllabic. Although the pronunciation length of monosyllabic words is not completely the same, at least they are comparable.

II. Classification of Languages

There are roughly two types of languages in the world, one is called analytic language (or isolated language), and the other is called synthetic language (which is further divided into agglutinative language, inflectional language and other subtypes). Simply put, synthetic language can express different meanings by changing the morphology of words. Analytic language simply expresses different meanings by the relationship between words.

Let's take a simple example:

Chinese (analytic language): I told him yesterday.

English (synthetic language): I told him yesterday.

In English, you can first see the changes of told and him. Among them, changing tell to told means the action happened in the past, and changing he to him means the object. In Chinese, yesterday is used to directly indicate the time. If the time is not indicated, words like "already", "past" and so on are needed to indicate the past, but the morphology of the word is not changed. Chinese also reflects who is the subject and who is the object through their respective positions.

In fact, English is very close to analytic language among synthetic languages. For example, the future tense in English has no difference in word form from the general tense. In many other languages, different tenses are expressed by different word forms. For example, English has clear regulations on the position of each word. In many other synthetic languages, such as Latin, the position of words can be not fixed. In other words, such a sentence can be said: Him yesterday told I. Then, through the nominative case word form and objective case word form in the word form, it is judged who told whom.
This is very bad, because there is always a difference in order when reading. The reading order we hope is: first read the part that our brain needs to process first. Analytic language has this advantage by nature. Many synthetic languages have also established similar rules in the process of gradual development, such as French, one of the descendants of Latin.

Ancient classical Chinese was also a kind of synthetic language. But due to the limitation of Chinese characters, the word form changes of ancient Chinese only exist in spoken language. For example, the causative use in classical Chinese, such as "King Wen used a hundred miles of land to make the princes his ministers" in "The Book of Songs", the word "minister" (to make submissive) needed to add the sound "s-" before the pronunciation of the Chinese character to mark it in ancient times. This is a typical word form change of synthetic language.

Some people think that modern Chinese still has a certain synthetic language color. They think that adding particles such as "le", "de" is actually the deformation of words. This view can only be a matter of opinion.

From the overall development trend, the spoken languages all over the world are developing from synthetic language to analytic language. Although it cannot be said that analytic language is necessarily better than synthetic language. But this trend shows that a certain characteristic of analytic language is in line with the development of history. This advantage is data compression.

III. Data Compression: The Superiority of Analytic Language

When computers entered a new era, people began to study how to store video files on computers. The initial plan was extremely simple, that is, to store each frame of image all. But this is undoubtedly inefficient. Because there is too much redundant information here. For example, in a night scene, many places on the screen are black. Why record the color of each point repeatedly? So the next idea is to no longer store the complete information of each frame of image, but store the differences between the next frame of image and the previous frame of image. All parts with the same color in the two images are skipped.

Analytic language happens to have such an effect. For example, when a person is talking about what happened yesterday in Chinese, he only needs to mention "the following things all happened yesterday" at the beginning, and then there is no need to mention the tense again. In English, however, you need to repeatedly use the tense to indicate that this thing happened in the past. When communicating, you need to always consider time, nominative/objective case, quantity, active/passive. This has an impact on communication. Of course, after proficiently using this language, the time to consider these will be greatly reduced, but even proficient synthetic language users will still make mistakes in this regard. With the help of context, these changes can all be omitted. The time deformation can be completed by adding a time adverbial in the first sentence, and the sentences describing the same time period afterwards no longer need to consider the description time. This is the data compression ability of analytic language.

Of course, this does not mean that analytic language is necessarily better than synthetic language. In the absence of context, a synthetic language can convey more information in a relatively simple form. But as the amount of communication in human society increases, the context of one communication becomes rich, and many information communication methods that are very efficient under the conditions of words and single sentences also seem to be more and more redundant. Therefore, the world's major languages have all moved closer to analytic language.

In addition, some people think that analytic language is inferior to synthetic language. They say "synthetic language only uses a few letters to form a prefix to express a specific meaning, but Chinese can't do it". This is not seeing that in the actual communication in today's world, the context is always very rich. This way of deformation is very valuable under a single sentence, but it has no meaning in actual communication.

Regarding the表意 efficiency, that is, information density, of each language, a 2011 experiment was conducted by the University of Lyon in France. http://www.time.com/time/health/article/0,8599,2091477,00.htmlhttp://ohll.ish-lyon.cnrs.fr/fulltext/pellegrino/Pellegrino_2011_Language.pdf

In this experiment, the researchers found 59 users of different languages, including English, French, German, Japanese, Chinese, Italian and Spanish. They provided 20 passages of text, which were all translated into their respective languages. Then these people were asked to read at normal speed respectively. The researchers recorded the whole process.

Then the researchers calculated the number of all syllables, calculated the number of meanings (basic units of表意) expressed in the reading, and then drew conclusions. Of course, this experiment cannot completely and accurately show the differences of each language, but it can still be used for qualitative analysis.

Among them, the information density of Chinese is 0.94, ranking first. The information density of English is 0.91, ranking second. It is also worth noting that French (0.74), German (0.79), Italian (0.72), and Spanish (0.63), which are farther from analytic language, have lower information density than English. The information density of Japanese is 0.49, and some people think this is because the表意 method of Japanese is different from other languages.

The information density here is obtained by calculating the number of meanings contained in each syllable. Since the number of meanings is counted from the original text - English version. And after translation, the number of meanings may increase or decrease. In order to prevent the distortion in the translation process from causing the data standard to be inconsistent. The texts of each language were separately translated into Vietnamese, and then divided by the number of meanings contained in each syllable in their respective Vietnamese texts to finally obtain a more fair data.

This makes me can't help but think that after the information density of modern Chinese with large-scale water injection is still so high, then how high has the information density of classical Chinese reached? It's a pity that there is no such research. However, according to the weighted average of the frequency of use, the average number of words in modern Chinese is about 1.5. In classical Chinese, many of them are expressed by monosyllabic words. To make a compromise, I guess that the information density of classical Chinese reaching 1.25 times that of modern Chinese should be no problem. This was a terrifying number in ancient times. Because Latin is not simpler than the main modern European languages. Limited by the space for recording text in ancient times, Chinese was obviously more advantageous in terms of text recording. This may be the reason why the text records in ancient China were very rich.

Some people question that sometimes a very long syllable has a very simple meaning, while a very short word has a very complex meaning. This situation certainly exists. I need to explain that the advantages and disadvantages of all complex things are a statistical concept. "High" is not absolutely high under any conditions, but high in most cases. If a language uses an average of one more syllable than another language in plain communication, then obviously its表意 efficiency is very low.

Some people also say that syllables cannot be used as the unit for measuring information density, because different syllables have very different pronunciation lengths. This question is very good. In fact, I will talk about the pronunciation length of English and Chinese in the next section. The average pronunciation length of English monosyllables is longer than that of Chinese. Therefore, in terms of the information density of spoken language, Chinese has a greater advantage.

It is worth noting that because Chinese characters are separated from pronunciation, understanding and reading cannot be carried out simultaneously, so reading in Chinese will be slower. In the experiment of the University of Lyon, Chinese readers read 5.18 syllables per second. Significantly slower than other languages (English 6.19, French 7.18, German 5.97, Italian 6.99, Japanese 7.84, Spanish 7.82). Therefore, in reading, the efficiency of Chinese to transmit information is relatively low. The converted reading information transmission efficiency is English 1.08, French 0.99, German 0.90, Italian 0.98, Japanese 0.74, Chinese 0.94, Spanish 0.98.

Of course, the above research by the University of Lyon has also been criticized by some people. Some people think it is too rough, and some places are not self-consistent. For example, according to the data in the text, the表意 efficiency of Japanese is obviously low, but the author also says that in fact, the amount of information expressed by each language in one minute is almost the same.
IV. Advantage of Transmission: Tone

The transmission of spoken language information is completed through sound. Sound is generally divided into four parts: timbre, pitch, duration, and intensity. For a syllable of language, timbre refers to the combination of consonants and vowels in it. The other three are easy to understand.

Generally, modern languages do not regulate intensity very much, because this is really impossible to be unified among each person, and is affected by the distance of the speaker, mood, physical state, etc. It should be noted here that the "stress" in English is actually a change in pitch.

The remaining three elements are all widely used in languages.

Take Chinese as an example. Timbre is easy to understand, which is the pronunciation excluding tone. And tone itself is divided into two items: pitch and duration.

The four tones of Mandarin Chinese are four different pitch change modes. The first tone is to maintain pitch. The second tone is a change from low to high pitch. The third tone is a change from high to low and then to high pitch. The fourth tone is a change from high to low pitch. Although we said in Chinese class that neutral tone has no tone, in essence, neutral tone is to shorten the duration. Neutral tone is generally a fourth tone or first tone with shortened duration (sometimes the pitch is very low). However, neutral tone is very short and difficult to hear clearly, so it is only used for specific purposes in Mandarin (modal particles, plural markers, etc.).

The stress in English is actually a difference in pitch. So strictly speaking, English has tone. However, English has only two tones: high and low. But in English, tone is only used for some syllables. Except for indicating questions, emphasis and other situations, English words do not change tone usually. Except for a small number of English words (such as Record), English words do not use tone to distinguish different meanings. English does use the duration element (such as Sheep and Ship). Since English generally only uses timbre and duration to distinguish meanings, it is generally considered to be a language without tone.

First of all, from the perspective of coding efficiency, we can say that Chinese is one of the languages with the highest coding efficiency in the world. Chinese is one of the few languages in the world that use pitch to distinguish different words. Therefore, from the perspective of coding, the表意 ability of Chinese pronunciation is one dimension higher than that of general languages. That is to say, using a single syllable, Chinese can theoretically express up to 4 times the different types of information that general foreign languages can express.

Why do foreigners find it so hard to learn Chinese, while Chinese people find it not so hard to learn English? It's because Chinese people have already mastered the pitch change in pronunciation, and now they are learning the two fixed pitches. In learning Chinese, one of the most difficult problems for many foreigners is tone, because they are only used to two fixed pitches and have not heard or practiced changing pitches continuously when pronouncing.

Some people propose that tone itself will affect the pronunciation speed. This is of course existing. For example, the second tone requires the pitch to rise gradually. To show this, it is necessary to show two pitches when pronouncing, which is naturally more troublesome than a single pitch. But the advantage of tone is that it can produce a variety of different pronunciations on the basis of the same group of consonants and vowels combinations, so there is no need to use more consonants to distinguish two words with similar pronunciations (such as English life and light). In English syllables, the pronunciation of consonant-vowel group mode syllables is indeed slightly faster than that of Chinese. But other modes, such as consonant-consonant-vowel group-consonant (Blight), or consonant-vowel group-consonant, are often slower than Chinese pronunciation. And for consonant-vowel group mode syllables, because English lacks tone, the number is far less than that of Chinese. On average, the pronunciation of a single syllable in English should be slower than that in Chinese.

For the conclusion that "the pronunciation of a single syllable in English should be slower than that in Chinese", some people may not be very convinced. Let's take a very simple example. The basic mathematical ability of ordinary Chinese people is often higher than that of Europeans and Americans. This is not simply a matter of education. More importantly, this is the result of the naming of numbers in Chinese.

All numbers in Chinese are monosyllabic. Among them, 6 and 9 have two vowels (or according to international academic standards, three vowels), and the pronunciation is slightly slower; 0 has a retroflex nasal sound, and the pronunciation may be slightly slower. There are also a few with retroflex sounds, which may be slightly slower.

All numbers in English, except 7, are all monosyllabic. Among them, they cover "consonant-vowel" structure (such as 4), "consonant-consonant-vowel" structure (such as 3), "consonant-vowel-consonant" structure (such as 5), "consonant-vowel-consonant-consonant" structure (such as 6), vowel-consonant structure (such as 8), diphthong structure (such as 0, note that 0 is pronounced "ou" in general numbers, not zero, which is really too laborious), etc. It can be said that these numbers in English roughly include various monosyllabic word syllable structures in English. Of course, there is no most magical consonant-consonant-vowel-consonant-consonant structure here, but that is as rare as the unit vowel structure.

So on average, how much slower are English numbers than Chinese numbers?

First of all, we need to know that people's short-term memory of numbers is actually the memory of the pronunciation of numbers. In other words, the number of digits you can remember is determined by the total length of the pronunciation of numbers. Research shows (see http://deepblue.lib.umich.edu/bitstream/handle/2027.42/26140/0000216.pdf?sequence=1) that the short-term digital memory ability of Chinese people has been higher than that of Americans since childhood. This document shows that the average number of digital memory length of American college students is 7.2 digits, and that of Chinese college students is 9.2 digits. The number of short-term digital memory digits of American college students is 78.26% of that of Chinese students. In other words, the average pronunciation length of English numbers is 27.78% higher than that of Chinese numbers. Removing the influence of English number 7 (bisyllabic) on the average length (10%), then on average, the pronunciation length of monosyllabic English is about 17.78% higher than that of Chinese.

Furthermore, the numbers in China are completely monosyllabic, so the entire number table can be constructed in an absolute logical way. Ninety-six is nine tens plus one six. English is "ninety" (a special word different from nine and ten) plus one six. French is four twenties plus sixteen. The most concise and logical structure in Chinese is extremely rare in the world. The multiplication table of nine times nine is constructed on this basis. For children in other countries, it can be said that it is several times more difficult to recite this table than Chinese children. This advantage in language improves the basic mathematical ability of Chinese people.

By the way, some people say that ancient Chinese had more tones. Did Chinese develop back? This is of course wrong. Some tones are affected by the speaker's mood and other conditions, and there are more uncertain factors in identification. This problem will be explained in detail in the section on noise resistance.
V. Advantage in Information Theory

From the perspective of information theory, coding is very learned. For example, we know that when computers transmit information, they are actually transmitting 0s and 1s. So what if the frequencies of various information we transmit are not the same? The answer is that the more frequent the occurrence, the shorter the coding. This can improve the overall efficiency.

For example, we only need to transmit four kinds of information. According to the general idea, naturally, these four kinds of information are respectively represented by 00, 01, 10, 11. Each information needs to be represented by a two-bit binary number, that is, 100 pieces of information need to send 200 binary numbers. But if one of the information has a probability of 91%, and the other three are 3% respectively. Then another coding method can be used: 1, 01, 001, 000. On average, this transmission method needs to send 91 + 2*3 + 3*3 + 3*3 = 115 binary numbers to transmit 100 pieces of information. Obviously, it is more efficient than the previous one.

The world's major languages have evolved for thousands of years, and the length distribution of their vocabulary is relatively in line with the requirements of information theory. You will find that the more common words in each language are generally shorter. In English, I, you, he, she, we are all monosyllabic words.

The monosyllabic words in English are very rich. In fact, there are more than those in Chinese. This is because English can add consonants before and after a consonant-vowel combination to realize various different monosyllabic pronunciations. After permutation and combination, the monosyllabic word library in English is a relatively large word library. But this has a cost, that is, low noise resistance. This problem will be explained in the next section.

According to the Western syllable definition, the use of monosyllabic words in English is more frequent than that in Chinese. According to statistical analysis. The average word length of Chinese users is about 1.5 words up and down. (There is a chapter "Eye Movements in Reading Chinese and English Text" in the book "Reading Chinese Script: A Cognitive Analysis"), and the average number of syllables per word for adult English users is about 1.4 (see http://www.waisman.wisc.edu/phonology/pubs/PUB30.pdf, which is also supported by another study: analyzing 10 hours of speech and listening recordings, the average number of syllables per word is 1.38).

Therefore, in terms of the number of syllables, the average Chinese word is 7.14% longer than the English word. However, considering that one syllable in English is actually much longer than that in Chinese (even if the 17.78% obtained in the previous section is halved, there is still 8.89%). Coupled with the fact that the information content per syllable in Chinese is roughly 3.30% higher than that in English (see the data in the third section). From the perspective of information theory, the information transmission efficiency of Chinese is basically higher than that of English. (Yes, I know that the calculation of information theory also needs more data, and the average value is not enough. But I really can't find it, so I can only make a rough estimate.)

This is also verified by another experiment (see http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111932/)

This paper found 8 Chinese users and 8 English users. Then let them form a group of two. One person in a group describes the picture (describer), and the other person draws according to the description (drawer). Finally, the accuracy of the picture is compared. At the same time, observe the language of the describer.

The final result is that the Chinese describer obviously started describing earlier than the English describer. The time difference between the describer starting to describe and the drawer starting to draw is lower for Chinese users than for English users. The drawing time is lower for Chinese drawers than for English drawers. The overall time is significantly lower for the Chinese group than for the English group. So all of the Chinese group are faster than the English group. And the more complex the drawing, the faster the Chinese group is.

Drawing results, 144 paintings. The Chinese group made an average of 8.1 mistakes each time, among which the describer made an average of 3.7 mistakes each time, and the drawer made 4.4 mistakes. The English group made an average of 13.25 mistakes each time, among which the describer made an average of 8.5 mistakes each time, and the drawer made 4.75 mistakes. The gap between the two is still within the statistical error. So it is not significant. (Hmm? Who just said that Chinese is not as accurate as English?)

After that, in order to consider the skill differences of the drawers. Let each drawer draw directly according to the original picture. The result is that the Chinese group is slower than the English group.

During the drawing, the Chinese group used significantly fewer syllables than the English users. The speech rates of the two groups (syllables per second) were close (this conclusion is consistent with the previous analysis of speech speed).

Although this is a preliminary experiment, it can still be seen that Chinese has advantages in communication: using fewer words, shorter time, and roughly the same (if not higher) description accuracy. This experiment also shows... The artistic level of our people still needs to be improved...

As for the main other European languages, according to the data mentioned in the third section, their pronunciation frequencies are higher than those in English, and their information density is lower than that in English. I think it is unlikely that those languages will surpass Chinese in this regard.

VI. Word Formation Logic

Because the monosyllabic word resources in English are relatively rich, but the average pronunciation length is longer. So you will notice that the proportion of monosyllabic words in English is higher than that in Chinese, and the proportion of polysyllabic words is smaller than that in Chinese. Many words that are very logically related have lost their connection in pronunciation because they have to squeeze into the limited monosyllabic word library. This leads to the fact that the commonly used words in English often lack the logic of word formation.

For example, we can easily say "rooster", "hen", "chick", "egg". In English, it becomes "Cock", "Hen", "Chick", "Egg", with no connection at all. If common words want to establish a connection, then English must increase the syllable number of these words, then the average word length will increase, and from the perspective of information theory, the information transmission efficiency will be lower. So English can only compromise on the logic of word formation.

In contrast, Chinese is much more relaxed. Even at the current information efficiency, Chinese can still ensure that most word constructions have logic. Therefore, compared with English vocabulary, Chinese vocabulary is relatively easy to remember. This has a result that the commonly used vocabulary of Chinese is far more than that of English.

The vocabulary and new word recognition ability of English users can be seen in the following paper from 1995: http://jlr.sagepub.com/content/27/2/201.full.pdf

Among them, the average self-reported vocabulary of college students is 16141. After multiple-choice tests, it is found that they can identify 71% of the vocabulary on average. That is to say, the average college student's vocabulary is estimated to be 11460 words. The average self-reported vocabulary of the elderly is 21252. After multiple-choice tests, it is found that they can identify 80% of them on average. That is to say, the vocabulary of the elderly is roughly 17002. The probability of correctly understanding an unknown word is 30% for college students and 39% for the elderly.

No relevant research in China has been found. However, here is the "Draft of the Common Words List of Modern Chinese" of the Commercial Press: http://wenku.baidu.com/view/51636fec551810a6f5248676.html

You can go in and see your own vocabulary. I feel that college students can reach 50,000 without any problem. And this is definitely not the entire vocabulary of a person...

There may be words you haven't seen in it. You can see how many you can't guess the correct meaning... I think except for very few, there is really no difficulty...
Digital Memory
This section is based on the following paper from the University of Michigan: http://deepblue.lib.umich.edu/bitstream/handle/2027.42/26140/0000216.pdf?sequence=1

This article summarizes three statistical results.

The first one: Kindergarten, first grade, second grade. Chinese children can generally remember about two more digits than American and Japanese children.

The second one: 6-7-year-old children, reciting numbers forward, backward, and in groups, mainly to deny the hypothesis that "Asian children are born to be better at mathematics than Americans". It is found that Chinese children have worse memory for numbers when reciting backward than American children.

The third one: Comparison between Chinese and American college students. It is found that students from both countries can remember numbers with the same pronunciation length. On average, Chinese students are two digits higher than American students (9.2 vs. 7.2).

Foreigners Learning Chinese

Finally, let's have some sketches. Here are some experiences of foreigners learning Chinese.
http://benross.net/wordpress/journey-across-the-great-hump-of-china-debunking-the-myth-that-chinese-is-the-world%E2%80%99s-most-difficult-language/2009/10/29/
http://www.fluentin3months.com/chinese/

Disadvantages:
First experience: Can't hear the tone when entering.
This is completely understandable. They have never lived in this environment and are not sensitive to tone. Chinese children don't have this problem.
Second experience: Chinese characters are really hard to remember.
But their summary is that once you learn hundreds of them, it becomes easier later. This seems to be recognized.

Advantages:
First experience, no tenses and deformations.
So much so that an American said that overall this is easier than learning Spanish: In Spanish, "is" has the following deformations in the general tense: estar, estoy, estás, está , estamos, estáis, están, ser, soy, eres, es, somoms, sois, son
After changing the learned words, you can't understand them if you change the form.
By the way, I saw a log last week specifically about what tense should be used in different parts of the paper... Chinese has never had this problem.

Second experience, easy to infer the meaning of new words.
This originated from an old American chatting with a Chinese person online. The Chinese asked the American what he was doing. The American said he was going to take the GRE and was reciting words. The Chinese was shocked and asked why you are an American and still need to recite words. Indeed, everyone thinks back, how many times have you checked Chinese new words since entering college? Do you need to recite Chinese words when taking the postgraduate entrance examination?
Benzene Benzene
Toluene Toluene
Xylene Xylene
Of course, these also have relatively unified names: Benzene, Methyl-benzene, Dimethyl-benzene. But the Americans dislike them being too long, so they are used less.
No wonder Americans need to check words after entering college...
If you tell a Chinese person peritonitis, everyone basically knows where the problem is. Tell an American Peritonitis. Generally, Americans can at most estimate that this is a kind of disease, but they don't know where it is.
For example, "syllable" everyone can see what it means at a glance. There may be some differences in the specific definition in the academic circle. This word doesn't need to be memorized at all when you see it. The English word Syllable, it is impossible to guess its meaning from the form. This word can be traced back to the Greek word "syllabē", and the Greek word comes from the Greek word syllambanein, which means to gather, gather. It's really hard for English users.
But to be fair, Britain has long been a backward country in Europe, so there are really too many loanwords in English (French, German, Latin), and the logical structure of words is really not very good. This is not necessarily a common problem of phonetic writing.
This reminds me of the anti-intellectualism in the United States. This may be related to the fact that the people often can't understand professional terms.

[ Last edited by zzz19760225 on 2017-5-23 at 07:17 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 9 Posted 2016-06-26 18:33 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Thousand Character Classic

Thousand Character Classic ~ Chapters 1-4 ~ (Loli Version) http://music.163.com/#/m/song?id=429459238&userid=282225839

Thousand Character Classic ~ Chapters 2-4 ~ (Loli Version) http://music.163.com/#/m/song?id=442030065&userid=282225839

Thousand Character Classic ~ Chapters 3-4 ~ (Loli Version) http://music.163.com/#/m/song?id=476845279&userid=282225839

Thousand Character Classic ~ Chapters 4-4 ~ (Loli Version) http://music.163.com/#/m/song?id=476845424&userid=282225839

http://www.bangnishouji.com/guoxue/201305/1336.html

Weipan Calligraphy Thousand Character Classic http://vdisk.weibo.com/s/ugCAvMHFBZkTq?category_id=27&parents_ref=ugCAvMHFCJU6g

1

Heaven and earth are dark yellow, the universe is vast and primeval.

The sun and moon wax and wane, the stars are arrayed in the sky.

Cold comes and heat goes, autumn harvests and winter stores.

Intercalary months make the year, musical pipes regulate the yang.

Clouds rise to cause rain, dew condenses to form frost.

Gold is produced in Lishui, jade comes from Kun冈.

The sword is called Jù Què, the pearl is called Night Light.

Fruits are precious plums and nai, vegetables are important mustard and ginger.

The sea is salty and the river is fresh, fish dive and birds fly.

Dragon masters and fire emperors, bird officials and human emperors.

Writing was first invented, then clothes were worn.

Throne was yielded and country was let, there were Yu of Yu and Tang of Tao.

People were pitied and crimes were伐, Zhou Fa and Yin Tang.

Sitting in court to ask about Tao, governing with hands down.

Loving and nurturing the common people, ministers submit to Rong Qiang.

Distant and near are one body, all subjects return to the king.

Phoenixes sing in bamboo, white foals graze in the field.

Influence reaches grass and trees, benefits extend to all directions.

2

This body and hair, four greats and five constants.

Respectfully considering upbringing, how dare to destroy and injure.

Women admire chastity and integrity, men imitate talent and goodness.

If you know a fault, you must correct it, if you gain ability, you must not forget.

Do not talk about others' faults, do not rely on your own长处.

Letters must be reliable, vessels should be hard to measure.

Mo laments silk being dyed, Shi praises the lamb.

Noble conduct is for the worthy, able to think and become sage.

Virtue is built and name is established, appearance is端正 and bearing is correct.

Echoes are transmitted in empty valleys, listening is practiced in empty halls.

Disaster comes from evil accumulating, blessing comes from good being rewarded.

A foot of jade is not precious, an inch of time is to be competed for.

To support father and serve君主, it is said to be strict and respectful.

Filial piety should do your best, loyalty should give your life.

Approaching deep water and treading on thin ice, getting up early to warm and cool.

Like orchid, so fragrant, like pine, so prosperous.

River flows continuously, deep pool reflects.

Posture is like thinking, speech is stable and determined.

Starting well is really beautiful, ending carefully is appropriate.

Glory and career are the foundation, reputation is boundless.

If you are excellent in study, you can become an official, take up a post and be in politics.

Keep Gan Tang when you are there, sing more when you leave.

Music differs in nobility and humbleness, ritual distinguishes nobility and humbleness.

Upper is harmonious and lower is和睦, husband sings and wife follows.

Outside receive tutor's instruction, inside serve mother's example.

All aunts, uncles, fathers and uncles, are like sons compared to sons.

Brothers should care for each other, of the same breath and connected branches.

Making friends with投分, cutting, grinding, admonishing and regulating.

Benevolence, compassion, mercy and pity, never leave in haste.

Integrity, righteousness, integrity and retreat, never lose in hardship.

Temperament is calm and emotion is easy, mind moves and spirit is tired.

Keep true and will be satisfied, follow things and will be moved.

Adhere to noble conduct, good titles will come by themselves.

3

Capitals are in Huaxia, east and west two capitals.

Back to Mang and face Luo, float Wei and rely on Jing.

Palaces are winding and lush, towers and pavilions are astonishing.

Pictures depict animals and birds, paintings color fairies and spirits.

Side rooms are opened on the side, armor tents face the pillars.

Laying out banquets and setting up seats, playing the se and blowing the sheng.

Ascending the steps and entering the steps, hats turn like stars.

Right connects to Guangnei, left reaches Chengming.

Already gathered ancient books and classics, also gathered talented people.

Du Gao and Zhong Li, lacquer books and wall classics.

Government houses contain generals and ministers, roads are lined with槐 officials.

Households are sealed with eight counties, families are provided with a thousand soldiers.

High hats accompany the carriage, driving wheels and shaking tassels.

Inherited wealth is luxurious and rich, carriages are fat and light.

Record merits and great achievements, carve inscriptions on steles.

Pan Xi and Yi Yin, assist the time as A Heng.

Occupy Qufu, who can manage without Dan?

Duke Huan unites and harmonizes, helps the weak and supports the fallen.

Qi Hui helps Han Hui, Yue moves Wu Ding.

Talented people work hard, many officials are really peaceful.

Jin and Chu alternate as overlords, Zhao and Wei are trapped in horizontal.

Borrow the way to灭 Guo, meet at Jian Tu to make an alliance.

He Zun follows simple laws, Han Bi is troubled by complex laws.

Qi, Jian, Po, Mu, use the army most finely.

Spread prestige in the desert, gain reputation in painting and calligraphy.

Nine provinces have Yu's traces, hundred counties were unified by Qin.

Mountains respect Tai岱, sacrifice masters Yun Ting.

Yanmen is a purple pass, Jitian is a red city.

Kun Pool and Jie Stone, Ju Ye and Dong Ting.

Vast and far, distant and邈, cliffs and peaks are distant and dark.

4

The foundation of governance is in agriculture, focus on this farming.

Begin to plant in the south field, I plant millet and millet.

When the tax is ripe, pay the new, encourage rewards and punish promotions.

Mencius advocates simplicity, Shi Yu adheres to integrity.

Almost in the middle way, hardworking, modest, careful and strict.

Listen to the sound and observe the reason, examine the appearance and distinguish the color.

Leave a good plan, encourage him to be upright.

Examine oneself and be criticized, honor increases and resist the extreme.

Near to shame, fortunate to reach the forest and highland.

Two Shu see the opportunity, who forces to resign.

Living alone and living in leisure, silent and lonely.

Seek ancient times and explore theories, dispel worries and be carefree.

Happy plays relieve fatigue, sorrows decline and joys invite.

Ditch lotus is bright, garden bushes sprout.

Loquat is late green, sycamore withers early.

Old roots are withered, fallen leaves are fluttering.

Flying kun alone moves, soars to the red sky.

Diligent in reading and playing in the market, eyes on bags and boxes.

Yi You is fearful, ears are on the wall.

Prepare meals and eat, suitable for mouth and full of stomach.

Satiated with cooking and slaughtering, hungry with bran and chaff.

Relatives and old friends, old and young have different grains.

Concubines and maids spin, serve towels and curtains.

Silk fans are round and clean, silver candles are bright.

Sleep during the day and sleep at night, blue bamboo and ivory bed.

String songs and wine banquets, pick up cups and raise cups.

Raise hands and stamp feet, happy and healthy.

Heirs and successors, sacrifices are made in spring and autumn.

Kowtow and bow twice, trembling, afraid and panicked.

Letters and documents are brief and concise, answer carefully and in detail.

Body dirty thinks of bathing, holding heat wishes for cool.

Donkeys, mules, calves and bulls, startle and jump over.

Execute and kill thieves and bandits, capture rebels and runaways.

Bu shoots and Liao plays balls, Ji Qin and Ruan whistles.

Tian writes and Lun makes paper, Jun is clever and Ren fishes.

Relieve troubles and benefit the people, all are excellent.

Mao Shi is beautiful, good at frowning and beautiful smiling.

Years' arrows are always urging, sun and moon are bright.

Spindle hangs and rotates, dark moon rings and shines.

Pointing at firewood to cultivate blessing, always enjoy auspicious and good.

Step by step and lead, look up and down in the hall.

Tie the belt and be solemn, wander and look around.

Lone, ignorant and lack of knowledge, foolish and ignorant are ridiculed.

Words are auxiliary, yan, zai, hu, ye.

Four in a group, twenty-five is one hundred, ten hundreds. From the beginning to the end, from 000 to 999, digital quantity quality upgrades, first left then right or first right then left? Or progress from single numbers to three numbers, not starting with three numbers.

Press enter or space after input, can this enter and space be canceled, directly press the key to come out?

Differences in historical versions, including avoiding dynasty consciousness and changes in emperor names

125*8=1000

[ Last edited by zzz19760225 on 2017-6-23 at 03:23 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 10 Posted 2016-06-26 18:35 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Netizens Make Homemade "Chinese Keyboard" with All Keys Fully Localized into Chinese
April 13, 2010 08:33 9821 views Source: 0 comments
cnBeta Visitor Interaction Google Japan once released a "Japanese keyboard" covering all Japanese syllabaries, Chinese characters, and emoticons on April Fools' Day this year. The layout like an ancient movable type printing tray was quite amusing. Today, a netizen has created a set of "Chinese keyboard". The difference is that it doesn't display thousands of Chinese characters side by side, but instead fully localizes the letters and symbols on the current English keyboard into Chinese. Let's take a look.

"Chinese keyboard". Photo/Netizen "Unsold Books"

Here is the author's introduction:

General principles:
Use only one character
Try to use characters representing verbs
For keys that appear twice, such as Enter, they can be represented by 2 kinds of characters respectively

During the process of making the full Chinese keyboard, a long time was spent thinking about the mapping relationships of some keys. Here are the results of the thinking:

Esc=Exit/Escape/Flee/Withdraw/Leave/Abandon/Retreat/Strategic Retreat
F1~F12=Zi Chou Yin Mao Chen Si Wu Wei Shen You Xu Hai
PrintScreen=Print Screen/Capture/Photograph/Print/Shoot
SystemRequire=System Request/Request
ScrollLock=Scroll Lock/Roll/Scroll
Pause/Break=Pause/Interruption/Stop/Discontinue/Break
1~0=Jia Yi Bing Ding Wu Ji Geng Xin Ren Zero/One Two Three Four Five Six Seven Eight Nine Zero
26 letters=A Bo Ce De E Fu Ge Ha Yi Jie Ke Le Mo Ne O Pi Qiu Re Si Te Wu Yu Wa Xi Ya Ze
`=Dot/Punctuation mark
~=Wave/Ripple
!=Surprise
@=At/In
#=Well
$=Gold/Money
%=Xuan/Lv/Chuan
^=Person
&=And/With
*=Rice/Star
(=Parenthesis
)=Parenthesis
-=Negative/Subtract
+=Positive/
_=Draw/Stroke
==Equal
{=Hold
}=Hold
|=Vertical/Stroke
\=Press/Stroke
:=Colon
;=Semicolon
"=Support
‘=Quote
>=Ice/Cloth
<=Wave/Slice
?=Question/Doubt
/=Stroke/Pull
Backspace=Backspace/Backspace/Back/Return/Backward
Tab=Tab/Table/Jump
CapsLock=Caps Lock/Capital Lock/Big/Rise/Change Giant
Shift=Shift/Upper gear/Change/Raise/Shift/Change/Move
Ctrl=Control/Control/Stabilize/Control
Alt=Alt/Shift/Change/Alter/Change/Excision/Neuter
Windows=Windows/Window
Space=Space/Interval/Space/Empty
Right Click/Context Menu=Right Click Menu/Context Menu/Right/Single
Enter=Enter/Line Feed/Enter/Proceed/Good/Good/Good/Good/Correct/Press
Insert=Insert/Insert/Enter
Home=Home/Start/Beginning/First/Home/Home/Family/Home Town/Benchmark
PageUp=Previous Page/Flip Up/Flip
Delete=Delete/Delete/Wipe/Humane Destruction
End=End/Finish/End/End/End/End of play
PageDown=Next Page/Flip Down/Pounce/Cover
↑=Up
↓=Down
←=Left
→=Right
NumLock=Num Lock/Digital Lock/Number Lock
1<词>,2,3/段\,4{节},5(章)。
Floor 11 Posted 2016-06-26 18:37 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
0 No The chaotic and inconceivable existence is really ignorant
---------------------------------------------------
Three-circle concept Points, lines, surfaces and solids are in
1 And Holographic quantum
2 Or Dialectics
3 Non Contradiction theory
---------------------------------------------------
Orientation Change Yi - one movement - one thought - one realm - one life world universe
4 Front
5 Back
6 Left
7 Right
8 Up
9 Down
--------------------------------------------------
10 Text composition Text of information flow and dynamic combination
11 Horizontal
12 Vertical
13 Split
14 Nib
15 Dot
16 Line
17 Small
18 Big
--------------------------------------------------
19 Big Human concept group The human and object integrated character concept
20 Heart
21 Body
22 Match Match object The classification, distinction and selection of object and item all things physics, the demand of matching object for human concept is called matching object. Distinguish from the external object concept outside heart and body, the external object is classified into all things concept
23 Life Life One realm of three realms
24 World World Second realm
25 Universe Universe Third realm
26
27
28
29
---------------------------------------------------
30 Object All things physics
31 Degree Mass The definition of unique existence Distance, principle, etiquette?
32 Quantity Quantity
33 Five The dynamic transformation under the relative definition of five elements abstraction and basic factors, ordinary, average, usual, extraordinary, abnormal
34
35
36
37
38
39
---------------------------------------------------
40 Event One of the life learning axes of the exploration, research and study of ultimate concepts, assuming that all events are combined into the concept space-time of these
41 Change Change or not Exist or not Know or not Believe or not Do or not
42
43
44
45
46
47
48
49



--------------------------------------------------------------------------------------------------------------------
Nothing and something, something extends dynamic knowable information, dynamic knowable information expresses human, thing and object concepts. Triangular cycle structure.
Human is outside the triangular cycle structure to improve and use this cycle structure.
The content that changes as a whole with text information as the carrier is called culture.
Dao, road, walk three basic character concepts of human.
The range of operation words during chip motherboard memory hard disk USB flash drive boot guidance, the range of operation words of the system, the range of operation words of the computer use interface, the range of operation words of the application software, the range of operation words of five senses information, mainly the range of operation words of text operation, the range of words of consumers' user habits.


Idiom:有名无实,

[ Last edited by zzz19760225 on 2017-9-8 at 15:34 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 12 Posted 2016-06-26 18:38 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Technical production-related Chinese character language information standards content or scope, consumer operation layer setting-related Chinese range, consumer operation layer usage-related Chinese range.

There exists a Chinese range, demand interest-related Chinese. It can be separated from life and survival, and can be understood through the concept of climbing and breaking through the choices of life and death, gains and losses. The demand interest category belongs to the standard content list accumulation of directions such as life, preference, and choice.

If taking the Rubik's Cube 9*3=27 as a hypothetical structural quantity, then there are 26 foreign alphabets (should use English, foreign texts are not just English), and one is still missing.

Single-character action words and multi-character action words.

The verb patterns of pipe joints. Direct: joint, reducer (large to small and small to large); multi-way: tee, four-way, five-way, etc., with tee as the main, four-way supplementing the deficiency; bend: elbow, reducer; storing small or equal amounts into the large warehouse is store, decomposing large amounts of components gradually waiting for storage; one-to-one equal amount and gathering small into large, taking the divided and decomposed parts. All space-time concepts are decomposed and digested by the action words of food, clothing, housing, transportation, and entertainment. The action behavior concepts triggered by this food, clothing, housing, transportation, and entertainment digestion of the world, the specialized words extracted are action words.

Use action words and thought action words, physical and metaphysical.

Five action word ranges of heaven, earth, man, universe, and also five action word ranges of mind, body, object, principle, and reason.

1 and
2 or
3 non
---------------------------------------------------
4 front
5 back
6 left
7 right
8 up
9 down
--------------------------------------------------
10
11 horizontal
12 vertical
13 split
14 捺(nà)
15 dot
16 line
17 small
18 large
--------------------------------------------------
19 large
20 mind
21 body
22 match
23 life
24 world
25 universe
26
27
28
29
---------------------------------------------------
30 object
31 degree
32 quantity
33 five
34
35
36
37
38
39
---------------------------------------------------
40 matter
41 change
42
43
44
45
46
47
48
49

move, show, write, if, then, order, command, store, take, advance, retreat, front, back, left, right, up, down, time, space, universe, world, country, family, environment, positive, negative, different, and, or, non, heaven, earth, man, matter, mind, think, body, form, fail, righteousness, meaning, benefit, profit, loss, harm, life, death, gain, loss, way, road, principle, virtue, kindness, benevolence, love, good, kind, rule, compass, vertical, horizontal, longitude, latitude, large, small, many, few, far, near, high, low, old, young, gather, disperse, ring, period, section, go, come, yesterday, today, tomorrow,
zero, one, two, three, four, five, six, seven, eight, nine, ten, hundred, thousand, ten thousand, add, subtract, multiply, divide,

The conceptual space of the meaning of words can form a human-shaped world, which has a human form because of human creation and intention-led participation. Just as the group energy of money also has the nature of a human form, believing in money is actually believing in the power of humans. There should be a very small door at the end of extreme privatization, which can be opened to crawl into the selfless group concept world.


System:

Power on start rise qi qi (start rise)
Power off end stop zhong zhi
Delayed power on delay start
Delayed power off delay end
System state machine state
Chip chip xin
Mainboard
Memory
Display display state
Sound sound state
Network network state


Folders and files


Entering files and folders open kai
Exiting files and folders close guan
Creating files and folders create establish generate jian li sheng
Deleting files and folders delete reverse remove shan dao chu
Modify modify change xiu gai
Save store save cun chu? (except save)

Statistics command ---- count
Calculation command ---- calculate

Text

Input write input xie shu
Insert insert insert can cha
Search search search cha (search insert) sou
Replace replace replace ti dai

Chinese specialized range commands
|--------- Chinese general commands (simple commands)
|--------- Chinese ? (complex commands)
|--------- Chinese user commands (personal commands)

Through the description of the process from computer boot to shutdown, including user consumer habit function content, after abstracting these content nouns, they are often compared, and compared with historical Chinese character totems, Chinese character evolution, image rules, historical significance accumulation, and customs significance to find possible general computer usage single characters and nouns, as the content structure of pinyin compilation, command structure, command structure, search dialogue, return and error information.

[ Last edited by zzz19760225 on 2017-9-22 at 20:06 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 13 Posted 2016-06-26 18:39 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Output "0" character, using horizontal and vertical coordinates (depth plus color? or hidden font information), character unit box 11*11 size, decimal
1 2 3 4 5 6 7 8 9 10 11
1 0 0 0 0 0 0 0 0 0 0 0 00000000000
2 0 0 0 0 0 0 0 0 0 0 0 00000000000
3 0 0 0 1 1 1 0 0 0 0 0 00011100000 34 35 36
4 0 0 1 0 0 0 1 0 0 0 0 00100010000 43 47
5 0 0 1 0 0 0 1 0 0 0 0 00100010000 53 57
6 0 0 1 0 0 0 1 0 0 0 0 00100010000 63 67
7 0 0 1 0 0 0 1 0 0 0 0 00100010000 73 77
8 0 0 1 0 0 0 1 0 0 0 0 00100010000 83 87
9 0 0 0 1 1 1 0 0 0 0 0 00011100000 94 95 96
10 0 0 0 0 0 0 0 0 0 0 0 00000000000
11 0 0 0 0 0 0 0 0 0 0 0 00000000000 34 35 36 43 47 53 57 63 67 73 77 83 87 94 95 96

If it is the horizontal and vertical coordinates of 1~1000, how to identify it
1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000
3 45 3 567 3 897 3 900 3 920 3 999
67 39 67 300 67 456 67 555 67 680 67 790

3*4,5,6
4*3,7
5*3,7
6*3,7
7*3,7
8*3,7
9*4,5,6

3*4,5,6;4*3,7;5*3,7;6*3,7;7*3,7;8*3,7;9*4,5,6 。

Combine the list, first look at the blank of the possible character range 1~1000*1~1000;。

[ Last edited by zzz19760225 on 2017-7-23 at 07:14 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 14 Posted 2016-06-26 18:40 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Zhong蟒 is a Chinese programming language based on Python's just-in-time language version 2.1.3. In addition to reserved words, variable names can be in Chinese, and many operations on built-in data types can be performed in Chinese.

http://www.chinesepython.org/


Implementations of other Chinese programming languages

E Language:
It is a visual programming software for Windows. It feels like a Chinese translation of Visual Basic. It seems that it can only operate in the M$ environment. It has a good solution for entering Chinese instructions. It should only support simplified Chinese. Because E Language seems to focus on image interface applications, it is difficult to compare with Hong蟒. Hong蟒 currently does not have a Chinese visual interface editor (there are many English ones like Tkinter, wxWindows, pyGTK, pyQT, pyKDE). But in terms of the language itself, Hong蟒 should be better. The following is a segment of E Language program compared with Hong蟒:
E Language: Hong蟒
Loop count start (3 container1)
Paintboard1.scroll write line (container1)
Loop count end()
Container loop start (100, 180, 50, container1)
Paintboard1.scroll write line (container1)
Container loop end()
Loop count start (5, container1)
If true (container1 == 2)
Go to loop end()
If true (container1 == 4)
Break out of loop()
Paintboard1.scroll write line (container1)
Loop count end()
Take operator from range(3):
Write operator
Take operator from range(100, 180, 50):
Write operator
Take operator from range(5):
If operator == 2:
Continue
If operator == 4:
Interrupt
Write operator
Result:
1
2
3
100
150
1
4
Result:
0
1
2
100
150
0
1
3
In addition, Hong蟒 can run on multiple operating platforms, and it is relatively easy to use for system manipulation, network, scientific computing, automation, etc. Of course, the open source code is also a big advantage. (I don't mean that because Hong蟒's source code is open, so don't ask for too much, but that because the source code is open, the quality of Hong蟒 is more guaranteed!)
Chinese Programming:
Its development was earlier than 1993, and it looks like Forth. However, it has comprehensive Chinese support, and it seems that even the underlying concepts of the compiler are based on Chinese thinking. Because there are no actual technical documents, I don't know why. It can be used for Windows systems and supports all image interfaces. In theory, it can work on other operating platforms, but it doesn't seem to have been made.
Fever Building Blocks:
It belongs to the Visual family of programming tools. It is completely visual programming. There are various components, and the process of the program is pieced together by dragging and dropping. The statement on its website is "building" the program.
Bagua Programming Qianlong Edition:
Its full name is Beginners' All-purpose Graphical User Interface Code. It is also a fully visual programming tool. The process of writing the program is like moving different artifacts around.
Taiwan Chinese Forth Society:
Forth is a programming language with a very complete structure. Its virtual machine has a complete set of CPU architectures, and when writing programs, there is a stack concept. It feels a bit like a macro set language, that is, defining some words (subroutines), and then defining new words according to these definitions, and so on. Relatively speaking, Forth has great flexibility when customizing Chinese programming languages compared to Hong蟒. However, it is very difficult to learn. I messed around for a long time and didn't understand it. So there are no examples to compare. In fact, Hong蟒 has built-in many high-level data types, so it is not fair to compare with the original Forth. On the other hand, the thinking training provided by Forth on basic operations of the computer system is something that Hong蟒 cannot match.
Yifu Intelligent Technology:
In 1990, Mr. Ding Chen Hansun initiated a streamlined FORTH in the Silicon Valley Forth Interest Group (SVFIG). Its goal was a simple and easy-to-learn small FORTH operating system. So it was given an 'e' character to become "eForth". eForth is because it is light and small (only about 10 K bytes), especially suitable as an OS for Embedded systems. Therefore, over the past ten years, there have been about 20 versions of CPU porting. These versions range from 8 BIT 6502, 8051 to 32 BIT POWERPC, MIPS, ARM, etc.
Chinese Logo:
It is a Logo language controlled entirely in Chinese. But in modern terms, it is a bit backward. In fact, with Hong蟒 plus Tk, you can completely implement Chinese Logo. If you add voice input function, it can be a very good preschool education software.
Perl Language:
It uses Perl's source code filtering function. It converts Chinese programs into English. The author's main purpose seems to be to study writing ancient poems with computer programs. It is not very practical but is really fun.
Chinese Pascal:
I have heard of it on the Internet, but I haven't found actual information. Welcome to provide information.
Chinese Cobol:
There was a Chinese business language in the Tianlong Chinese computer that Mr. Zhu Bangfu cooperated with Acer. I have no chance to see it.
Chinese Basic:
It is also an interpreter of Chinese Basic in the Apple Chinese card designed by Mr. Zhu Bangfu. I have never used it.
Bing Zheng Zheng:
Chinese C++! I remember seeing it on a certain BBS before, but I can't find it back. Its approach is to first filter the source code to convert all Chinese variable names and keywords into ASCII codes and then compile. However, it is very troublesome when debugging, because I don't remember how the correspondence is.

[ Last edited by zzz19760225 on 2017-6-3 at 13:19 ]
1<词>,2,3/段\,4{节},5(章)。
Floor 15 Posted 2016-06-26 18:41 ·  中国 海南 海口 电信
超级版主
★★★★
Credits 3,673
Posts 2,020
Joined 2016-02-01 00:00
10-year member
UID 181465
Gender Male
Status Offline
Pinyin, stroke count, pictographic form, phonetic form,
We need to go through all 18030 characters one by one,
Relate pinyin to stroke counts, not for human usage needs, but to find a pattern. This pattern has human content because of human involvement, it's a pattern sought by humans. Since what's being sought is the pattern, which is a usable form of power, it will exclude direct human presence, being dehumanized, similar to a dialectical contradiction.
This 18030 has complex content, with various concept clusters, but these contents have some unique and commonalities at different levels. Based on the commonalities, they can be organically combined again.
There will be inflexible knowledge extending linearly, various coincidences of random assumptions, and miraculous wonders, which will be a very interesting world.

The process control of Longxin and binary conversion, the connection between binary process control and Chinese character compilers, and the realization and accumulation of application functions for the needs of groups, individuals, and processes in Chinese character compilers.
An entity participates in the binary of Longxin computers, GCC, Vxworks,
Outside Vxworks, a compiler that uses Vxworks to establish independently running binary and Chinese character exchange to realize Chinese character instructions,
Based on the Chinese character instruction compiler, classification for long-term demand changes in different directions.

{
It even affects our translation. For Chinese learners, the first thing to learn is the unavoidable ambiguity of this language. Chinese has no alphabet, and its words are not composed of letters. Instead, its words are composed of smaller words. For example, the Chinese word describing size (big and small) is composed of the word "big" and the word "small", and the Chinese word describing length (long and short) is composed of the word "long" and "short". Chinese people use dictionaries to arrange thousands of Chinese characters, which must be classified into about two hundred so-called radicals or families, all classified by related relationships. In each category of related words, dozens of Chinese characters are sorted in order according to the total number of strokes they require, from the smallest with one stroke to the largest with up to seventeen strokes.

The pronunciation and tone of words add complexity to this language. And the influence of tone is that one word may have four meanings. A typical example is "ma". In the first tone, ma means mother. In the second tone, ma means numb. In the third tone, ma means horse. And in the fourth tone, ma suddenly turns down, meaning scold. Chinese people must speak loudly so that the difference in tone can be heard. Another reason for ambiguity is that the number of syllables used in Chinese is so small. English uses ten thousand different syllables, while Chinese has only four hundred syllables. Therefore, many words have the same pronunciation. Punches and misunderstandings are everywhere.
}




Announcement of National Standards of the People's Republic of China
http://www.gov.cn/fuwu/bzxxcx/bzh.htm

[ Last edited by zzz19760225 on 2017-8-31 at 15:47 ]
1<词>,2,3/段\,4{节},5(章)。
1 2 3 6 Next ›
Forum Jump: