mirror of
https://github.com/9fans/plan9port.git
synced 2025-01-15 11:20:03 +00:00
50 lines
1.5 KiB
Text
50 lines
1.5 KiB
Text
.tr -\(hy
|
|
.TL
|
|
Hello World
|
|
.br
|
|
or
|
|
.br
|
|
Καλημέρα κόσμε
|
|
.br
|
|
or
|
|
.br
|
|
こんにちは 世界
|
|
.AU
|
|
Rob Pike
|
|
Ken Thompson
|
|
.AI
|
|
.MH
|
|
.AB
|
|
Plan 9 from Bell Labs has recently been converted from ASCII
|
|
to an ASCII-compatible variant of Unicode, a 16-bit character set.
|
|
In this paper we explain the reasons for the change,
|
|
describe the character set and representation we chose,
|
|
and present the programming models and software changes
|
|
that support the new text format.
|
|
Although we stopped short of full internationalization\(emfor
|
|
example, system error messages are in Unixese, not Japanese\(emwe
|
|
believe Plan 9 is the first system to treat the representation
|
|
of all major languages on a uniform, equal footing throughout all its
|
|
software.
|
|
.AE
|
|
.SH
|
|
Introduction
|
|
.PP
|
|
The world is multilingual but most computer systems
|
|
are based on English and ASCII or worse.
|
|
The pending release of Plan 9 [Pike90], a new distributed operating
|
|
system from Bell Laboratories, seemed a good occasion
|
|
to correct this chauvinism.
|
|
It is easier to make such deep changes when building new systems than
|
|
by retrofitting old ones.
|
|
.PP
|
|
The ANSI C standard [ANSIC] contains some guidance on the matter of
|
|
`wide' and `multi-byte' characters but falls far short of
|
|
solving the myriad associated problems.
|
|
We could find no literature on how to convert a
|
|
.I system
|
|
to larger character sets, although some individual
|
|
.I programs
|
|
have been converted.
|
|
This paper reports what we discovered as we
|
|
explored the problem of representing multilingual
|