CJKV Information Processing, 2nd Edition
- Length: 912 pages
- Edition: 2
- Language: English
- Publisher: O'Reilly Media
- Publication Date: 2009-01-05
- ISBN-10: 0596514476
- ISBN-13: 9780596514471
- Sales Rank: #1822091 (See Top 100 Books)
CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing
First published a decade ago, CJKV Information Processing quickly became the unsurpassed source of information on processing text in Chinese, Japanese, Korean, and Vietnamese. It has now been thoroughly updated to provide web and application developers with the latest techniques and tools for disseminating information directly to audiences in East Asia. This second edition reflects the considerable impact that Unicode, XML, OpenType, and newer operating systems such as Windows XP, Vista, Mac OS X, and Linux have had on East Asian text processing in recent years.
Written by its original author, Ken Lunde, a Senior Computer Scientist in CJKV Type Development at Adobe Systems, this book will help you:
- Learn about CJKV writing systems and scripts, and their transliteration methods
- Explore trends and developments in character sets and encodings, particularly Unicode
- Examine the world of typography, specifically how CJKV text is laid out on a page
- Learn information-processing techniques, such as code conversion algorithms and how to apply them using different programming languages
- Process CJKV text using different platforms, text editors, and word processors
- Become more informed about CJKV dictionaries, dictionary software, and machine translation software and services
- Manage CJKV content and presentation when publishing in print or for the Web
Internationalizing and localizing applications is paramount in today’s global market — especially for audiences in East Asia, the fastest-growing segment of the computing world. CJKV Information Processing will help you understand how to develop web and other applications effectively in a field that many find difficult to master.
CJKV Information Processing covers all major writing systems for Vietnamese (including Quôc ngu, chu Nôm and chu Han), Japanese (kana and kanji), Korean (hangul and hanja), and Chinese (hanzi), plus the various means of integrating multiple character sets and systems for transliterating these languages into the Latin alphabet. Author Ken Lunde explains what’s involved in taking input in the various languages and goes into great detail about output, including some detailed coverage of professional-quality computer typesetting with Chinese, Japanese, Korean, and Vietnamese (CJKV) characters.
But CJKV Information Processing doesn’t restrict itself to input and output issues. There’s extensive coverage of the special issues that arise when you attempt to work with multibyte characters inside programs–especially Java programs, since that language is especially adroit at internationalization tasks. You’ll find ready-to-use algorithms for detecting and converting characters among the various sets.
Almost half of the book is consumed by exhaustive character tables listing every CJKV character set ever defined by a standards body, software vendor, or other organization. Comprehensive is the operative word here–Lunde even gives space to 145 hanzi characters defined by Hong Kong’s Department of the Judiciary. You’ll find a full suite of keyboard mapping tables, too. With the same thoroughness and clarity that made his Understanding Japanese Information Processing such a hit among members of the Pacific Rim crowd, Ken Lunde provides an unparalleled guide to computing with the CJKV character sets. –David Wall
Table of Contents
Chapter 1. CJKV Information Processing Overview
Chapter 2. Writing Systems and Scripts
Chapter 3. Character Set Standards
Chapter 4. Encoding Methods
Chapter 5. Input Methods
Chapter 6. Font Formats, Glyph Sets, and Font Tools
Chapter 7. Typography
Chapter 8. Output Methods
Chapter 9. Information Processing Techniques
Chapter 10. OSes, Text Editors, and Word Processors
Chapter 11. Dictionaries and Dictionary Software
Chapter 12. Web and Print Publishing