Boost C++ Libraries Home Libraries People FAQ More

Next

Chapter 1. Unicode 0.1 preview 3

Mathias Gaunard

Distributed under the Boost Software License, Version 1.0. (See accompanying file LICENSE_1_0.txt or copy at http://www.boost.org/LICENSE_1_0.txt)

Table of Contents

Preface
Motivation
Introduction to Unicode
Notion of character
Character set
Encodings
Combining character sequences
Grapheme clusters
Normalization
Other operations
Character properties
Linking the library
The concepts behind this library
Overview
Range operations
Composition and Normalization
String searching algorithms
User's Guide
Organization
Examples
convert
characters
compose
search
source_input
Iterator/Range reference
Concepts
Header <boost/iterator/any_iterator.hpp>
Header <boost/iterator/consumer_concept.hpp>
Header <boost/iterator/consumer_iterator.hpp>
Header <boost/iterator/consumer_iterator_fwd.hpp>
Header <boost/iterator/join_iterator.hpp>
Header <boost/iterator/pipe_concept.hpp>
Header <boost/iterator/pipe_iterator.hpp>
Header <boost/iterator/pipe_iterator_fwd.hpp>
Header <boost/range/any_range.hpp>
Unicode reference
Header <boost/cuchar.hpp>
Header <boost/unicode/cat.hpp>
Header <boost/unicode/combining.hpp>
Header <boost/unicode/compose.hpp>
Header <boost/unicode/compose_fwd.hpp>
Header <boost/unicode/graphemes.hpp>
Header <boost/unicode/hangul.hpp>
Header <boost/unicode/pipe_def.hpp>
Header <boost/unicode/search.hpp>
Header <boost/unicode/static_utf.hpp>
Header <boost/unicode/static_utf_codecs.hpp>
Header <boost/unicode/string_cp.hpp>
Header <boost/unicode/surrogates.hpp>
Header <boost/unicode/ucd/block_types.hpp>
Header <boost/unicode/ucd/properties.hpp>
Header <boost/unicode/ucd/properties_types.hpp>
Header <boost/unicode/utf.hpp>
Header <boost/unicode/utf_codecs.hpp>
Appendices
Appendix A: Unicode in source files
Appendix B: Rationale
Appendix C: Future Work
Appendix D: Acknowledgements

Unicode is the industry standard to consistently represent and manipulate text across most of the world's writing systems.

Description

This library aims at providing the foundation tools to accurately represent and deal with natural text in C++ in a portable and robust manner, so as to allow internationalized applications, by implementing parts of the Unicode Standard.

This library is environment-independent and deliberately chooses not to relate to the standard C++ locale facilities as well as the standard string facilities, judged ill-suited to Unicode.

The current version is locale-agnostic, but a subsystem for tailored locale behaviour may be added in the future.

[Warning] Warning

Boost.Unicode is a library in development and is not part of Boost.

How to use this manual

Some icons are used to mark certain topics indicative of their relevance. These icons precede some text to indicate:

Table 1.1. Icons

Icon

Name

Meaning

note

Note

Information provided is auxiliary but will give the reader a deeper insight into a specific topic. May be skipped.

alert

Alert

Information provided is of utmost importance.

tip

Tip

A potentially useful and helpful piece of information.


Last revised: August 27, 2009 at 23:25:41 GMT


Next