项目作者: christianparpart

项目描述 :
Modern C++17 unicode library
高级语言: C++
项目地址: git://github.com/christianparpart/libunicode.git
创建时间: 2020-05-16T15:13:30Z
项目社区:https://github.com/christianparpart/libunicode

开源协议:Apache License 2.0

下载


C++20
CI Build

Modern C++20 Unicode Library

The goal of this library is to bring painless unicode support to C++ with simple and easy to understand APIs.

The API naming conventions are chosen to look familiar to those using the C++ standard libary.

Feature Overview

  • API for accessing UCD properties
  • UTF8 <-> UTF32 conversion
  • wcwidth equivalent (int unicode::width(char32_t))
  • grapheme segmentation (UTS algorithm)
  • symbol/emoji segmentation (UTS algorithm)
  • script segmentation UTS 24
  • unit tests for most parts (wcwidth / segmentation)
  • generic text run segmentation (top level segmentation API suitable for text shaping implementations)
  • word segmentation (UTS algorithm)
  • CLI tool: uc-inspect for inspecting input files by code point properties, grapheme cluster, word, script, …

Unicode Technical Specifications

  • UTS 11 - character width
  • UTS 24 - script property
  • UTS 29 - text segmentation (grapheme cluster, word boundary)
  • UTS 51 - Emoji

Integrate with your CMake project

  1. git submodule add --name libunicode https://github.com/contour-terminal/libunicode 3rdparty/libunicode
  1. add_subdirectory(3rdparty/libunicode)
  2. add_executable(your_tool your_tool.cpp)
  3. target_link_libraries(your_tool PRIVATE unicode::unicode)

Contributing

Users of this library

Disclaimer

This library is -in terms of features- by no means competive to the ICU library, but it attempts to
provide a clean and intuitive modern C++ API for those that do not want to fight legacy-style C APIs.

I hope that over time we can add more and more features to this library to conform to the Unicode
specification eventually at some point and I welcome everyone to contribute to it by forking the
library, creating pull requests, or even just constructive feedback.

License

  1. libunicode - a modern C++20 unicode library
  2. -------------------------------------------
  3. Licensed under the Apache License, Version 2.0 (the "License");
  4. you may not use this file except in compliance with the License.
  5. Unless required by applicable law or agreed to in writing, software
  6. distributed under the License is distributed on an "AS IS" BASIS,
  7. WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  8. See the License for the specific language governing permissions and
  9. limitations under the License.