Code Monkey home page Code Monkey logo

unicode_data's Introduction

парсинг данных из UCD, UCA, CLDR

UCD:

  • UNICODE: HashMap<u32, Codepoint> - UnicodeData.txt, информация о кодпоинтах Unicode
  • BLOCKS: HashMap<String, CodepointsBlock> - Blocks.txt, блоки кодпоинтов (диапазоны)
  • QC_NFD, QC_NFKD, QC_NFC, QC_NFKC: Vec<char> - DerivedNormalizationProps.txt, быстрые проверки нормализации
  • COMPOSITION_EXCLUSIONS: Vec<u32> - CompositionExclusions.txt, исключения композиции
  • NORMALIZATION_TESTS: Vec<NormalizationTest> - NormalizationTest.txt, тесты нормализации

нормализация:

  • NFD, NFKD: HashMap<u32, Vec<Codepoint>> - полностью разложенная NFD / NFKD декомпозиция
  • NFC, NFKC: HashMap<u32, Vec<Codepoint>> - прекомпозиция NFC / NFKC
  • COMPOSITION_PAIRS: HashMap<u32, HashMap<u32, Codepoint>> - комбинируемые пары кодпоинтов
  • COMBINES_BACKWARDS: HashMap<u32, HashMap<u32, Codepoint>> - кодпоинты, комбинируемые с предыдущими

UCA:

  • DUCET: Vec<WeightsEntry> - allkeys.txt, DUCET
  • DUCET_FILTERED_TRIE: HashMap<u32, TrieNode> - DUCET в виде дерева, содержащая только NFD-кодпоинты
  • COLLATION_TEST_DUCET_NON_IGNORABLE, COLLATION_TEST_DUCET_SHIFTED: Vec<CollationTest> - CollationTest, тесты сопоставлений

CLDR:

  • CLDR_UND: Vec<WeightsEntry> - allkeys_CLDR.txt, CLDR-версия DUCET
  • CLDR_FILTERED_TRIE: HashMap<u32, TrieNode> - аналогично DUCET_FILTERED_TRIE
  • COLLATION_TEST_CLDR_NON_IGNORABLE, COLLATION_TEST_CLDR_SHIFTED: Vec<CollationTest> - CollationTest, тесты сопоставлений

unicode_data's People

Watchers

Anton avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.