Code Monkey home page Code Monkey logo

honpei_project's Introduction

HonPei_project

tasks

  • not done yet

  • done

  • Automate search for Zhonga ZH-RU Dictionary. This cannot be done by simple URL manipulation, since each character has its own ID, which is present in the URL and makes automatic searching of individual characters impossible. For words that consist of more than one character this is not a problem though.

  • Create a script that can convert Traditional Chinese Characters in the clipboard to Simplified Characters and then send the simplified version to the clipboard again.

  • Automate search for Mini Taiwanese Mandarin Dictionary. This resource is specially useful for searching individual characters, and the problem is the same as described for Zhonga ZH-RU: each character has its own ID, which is present in the URL.

  • Automate pronunciation field in anki cards, the idea is to add pronuncation effortlessy, by minimizing user-input as much as possible yet we have thought about creating a python script that identifies the TW and donwload the pronunciation in forvo, and automatically puts in the right place. The challenge is making it more "buky" , so we dont have to go card by card manually.

  • Create a script that converts a .srt subtitle file in Simplified Chinese to Traditional Chinese.

  • Create a script that takes a Chinese dictionary file (like the ones for GoldenDict, Yomichan, etc.), identifies the Pinyin in the entries and change it for the correspondent Zhuyin (phonetic system used in Taiwan). Romanization is bad.

  • Create a script in order to quickly and efficiently use this OCR engine for Mandarin Chinese. Choose a keybind, press it, select an area with Chinese text with your mouse and the OCR will work, the recognized text will be sent to your clipboard. ImageMagick might be useful.

  • Analyze this tool's code and change its built-in resources for Mandarin Chinese ones.

Dictionaries can NOT be imported dynamically. They are built into the app itself. One can build the dictionary database and compile the app himself if needed.

  • Word segmentation can be achieved by using a trained version of mecab and a modification in gd-mecab . It's also worth noticing that those scripts used to create a csv can also be used to create a cantonese version of the same program, by taking the data in cc-canto DONE image

Resources

Cantonese Yomichan Dictionary

it was taken from this repo image

extract helper image

https://github.com/KonstantinDjairo/baidu-pan-downloader

https://github.com/KonstantinDjairo/chinglish

honpei_project's People

Contributors

konstantindjairo avatar bogozarnyj avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.