Code Monkey home page Code Monkey logo

hebrew's Introduction

Hi there ๐Ÿ‘‹

I'm Avi, a developer with a strong background in custom full-stack software design and implementation, usually on the web. Frequently in Python, ideally in Kotlin, and sometimes in PHP.

๐Ÿ“ซ You can reach me via the contact information listed on my personal site.


My Personal Projects
  • FormHole - A FastAPI project that supports the storage and management of objects of unknown schema.
  • Gist blog - Convert your github gists into a rich discoverable personal blog.
  • Hebrew - A python package with methods to handle the complexities of Hebrew text.
  • rich-tools - A Python package with helpful functions for use alongside the rich Python library.
  • Web Parser API - An API for parsing online documents. (E.g. HTML, JSON, etc.)
My Favorite Books
My Favorite Librarys

Libraries I'm proficient with

Libraries I'm excited about

hebrew's People

Contributors

avi-perl avatar dickermoshe avatar taber avatar yoavcaspi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

hebrew's Issues

Gematria Support

I wish to add Gematria to this package.
Wonder where is the best place to do it?
From my point of view it should be in the Hebrew class.
what do you think?

Parse the taamim

Hey,

Is there a way to parse the taamin. For example

Hebrew(ืึธื“ึธึ›ื).taam = 'Tevir'

Thanks,

Jordan

The Method `Hebrew.no_niqqud` is misleading

Hi there!

Thank you for this awesome library. It is very useful! I'm currently using it to create a machine-learning model for automatic niqqud.

I looked at the method Hebrew.no_niqqud, and its current form is very misleading. This method strips all of the niqqud characters from the word, but this alone doesn't make the word without niqqud.

Here is an example:
Look at the word ืึนื”ึถืœ (tent). If I want to write this word without niqqud I need to add the letter ื• to the word: ืื•ื”ืœ.
Currently, the function Hebrew.no_niqud will turn the word to ืื”ืœ, which is an incorrect translation.

My suggestion:
Rename the method Hebrew.no_niqqud to Hebrew.strip_niqqud. This is a much more accurate name for the method that is less misleading.
After that, create a new function named Hebrew.ctiv_male (full writing) that removes niqqud smartly, adding ื• and ื™ (vav and yud) whenever needed.

Let me know what do you think about the idea!

Hebrew.no_taamim fails with "AttributeError: 'str' object has no attribute 'string'"

Trying this:

hebrew_text = "\u05d1\u05b8\u05bc\u05a3\u05d8\u05b7\u05d7 \u05d1\u05b8\u05bc\u05ad\u05d4\u05bc \u05dc\u05b5\u05a3\u05d1 \u05d1\u05b7\u05bc\u05e2\u05b0\u05dc\u05b8\u0591\u05d4\u05bc \u05d5\u05b0\u059d\u05e9\u05b8\u05c1\u05dc\u05b8\u0597\u05dc \u05dc\u05b9\u05a3\u05d0 \u05d9\u05b6\u05d7\u05b0\u05e1\u05b8\u05bd\u05e8\u05c3"
print (hebrew_text)
hebrew_text2 = Hebrew.no_taamim(hebrew_text, False, False)
print (hebrew_text2)

Error is this:


ื‘ึธึผึฃื˜ึทื— ื‘ึธึผึญื”ึผ ืœึตึฃื‘ ื‘ึทึผืขึฐืœึธึ‘ื”ึผ ื•ึฐึืฉึธืืœึธึ—ืœ ืœึนึฃื ื™ึถื—ึฐืกึธึฝืจืƒ
Traceback (most recent call last):
  File "C:\GitHub\GetBible\CommonTransliterate.py", line 74, in <module>
    hebrew_text2 = Hebrew.no_taamim(hebrew_text, False, False)
  File "C:\GitHub\GetBible\hebrew\hebrew_obj.py", line 169, in no_taamim
    string = self.no_maqaf().string if remove_maqaf else self.string
AttributeError: 'str' object has no attribute 'string'

Same with:
hebrew_text2 = Hebrew.no_taamim(hebrew_text)
hebrew_text2 = Hebrew.no_taamim(hebrew_text, True, True)

Are you planning on adding a transliteration function? I'm working on that now.

Thanks,
Neal

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.