Code Monkey home page Code Monkey logo

emoji's Issues

Use https://github.com/iamcal/emoji-data as source for short names

I think we should use https://github.com/iamcal/emoji-data as the source of the emoji data. This will add support for several older code-points (pre the now-standard Unicode format), and it will also make sure that the shortnames (:smile:) are the same as what other vendors use.

It will also add support for :skin-tone-2: and so on, which are now incorrectly called :emoji_modifier_fitzpatrick_type-1-2:.

If you think this is a good idea, I'd be happy to submit a PR trying it out. I'm thinking of adding iamcal/emoji-data as a submodule, and writing a script to generate core_data.py from it.

emoji in table

Hi,

Thanks for your repo. I need to use it with terminaltables

As you can see in the images below with emoji cut the table! even when I added more space. Any help?

screen shot 2017-04-15 at 8 53 57 pm

screen shot 2017-04-15 at 8 54 12 pm

Some flags are absent

I didn't find these flags ':flag_for_Northern_Ireland:', ':flag_for_Wales:' and ':flag_for_England:'.
Is it possible to add them?

emoji.decode() is fundamentally broken, not needed, and should be removed.

@carpedm20 having emoji.EMOJI_UNICODE gives us an easy way to look up unicode codes by emoji and emoji.UNICODE_EMOJI gives us an easy way to look up emoji names by unicode codes, but multiple aliases point to the same unicode code so we can't do reverse alias lookups. See the sample code below. About 400 aliases are dropped.

emoji.decode() really isn't that useful and I vote we just remove it. Any objections?

>>> import emoji
>>> len(emoji.EMOJI_UNICODE)
1282
>>> len(emoji.UNICODE_EMOJI)
1282
>>> len(emoji.EMOJI_ALIAS_UNICODE)
1694
>>> len(emoji.UNICODE_EMOJI_ALIAS)
1279

Windows Version

Mmmm i try to install this on python for windows but noting work do you have tutorial or other code

square output?

Fail to show in my terminal (utf8), and was :smile: not supported 😒 ?

>>> import emoji
>>> print(emoji.emojize('Python is :thumbs_up_sign:'))
Python is πŸ‘
>>> print(emoji.emojize('Python is :thumbsup:', use_aliases=True))
Python is πŸ‘
>>> print(emoji.emojize('Python is :smile:'))
Python is :smile:

package name conflict on PyPI

This package installs into a top-level emoji. This conflicts with the top-level of an older project named django-emoji.

It is impossible to use both packages within a single project.

To import from this package, you need to import from emoji.

This conflicts with an older project called django-emoji. To import from django-emoji you also need to import from emoji.

pip does not have the tooling to rename packages on install.

PyPI has a policy of unique package names, which this project violates: http://legacy.python.org/dev/peps/pep-0423/ "make sure your project name is unique, i.e. avoid duplicates:"

Unable to detect flag emojis

``import emoji

def emoji_lis(string):
_entities = []
for pos,c in enumerate(string):
if c in emoji.UNICODE_EMOJI:
print("Matched!!", c ,c.encode('ascii',"backslashreplace"))
_entities.append({
"location":pos,
"emoji": c
})
else:
print(c ,c.encode('ascii',"backslashreplace"))
return _entities

#emoji_lis("Ω…Ψ―ΫŒΨ­ΫπŸ‡΅πŸ‡°")
emoji_lis("πŸ‡΅πŸ‡° πŸ‘§πŸΏ")

Output:
[{u'emoji': u'\U0001f467', u'location': 3},
{u'emoji': u'\U0001f3ff', u'location': 4}]

help

i keep getting Invalid Syntax 'import emoji'

Release v0.3.4

  • Merge #7.
  • Close out https://github.com/carpedm20/emoji/milestones/v0.3.4.
  • Update changelog with mention of aliases and restoration of default functionality.
  • Make sure aliases are properly documented after the recent changes.
  • Create a v0.3-maint branch as a safety measure and keep it in git.
  • Tag a release in GitHub.
  • Upload to PyPi.

outdated emoji list

I wanted to print :first_place_medal: ( πŸ₯‡ ) emoji in my own telegram bot but emojis of unicode v6+ is not supported. Any help?

Markdown

Hi

I am sorry but I have stupid question. How to use it on website? It is possible use it with Markdown2 python module?

Example code in README.rst is faulty

I tried the example and only the aliases are converted on my system.

>>> print(emoji.emojize('Python is :thumbsup: :thumbs_up_sign:', use_aliases=True))

This results in:
Python is πŸ‘ :thumbs_up_sign:

I have a Windows machine, could this be the reason why it's not fully working?
I am using the pypi version 0.4.5

Getting black and white Emojis

Here is my code.
`import emoji'

'print(emoji.emojize("Hello 🌎", use_aliases=True))`

Here is the output.

emoji black-white

tried others but still got no colors on both windows or linux..still no colors..

National flag emojis should not contain space character

So all the national flag emoji character combinations in your regexp and lookup tables, contain a space in between the two "regional indicator" letters. This means they won't actually match the national flag sequences:

In [7]: emoji.get_emoji_regexp().match("πŸ‡¨πŸ‡¦")

In [8]: emoji.get_emoji_regexp().match("πŸ‡¨ πŸ‡¦")
Out[8]: <_sre.SRE_Match object; span=(0, 3), match='πŸ‡¨ πŸ‡¦'>

the emoji didn't work....

as you can see , i do it as example ,but didn't work

 -*- coding: <encoding unicode > -*-

import emoji
print(emoji.emojize('Python is :thumbs_up_sign:'))`

the Result is:

Python 3.4.1 (v3.4.1:c0e311e010fc, May 18 2014, 10:38:22) [MSC v.1600 32 bit (Intel)] on win32
Type "copyright", "credits" or "license()" for more information.
>>> ================================ RESTART ================================
>>> 
Python is :thumbs_up_sign:
>>> 

get nothing or exception

Hi!

I just tried this plugin all I get is nothing or an exception

Traceback (most recent call last):
  File "C:\add\emoji\test.py", line 4, in <module>
    print(emoji.emojize('Water! :water_wave:'))
  File "C:\Users\math\python\lib\encodings\cp850.py", line 12, in encode
    return codecs.charmap_encode(input,errors,encoding_map)
UnicodeEncodeError: 'charmap' codec can't encode characters in position 7-8: character maps to <undefined>

Does anyone have an idea where it could be coming from?

I'm using python 2 on window 8

Mathieu

I'm getting many question marks

I think that some of the emojis I try to convert using the emoji library are being converted to question marks.

Any chance it happens as a result of the emojis that comes with iOS 10 maybe?

Add emoji "annotations"

The annotations on the page you scrape would be useful for a program that wants to classify emoji, as well as for bots that might want to, for example, choose a random "grin" face to spice up their text.

Give user a better option to enable emoji aliases

GitHub and others support a bunch of emoji aliases that are not officially part of the unicode set. Many of the aliases are easier to remember than their official counter-parts but they are only supported by some platforms and really should be considered specific to this library because different platforms might have different aliases that point to different codes so this library really only supports one set of aliases.

Replace is_alias in emojize() with use_alises to toggle character sets.

Ignore Fitzpatrick Modifiers and other Flags

I want to extract all emojis from a long list of strings and then count the number of occurences of each emoji. The Emoji-module does a great job, but it extracts emojis with flags (which is what would be normally expected). Thus, distinct = set(list(emojis)) will treat same emojis with different flags/color differently. How can I ignore these modifiers?

This is my code:

def extract_emojis(str):
  return list(c for c in str if c in emoji.UNICODE_EMOJI)

I already tried:

def extract_emojis(str):
  return list(emoji.emojize(c,use_aliases=True) for c in str if c in emoji.UNICODE_EMOJI)

but it does not work. For example, I get:

❀,8654
😍,4774
🏻,3603
🏼,2839
✨,2696
β˜€,2439
πŸ˜‚,1904
😊,1862
πŸ‘Œ,1690
πŸ’•,1677
πŸŽ„,1587
😎,1559
πŸ›,1459
✌,1434

The numbers are the number of occurrences. In this case, the first 4 emojis refer all to variants of "heart"-emoji I believe.

:hash: was printing only :hash: not emoji

I try these methods:
emoji.emojize("Настройки #️⃣", use_aliases=True)
emoji.emojize("Настройки #️⃣")
in each case result is same

convert back to '\uxxx' code

import io,sys
import emoji

sys.stdout = io.TextIOWrapper(sys.stdout.buffer,encoding='utf-8')
s = 'hello\u2665'
emojiCode = emoji.demojize(s)
print(emojiCode) #hello:black_heart_suit: 
print(emoji.emojize(emojiCode)) #hello鈾?

how can i make the '\u2665' code just convert to 'β™₯'

Some alias typos

Some aliases, like :upside-down_face:, should be πŸ™ƒ (hyphen turned to underscore). I observed this bug when exporting Slack messages and emojifying them.

The following hack fixed it for now, though it may override some legit aliases:

def fix_emoji():
    """Fix emoji's aliases as they have some typos."""
    from emoji import unicode_codes
    for key, val in list(unicode_codes.EMOJI_UNICODE.items()):
        unicode_codes.EMOJI_UNICODE[key.replace('-', '_')] = val
    for key, val in list(unicode_codes.EMOJI_ALIAS_UNICODE.items()):
        unicode_codes.EMOJI_ALIAS_UNICODE[key.replace('-', '_')] = val

    unicode_codes.UNICODE_EMOJI = {v: k for k, v in unicode_codes.EMOJI_UNICODE.items()}
    unicode_codes.UNICODE_EMOJI_ALIAS = {v: k for k, v in unicode_codes.EMOJI_ALIAS_UNICODE.items()}


fix_emoji()

regexp does not detect some emoji

Python 2.7.9 (default, Mar  1 2015, 12:57:24)
[GCC 4.9.2] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from emoji import get_emoji_regexp
>>> s='test sting πŸ‘'
>>> RX = get_emoji_regexp()
>>> res = RX.match(s)
>>> print res
None

also πŸš€ and πŸ”₯ πŸ“§ and others

demojize() not correctly decoding flag emojis

Noticed this problem while cleaning a twitter corpus:

@MENTION: I've been wanting to post this since I saw the HT. :smiling_face_with_open_mouth_&_smiling_eyes: off to work I go. Good Morning πŸ‡ΊπŸ‡Έ Good Night πŸ‡΅πŸ‡­β€¦

My code is calling emoji.demojize(text) with a data['text'] directly from Twython. Not sure what's going on here -- am I doing something wrong?

Thanks!

The dataset in unicodes_code.py contains helm_symbol not in the Unicode standard

In an attempt to answer #18, I compared the contents in the EMOJI_UNICODE dict with that in http://www.unicode.org/emoji/charts/full-emoji-list.html (by using utils/get-codes-from-unicode-consortium.py). It seems that all characters are present, but there is one extra character: :helm_symbol:. More info about that character here: http://www.fileformat.info/info/unicode/char/2388/index.htm

Why is U+2638 included as an emoji? Perhaps that character should be in a separate dict of characters that are often treated as emojis?

MyPy typeshed stub file

I've create a library stub for mypy static type checker. Here is the pull request for it: python/typeshed#1506
Can you please comment on this pull request on whether it could be merged from your point of view as the author and maintainer of the library?

:one: alias doesn't work

emojize(":one:") doesn't print the emoji. I found that the correct alias is ":keycap_digit_one:". Btw the emoji that appears is different from the Apple one. I found that the Apple emoji is composed by two character, the first one is the number and the second one is chr(8419), so for me the best solution was to substitute emojize(":one:") with "1"+chr(8419). This works also for other numbers (the second character is always the same).

Duplicate aliases

@carpedm20 the following aliases appear multiple time in emoji.EMOJI_ALIAS_UNICODE, which means that the last one encountered will be included in the one present in the dictionary and the rest will be thrown away. Could you pick one for each and update the dictionary?

{
    ':bee:': u'\U0001F41D',
    ':bee:': u'\U0001F41D',
    ':satellite:': u'\U0001F4E1',
    ':satellite:': u'\U0001F6F0',
    ':snowman:': u'\U00002603',
    ':snowman:': u'\U000026C4',
    ':umbrella:': u'\U00002602',
    ':umbrella:': u'\U00002614'
}

Release v0.3.5

@carpedm20 I think this packaging issue might be affecting a lot of people so unless you have anything else to add I vote we do this as a patch and then do #14 (and anything else that pops up before it is implemented) in the next release.

  • Update changelog
  • Pull onto v0.3-maint branch
  • Tag a release in GitHub
  • Push to PyPi

Upgrade fails on mac, python 3.4.2

Executed command: pip install -U emoji
Error occurred: UnicodeDecodeError: 'ascii' codec can't decode byte 0xf0 in position 738: ordinal not in range(128)

Update in the PYCharm IDE, maybe a invisible character, like this issue had? But It could just be PyCharm related instead.

Collecting emoji
  Using cached emoji-0.2.tar.gz
    Complete output from command python setup.py egg_info:
    Traceback (most recent call last):
      File "<string>", line 20, in <module>
      File "/private/var/folders/zh/ntn59b4954l6tldmt4hn8bgc0000gn/T/pycharm-packaging1.tmp/emoji/setup.py", line 17, in <module>
        readme_content = f.read().strip()
      File "/Users/luckydonald/virtualenv3.4.3/bin/../lib/python3.4/encodings/ascii.py", line 26, in decode
        return codecs.ascii_decode(input, self.errors)[0]
    UnicodeDecodeError: 'ascii' codec can't decode byte 0xf0 in position 738: ordinal not in range(128)

    ----------------------------------------

    Command "python setup.py egg_info" failed with error code 1 in /private/var/folders/zh/ntn59b4954l6tldmt4hn8bgc0000gn/T/pycharm-packaging1.tmp/emoji

Don't import * in __init__

Blocked until #19 is merged

core.py does not implement an __all__ but this package is small enough that we should probably just explicitly import what we need to the top level.

Not able to run on my terminal!

Does it supports windows command prompt ?
If yes then what should I set as output encoding ?
I am getting the following error when i ran print emojize("emoji is πŸ‘ ")
UnicodeEncodeError: 'charmap' codec can't encode characters in position 9-10: character maps to

its is parsing thumbsup in github so adding a clip too

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.