ibm / max-toxic-comment-classifier Goto Github PK

Detect 6 types of toxicity in user comments.

Home Page: https://developer.ibm.com/exchanges/models/all/max-toxic-comment-classifier/

License: Apache License 2.0

Dockerfile 4.23% Python 95.77%

docker-image machine-learning machine-learning-models ibm pytorch nlp comments natural-language-processing natural-language-understanding

max-toxic-comment-classifier's Issues

Config version mismatch

MAX-Toxic-Comment-Classifier/config.py

Line 11 in 25c7aea

API_VERSION = '1.0'

Should be 1.0.1 - please remember to bump the version in config.py before cutting a new release!

Move sample assets to "samples/" directory

Following new convention to separate model assets from sample assets (e.g. images, text, ...)

Simplify /model/predict results data structure

Is it possible to simplify the result data structure?

For example

{
  "text": [
    "Mary had a little lamb"
  ]
}

returns

{
  "status": "ok",
  "predictions": [
    [
      {
        "toxic": 0.0014839612413197756,
        "severe_toxic": 0.00009525173663860187,
        "obscene": 0.0003549971734173596,
        "threat": 0.00007228714093798772,
        "insult": 0.00022114443709142506,
        "identity_hate": 0.00010927908442681655
      }
    ]
  ]
}

Note that the prediction returns a list of lists of dicts, whereas it should be sufficient to return a list of dicts.

{
  "status": "ok",
  "predictions": [
      {
        "toxic": 0.0014839612413197756,
        "severe_toxic": 0.00009525173663860187,
        "obscene": 0.0003549971734173596,
        "threat": 0.00007228714093798772,
        "insult": 0.00022114443709142506,
        "identity_hate": 0.00010927908442681655
      }
   ]
}

Not sure which input would return a multi-dimensional response. The simplified version can accommodate multiple inputs:

{
  "text": [
    "Mary had a little lamb",
    "Little lamb, little lamb,"
  ]
}

would return

{
  "status": "ok",
  "predictions": [
      {
        "toxic": 0.0014839619398117065,
        "severe_toxic": 0.0000952516493271105,
        "obscene": 0.00035499755176715553,
        "threat": 0.00007228707545436919,
        "insult": 0.00022114443709142506,
        "identity_hate": 0.00010927919356618077
      },
      {
        "toxic": 0.1212717592716217,
        "severe_toxic": 0.00012042735033901408,
        "obscene": 0.0017211131053045392,
        "threat": 0.00017663949984125793,
        "insult": 0.0008766597020439804,
        "identity_hate": 0.00014419591752812266
      }
  ]
}

with predictions[0] returning the eval for the first string, .... second string.

What am I missing?

Dockerfile: Upgrade to max-base v1.1.3

https://github.com/IBM/MAX-Toxic-Comment-Classifier/blob/master/Dockerfile#L1 to take advantage of the CORS bug fix.

How can I download the model file?

Similar to https://github.com/IBM/MAX-Object-Detector/blob/master/demo.ipynb, which illustrates how to invoke the prediction endpoint and parse the results. (~ "notebook-based" swagger spec for a data scientist audience)

README: syntax of JSON response is inconsistent

Compare the example JSON response in the markdown and the response in the screen captures. They are different.

/model/predict returns unexpected result

No big deal, but

{
 "text": [""]
}

it's odd that nothing can be interpreted as something

     {
        "toxic": 0.0015665121609345078,
        "severe_toxic": 0.00011611604713834822,
        "obscene": 0.0004790136299561709,
        "threat": 0.00007217624806798995,
        "insult": 0.0003883792378474027,
        "identity_hate": 0.0003294776252005249
      }

and similarly a meaningless combination of characters

{
 "text": ["khdcuc4wcc5jlcnw"]
}

and numbers

{
 "text": ["123"]
}

Note sure if there's something that can be done to eliminate noise that's a result of garbage input.

ibm / max-toxic-comment-classifier Goto Github PK

max-toxic-comment-classifier's Issues

Config version mismatch

Move sample assets to "samples/" directory

Simplify /model/predict results data structure

Dockerfile: Upgrade to max-base v1.1.3

How can I download the model file?

Add a basic demo notebook

README: syntax of JSON response is inconsistent

/model/predict returns unexpected result

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent