Code Monkey home page Code Monkey logo

vmware / data-annotator-for-machine-learning Goto Github PK

View Code? Open in Web Editor NEW
54.0 54.0 22.0 12.53 MB

Data annotator for machine learning allows you to centrally create, manage and administer annotation projects for machine learning

License: Apache License 2.0

Python 4.02% JavaScript 27.40% HTML 27.08% SCSS 3.34% TypeScript 38.04% Dockerfile 0.09% Shell 0.03% CSS 0.01%
annotation annotation-tool deep-learning image-annotation labeling-tool named-entity-recognition tabular text-annotation

data-annotator-for-machine-learning's People

Contributors

kai-qiao avatar lijulia avatar rekeyz avatar yangrong16 avatar zfenfen avatar zrekey avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data-annotator-for-machine-learning's Issues

[BUG] [AL] Request can access but show 401 error

al service can handle the request but client will show 401 code error

[ ACTIVE-LEARNING ] [ ERROR ] QUERY-INSTANCE: Error: Request failed with status code 401
at createError (E:\vmware\Loop\GitHub\master\annotation-service\node_modules\axios\lib\core\createError.js:16:15)
at settle (E:\vmware\Loop\GitHub\master\annotation-service\node_modules\axios\lib\core\settle.js:17:12)
at IncomingMessage.handleStreamEnd (E:\vmware\Loop\GitHub\master\annotation-service\node_modules\axios\lib\adapters\http.js:260:11)
at IncomingMessage.emit (node:events:406:35)
at IncomingMessage.emit (node:domain:475:12)
at endReadableNT (node:internal/streams/readable:1348:12)
at processTicksAndRejections (node:internal/process/task_queues:83:21) {
config: {
url: '/al/model/query',
method: 'post',
data: '{"projectName":"non-ascii-code-testing","user":"[email protected]"}',
headers: {
Accept: 'application/json, text/plain, /',
Authorization: 'eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJleHAiOjE2MzEwMTAwMzUsImVtYWlsIjoicG9jLW9zQHBvYy1vcy5jb20iLCJpYXQiOjE2MzEwMDgyMzV9.ovxbt-VGkmOJmuA9EO8A2nof-mduwVxW4-I0O6932fw',
'Content-Type': 'application/json;charset=utf-8',
'User-Agent': 'axios/0.21.1',
'Content-Length': 67
},

[BUG] [APP] Add validation to the upload zip/tgz file

  1. if the user make the file format is image, then the uploaded zip file must include at least one image type file in it, if there has no any image type file there then give a upload failed alert
  2. if the user make the file format is txt, then the uploaded zip/tgz file must include at least one txt type file in it, if there has no any txt type file there then give a upload failed alert

[BUG] - [API] - Fix security alert

glob-parent:
Remediation
Upgrade glob-parent to version 5.1.2 or later. For example:

"dependencies": {
  "glob-parent": ">=5.1.2"
}
or…
"devDependencies": {
  "glob-parent": ">=5.1.2"
}

normalize-url:
Remediation
Upgrade normalize-url to version 4.5.1 or later. For example:

"dependencies": {
  "normalize-url": ">=4.5.1"
}
or…
"devDependencies": {
  "normalize-url": ">=4.5.1"
}

[BUG] [APP] Enable the default assigned tickets # should be editable at any time

  1. when there is only one annotator, the front-end will give the total tickets number which caculated by the front-end to this annotator.
  2. Then when the back-end caculate a different total tickets number after finish inserting all valide data into db, the front-end should use this new number and let owner can do edit the original assigned tickets number according to this new one

Enable all types of project's total valid ticket number can be evenly distributed during project creation and the number of assigned tickets can be modified in edit modal

Project Creation Page

  • during assign annotators, should show owner the default specifica ticket number that evenly distributed to every annotator

  • need make sure the total distributed number should <= this project's valid total rows

Annotate Page

  • for details displayed in annotation progress, this session's number should be the latest evenly distributed number, in case there is any newly added or deleted annotator of this project

  • the progress bar should be based on his own session

The Annotate Tab

  • in datagrid the column Reviewed/Complete/Total, the total need to be the current annotator's own distributed ticket number, the complete is his own annotated tickets number which maybe more than the total here

Projects/Admin tab

  • the projects/admin tab page, in datagrid the column Annotator/Complete/Total need changed to Annotator/Complete/Assigned
  • need remove the column Generate Updated Date and add new column to show the current project's total valid ticket number

Edit Modal

  • to display the current # assigned tickets and # complete tickets of every annotator
  • disable the complete tickets input and make sure the assigned tickets input is editable
  • need make sure the total distributed number should <= this project's valid total rows

[APP]-Decouple AWS from UI

  1. add variable enableAWSS3 in env.ts to make this decouple is configurable
  2. send all upload file in binary to backend

[BUG] [APP] Clear the total rows and clear one sonar bug

  1. when select the label column that has more than 50 labels, there will pop out a modal to alert this info, then after click ok should clear the total rows number in page
  2. in un-zip.service.ts, bug is, Introduce a new variable or use its initial value before reassigning "previewHeadDatas".

[APP]-Some enhancement changes for the markdown UI

  1. Remove the “Text” and “Visual” buttons at the right panel.
  2. Instead, add a project level display dropdown with Text (default) & Render Markdown values at the left panel below the “project” dropdown if the selected project supports markdown format.
  3. If the annotator selects Render Markdown, save the preference in the session so he will see markdown rendered cases for the session.

[APP] - Sliders and Submit Button for Numerical Label

Numerical annotation is completed by entering a number within the given range in the text-box and pressing enter. To make this interface more congruent and intuitive it would be appropriate to add a submit button (already present for categorical annotation) and a slider with a text box to indicate the numerical annotation.

Numerical Annotation:
image

Categorical Annotation:
image

Slider Example:
image

<div class="slidecontainer">
  <input type="range" min="1" max="100" value="50" class="slider" id="myRange">
</div>

w3 Schools Sliders in HTML

Multiple Numerical Labels

For certain use cases it might be desirable to annotate multiple metrics numerically. For example:

Query: Annotate "Good Job"
Annotation: Appreciation: 4/5, Encouraging: 3/5, Dismissive: 0/5.

Query: Annotate "Brilliant Work"
Annotation: Appreciation: 5/5, Encouraging: 4/5, Dismissive: 0/5.

So it would increase the utility to allow for adding multiple numeric labels, and consolidate functions with those used for class labels.

Only ASCII Values in Dataset

On uploading a dataset for text classification on annotation InstaML automatically filters out samples (rows) with values that do not fall within ASCII. A better resolution of such a conflict might be replacing Unicode with corresponding ASCII or replacement with blank values.

Usability related - Annotate Web Page design suggestion.

With the current annotate project web page, it is not immediately obvious how to get started because the start button is hidden and one must now to slide to the end. Perhaps display start at a fixed location, in a separate column?
Screen Shot 2021-08-19 at 12 23 46 AM
Screen Shot 2021-08-19 at 12 23 28 AM

[BUG]-[APP]-Log review query should query with conditions

  1. for now always query with the first annotator as the default, but if the first one's complete tickets all have been reviewed, then the reviwer never can enter that review page
  2. before query need to foreach the user list to find the one which has lefted non reviewed ticket

[APP]-Some usability updates

  1. Not allow the annotator to see the flagged tickets
  2. In review page, default the reviewee if there is only one annotator
  3. Change the string "Modify" to "Return to annotator" on the Projects tab
  4. Modify the review page head title from “Annotate Details” to “Review Details”
  5. test

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.