Code Monkey home page Code Monkey logo

taxonomic's Introduction

Table of Contents

Introduction

Our project aims to automate taxonomic data capture from scientific reports, something which is currently performed manually. This information can then be uploaded to searchable databases where it can be accessed by the public. Automating this process will save our client time, effort and money which can be better spent elsewhere.

Diagram

Product Value

Our project has high academic and economic value. The project will help our clients improve their data collection efficiency, which can save a lot of time for them. Also, the project output software is working on automatic data collection. Therefore, they can reduce labor consumption and save a lot of money. In addition, this project is working for an open source academic website, the success of our project will help many people improve their data searching speed on this web site.

Client Expectation

The client expects us to achieve some sort of functionality in terms of analysing documents in pdf form and returning scientific names present. At this point the client's expectations are not entirely clear and we intend to work with them to develop a concrete idea of what we are supposed to achieve with our project over the course of this week.

Road Map and System Diagram

Diagram

Diagram

User Interface Prototype

Diagram

User Story Map

Diagram

Milestones

  • Functional webclient interface
  • Enable uploading pdfs and viewing them as plain text in sub-window
  • Enable customising extractable tags
  • Implement automated IBRA7 region calculation
  • Successfully use GoldenGate to tag pdfs
  • Analyse and maximise the success rate of GoldenGate’s tagging
  • Analyse and maximise the success rate of tag extraction
  • Propose XML schema which will allow trivial data extraction

Schedule

Week 6:

  • Finalise higher level documentation (schedule, roles, milestones, audit ppt)
  • Prepare for audit

Teaching break week 1:

  • Thoroughly investigate GoldenGate
  • Inquire about getting a web server set up with the ANU
  • Create lower level documentation, ie. pseudocode, overall program structure, algorithms and testing plan
  • Begin to design UI.

Teaching break week 2:

  • Complete development on basic interface, xml tag extraction and IBRA classification
  • Complete basic UI
  • Begin working on integrating GoldenGate with the server
  • Prepare for audit

Week 7-8:

  • Continue Work on implementing GoldenGate interpreting of PDF files server-side (backend)
  • Create tools which allow the user to test the validity of output and fix it if necessary (frontend)
  • Try to implement audit feedback

Week 9:

  • Create project poster
  • Meet with client and get feedback regarding program interface
  • Work to improve efficiency and accuracy of data classification and extraction

Week 10:

  • Overall system testing
  • Finalise documentation
  • Prepare for audit

Week 11:

  • Implement audit feedback
  • Prepare handover resources

Progress

Diagram

Risk Management

As the project is being implemented as part of a secure system, it is important that it does not present any new vulnerabilities to that system. This can be achieved by being considerate of the environment in which our project will be deployed and using appropriate programming techniques.

Team Member Roles

Team Member Uni ID Role
Jing Li u6531952 Project Manager, Developer(PDF type convert & Database)
Biwei Cao u5926643 Developer(data taxonomy & Database), Documentation (Agenda)
Jiaqi Zhang u6089193 Developer(OCR & Testing)
Joshua Trevor u6405233 Developer(data taxonomy), Spokesperson
Yanlong LI u5890571 Developer(OCR & full steak web)
Yuan Yao u5945391 Developer, Documentation

Communication Tools

  1. Email
  2. Facebook Messenger
  3. Slack

Development Environment

  • Language:
    • Java installation is necessary for GoldenGate
    • Other languages not determined yet, most likely a combination of python, javascript and php will be used
  • Testing:
    • Unit test during development by black/white box
    • A/B test for the final stage

Development Tools

  1. PyCharm(Python IDE)
  2. WebStorm(JavaScript)
  3. DreamWeaver(HTML, CSS, JavaScript)
  4. Tesseract(OCR)
  5. Imagemagic(PDF to image)

Decision Make Procedures

Diagram

Meeting Agenda

Other Resources

taxonomic's People

Contributors

tarasom123 avatar claire0212 avatar jingli201802 avatar joshuatrevor avatar superfeone avatar afuchs1 avatar tyraeldlee avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.