Code Monkey home page Code Monkey logo

bert_hr's Introduction

  • apt-get install default-jre

  • First Install Grobid by following instructions from "https://grobid.readthedocs.io/en/latest/Install-Grobid/" and "https://grobid.readthedocs.io/en/latest/Grobid-service/"

    1. To Install : cd grobid && ./gradlew clean install
    2. Run virtual env : in ~/bert_hr/
    3. To Run : ./gradlew run to run the server for grobid _parser_pdf.py
  • Run : script "~/bert_hr/grobid/grobid_parser_pdf.py" with all to-be-processed-pdf-files stored in "../data_pdf/" directory Results would be stored in "../data_xml/" directory

  • Run : “~/bert_hr/grobid/xml_parser.py” to process data in “../data_xml/” and saving in “../data_input/” directory for BERT prediction.

  • ‘berthr.py’ would take csv files from “bert_hr/data_input/” to do the predictions while saving data in “bert_hr/data_output/” directory.

  • Run : ‘~/bert_hr/flask_app/berthr.py’ for predictions whose results would be saved in "data_output" directory.

  • "flask_app/templates/" directory contains the required UI files.

  • "data" directory contains the checkpoints of the custom trained BERT model (note: files can be found in the shared Drive folder )

bert_hr's People

Contributors

amitgayar avatar

Stargazers

Gordon Wright avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.