Code Monkey home page Code Monkey logo

uithelper_qas's Introduction

Question Answering System for Regulations of University of Information Technology

General

The purpose of this project is to develop a Question Answering System with Reading Comprehension ability on Vietnamese, whose tools and resources are lacked, and applied to answering question related to rules and regulations of University of Information System.

This system adapts traditional Information Retrieval techniques (mostly based on Extended Boolean Model) and Deep Learning algorithms (BERT that achieves state-of-the-art performance on 11 different NLP tasks in English) and transfer learning on Vietnamese that posed attractive potential on Vietnamese Question Answering researches.

The Information Retrieval aprroaches are very common, but deep learning approaches are almost never used in Vietnamese QA System. In this project, a naive transfer learning technique is used, where we translate the SQuAD dataset from English to Vietnamese and remove bad translation (link) that poses an additional 10% boost in F1 accuracy, resulted in an F1 accuracy of 66% in the original task (QA on Wikipedia) and 56% in the UIT regulations task.

More information about this project is stored in ./Report/Summary.pdf or ./Report/Thesis.pdf

Structures

  • QASystem and Ultilities contain source codes, base model as well as fine-tuning models and dataset used in this project. Guide on how to setup and re-produce the result is also provided.
  • Report contains documents about this thesis as well as slides and related files.
  • Dataset contains the dataset that is used in this project.

Information

  • By Nguyễn Việt Nam - 14520560
  • Advisors: Dr. Ngô Đức Thành & M Sc Nguyễn Vinh Tiệp
  • Advanced Education Program 2014 - VNU-UIT

If any problem occurs, please contact me via my email address [email protected] or [email protected]

uithelper_qas's People

Contributors

namnv1113 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.