Code Monkey home page Code Monkey logo

jddc-baseline-seq2seq's Introduction

JD Dialog Challenge

关于该基线

#该基线是JD.com关于JD Dialog Challenge大赛向广大参赛选手们提供的开源基线代码,用到的模型为:Seq2Seq,该基线仅作参考
#working_dir: (1).训练之后的模型保存文件 (2).词典文件
#data: (1).原训练语料, path:data/chat.txt (2).数据清洗之后的训练语料:train.enc、train.dec、test.enc、test.dec (3).测试模型数据,测试问题test.txt和输出结果result.txt,path:data/test/
#seq2seq.ini: 参数配置文件
#dataProcessing.py: 数据清洗文件
#基线代码文件:data_utils.py、seq2seq_model.py、execute.py

关于数据

#本基线代码当前训练语料共计1万个seesion会话的数据(path:data/chat.txt),该数据是在京东公司客服和客户的真实聊天数据上做了脱敏处理后的数据。
#train.enc、train.dec、test.enc、test.dec作为模型的训练语料,都是基于文件chat.txt做的数据清洗,enc文件每行数据是同一会话中的QAQAQ,dec文件每行数据是同一会话中的A,Q表示用户回答,A表示客服/机器人回答,具体实现可参考数据处理文件dataProcessing.py

requirements

#python3.5
#tensorflow1.0.0

Train Model

#edit seq2seq.ini file to set mode = train
python execute.py

Test Model

#edit seq2seq.ini file to set mode = test
#输入:输入文件格式(QAQAQ),path(working_dir/test/test.txt),注意正式比赛开始时会有100个问题,该基线文件只给出了50个问题
#输出:输出文件格式(A),path(working_dir/test/result.txt)
python execute.py

Notes

#本基线后续还会持续更新

jddc-baseline-seq2seq's People

Contributors

simonjyang avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.