Code Monkey home page Code Monkey logo

led's People

Contributors

drogozhang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

tuidan

led's Issues

how to train model with customed datasets

Thanks for your contribution and great work!
I have some questions,
How to train model with customed datasets? I am going to train the whole model with Chinese retrieval datasets.
All I need to do :

  1. transform my training data to customized formatted data
  2. initialize the backbone bert-like model with Chinese bert-like model such as bert-base-chinese
  3. train step by step

Am I right? And what the format for the customized training data?

Question about the loss function in your paper

In your paper, the equation (3) and (8) for loss function read,
๐‘ โˆˆ {๐‘+ } โˆฉ N, or ๐‘ โˆˆ {๐‘+ } โˆฉ N^mix
I don't know why you use intersection instead of union.
Each batch in your loss function consists of positive passages and negative passages.
So I think it confuses me when you use 'โˆฉ' to represent intersection here.

Question about training warm-up lexical model with BM25 negatives.

I saw in your python code "train_splade_retriever.py",

target = torch.arange(ga_emb_query.shape[0], device=ga_emb_query.device, dtype=torch.long) * num_docs

I don't know for sure why you didn't pass the relevance label of passage-query pair explicitly.
It seems that you still utilize In-Batch-Negative sampling methods instead of BM25 negative as in your paper. So I want to make sure it is right or not, and how could I train this model with BM25 negatives.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.