Code Monkey home page Code Monkey logo

is-bert's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

is-bert's Issues

Train with fine-tuned embeddings?

Hi,

Thanks very much for releasing your code!

Was wondering if it is worth training IS-BERT with a fine-tuned hugging face model rather than with an out-of-the-box hugging face model?

So, if I want to train IS-BERT on reddit comments, is it better to:

a.) Fine-tune 'bert-base-uncased' on reddit comments and then use embeddings from fine-tuned model to train IS-BERT

or

b.) Directly train IS-BERT on reddit comments using 'bert-base-uncased' embeddings without first fine-tuning 'bert-base-uncased' on reddit comments?

why use MI instead of InfoNCE as loss function?

Hi, since you treat each sentence and its local context representations as positive examples, and treat all the local context representations from other sentences as negative examples, like what we usually do in contrastive learning, why do you choose MI as loss function instead of conventional CL loss like InfoNCE? Is MI better than InfoNCE in this scenario? Thanks!

Get embeddings from trained model

Hi,

First of all thank you for sharing your code!

I am wondering if there is a way to get an embedding for a new sentence after i trained the model with my data. model.encode(sentence) seems to be doing this. But I tested it using a sentence from the sentences I trained the model with and the embeddings differ. Am I doing something wrong?

Thank you in advance for your help!

discriminator

Is the discriminator Tw in this paper dot product operation?

‘measures’

hello,what these measures such as 'GAN','JSD' and so on mean?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.