Some Sentence Transformers sentence similarity models need to use the dot-product for

Opened up an issue for this: <a class="issue-link js-issue-link" data-error-text="Fail

Allow choice of similarity function for Sentence Similarity task about api-inference-community HOT 8 OPEN

huggingface commented on May 29, 2024

Allow choice of similarity function for Sentence Similarity task

from api-inference-community.

Comments (8)

Narsil commented on May 29, 2024 2

I don't have a super strong opinion against making a parameter but I have multiple indicators that make me feel like it's not correct.

The widgets don't send parameters and I don't think it's super user friendly to make users choose (how is a random user supposed to know which one to use ? )
It seems to me that models are trained with a certain similarity function, so using the other one doesn't make sense. Allowing to choose seems like we're allowing users to shoot themselves in the foot by using the wrong function.

Do you think we could take upstream to fix this properly and store somewhere the information in some configuration ? That seems the cleanest way to me right now.
What do you think ?

Again, we could just add the parameter

from api-inference-community.

osanseviero commented on May 29, 2024 2

Yes, since this information is really model-dependant, having it in https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1/blob/main/config.json (or config_sentence_transformers.json) makes more sense than adding a parameter for users to input

from api-inference-community.

NimaBoscarino commented on May 29, 2024 2

Opened up an issue for this: UKPLab/sentence-transformers#1643

I'll leave this issue open for now until there's a resolution on that ST issue, unless you think otherwise!

from api-inference-community.

NimaBoscarino commented on May 29, 2024 1

For just computing sentence similarity there isn't a setting in ST, they just make you use the util.cos_sim or util.dot_score methods, like here: https://www.sbert.net/docs/usage/semantic_textual_similarity.html The dot-score method isn't documented on that page though, and it's really only mentioned in the MS-MARCO page.

the default operator is correct but sometimes you would want to override

Yup, pretty much! In their other methods, like for paraphrase mining, they have the score function set to cos_sim by default, and users can override it. I don't personally know how common using the dot-product is, but it looks like there are some use-cases https://www.reddit.com/r/MachineLearning/comments/pd6wjh/comment/haobugl

from api-inference-community.

osanseviero commented on May 29, 2024

Yes, this sounds like a good idea. This would require an additional parameter which I think it's ok. Feel free to open a PR If you want :) cc @Narsil

from api-inference-community.

Narsil commented on May 29, 2024

Isn't there a setting or a config that could be used within sentence-transformers to know which similarity to use ?

It feels like this shouldn't be passed by users as they are likely to not know which operator to use.

Or is what you are implying is that the default operator is correct but sometimes you would want to override ?

from api-inference-community.

Narsil commented on May 29, 2024

I also remember there was such a configuration, but while writing this pipeline, we were told that cos_sim had basically won. (FYI)

from api-inference-community.

NimaBoscarino commented on May 29, 2024

That sounds good to me! config_sentence_transformers.json seems like an appropriate place for it, but at the moment ST isn't actually using it for anything other than version numbers for ST, transformers, and PyTorch, e.g. https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/blob/main/config_sentence_transformers.json

I'll open up an issue on ST for storing the similarity function in that file, and for having the ST util methods use use it, to get input from Nils.

from api-inference-community.

Allow choice of similarity function for Sentence Similarity task about api-inference-community HOT 8 OPEN

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent