Trying to use MonoT5 3B on some custom reranking tasks, the gist of the code is <d

MonoT5 .predict gives NaN about beir HOT 2 OPEN

maxmatical commented on June 13, 2024 1

MonoT5 .predict gives NaN

from beir.

Comments (2)

rahmanidashti commented on June 13, 2024

Hi, have you found the solution for this?

from beir.

soyoung97 commented on June 13, 2024

Hi, It's not exactly the same issue, but I've noticed similar issues on the reranking task.
For example, I'm using the following code.

def do_evaluation(queries, qrels, corpus, results=None):
    k_values = [1,5,10,20,50,100]
    retriever = EvaluateRetrieval()
    from beir.reranking.models import CrossEncoder, MonoT5
    from beir.reranking import Rerank
    cross_encoder_model = MonoT5(mode, token_false='▁false', token_true='▁true')
    print(f"Loading cross-encoder model from: {cross_encoder_model.model.config._name_or_path}")
    reranker = Rerank(cross_encoder_model, batch_size=256)
    results = reranker.rerank(corpus, queries, results, top_k=100) # outputs nan scores to results
    results = remove_nan(results) # manually assign score due to bug
    ndcg, _map, recall, precision = retriever.evaluate(qrels, results, k_values)

It may be too late, but the order inside the results is preserved, so the following code can be used as a quick workaround:

def remove_nan(results):
    new_res = {}
    for query_key in results.keys():
        out = {}
        for i, corpus_key in enumerate(results[query_key].keys()):
            out[corpus_key] = 100 - i
        new_res[query_key] = out
    return new_res

Using this code, it output correct scores for ndcg, recall.. and so on.

from beir.

Recommend Projects