Light

haojiepan1 / e-commerce-search-recall Goto Github PK

View Code? Open in Web Editor NEW

This project forked from muyuuuu/e-commerce-search-recall

0.0 0.0 0.0 1.76 MB

天池阿里灵杰问天引擎电商搜索算法赛 pytorch baseline，非官方。附从 0.05 到 0.26 分的 trick。

Python 100.00%

e-commerce-search-recall's Introduction

电商搜索召回

一个毫无 NLP 经验的人的比赛（挖坑填坑）之旅。

实现 DSSM baseline，直接优化距离结果很差，得分 0.057
实现 CoSENT，余弦距离得分 0.159
实现 SimCSE，得分 0.227

tools 里面是精度转换和结果文件检查。

Trick

在 model.py 中使用 first-last-avg 融合大概从 0.22 提升到 0.245 左右。

Details

def forward(self, input_ids, attention_mask, token_type_ids):
    out = self.extractor(input_ids,
                         attention_mask=attention_mask,
                         token_type_ids=token_type_ids,
                         output_hidden_states=True)

    first = out.hidden_states[1].transpose(1, 2)
    last = out.hidden_states[-1].transpose(1, 2)
    first_avg = torch.avg_pool1d(
        first, kernel_size=last.shape[-1]).squeeze(-1)  # [batch, 768]
    last_avg = torch.avg_pool1d(last, kernel_size=last.shape[-1]).squeeze(
        -1)  # [batch, 768]
    avg = torch.cat((first_avg.unsqueeze(1), last_avg.unsqueeze(1)),
                    dim=1)  # [batch, 2, 768]
    out = torch.avg_pool1d(avg.transpose(1, 2), kernel_size=2).squeeze(-1)
    x = self.fc(out)
    x = F.normalize(x, p=2, dim=-1)
    return x

在 unilm 文件夹下，进行 UniLM 预训练，大概 0.265 左右，损失在 1.3x 左右。预训练模型下载: YunwenTechnology/Unilm

参考

致谢

本仓库中的工作得到西安电子科技大学高性能计算校级公共平台的支持. Supported by High-performance Computing Platform of XiDian University.

e-commerce-search-recall's People

Contributors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.