Code Monkey home page Code Monkey logo

ctr_algorithm's People

Contributors

prayforhanluo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

ctr_algorithm's Issues

关于DIN模型的疑惑

大佬您好,最近看您的DIN代码,有一些地方不太明白,希望得到您的解答!
1、mask = (behaviors_x > 0).float().unsqueeze(-1) 这里的msak的具体作用是啥啊?为什么需要这个mask呢?
这里的 注意力输入部分,原始的好像没有queries - user_behavior吧?为啥有这一项呢
2、attn_input = torch.cat([queries, user_behavior,
queries - user_behavior,
queries * user_behavior], dim = -1)
3、 output = user_behavior.mul(attns.mul(mask)) # batch * seq_len * embed_dim
这个里面为啥还有mask呢?

模型中编码问题

DeepFM中的编码是采用的one-hot编码方式,作者在此处用了lable编码方式。这种编码方式很少用,且局限性很强。这种编码方式确实在embedding的时候非常方便。但是这种编码方式最后产生的实际效果是不是可信度不强呢

关于DIEN的PyTorch实现

你好,我在阅读DIEN的PyTorch代码时发现一个地方可能存在错误
在Interest Extractor Layer中,GRU的隐藏层应该保留前T-1个用于计算辅助loss
但是DIEN代码的152行是
gru_embed=pad_interests[:,1:]
这样是不是取了后T-1个hidden state?
我理解的应该是
gru_embed=pad_interests[:,:-1]

关于DIN的疑问

大佬您好,最近看您的DIN代码,有一些地方不太明白,希望得到您的解答!
1、mask = (behaviors_x > 0).float().unsqueeze(-1) 这里的msak的具体作用是啥啊?为什么需要这个mask呢?
这里的 注意力输入部分,原始的好像没有queries - user_behavior吧?为啥有这一项呢
2、attn_input = torch.cat([queries, user_behavior,
queries - user_behavior,
queries * user_behavior], dim = -1)
3、 output = user_behavior.mul(attns.mul(mask)) # batch * seq_len * embed_dim
这个里面为啥还有mask呢?

模型中offset的问题

大佬,请问一下模型中对输入加上一个offset 有什么作用呢,为啥offset还要向右偏移了一位呢?初次学习,还请大佬指教,感谢!

模型中关于输入的feature_fields的疑惑

大佬,您好,您写的这个项目是我看到最优美的推荐代码,对于小白的我,非常受益,感谢您把您的工作分享出来,供大家学习。在阅读代码的时候,我一直有一个小小的疑惑,就是,fields = data_x.max().values,这句话的意思是获取到每列特征中最大的索引,比如第一列是0,1两个不同的特征,而这句代码直接就取到了1,然而实际就有两个不同的特征,这样做embedding的时候,每一列的特征都少一个,所以torch.nn.Embedding(sum(feature_fields)+1,1)是不是应该为torch.nn.Embedding(sum(feature_fields+1),1)?还有为什么要在sum(feature_fields)后面加个1呢?还请大佬明示,感谢!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.