Code Monkey home page Code Monkey logo

Comments (12)

guoday avatar guoday commented on September 4, 2024

@guoday 多谢多谢

使用训练集的所有数据作为种子人群

from tencent2020_rank1st.

guotong1988 avatar guotong1988 commented on September 4, 2024

弱问下 种子人群 的概念?

from tencent2020_rank1st.

guoday avatar guoday commented on September 4, 2024

弱问下 种子人群 的概念?

种子人群就是用来统计广告的性别年龄分布,然后赋值给另一个人群的。比如测试集就不能作为种子人群,因为没有性别年龄信息

from tencent2020_rank1st.

guotong1988 avatar guotong1988 commented on September 4, 2024

感谢回复,
直接赋值给另一人群,不是相当于训练和预测阶段这个特征一样吗?没有泛化的感觉?

from tencent2020_rank1st.

guoday avatar guoday commented on September 4, 2024

感谢回复,

直接赋值给另一人群,不是相当于训练和预测阶段这个特征一样吗?没有泛化的感觉?

不一样,训练的时候分五份,1234→5,1235→4,1245→3,1345→2,2345→1,因此只有80%的训练集统计,给另外的20%。预测的时候取100%的训练集给测试集

from tencent2020_rank1st.

guotong1988 avatar guotong1988 commented on September 4, 2024

这个做法提升多少?感谢!

from tencent2020_rank1st.

guoday avatar guoday commented on September 4, 2024

这个做法提升多少?感谢!

提升特别大,大概4个千分点,也是获胜的关键之一。你可以看我们ppt上分情况,有提到这个fusion layer

from tencent2020_rank1st.

guotong1988 avatar guotong1988 commented on September 4, 2024

模型本身居然捕获不到这些分布信息

from tencent2020_rank1st.

guoday avatar guoday commented on September 4, 2024

模型本身居然捕获不到这些分布信息

主要广告比较稀疏,通过统计的方法会更好一些。如果广告特别稠密,比如只有1万个id,那么这方法就没啥用了

from tencent2020_rank1st.

guotong1988 avatar guotong1988 commented on September 4, 2024

哦哦,刚想问其他分类任务是否能这样把label分布加进去

from tencent2020_rank1st.

juzstu avatar juzstu commented on September 4, 2024

想问下,这个和target encoding有啥区别?

from tencent2020_rank1st.

guoday avatar guoday commented on September 4, 2024

想问下,这个和target encoding有啥区别?

没区别

from tencent2020_rank1st.

Related Issues (17)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.