Code Monkey home page Code Monkey logo

chineserumordataset's Introduction

中文谣言和虚假新闻数据集

Chinese_Rumor_Datasethttps://github.com/thunlp/Chinese_Rumor_Dataset.git

📎 该数据为从新浪微博不实信息举报平台抓取的中文谣言数据,分为两个部分。其中当前目录下的数据集仅包含谣言原微博,不包含转发/评论信息;而CED_Dataset中是包含转发/评论信息的中文谣言数据集。
有详细的readme简介。质量较高。

DoubleCheckhttps://github.com/Enderfga/DoubleCheck.git

📎 论文:Long-Text Chinese Rumor Detection Dataset 中提出的数据集LTCR。
LTCR 数据集为准确检测错误信息提供了宝贵的资源,特别是在与 COVID-19 相关的复杂假新闻的背景下。该数据集分别包含 1,729 条真实新闻和 500 条假新闻。真实新闻和虚假新闻的平均长度分别约为 230 和 152 个字符。
详见论文。

COVID19-Health-Rumorhttps://github.com/Kelaxon/COVID19-Health-Rumor.git

📎 论文:Know it to Defeat it: Exploring Health Rumor Characteristics and Debunking Efforts on Chinese Social Media during COVID-19 Crisis 中涉及到的数据集。
该数据集包含 COVID-19 早期在**互联网上流传的健康谣言,以及新浪微博(**最大的微博网站)上旨在反驳或揭穿这些谣言的帖子。与阴谋论不同,健康谣言是关于医疗保健和医学的,不涉及主要参与者(例如美国军方)。
详见论文以及readme。

CHECKEDhttps://github.com/cyang03/CHECKED.git

📎 论文:CHECKED: Chinese COVID-19 Fake News Dataset 提出的数据集。
包括真假新闻,json格式与csv格式存储。
详见论文以及readme。

CrossFakehttps://github.com/YingtongDou/CrossFake.git

📎 论文:Cross-lingual COVID-19 Fake News Detection 提到的数据集。
包含中、英文的真、假新闻。详见数据集。
详见论文以及readme。

CHEFhttps://github.com/THU-BPM/CHEF.git

📎 论文:CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking 提出的数据集。
详见论文以及readme。

Combating the Infodemic:https://www.mdpi.com/2227-9032/9/9/1094

📎 论文:Combating the Infodemic: A Chinese Infodemic Dataset for Misinformation Identification 提出的数据集,通过收集 COVID-19 爆发期间广泛传播的**信息流行病来构建**信息流行病数据集“infodemic 2019”。每条记录都被标记为真实、错误或可疑。
详见论文。

chineserumordataset's People

Contributors

yeren66 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.