Code Monkey home page Code Monkey logo

datamining's Introduction

DataMining

- Exploratory Data Analysis and Data Mining for COVID-19 pandemic using Pyspark

- Short Description & Introduction

The analysis was conducted for the purposes of the course in Data Mining during my postgraduate program in Data Science & Machine Learning. The dataset that we use for our analysis originates from the non-profit organization Our World in Data. The dataset was pulled from the following GitHub link. In this analysis we are mainly interested in extracting valuable information about the spreading and the effects of the COVID-19 pandemic for the period 05/01/2021 - 30/06/2021. We describe and analyze how the pandemic affected and spread among different countries, continents and in the whole World in general.

  • We produce visualizations to answer questions and enhance the findings of our analysis for the aforementioned period. For example, we would like to know How the number of the new COVID cases evolved in Europe?

Or Which countries have had the most total Covid cases until the end of June 2021?

Coutries with highest total Covid cases until 30/06/2021

Country Total Covid cases
United States 33.777.444.
India 30.411.634
Brazil 18.570.296
France 5817787
Russia 5449594

Despite the fact the uploaded notebook is writting in Greek an English version will also be available soon! :)

datamining's People

Contributors

chrisnick92 avatar

Stargazers

Spyros Rigas avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.