Code Monkey home page Code Monkey logo

damg_final_project's Introduction

DAMG_FINAL_PROJECT

IMDB Movies Project using Talend, MSSQL Server, Altreyx, PowerBI, MySQL, Tableau

Tools Used: Data Profiling: Altreyx Data Cleaning: Talend ETL tool: Talend SQL Servers: MySQL, MSSQL Data Visualization: PowerBi and Tableau

IMDB Movies Analysis Using Talend
Tools & Technologies Used: Talend, ER Studio, Altreyx, Microsoft SQL Server, MySQL, Tableau, Azure Data Studio, PowerBi • Executed data integration from diverse sources including MySQL (IMDb tables), TSV (revenue data), and JSON files (movie titles and actor name changes), ensuring comprehensive data consolidation • Conducted in-depth data profiling and analysis using Alteryx, producing detailed reports and insights, complemented by a meticulous mapping document in Excel • Developed a robust data model focusing on an SCD Type 2 Movie Titles Dimension table, enhancing data accuracy and historical tracking • Designed and implemented ETL mappings in Talend, utilizing metadata-based connections, contexts, and environments, to streamline data processing workflows • Created dynamic and interactive dashboards in Power BI and Tableau, ensuring SQL script outputs were consistent with visualized data, effectively communicating key metrics and trends

1.Alteryx:

Alteryx Workflow: Understanding data

image

Finding:

  • Rank: The movie's rank varied from 1 to 55 during its box office run, and it contains “-” values as well
  • Gross: Daily gross earnings ranged from a minimum of $357 to a maximum of about $28.27 million.
  • Per Theater: Earnings per theater varied between $60 and $8,181.
  • Total Gross: The cumulative gross earnings increased, reaching approximately $760.51 million.
  • Days: The dataset covers 336 days from the movie's release.
  • %LW and %YD contain null values

Insights and Observations

  • Strong Initial Performance: "Avatar" had a powerful opening, indicated by the high initial daily and per-theater gross.
  • Longevity in Theaters: The movie remained in theaters for a significant duration (336 days), highlighting its lasting appeal.
  • Consistent Top Rankings: The movie consistently ranked well during its theatrical run despite fluctuations.
  • Revenue Stability: After the initial spike, the total gross showed stability, indicating a steady influx of viewers over an extended period.

2. Navicat: For designing Data Model Dimensional Model:

image

3.Talend Workflow Screenshots

Staging image image image

Dimensions: image image image image image

Bridge Tables: Movie-Genre Bridge table: image Movie-Region Bridge Table: image

Fact Tables:

BoxOfcFact: image FactTitle Principal: image Genre Fact: image

Visualization Using Power BI (https://app.powerbi.com/groups/4245cd51-53a4-4aac-984f-18f6bde6a73e/reports/07948f86-f53d-4286-b8c0-efee8aaf52e1/ReportSection185e58af7ba5a1c2e3ef?experience=power-bi): image image image image

damg_final_project's People

Contributors

madanjatin18 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.