Code Monkey home page Code Monkey logo

project-group-group21's Introduction

Group 21 - Analysis of Airbnb Data Across 3 Major Cities

  • Your title can change over time.

Milestones

Details for Milestone are available on Canvas (left sidebar, Course Project).

Describe your topic/interest in about 150-200 words

Air bnb data that summarizes price, neighborhood, review, and availability information for Air bnb rental places in Mexico, Montreal, and Tokyo. We are interested in exploring the correlations between the various data offered in this dataset to figure out our research questions, such as exploring the cheapest Air bnb rental spot, the correlation between price and rating, the average pricing of each city, and price versus size correlation. In terms of our interest in the topic, we are hoping that by visualizing the above mentioned data, we are able to better plan future trips around booking through airbnb in order to obtain the highest accommodation value when visiting Mexico, Tokyo, and Montreal in the future With the raw data being downloaded in separate cities, we believe that it is also possible to conduct analysis on each individual cities, which may offer further insights for our, or our analysis users', travel plan to Mexico, Montreal, and Tokyo.

Describe your dataset in about 150-200 words

Who:

The company that provided this data is Inside Airbnb which is a website that contains various airbnb databases. Inside Airbnb is an organization that provides various data sets from multiple cities around the world such as Belgium, Athens, Boston etc. The data sets provided range from listings, neighbourhoods, and reviews and is widely available to the public.

What:

Our data consists of various information relating to airbnb bookings in Mexico city, Tokyo, and Montreal, such as price, availability, room type and number of reviews. Aside from the three cities we conduct our analysis on, the Inside Airbnb website also offers the same set of data for a large number of other cities that Airbnb operates in.

When:

The presented data was collected from the years 2015 to 2022.

Why:

We believe the purpose of our data set is to provide information to potential users of airbnb in order to gain deeper insights as to optimal locations to utilize airbnb.

How:

We selected our data from an array of csv files from insideairbnb.com. We then extracted our three data files with information from three cities onto our local repositories before pushing it to the group repository.

Team Members

  • Aurora Gardiner: I am an Accountant.
  • Harry Wu: I am an Accountant, too!
  • Steven To: I am not an Accountant.

Images

{You should use this area to add a screenshot of an interesting plot, or of your dashboard}

References

Airbnb Data Source. Inside Airbnb. (n.d.). Retrieved February 14, 2023, from http://insideairbnb.com/get-the-data/

project-group-group21's People

Contributors

aurorag1 avatar github-classroom[bot] avatar harry-hao-wu avatar stevent9 avatar

project-group-group21's Issues

Let's Fix the Import Errors!

Hey Aurora and Steven,

As I was reviewing your analysis files, I noticed that you were not able to import your functions from the .py file properly. I have attached a screenshot of how I did it in my analysis 1 file and I can walk you through the codes:

  1. I first imported os, so that I can define the file path for importing my function. (this was done at the very top, where I imported pandas, seaborn, etc. the code is simply import os)
  2. I then assigned cpath(a variable) a value of os.get.cwd(). This basically stores your current directory path (aka where all our analysis files are located) into the variable cpath.
  3. I then changed the directory in the next line, basically adding "\code" to the end of my current working directory path, which guides the computer into the code folder, where we stored all our function files.
  4. Now that we are at the right directory, we can import the load and process function as I wrote on my third line of code.
  5. I then change the directory to head back to where all the analysis files are, which we can use the variable cpath to define the path.
  6. The load and process function should now work using whatever parameters you have set for yourselves.

Let me know if you need me to explain this in person!

image

MS4 Feedback

Feedback for Group 21

  1. Notes from [Islam] on Feedback on the analysis:

Analysis 1:

  • Good job.
  • add titles to your visualizations
  • increase text size in your visualizations
  • fix histogram

Analysis 2:

  • Good job.

Analysis 3:

  • Good job.
  • fix Minimum_nights VS. Price (overplotted)
  • add titles to your visualizations
  • fix scale of small visualizations by changing xlim

Contracted Grade

  1. What is your contracted grade?
  • A
  1. Are you on track to satisfy all the requirements of your contracted grade?
  • Yes, work isn't done but you're on the right track.

Feedback session 2

Analysis 1

  • Add title and observations for plots
  • Change count and price plot and explain (histogram)
  • Overlay the distributions, explain the y axis and x axis and show what the difference is
  • Refer to the plots that answer the question
  • #2
    Analysis 2
  • Add value on top of the bar chart on room type
  • Add a legend on the geographic plot and figure out how to overlay different colour plots
  • Resolve warning message
  • Summary of research questions can be longer
    Analysis 3
  • edit research question (make deeper)
  • Edit minimum nights vs price: Use box plot
  • Add titles and comments to plots
  • Sort the number of listings and count plot
  • Combine method chaining
  • Cut off (limit y axis) and combine for price distribution plot

Feedback Session 1

  1. What is your contracted grade?
  • A
  1. Are there are any group dynamics issues that we need to be aware of ?
  • None
  1. Notes from Firas on Feedback on the analysis:
  • Refactor your code instead of using f-strings and loops, to do it using pandas groupby, or apply function isin()
  • Show me a plot of the price as a histogram, and then limit to the top 95% (you may need to play around with this number)
  • Room type vs price is over plotted (switch to a histogram) (analysis2)
  • Create an annotated line at the lower and upper end of your price to say that's what you're considering (analysis1,2,3)
  • Sort your histogram by number of listings (analysis2)
  • Boxplot isn't the right plot there (analysis2)
  • Lots more work left for analysis3, unable to give you useful feedback at the moment

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.