Assignment Task :

Part 1
Tasks 1-5
focus on housing data from the 1990 California census, with each row representing a census block group, which is the smallest geographical unit for which sample data is provided by the U.S. Census Bureau. A census block group usually has a population of about 600 to 3,000 people. The data is sourced from Kaggle1 and is provided in the housing.csv file on iLearn. It consists of the following fields:
• longitude - a measure of how far west a house is; a higher value is farther west
• latitude - a measure of how far north a house is; a higher value is farther north
• housing_median_age - median age of a house within a block in years; a lower number is a newer building
• total_rooms - total number of rooms within a block
• total_bedrooms - total number of bedrooms within a block
• population - total number of people residing within a block
• households - total number of households, a group of people residing within a home unit, for a block
• median_income - median income for households within a block of houses (measured in tens of thousands of US Dollars)
• median_house_value - median house value for households within a block (measured in US Dollars)
• ocean_proximity - location of the block with respect to the ocean
 

Task 1
Display the structure of the housing data and calculate descriptive statistics for the numerical columns (mean, median, standard deviation, maximum and minimum).
• How many rows and columns are in this dataset?
• Create a frequency table showing the unique values of the ocean proximity variable and the number of times these values occur in the dataset.
• Create a pie chart showing the breakdown of the categories in the ocean proximity variable. Display the percentage of each category on the pie chart. Interpret the plot.
 

Task 2
• How many values are missing from the total bedrooms variable? Print out the first 5 rows in the housing data with missing values for this variable.
• Create a violin plot showing how the distribution of median house value differs across the location of the block with respect to the ocean. Interpret the plot.
Task 3 
Add a new column (age_category) to the housing data that classifies each block into one of five categories depending upon the median age of a house:
 0-10 years – Cat 1
 10-20 years – Cat 2
20-30 years – Cat 3
30-40 years – Cat 4
> 40 years – Cat 5
• Print out the median age of a house and the age category for the first 10 rows in the dataset
• Create a grouped bar chart showing the breakdown of the age_category variable for each category in the ocean proximity variable (i.e. put ocean proximity on the x-axis and age_category in the bars). Interpret the plot.
 

Task 4 
Investigate the relationship between total bedrooms (x-axis) and median house value (yaxis) by creating a scatter plot with a linear regression line through it and a joint plot. Comment on what the plots say about this relationship.
• Filter the housing data so it only contains census blocks with a median income above $60,000 USD and a median house age less than 5 years. Print the first 5 rows of this filtered dataset. How many census blocks are in this filtered dataset?
• Randomly sample 500 rows from the housing dataset and store them in a variable. Print out the first 10 rows of this dataset.
• Create a scatter plot showing the relationship between total rooms (x-axis) and median house value (y-axis) for the randomly sampled data, colouring points on the chart based on a census block’s median income. Interpret the plot

 

This Business Assignment has been solved by our Business Experts at UniLearnO. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.

Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

Eureka! You've stumped our genius minds (for now)! This exciting new question has our experts buzzing with curiosity. We can't wait to craft a fresh solution just for you!

  • Uploaded By : Grace
  • Posted on : December 14th, 2018

Whatsapp Tap to ChatGet instant assistance