Assignment Task :

In this homework assignment, you are going to use Cluster Analysis and Decision Tree induction algorithm on a weather forecast problem. It is a binary classification problem to predict whether or not a location will get rain the next day.
Information about the dataset (Weather Forecast Training.csv):
• Location: The location name of the weather station
• MinTemp: The minimum temperature in degrees celsius
• MaxTemp: The maximum temperature in degrees celsius
• Rainfall: The amount of rainfall recorded for the day in mm
• Evaporation: The so-called Class A pan evaporation (mm) in the 24 hours to 9am
• Sunshine: The number of hours of bright sunshine in the day.
• WindGustDir: The direction of the strongest wind gust in the 24 hours to midnight
• WindGustSpeed: The speed (km/h) of the strongest wind gust in the 24 hours to midnight
• WindDir: Direction of the wind
• WindSpeed: Wind speed (km/hr) averaged over 10 minutes
• Humidity: Humidity (percent)
• Pressure: Atmospheric pressure (hpa) reduced to mean sea level
• Cloud: Fraction of sky obscured by cloud This is measured in “oktas”, which are a unit of eigths. It records how many eigths of the sky are obscured by cloud. A 0 measure indicates completely clear sky whilst an 8 indicates that it is completely overcast.
• Temp: Temperature (degrees C)
• RainTodayBoolean: 1 if precipitation (mm) in the 24 hours to 9am exceeds 1mm, otherwise 0
• RainTomorrow: The target variable. Did it rain tomorrow? Organize your report using the following template (with the section breakdown and grading rubrics):

Section 1: Data preparation  
• Discuss the potential data quality issues you identify about the dataset and how you apply various data preprocessing techniques to cope with those issues and perform Exploratory Data Analysis (EDA).
Specifically discuss the type of techniques you carry out in order to prepare the dataset for the machine learning algorithms you use in the next section. Whenever appropriate, enhance your EDA with the effective data visualization.
 

 

This Data Mining Assignment has been solved by our Data Mining  Experts at UniLearnO. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.

Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

Eureka! You've stumped our genius minds (for now)! This exciting new question has our experts buzzing with curiosity. We can't wait to craft a fresh solution just for you!

  • Uploaded By : Grace
  • Posted on : February 19th, 2018

Whatsapp Tap to ChatGet instant assistance