Subject Code : GV900
Assignment Task

 

Tasks

 1. Load the “world” dataset (world.csv), and store it as an object named world.data.

2. The data set contains a dummy variable (i.e., a nominal variable with two categories) named oecd that classifies countries into two groups, OECD member countries and nonmember countries. One way to describe and summarize the information contained in a nominal variable is to describe the distribution numerically. As we learned during the past weeks, we describe the distribution of a nominal variable numerically by creating a frequency table. Create a frequency table of this variable and store it into a data frame object ft.oecd. The table has to have three columns: values (initially called “Var1”), frequency (called “Freq”), and percentage (should be called “Percentage”). Change the column name of the first column to “OECD Member?”.

3. According to the frequency table you created above, (A) how many countries in the data set are OECD members? (B) How many countries in the data set are not? (C) What percentage of countries are OECD members? (D) What percentage of countries are nonmembers? Give me four answers (four numbers) as a comment. Note: for this task, you 1 don’t need an R command. Just read the table and tell me the numbers. Don’t forget to comment them out.

4. Another way to describe and summarize a nominal variable is to draw a frequency distribution graph. For nominal variables, we draw a bar chart. Using the functions available in the ggplot2 package (e.g., geom bar), draw a bar chart of the dummy variable that measures OECD membership. • Hint 1: Don’t forget to load the package using the library function. It’s usually a good idea to do so at the beginning of your R code file. • Hint 2: Don’t forget to change the axis labels using the xlab and ylab options. The appropriate label for the X axis would be “OECD membership”, whereas the label for the Y axis could be “Number of countries”.

5. List three countries that are coded as OECD member states. List three countries that are non-democratic according to the democracy dummy variable. Note: Again, you don’t need a command for this one; I only need six country names.

6. The data set contains a numerical variable (interval-level variable) named gdp 10 thou that records a country’s per capita GDP in 10,000 US dollars. Note that this variable measures per capita GDP in 10,000 dollars, not in dollars. This means that, when this variable takes a value of 4, for example, then that country’s per cpaita GDP is 40,000 dollars, not 4 dollars. Describe this variable numerically by calculating the following statistics: • Range (minimum and maximum), median, mean, 1st and 3rd quartile values (Hint: this can be done at once with one command) • Standard deviation (Hint: you need to take care of missing values using the na.rm option) Note: You need to provide R commands, not just numerical answers for this one.

7. It appears that the mean and the median of this per capita GDP variable are far apart: the mean is 6,018 dollars whereas the median is 1,897 dollars. Given that the mean is much higher than the median, the distribution of this variable is very skewed (i.e., not symmetric). In which way does the skew go? Answer this question by choosing between two options: (A) negatively skewed (skewed to the left) or (B) positively skewed (skewed to the right). Note: Give me your answer in words, not in R commands.

 

 

This GV900 - Data Analytics Assignment has been solved by our Data Analytics experts at UniLearnO. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

Eureka! You've stumped our genius minds (for now)! This exciting new question has our experts buzzing with curiosity. We can't wait to craft a fresh solution just for you!

  • Uploaded By : Brett
  • Posted on : April 19th, 2019

Whatsapp Tap to ChatGet instant assistance