diff_months: 7

Health Insurance Costs of Customers for a Real Health Insurance Company - Statistics Assignment Help

Notify Me
Added on: 2022-01-10 06:13:07
Order Code:
Question Task Id: 386978
  • Country :


Assignment Task


There are three questions to be answered. All calculations need to be in the corresponding sheet. 

Taks1: It is a hypothesis testing. You need to answer to below questions:
Is the difference significant?
b) Is this durg helping patients or worsening it?

Task2: Linear regression: This is a dataset to predict the heath insurance costs of customers for a real health isurance company. Target variable is "charges" For "sex", and "smoker" features, replace classes with 0 or 1. For instance "female" could be 0, "male" could be 1. Choice of 0 and 1 is oprional. You will use these to build the model Use multiple linear regressionto predict charges. Set 20% of the data to be "test" set. Calculate RMSE for test set and training set
a) Calculate RMSE?
Find out which feature can be dropped, either using correlaton between features, or use t-stat ot p-value. Build a new model with one less feature
b) which feature better to be dropped?
Compare first model and second model
c) which model is better
d) why

Task3:There are two colums coming rom a classification problem. First column is the predicted value, second is the real value value of 1 stands for positive/yes and 0 means negative or no. Try to build a confusion matrix by counting how many instances of TN, TP, FP, FN we have
a) Find TP, TN, FP, and FN in order
b) Calculate accuracy, precision, recall, and f1-score
c) Is the dataset balanced or imbalanced?
d) Which performance metrics will you choose if we don’t have any information about what is the dataset about?
Let's assume 0 stands for not-rain and 1 stands for rainy days. A business of a large chain ice cream store is using this model to close some of the branches if it is predicted to rain A night before, based on model prediction, employees receive an email if they need to show up tomorrow at work or not The company is already well-known and doesn't need to acquire more customers. Their focus for now is to save costs and not open the store, when there is no customer due to the rain
e) With this background, which performance metrics will you choose?


This Statistics Assignment has been solved by our Statistics Experts at TVAssignmentHelp. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment Experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

  • Uploaded By : Roman
  • Posted on : January 10th, 2020
  • Downloads : 0

Notify Me

Review Question

Please enter your email

Can't find what you're looking for?

Choose a Plan


80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing


30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%


20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more