From the Introduction to Modern Statistics (IMS) textbook, complete the following exercises:
Also complete the additional question given below:
Question #1: The data below are a random sample of \(n=5000\) claims made against the US Transportation Security Administration (TSA). To provide some background, a claim is a formal request for compensation due to damages. People make claims against the TSA when their personal property is stolen or damaged, or when they are injured by TSA processes. If the claim is approved or settled, the TSA pays an amount of money, the close amount, to the individual who made the claim.
tsa = read.csv("https://remiller1450.github.io/data/tsa_small.csv")
tsa
data set to
only include claims on the following items: "Cell Phones"
and "Computer - Laptop"
. Then, create a new binary variable
“Denied” that records whether a claim was denied or not using the
mutate()
and ifelse()
functions. Store your
results in a new data frame and use them throughout the remainder of
this question.R
to find a 95% confidence
interval estimate of odds ratio described in part B.Status
to model the
outcome variable Close_Amount
. Find an interpret a 95%
confidence interval estimate using the coefficient of the re-coded
variable StatusSettled
. Can you be confident that settled
claims provide payments that differ from approved claims?Close_Amount
using the predictors Status
and
Claim_Amount
. Find an interpret a 95% confidence interval
estimate using the coefficient of the re-coded variable
StatusSettled
. Does this interval lead you to a different
conclusion about the payouts of settled claims vs. approved claims when
compared with your results from Part F? If so, explain what is
different. If not, briefly explain why.