For all Case Study 2 questions we will be using the heart.csv data provided with this case study. Along with the heart.csv data, I have provided a heart_data_dictionary.csv file that provides a description of each column. As you answer the lab questions, it may be beneficial to reference this data dictionary.
chest_pain
, chol
, and max_hr
columns. How many rows and columns are in the resulting DataFrame?.fillna()
docs.risk
that is equal to $ \frac{age}{\text{res_bp} + chol + \text{max_hr}} $. What is the mean of this risk
column?rest_ecg
column so that:rest_ecg
categories?risk
value (the risk
column was created in problem 2 of the "Manipulating data" section) for each age and sex. Which gender and age group has the highest average risk value?