A dataset predicting the diagnosis
of 303 patients tested for heart disease.
Format
A data frame containing 303 rows and 14 columns, with the following variables:
- diagnosis
True value of binary criterion: TRUE = Heart disease, FALSE = No heart disease
- age
Age (in years)
- sex
Sex, 1 = male, 0 = female
- cp
Chest pain type: ta = typical angina, aa = atypical angina, np = non-anginal pain, a = asymptomatic
- trestbps
Resting blood pressure (in mm Hg on admission to the hospital)
- chol
Serum cholestoral in mg/dl
- fbs
Fasting blood sugar > 120 mg/dl: 1 = true, 0 = false
- restecg
Resting electrocardiographic results. "normal" = normal, "abnormal" = having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV), "hypertrophy" = showing probable or definite left ventricular hypertrophy by Estes' criteria.
- thalach
Maximum heart rate achieved
- exang
Exercise induced angina: 1 = yes, 0 = no
- oldpeak
ST depression induced by exercise relative to rest
- slope
The slope of the peak exercise ST segment.
- ca
Number of major vessels (0-3) colored by flouroscopy
- thal
"normal" = normal, "fd" = fixed defect, "rd" = reversible defect
Details
Note that this is a simplified version of the 303 cases of the Cleveland Clinic Foundation (V.A. Medical Center, Long Beach and Cleveland Clinic Foundation; Principal investigator: Robert Detrano, MD, PhD).
The original dataset contains 3 further subsets (from Budapest, Hungary; Long Beach CA; and Zurich, Switzerland), a total of 76 raw attributes, and some missing values.
The original criterion variable num
is integer valued from 0 (no presence) to 4 (maximum).
To obtain a binary criterion diagnosis
, values from 1 to 3 have been collapsed to TRUE
.
See also
heart.cost
dataset for cost information.
Other datasets:
blood
,
breastcancer
,
car
,
contraceptive
,
creditapproval
,
fertility
,
forestfires
,
heart.cost
,
heart.test
,
heart.train
,
iris.v
,
mushrooms
,
sonar
,
titanic
,
voting
,
wine