This dataset describes a sample of 100 volunteers providing a semen sample that was analyzed according to the WHO 2010 criteria.

fertility

Format

A data frame containing 100 rows and 10 columns.

season

Season in which the analysis was performed. (winter, spring, summer, fall)

age

Age at the time of analysis

child.dis

Childish diseases (ie , chicken pox, measles, mumps, polio) (yes(1), no(0))

trauma

Accident or serious trauma (yes(1), no(0))

surgery

Surgical intervention (yes(1), no(0))

fevers

High fevers in the last year (less than three months ago(-1), more than three months ago (0), no. (1))

alcohol

Frequency of alcohol consumption (several times a day, every day, several times a week, once a week, hardly ever or never)

smoking

Smoking habit (never(-1), occasional (0)) daily (1))

sitting

Number of hours spent sitting per day

diagnosis

Criterion: Diagnosis normal (TRUE) vs. altered (FALSE) (88.0% vs.\ 22.0%).

Source

https://archive.ics.uci.edu/ml/datasets/Fertility

Original contributors:

David Gil Lucentia Research Group Department of Computer Technology University of Alicante

Jose Luis Girela Department of Biotechnology University of Alicante

Details

Sperm concentration are related to socio-demographic data, environmental factors, health status, and life habits.

We made the following enhancements to the original data for improved usability:

  • The criterion was redefined from a factor variable with two levels (N = Normal, O = Altered) into a logical variable (TRUE vs. FALSE).

Other than that, the data remains consistent with the original dataset.