Data describing poisonous vs. non-poisonous mushrooms.
mushrooms
A data frame containing 8,124 rows and 23 columns.
See http://archive.ics.uci.edu/ml/machine-learning-databases/mushroom/agaricus-lepiota.names for column descriptions.
Criterion: Is the mushroom poisonous?
Values: TRUE
(poisonous) vs. FALSE
(eatable) (48.2% vs.\ 52.8%).
cap-shape, character (bell=b, conical=c, convex=x, flat=f, knobbed=k, sunken=s)
cap-surface, character (fibrous=f, grooves=g, scaly=y, smooth=s)
cap-color, character (brown=n, buff=b, cinnamon=c, gray=g, green=r, pink=p, purple=u, red=e, white=w, yellow=y)
Are there bruises? logical (TRUE/FALSE)
character (almond=a, anise=l, creosote=c, fishy=y, foul=f, musty=m, none=n, pungent=p, spicy=s)
gill-attachment, character (attached=a, descending=d, free=f, notched=n)
gill-spacing, character (close=c, crowded=w, distant=d)
gill-size, character (broad=b, narrow=n)
gill-color, character (black=k, brown=n, buff=b, chocolate=h, gray=g, green=r, orange=o, pink=p, purple=u, red=e, white=w, yellow=y)
stalk-shape, character (enlarging=e, tapering=t)
stalk-root, character (bulbous=b ,club=c, cup=u, equal=e, rhizomorphs=z, rooted=r)
stalk-surface-above-ring, character (fibrous=f, scaly=y, silky=k, smooth=s)
stalk-surface-below-ring, character (fibrous=f, scaly=y, silky=k, smooth=s)
stalk-color-above-ring, character (brown=n, buff=b, cinnamon=c, gray=g, orange=o, pink=p, red=e, white=w, yellow=y)
stalk-color-below-ring, character (brown=n, buff=b, cinnamon=c, gray=g, orange=o, pink=p, red=e, white=w, yellow=y)
veil-type, character (partial=p, universal=u)
veil-color, character (brown=n, orange=o, white=w, yellow=y)
character (none=n, one=o, two=t)
character (cobwebby=c, evanescent=e, flaring=f, large=l, none=n, pendant=p, sheathing=s, zone=z)
spore-print-color, character (black=k, brown=n, buff=b, chocolate=h, green=r, orange=o, purple=u, white=w, yellow=y)
character(abundant=a, clustered=c, numerous=n, scattered=s, several=v, solitary=y)
character (grasses=g, leaves=l, meadows=m, paths=p, urban=u, waste=w, woods=d)
This dataset includes descriptions of hypothetical samples corresponding to 23 species of gilled mushrooms
in the Agaricus and Lepiota Family. Each species is classified as poisonous
(True or False).
The Guide clearly states that there is no simple rule for determining the edibility of a mushroom;
no rule like ``leaflets three, let it be'' for Poisonous Oak and Ivy.
We made the following enhancements to the original data for improved usability:
Any missing values, denoted as "?" in the dataset, were transformed into NAs.
Binary factor variables with exclusive "t" and "f" values were converted to logical TRUE/FALSE
vectors.
The binary factor criterion variable with exclusive "p" and "e" values was converted to a logical TRUE/FALSE
vector.
Other than that, the data remains consistent with the original dataset.
Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G.H. Lincoff (Pres.), New York: A.A. Knopf.
Other datasets:
blood
,
breastcancer
,
car
,
contraceptive
,
creditapproval
,
fertility
,
forestfires
,
heart.cost
,
heart.test
,
heart.train
,
heartdisease
,
iris.v
,
sonar
,
titanic
,
voting
,
wine