** Iris Plants Database **


Sources
Past Usage.
Relevant Information.
Attribute Information.
Summary Statistics.


Back to Syllabus Page

Sources:


Back to Top of the Page

Past Usage:

Publications: too many to mention!!! Here are a few.

  1. Fisher, R.A. "The Use of Multiple Measurements in Taxonomic Problems" Annual Eugenics, 7, Part II, 179-188 (1936); also in Contributions to Mathematical Statistics (John Wiley, NY, 1950).
  2. Duda, R.O., & Hart, P.E. (1973) Pattern Classification and Scene Analysis. John Wiley & Sons. P. 218.
  3. Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New System Structure and Classification Rule for Recognition in Partially Exposed Environments," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-2, No. 1, 67-71. Results: very low misclassification rates (0% for the setosa class)
  4. Gates, G.W. (1972) "The Reduced Nearest Neighbor Rule," IEEE Transactions on Information Theory, May 1972, 431-433. Results: very low misclassification rates again
  5. See also: 1988 MLC Proceedings, 54-64. Cheeseman et al's AUTOCLASS II conceptual clustering system finds 3 classes in the data.


Back to Top of the Page

Relevant Information:


Back to Top of the Page

Attribute Information:


Back to Top of the Page

Summary Statistics:

 
Min
Max
Mean
SD
Class Correlation
sepal length
4.3
7.9
5.84
0.83
0.7826
sepal width
2.0
4.4
3.05
0.43
-0.4194
petal length
1.0
6.9
3.76
1.76
0.9490 (high!)
petal width
0.1
2.5
1.20
0.76
0.9565 (high!)

Class Distribution: 33.3% for each of 3 classes.


Back to Top of the Page
Back to Syllabus Page