Skip to content

Commit 397a9a2

Browse files
committed
adding more description to the nursery notebook
1 parent 2eea6b3 commit 397a9a2

File tree

2 files changed

+326
-950
lines changed

2 files changed

+326
-950
lines changed

data/nursery.arff

+22-23
Original file line numberDiff line numberDiff line change
@@ -1,32 +1,32 @@
11
% 1. Title: Nursery Database
2-
%
2+
%
33
% 2. Sources:
44
% (a) Creator: Vladislav Rajkovic et al. (13 experts)
55
% (b) Donors: Marko Bohanec (marko.bohanec@ijs.si)
66
% Blaz Zupan (blaz.zupan@ijs.si)
77
% (c) Date: June, 1997
8-
%
8+
%
99
% 3. Past Usage:
10-
%
10+
%
1111
% The hierarchical decision model, from which this dataset is
12-
% derived, was first presented in
13-
%
12+
% derived, was first presented in
13+
%
1414
% M. Olave, V. Rajkovic, M. Bohanec: An application for admission in
1515
% public school systems. In (I. Th. M. Snellen and W. B. H. J. van de
1616
% Donk and J.-P. Baquiast, editors) Expert Systems in Public
1717
% Administration, pages 145-160. Elsevier Science Publishers (North
1818
% Holland)}, 1989.
19-
%
19+
%
2020
% Within machine-learning, this dataset was used for the evaluation
2121
% of HINT (Hierarchy INduction Tool), which was proved to be able to
2222
% completely reconstruct the original hierarchical model. This,
2323
% together with a comparison with C4.5, is presented in
24-
%
24+
%
2525
% B. Zupan, M. Bohanec, I. Bratko, J. Demsar: Machine learning by
2626
% function decomposition. ICML-97, Nashville, TN. 1997 (to appear)
27-
%
27+
%
2828
% 4. Relevant Information Paragraph:
29-
%
29+
%
3030
% Nursery Database was derived from a hierarchical decision model
3131
% originally developed to rank applications for nursery schools. It
3232
% was used during several years in 1980's when there was excessive
@@ -38,10 +38,10 @@
3838
% The model was developed within expert system shell for decision
3939
% making DEX (M. Bohanec, V. Rajkovic: Expert system for decision
4040
% making. Sistemica 1(1), pp. 145-157, 1990.).
41-
%
41+
%
4242
% The hierarchical model ranks nursery-school applications according
4343
% to the following concept structure:
44-
%
44+
%
4545
% NURSERY Evaluation of applications for nursery schools
4646
% . EMPLOY Employment of parents and child's nursery
4747
% . . parents Parents' occupation
@@ -55,30 +55,30 @@
5555
% . SOC_HEALTH Social and health picture of the family
5656
% . . social Social conditions
5757
% . . health Health conditions
58-
%
58+
%
5959
% Input attributes are printed in lowercase. Besides the target
6060
% concept (NURSERY) the model includes four intermediate concepts:
6161
% EMPLOY, STRUCT_FINAN, STRUCTURE, SOC_HEALTH. Every concept is in
6262
% the original model related to its lower level descendants by a set
63-
% of examples (for these examples sets see
63+
% of examples (for these examples sets see
6464
% http://www-ai.ijs.si/BlazZupan/nursery.html).
65-
%
65+
%
6666
% The Nursery Database contains examples with the structural
6767
% information removed, i.e., directly relates NURSERY to the eight input
6868
% attributes: parents, has_nurs, form, children, housing, finance,
6969
% social, health.
70-
%
70+
%
7171
% Because of known underlying concept structure, this database may be
7272
% particularly useful for testing constructive induction and
7373
% structure discovery methods.
74-
%
74+
%
7575
% 5. Number of Instances: 12960
7676
% (instances completely cover the attribute space)
77-
%
77+
%
7878
% 6. Number of Attributes: 8
79-
%
79+
%
8080
% 7. Attribute Values:
81-
%
81+
%
8282
% parents usual, pretentious, great_pret
8383
% has_nurs proper, less_proper, improper, critical, very_crit
8484
% form complete, completed, incomplete, foster
@@ -87,11 +87,11 @@
8787
% finance convenient, inconv
8888
% social non-prob, slightly_prob, problematic
8989
% health recommended, priority, not_recom
90-
%
90+
%
9191
% 8. Missing Attribute Values: none
92-
%
92+
%
9393
% 9. Class Distribution (number of instances per class)
94-
%
94+
%
9595
% class N N[%]
9696
% ------------------------------
9797
% not_recom 4320 (33.333 %)
@@ -13078,4 +13078,3 @@ great_pret,very_crit,foster,more,critical,inconv,slightly_prob,not_recom,not_rec
1307813078
great_pret,very_crit,foster,more,critical,inconv,problematic,recommended,spec_prior
1307913079
great_pret,very_crit,foster,more,critical,inconv,problematic,priority,spec_prior
1308013080
great_pret,very_crit,foster,more,critical,inconv,problematic,not_recom,not_recom
13081-

0 commit comments

Comments
 (0)