source: orange/Orange/doc/widgets/Visualize/AttributeStatistics.htm @ 9671:a7b056375472

Revision 9671:a7b056375472, 2.5 KB checked in by anze <anze.staric@…>, 2 years ago (diff)

Moved orange to Orange (part 2)

Line 
1<html>
2<head>
3<title>Attribute Statistics</title>
4<link rel=stylesheet href="../../../style.css" type="text/css" media=screen>
5<link rel=stylesheet href="style-print.css" type="text/css" media=print></link>
6</head>
7
8<body>
9
10<h1>Attribute Statistics</h1>
11
12<img class="screenshot" src="../icons/AttributeStatistics.png">
13<p>Shows basic distribution of attribute values.</p>
14
15<h2>Channels</h2>
16
17<h3>Inputs</h3>
18
19<DL class=attributes>
20<DT>Examples (ExampleTable)</DT>
21<DD>Input data set.</DD>
22</dl>
23
24<h3>Outputs</h3>
25
26<DL class=attributes>
27<DT>(None)</DT>
28<DD></DD>
29</dl>
30
31<h2>Description</h2>
32
33<p>Attribute Statistics shows distributions of attribute values. It is a good practice to check any new data with this widget, to quickly discover any anomalies, such as duplicated values (e.g. gray and grey), outliers, and similar.</p>
34
35<table>
36<tr><td valign="top">
37<img class="screenshot" src="AttributeStatistics-Cont.png" align="left">
38</td>
39<td valign="top">
40<P>For continuous attributes, the widget shows the minimal and maximal value. In case of Iris' attribute "petal length" (figure on the left), these are 1.00 and 6.90. In between are the 25'th percentile, the median and the 75%, which are 1.60, 4.35 and 5.10, respectively. The mean and standard deviation are printed in red (3.76 and 1.76) and also represented with the vertical line. At the bottom left corner there is also information on the sample size (there are 150 examples in the Iris data set, without any missing values) and the number of distinct values that this attribute takes.</P>
41</td></tr>
42
43<tr><td valign="top">
44<img class="screenshot" src="AttributeStatistics-Disc.png" align="left">
45</td>
46<td valign="top">
47<P>For discrete attributes, the bars represent the number of examples with each particular attribute value. The picture shows the number of different animal types in the Zoo data set: there are 41 mammals, 13 fish and so forth.</P>
48</td>
49</tr></table>
50
51<P>For both kinds of attributes, the graph can be saved by clicking the Save Graph button.</P>
52
53<h2>Examples</h2>
54
55<P>Attribute Statistics is most commonly used immediately after the <a href="../Data/File.htm">File</a> widget to observe statistical properties of the data set. It is also useful for finding the properties of a specific data set, for instance a group of examples manually defined in another widget, such as scatter plot or examples belonging to some cluster or a classification tree node, as shown in the schema below.</P>
56
57<img class="schema" src="AttributeStatistics-Schema.png">
58
59
60</body>
61</html>
Note: See TracBrowser for help on using the repository browser.