source: orange/docs/widgets/rst/visualize/attributestatistics.rst @ 11050:e3c4699ca155

Revision 11050:e3c4699ca155, 2.0 KB checked in by Miha Stajdohar <miha.stajdohar@…>, 16 months ago (diff)

Widget docs From HTML to Sphinx.

Line 
1.. _Attribute Statistics:
2
3Attribute Statistics
4====================
5
6.. image:: ../icons/AttributeStatistics.png
7
8Shows basic distribution of attribute values.
9
10Signals
11-------
12
13Inputs:
14   - Examples (ExampleTable)
15      Input data set.
16
17
18Outputs:
19   - (None)
20
21
22Description
23-----------
24
25Attribute Statistics shows distributions of attribute values. It is a good practice to check any new data with this widget, to quickly discover any anomalies, such as duplicated values (e.g. gray and grey), outliers, and similar.
26
27.. image:: images/AttributeStatistics-Cont.png
28
29For continuous attributes, the widget shows the minimal and maximal value. In case of Iris' attribute "petal length" (figure on the left), these are 1.00 and 6.90. In between are the 25'th percentile, the median and the 75%, which are 1.60, 4.35 and 5.10, respectively. The mean and standard deviation are printed in red (3.76 and 1.76) and also represented with the vertical line. At the bottom left corner there is also information on the sample size (there are 150 examples in the Iris data set, without any missing values) and the number of distinct values that this attribute takes.
30
31.. image:: images/AttributeStatistics-Disc.png
32
33For discrete attributes, the bars represent the number of examples with each particular attribute value. The picture shows the number of different animal types in the Zoo data set: there are 41 mammals, 13 fish and so forth.
34
35
36For both kinds of attributes, the graph can be saved by clicking the Save Graph button.
37
38Examples
39--------
40
41Attribute Statistics is most commonly used immediately after the `File <../Data/File.htm>`_ widget to observe statistical properties of the data set. It is also useful for finding the properties of a specific data set, for instance a group of examples manually defined in another widget, such as scatter plot or examples belonging to some cluster or a classification tree node, as shown in the schema below.
42
43.. image:: images/AttributeStatistics-Schema.png
Note: See TracBrowser for help on using the repository browser.