source: orange/docs/widgets/rst/data/file.rst @ 11797:840029d005bb

Revision 11797:840029d005bb, 3.0 KB checked in by blaz <blaz.zupan@…>, 4 months ago (diff)

Enabled custom documentation enumeration (stamper CSS style).

Line 
1.. _File:
2
3File
4====
5
6.. image:: ../../../../Orange/OrangeWidgets/Data/icons/File.svg
7   :alt: File widget icon
8   :class: widget-category-data widget-icon
9
10Reads attribute-value data from an input file.
11   
12Signals
13-------
14
15Inputs:
16   - None
17
18Outputs:
19   - Data
20         Attribute-valued data set read from the input file.
21
22Description
23-----------
24
25File widget reads the input data file (data table with data instances)
26and sends the data set to its output channel. It maintains
27a history of most recently opened files. For convenience, the history also includes
28a directory with the sample data sets that come pre-installed with Orange.
29
30The widget reads data from simple tab-delimited or comma-separated files,
31as well as files in
32`Weka's arrf format <http://www.cs.waikato.ac.nz/~ml/weka/arff.html>`_.
33
34.. image:: images/File-stamped.png
35   :alt: File widget with loaded Iris data set
36   :align: right
37
38.. rst-class:: stamp-list
39
40   1. Browse for a data file.
41   #. Browse through previously opened data files, or load any of the sample data
42      files.
43   #. Reloads currently selected data file.
44   #. Information on loaded data set (data set size, number and types of
45      data features).
46   #. Opens a sub-window with advanced settings.
47   #. Adds a report on data set info (size, features).
48
49.. container:: clearer
50
51    .. image :: images/spacer.png
52
53Advanced Options
54----------------
55
56.. image:: images/File-Advanced-stamped.png
57   :alt: Advanced options of File widget
58   :align: right
59
60.. rst-class:: stamp-list
61
62   1. Symbol for don't care data entry.
63   #. Symbol for don't know data entry.
64   #. Settings for treatment of feature names in the feature space of Orange.
65
66.. container:: clearer
67
68    .. image :: images/spacer.png
69
70Tab-delimited data file can include user defined symbols for undefined values. The symbols for
71"don't care" and "don't know" values can be specified in the corresponding edit lines.
72The default values for "don't know" and "don't care" depend upon format. Most users will
73use tab-delimited files: keep the field empty or put a question mark in there and that's
74it. Most algorithms do not differ between don't know and don't care values, so consider
75them both to mean undefined.
76
77Orange will usually treat the attributes with the same name
78but appearing in different files as the same attribute, so a classifier which uses the
79attribute "petal length" from the first will use the attribute of the same name from
80the second. In cases when attributes from different files just accidentally bear different
81names, one can instruct Orange to either always construct new attribute or construct them when
82they differ in their domains. Use the options on dealing with new attributes
83with great care (if at all).
84
85Example
86-------
87
88Most Orange workflows would probably start with the File widget. In the schema below,
89the widget is used to read the data that is sent to both :ref:`Data Table` widget and
90to widget that displays :ref:`Attribute Statistics`.
91
92.. image:: images/File_schema.png
93   :alt: Example schema with File widget
Note: See TracBrowser for help on using the repository browser.