source: orange/Orange/doc/widgets/Classify/RandomForest.htm @ 9671:a7b056375472

Revision 9671:a7b056375472, 3.7 KB checked in by anze <anze.staric@…>, 2 years ago (diff)

Moved orange to Orange (part 2)

Line 
1<html>
2<head>
3<title>Random Forest</title>
4<link rel=stylesheet href="../../../style.css" type="text/css" media=screen>
5<link rel=stylesheet href="../../../style-print.css" type="text/css" media=print></link>
6</head>
7
8<body>
9
10<h1>Random Forest</h1>
11
12<img class="screenshot" src="../icons/RandomForest.png">
13<p>Random forest learner</p>
14
15<h2>Channels</h2>
16
17<h3>Inputs</h3>
18
19<DL class=attributes>
20<DT>Examples (ExampleTable)</DT>
21<DD>A table with training examples</DD>
22</dl>
23
24<h3>Outputs</h3>
25<DL class=attributes>
26<DT>Learner</DT>
27<DD>The random forest learning algorithm with settings as specified in the dialog</DD>
28
29<DT>Random Forest Classifier</DT>
30<DD>Trained random forest</DD>
31
32<DT>Choosen Tree</DT>
33<DD>One of the classification trees from the random forest classifer</DD>
34
35</dl>
36
37
38<h2>Description</h2>
39
40<p>Random forest is a classification technique that proposed by <a href="#Breiman2001">Leo Brieman (2001)</a>, given the set of class-labeled data, builds a set of classification trees. Each tree is developed from a bootstrap sample from the training data. When developing individual trees, an arbitrary subset of attributes is drawn (hence the term "random") from which the best attribute for the split is selected. The classification is based on the majority vote from individually developed tree classifiers in the forest.</p>
41
42<p>Random forest widget provides for a GUI to Orange's own implementation of random forest (<a href="/doc/modules/orngEnsemble.htm">orngEnsemble</a> module). The widget output the learner, and, given the training data on its input, the random forest. Additional output channel is provided for a selected classification tree (from the forest) for the purpose of visualization or further analysis.</p>
43
44<table><tr>
45<td valign="top">
46<img class="screenshot" src="RandomForest.png" border=0 alt="Random Forest widget"></td>
47<td valign="top">
48<p>In the widget, the first field is used to specify the name of the learner or classifier. Next block of parameters tells the algorithm how many classification trees will be included in the forest (<span class="option">Number of trees in forest</span>), and how many attributes will be arbitrarily drawn for consideration at each node. If the later is not specified (option <span class="option">Consider exactly ...</span> left unchecked), this number is equal to square root of number of attributes in the data set. Original Brieman's proposal is to grow the trees without any pre-prunning, but since this later often works quite well the user can set the depth to which the trees will be grown (<span class="option">Maximal depth of individual trees</span>). As another pre-pruning option, the stopping condition in terms of minimal number of instances in the node before splitting can be set. Finally, if the training data is given to the widget, the <span class="option">Index of the tree on the output</span> can be specified, instructing the widget to send the requested classifier.
49</td></tr></table>
50</p>
51
52<h2>Examples</h2>
53
54<p>Snapshot below shows a standard comparison schema of a random forest and a tree learner (in this case, C4.5) on a specific data set.</p>
55
56<img class="screenshot" src="RandomForest-Test.png" alt="Random forest evaluation" border=0></td>
57
58<p>A simple use of this widget where we wanted to explore how do the actual trees in the forest look like is presented in the following snapshot. In our case, the 5-th tree from the forest was rendered in the Classification Tree Graph widget.</p>
59
60<img class="screenshot" src="RandomForest-TreeGraph.png" alt="Visualization of a tree from random forest"border=0></td>
61
62
63<h2>References</h2>
64
65Breiman L (2001) Random Forests. Machine Learning 45 (1), 5-32. [<a href="http://www.springerlink.com/content/u0p06167n6173512/fulltext.pdf">PDF</a>]
66
67</body>
68</html>
Note: See TracBrowser for help on using the repository browser.