Changes between Version 64 and Version 65 of GSoC/Ideas


Ignore:
Timestamp:
03/09/12 02:01:36 (3 years ago)
Author:
mitar
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • GSoC/Ideas

    v64 v65  
    1010 
    1111=== Text mining add-on for Orange === 
    12 Current [https://bitbucket.org/biolab/orange-addon-text Orange add-on for text mining] is outdated and incomplete. Source code needs rafactoring in order to be compliant with [http://orange.biolab.si/trac/wiki/Orange25 Orange 2.5 development guidelines]. Additionally, current text mining add-on lacks of documentation in reST format (including tutorial for beginners), unit tests and installation supported by PyPI (http://pypi.python.org/pypi). 
     12Current [https://bitbucket.org/biolab/orange-addon-text Orange add-on for text mining] is outdated and incomplete. Source code needs rafactoring in order to be compliant with [http://orange.biolab.si/trac/wiki/Orange25 Orange 2.5 development guidelines]. Additionally, current text mining add-on lacks of documentation in reST format (including tutorial for beginners), unit tests and installation supported by [http://pypi.python.org/pypi PyPI]. 
    1313 
    1414Project should also include a comparison between already implemented basic text (pre)processing techniques (lemmatization, steaming, document distance, feature sub selection, phrase detection) in current version of add-on and latest state-of-the-art techniques. If necessary additional algorithms (for example: multinomial Naive Bayes) should be (re)implemented. It would be very nice if text mining add-on functionalities would we also available from widgets in OrangeCanvas.  
     
    102102Note that this is not about porting Orange from Python 2.X to Py3K: this is trivial and can be done in one evening (we tried it). The work ranges from running 2to3 to redesigning some architectural parts, so the student will have to be in constant contact with the core group. 
    103103 
    104 Required skills: good knowledge of Orange and Python 
     104Required skills: Good knowledge of Orange and Python. 
    105105 
    106106Level from 1 (beginner) to 5 (professional): 4 
    107107 
    108108Possible mentors: Janez 
    109  
    110109 
    111110=== Widgets for statistics === 
     
    129128 
    130129Possible mentors: Gregor, Tomaz, Crt 
     130 
     131=== Animations in Orange === 
     132 
     133Data visualization plays a very important role in understanding relationships from the data. Unfortunately, it is usually limited to two dimensions (e.g. scatter plot), additional information about the data can be presented by different colors, sizes and shapes of the points. There can be, however, additional variables in data (e.g. time) which can have a strong influence on the scatter plot. Like in [http://www.gapminder.org/world/#$majorMode=chart$is;shi=t;ly=2003;lb=f;il=t;fs=11;al=30;stl=t;st=t;nsl=t;se=t$wst;tts=C$ts;sp=5.59290322580644;ti=2010$zpv;v=0$inc_x;mmid=XCOORDS;iid=phAwcNAVuyj1jiMAkmq1iMg;by=ind$inc_y;mmid=YCOORDS;iid=phAwcNAVuyj2tPLxKvvnNPA;by=ind$inc_s;uniValue=8.21; Gapminder] time can be used as an "animation" variable. One could play animations and see how the scatter plot changes during the time (or any other continuous variable from the data). 
     134 
     135Useful skills: Python. Widgets programming. 
     136 
     137Level from 1 (beginner) to 5 (professional): 4 
     138 
     139Possible mentors: ? 
    131140 
    132141== Ideas selected for GSoC 2011 == 
     
    157166 
    158167Possible mentors: Blaž 
    159  
    160 === Animations in Orange === 
    161  
    162 Data visualization plays a very important role in understanding relationships from the data. Unfortunately, it is usually limited to two dimensions (e.g. scatter plot), additional information about the data can be presented by different colors, sizes and shapes of the points. There can be, however, additional variables in data (e.g. time) which can have a strong influence on the scatter plot. Like in [http://www.gapminder.org/world/#$majorMode=chart$is;shi=t;ly=2003;lb=f;il=t;fs=11;al=30;stl=t;st=t;nsl=t;se=t$wst;tts=C$ts;sp=5.59290322580644;ti=2010$zpv;v=0$inc_x;mmid=XCOORDS;iid=phAwcNAVuyj1jiMAkmq1iMg;by=ind$inc_y;mmid=YCOORDS;iid=phAwcNAVuyj2tPLxKvvnNPA;by=ind$inc_s;uniValue=8.21; Gapminder] time can be used as an "animation" variable. One could play animations and see how the scatter plot changes during the time (or any other continuous variable from the data). 
    163  
    164  
    165 Useful skills: Python. Widgets programming. 
    166  
    167 Level from 1 (beginner) to 5 (professional): 4 
    168  
    169 Possible mentors: ?