Changes between Version 68 and Version 69 of GSoC/Ideas


Ignore:
Timestamp:
03/27/12 10:05:58 (2 years ago)
Author:
blaz
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • GSoC/Ideas

    v68 v69  
    11= Google Summer of Code Ideas = 
    22 
    3 Here is a list of ideas for projects we thought up for what would be interesting and useful to do in a course of Google Summer of Code program for [// Orange]. Of course you can propose also some other (your) idea(s). But of course connected with Orange, data mining, machine learning, artificial intelligence in general, bioinformatics and other fields we are interested in (or you can get us interested in). 
     3Here is a list of ideas for projects that might be interesting and useful to carry out during Google Summer of Code program for [// Orange]. Your own ideas in methods for data analytics and visualization that would complement Orange are most welcome, too! 
    44 
    55You can find more information about our participation in Google Summer of Code [wiki:GSoC here]. 
     
    2020Possible mentors: Črt 
    2121 
     22=== Data input from mldata.org === 
     23 
     24[http://mldata.org/ is an excellent machine learning data repository]. It would be great if orange would have  
     25 
     26=== Widgets for statistics === 
     27 
     28Orange is rather weak in basic statistics, from various statistical tests to linear regression, dimensionality reduction and so forth. It would be great to have some widgets for this. The code for computation of all this is already available in other libraries which we can call from Python, so what we actually need is a good integration within the canvas.  
     29 
     30Level from 1 (beginner) to 5 (professional): 3.5 
     31 
     32Possible mentors: Janez 
     33 
    2234=== Support for parallel computation for scripting/backend === 
    2335 
     
    3446=== Test scripts, example scripts and documentation === 
    3547 
    36 Orange comes with substantial documentation for scripting which, in places, could be substantially improved. Also, Orange 2.5 with its new class hierarchy and functions is coming, and code snippets and corresponding documentation would both require a revision. The project would embark in design of new use cases (snippets of code to demonstrate various aspects of orange), review of present set of snippets, and integration of code snippets within the documentation. 
     48Orange comes with substantial documentation for scripting which, in places, could be substantially improved. Also, Orange 2.5 with its new class hierarchy and functions is just about to be released, and some code snippets and corresponding documentation would both require a revision (note that [http://orange.biolab.si/doc/reference/ Reference Guide] has already been rewritten). The project would embark in design of new use cases (snippets of code to demonstrate various aspects of orange), review of present set of snippets, and integration of code snippets within the documentation. Writing of a Orange Cookbook, or Orange User's Guide would be most welcome. 
    3749 
    38 Snippets in documentation also serve as regression scripts upon which Orange is tested daily. Another purpose of this project is to increase the number and coverage of regression scripts. 
     50Snippets in documentation also serve as regression scripts upon which Orange is tested daily. Another purpose of this project might be to increase the number and coverage unit tests. 
    3951 
    4052This could be also a good project if you would like to learn more about Orange, data mining and machine learning itself. 
    4153 
    42 Useful skills: Proficiency in English (probably native speaker). Language/writing skills. Python. 
     54Useful skills: Proficiency in English (probably native speaker) if the target is documentation writing. Language/writing skills. Good knowledge of Python if the target is writing of unit tests. 
    4355 
    4456Level from 1 (beginner) to 5 (professional): 3 
     
    7082Possible mentors: Marko 
    7183 
    72 === Neural network === 
     84=== Neural Networks === 
    7385 
    7486Orange implements many algorithms for classification, but currently  
     
    108120Possible mentors: Janez 
    109121 
    110 === Widgets for statistics === 
    111  
    112 Orange is rather weak in basic statistics, from various statistical tests to linear regression, dimensionality reduction and so forth. It would be great to have some widgets for this. The code for computation of all this is already available in other libraries which we can call from Python, so what we actually need is a good integration within the canvas.  
    113  
    114 Level from 1 (beginner) to 5 (professional): 3.5 
    115  
    116 Possible mentors: Janez 
    117  
    118122=== biox library (NGS, next-generation sequencing) === 
    119123 
     
    138142 
    139143Possible mentors: Blaž 
    140  
    141 == Ideas selected for GSoC 2011 == 
    142  
    143 === Replacing PyQwt with pure PyQt visualizations  === 
    144  
    145 Many visualizations in Orange widgets currently use PyQwt. It seems a good idea to migrate to pure Qt implementation, for several reasons: 
    146  * PyQwt development seems stalled. The current version on the site is for Python 2.6 and Qt 4.5, and although Python 3.X is said to be supported, we have not been unable to build it. While Orange itself is basically ported to Python 3.X, PyQwt is a show stopper. We have also had this same problem with previous Python version upgrades. 
    147  * Qwt is not very estaetically pleasing: it is a very good tool for plotting the data, but not for publishing pictures in glossy journals and web sites. ;) The new Qt graphics classes would do a much better job. 
    148  * We are not using much of Qwt, we need only some basic stuff, which should be easy to reimplement. 
    149 Given all this, it would make little sense for the Orange team to take over the maintenance of Qwt-to-Python interface. 
    150  
    151 Fortunately, most widgets do not interact with PyQwt directly but instead use a middle layer, OWGraph, which is a part of Orange. The "toughest" part will be to reimplement the Qwt's classes for drawing curves, which need to be in C++ (with a sip interface to Python). 
    152  
    153 Useful skills: Python, C++, [http://www.riverbankcomputing.co.uk/software/sip/intro sip], [http://doc.qt.nokia.com/latest/ Qt].  
    154  
    155 Level from 1 (beginner) to 5 (professional): 4.5 
    156  
    157 Possible mentors: Miha, Janez