Changes between Version 47 and Version 48 of GSoC/Ideas


Ignore:
Timestamp:
03/26/11 19:55:43 (3 years ago)
Author:
blaz
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • GSoC/Ideas

    v47 v48  
    5757== Matrix factorization techniques for data mining == 
    5858 
    59 Matrix factorization is a fundamental building block for many of current data mining approaches. Factorization techniques, like non-negative and probabilistic sparse matrix factorizations are today widely used in various applications of data mining. The aim of this project is to develop a scripting library for Orange that include various matrix factorization techniques, document the code, provide examples that demonstrate various types of applications, include the examples in the documentation. The entire development would be for scripting, that is, the project would not involve any widget programming. We would, though, like to have the student sketch how should the widgets that use this library look like, which methods developed in the scripting library should they access, and which (if any) are useful visualizations to be implemented. 
     59Matrix factorization is a fundamental building block for many of current data mining approaches. Factorization techniques, like non-negative and probabilistic sparse matrix factorizations are today widely used in various applications of data mining. The aim of this project is to develop a scripting library for Orange that includes various matrix factorization techniques, and in addition provides documentation of the code, working examples that demonstrate various types of applications. Selected examples are to be included in the documentation. The entire development is therefore oriented for creation of the scripting library, that is, the project would not involve any widget programming. We would, though, like to have the student sketch how should the widgets that use this library look like, which methods developed in the scripting library should they access, and which (if any) are useful visualizations to be implemented. 
    6060 
    6161Useful skills: Python. Matrix operations (possibly in numpy). Good background in math, linear algebra and optimization.