wiki:GSoC/Application

Orange Application

Orange application for Google Summer of Code 2013 as an organization.

Organization id

orange

Organization Name

Orange – Data Mining Fruitful & Fun

Description

Orange is an open source component-based data mining and machine learning software suite, featuring friendly yet powerful and flexible visual programming front-end for explorative data analysis and visualization, and Python bindings and libraries for scripting. It includes comprehensive set of components for data preprocessing, feature scoring and filtering, modeling, model evaluation, and exploration techniques. It is maintained and developed at the Bioinformatics Laboratory of the Faculty of Computer and Information Science, University of Ljubljana, Slovenia.

We are collaborating with many educational and research institutions around the world: Baylor College of Medicine, Houston, MRC Laboratory of Molecular Biology, Cambridge, Universita di Pavia, Pavia, Graduate School of Medicine, Kyoto University, to name a few, so Orange is based on cutting edge technology combined with experience based on practical problems we are researching and solving. At the same time our graphical interface and easy scripting with Python makes Orange invaluable tool for educating data mining and machine learning techniques as both basic and complex concepts can be easily shown and visualized.

Organization home page url

 http://orange.biolab.si/

Main Organization License

GNU General Public License (GPL)

Veteran/New

Veteran

If you chose "veteran" in the dropdown above, please summarize your involvement and the successes and challenges of your participation. Please also list your pass/fail rate for each year.

We did participate in GSoC in 2006 under Python Software Foundation but our project sadly did not get any student at the end. In 2011 we successfully applied and participated as a new (smaller) project and had a great experience with the program, learned a lot, and also participated in Google Summer of Code mentor summit. In 2011, we got allocated 2 slots, asked for increase to 3 because we really had very good applications and got 3 slots at the end – which proved great because all students were really good and active and successfully contributed to the project. So we had 3 of 3 really successful students. In 2012 we got 5 students, one sadly dropped immediately because of visa issues, but remaining 4 successfully completed their projects, enriching Orange with long wanted features and goodies, but one student failed to submit his evaluation.

If you chose "new" in the dropdown above, have you applied in the past? If so, for what year(s)?

N/A.

Why is your organization applying to participate in Google Summer of Code 2013? What do you hope to gain by participating?

While we are already collaborating globally with other institutions and while Orange is open source from the very beginning we have never really managed to build a real open source community around the project. Last year we have decided that we should work more actively towards this and this year we are continuing working on that. We see GSoC as a great motivation and initiative to open our development and start building global community of independent contributors around Orange. It is also an opportunity for us to gain experience how to organize necessary environment for community participation and also reach to possible interested participants. And of course, to slowly show ourselves on the map of open source projects and communities.

On the other hand, we have much experience with working with students as we are also an educational institution at university level so we believe we can give much to possible students, ranging from knowledge and experience in data mining and machine learning to programming, algorithms, software development in general. We also hope that maybe we can find some students which would be interested in this fields also after the GSoC, possibly also into research.

And of course, we have some smaller but still interesting ideas we have not had the time to get to, but would make Orange even more powerful and easier to use at the same time. We also hope to integrate some social dimensions into Orange itself, inviting users to share and participate in knowledge exchange and in this way improve collaboration in general in the field of data mining and machine learning.

We really like the idea of encouraging students to work on open source and promote open source values. Also, we hope very much to be able to open our project to a wider and global community with Google's help and would like to thank you in advance for such opportunity, if you decide favorably and accept us into the program.

What is the URL for your Ideas list?

http://orange.biolab.si/trac/wiki/GSoC/Ideas

What is the main development mailing list for your organization?

We use forum, wiki and ticketing system for our development:  http://orange.biolab.si/forum/, http://orange.biolab.si/trac/

What is the main IRC channel for your organization?

We have a public Skype chat for development where we hang around: http://orange.biolab.si/trac/wiki/Skype

What criteria did you use to select your mentors for this year's program? Please be as specific as possible.

For mentors we have envisioned professors and teaching and research assistants who are already working at our laboratory on Orange itself and who have much experience with guiding students through their work and also teaching them diverse knowledge along the way. This way, we will have experienced mentors knowing both the project itself in-deep and also knowing how to tutor students to help them achieve the most.

If need be and students will propose interesting contribution combined also with research we will try to find them a mentor who would be also an expert from the field the student is interested in.

What is your plan for dealing with disappearing students?

In the case of a student disappearance, the mentor will contact him/her several times by e-mail, chat and finally by phone, if possible. If the mentor does not get any response, the admin of the project will try to contact the student. If the student does not respond after a maximum of two weeks we will contact Google and cancel the student project.

But to prevent that from even happening, we will keep contact with the student at all times, requiring regular updates and commits from him/her so his/her activities will be visible and transparent all the time and a possible lack of them noticed fast.

What is your plan for dealing with disappearing mentors?

Our mentors will be from lines of the staff at the laboratory so that should never happen. Nevertheless, somebody else from the laboratory will assume the role in this unlikely event.

What steps will you take to encourage students to interact with your project's community before and during the program?

As we have quite some global partner organizations with which we work closely on many projects, we will invite them to help us spread the word about us participating in GSoC and to invite their students and students around their organizations to apply. We will also work locally to attract local students. And we will also try to promote the program and our participation in it through global data mining and machine learning web pages and portals.

What will you do to encourage that your accepted students stick with the project after Google Summer of Code concludes?

After we attract them, we will encourage them to find interesting projects to work on. As we have a lot of interesting practical problems we are working on all the time as well, they might get interested and start working on those – even after GSoC and possibly in form of research projects.

We hope we will become an active and wide community where many participants from all around the globe will like to contribute and exchange knowledge and ideas and learn this way.

Are you a new organization who has a Googler or other organization to vouch for you? If so, please list their name(s) here.

N/A.

Are you an established or larger organization who would like to vouch for a new organization applying this year? If so, please list their name(s) here.

N/A.