wiki:GSoC/Application

Version 10 (modified by mitar, 4 years ago) (diff)

Orange Application

Orange application for Google Summer of Code as an organization.

Organization Name

Orange

Description

Orange is an open source component-based data mining and machine learning software suite, featuring friendly yet powerful and flexible visual programming front-end for explorative data analysis and visualization, and Python bindings and libraries for scripting. It includes comprehensive set of components for data preprocessing, feature scoring and filtering, modelling, model evaluation, and exploration techniques. It is maintained and developed at the Bioinformatics Laboratory of the Faculty of Computer and Information Science, University of Ljubljana, Slovenia.

We are collaborating with many educational and research institutions around the world: Baylor College of Medicine, Houston, MRC Laboratory of Molecular Biology, Cambridge, Universita di Pavia, Pavia, Graduate School of Medicine, Kyoto University, to name a few, so Orange is based on cutting edge technology combined with experience based on practical problems we are researching and solving. At the same time our graphical interface and easy scripting with Python makes Orange invaluable tool for educating data mining and machine learning techniques as both basic and complex concepts can be easily shown and visualized.

Home page

 http://orange.biolab.si/

Main Organization License

GNU General Public License (GPL)

Why is your organization applying to participate in GSoC 2011? What do you hope to gain by participating?

While we are already collaborating globally with other institutions and while Orange is open source from the very beginning we have never really managed to build a real open source community around the project. This year we have decided that we should work more actively towards this and we see GSoC as a great motivation and initiative to open our development and start building global community of independent contributors around Orange. It is also an opportunity for us to gain experience how to organize necessary environment for community participation and also reach to possible interested participants. And of course, to slowly show ourselves on the map of open source projects and communities.

On the other hand, we have much experience with working with students as we are also an educational institution at university level so we believe we can give much to possible students, ranging from knowledge and experience in data mining and machine learning to programming, algorithms, software development in general. We also hope that maybe we can find some students which would be interested in this fields also after the GSoC, possibly also into research.

And of course, we have some smaller but still interesting ideas we have not had the time to get to, but would make Orange even more powerful and easier to use at the same time. We also hope to integrate some social dimensions into Orange itself, inviting users to share and participate in knowledge exchange and in this way improve collaboration in general in the field of data mining and machine learning.

If accepted, would this be your first year participating in GSoC?

Yes.

Did your organization participate in past GSoCs? If so, please summarize your involvement and the successes and challenges of your participation.

We did participate in GSoC in 2006 under Python Software Foundation but our project sadly did not get any student at the end. This year we have decided to try again as Google has expressed interest in new projects.

If your organization participated in past GSoCs, please let us know the ratio of students passing to students allocated, e.g. 2006: 3/6 for 3 out of 6 students passed in 2006.

N/A.

What is the URL for your ideas page?

http://orange.biolab.si/trac/wiki/GSoC/Ideas

What is the main development mailing list for your organization? This question will be shown to students who would like to get more information about applying to your organization for GSoC 2011. If your organization uses more than one list, please make sure to include a description of the list so students know which to use.

We use forum, wiki and ticketing system for our development. Forum (for general question and discussions) is accessible at:

 http://orange.biolab.si/forum/

and our wiki and ticketing system (we use tickets for discussing development ideas, to keep everything in one place) at:

http://orange.biolab.si/trac/

Until now most of development was mostly internal in our laboratory so not all information about the development is available in those tools, but we are trying to improve as we are opening Orange to a global community.

What is the main IRC channel for your organization?

We have a public Skype chat for development where we hang around:

 http://orange.biolab.si/skype

Does your organization have an application template you would like to see students use? If so, please provide it now. Please note that it is a very good idea to ask students to provide you with their contact information as part of your template. Their contact details will not be shared with you automatically via the GSoC 2011 site.

  • Name and surname
  • E-mail address
  • Blog/homepage (if you have one)
  • Skype handle (preferably) or something else (IRC nick and network, MSN, Jabber)
  • Name of school/university, country, year, your average grade
  • Your general interests (in study and elsewhere)
  • Age
  • Which languages do you actively speak (and write)?
  • Project/proposal title
  • Project/proposal description
  • Project/proposal schedule. How long will the project take? When can you begin work?
  • Availability. How many hours per week can you spend working on this? What other obligations do you have this summer?
  • Deliverables. It is very important to list quantifiable results here, for example:
    • Improve X modules in ways Y and Z.
    • Write 3 new documentation pages for the new interfaces.
    • Improve test coverage by writing X more unit/regression tests.
    • Improve performance in FOO by X%.
  • How do you plan to continue with your project/proposal and within the Orange community after GSoC? Are you interested in doing some research in this field?
  • Benefits to the Free Software Community, who would gain from your project?
  • Biography and Free Software experiences. Who are you? What makes you the best person to work on this project/proposal? What free and/or open source projects have you participated in? Please describe your contributions with references.

What criteria did you use to select the individuals who will act as mentors for your organization? Please be as specific as possible.

For mentors we have envisioned professors and teaching and research assistants who are already working at our laboratory on Orange itself and who have much experience with guiding students through their work and also teaching them diverse knowledge along the way. This way, we will have experienced mentors knowing both the project itself in-deep and also knowing how to tutor students to help them achieve the most.

If need be and students will propose interesting contribution combined also with research we will try to find them a mentor who would be also an expert from the field the student is interested in.

What is your plan for dealing with disappearing students?

In the case of a student disappearance, the mentor will contact him/her several times by e-mail, chat and finally by phone, if possible. If the mentor does not get any response, the admin of the project will try to contact the student. If the student does not respond after a maximum of two weeks we will contact Google and cancel the student project.

But to prevent that even from happening, we will keep contact with student all the time, requiring from him regular updates and commits so that his/her activities will be visible and transparent all the time and possible lack of them also noticed fast.

What is your plan for dealing with disappearing mentors?

Our mentors will be from lines of the staff at the laboratory so that should never happen. Nevertheless, somebody else from the laboratory will assume the role in this unlikely event.

What steps will you take to encourage students to interact with your project's community before, during and after the program?

As we have quite some global partner organizations with which we work closely on many projects, we will invite them to help us spread the word about us participating in GSoC and to invite their students and students around their organizations to apply. We will also work locally to attract local students. And we will also try to promote the program and our participation in it through global data mining and machine learning web pages and portals.

After we attract them, we will encourage them to find interesting projects to work on. As we have a lot of interesting practical problems we are working on all the time as well, they might get interested and start working on those – even after GSoC and possibly in form of research projects.

We hope we will become an active and wide community where many participants from all around the globe will like to contribute and exchange knowledge and ideas and learn this way.

If you are a small or new organization applying to GSoC, please list a larger, established GSoC organization or a Googler that can vouch for you here.

Python Software Foundation does probably not remember us anymore from 2006 when we have participated together in GSoC. You could probably check out with our alumni Ziga Mahkovec, who currently works at Google, that we are a valid faculty/university and that we do great stuff at our laboratory ( http://www.biolab.si/) with Orange being probably the largest open source project in Slovenia.

You can also check our statistics at Ohloh:

 https://www.ohloh.net/p/orange

If you are a large organization who is vouching for a small organization applying to GSoC for their first time this year, please list their name and why you think they'd be good candidates for GSoC here:

N/A.

Anything else you'd like to tell us?

We really like the idea of encouraging students to work on open source and promote open source values. Also, we hope very much to be able to open our project to a wider and global community with Google's help and would like to thank you in advance for such opportunity, if you decide favorably and accept us into the program.