CS534 Final Project


Due: Friday June 6th, 11:59PM (no late submission)

The goal of this class project is to give you an opportunity to work with a "realworld" machine learning problem and explore various aspects that are involved in machine learning application and/or research. This project is intended to be more open-ended than your previous experimental assignments and give you more hands-on exprience with real-world machine learning problems and learning algorithms.  You  are encouraged to combine machine learning with problems in your own research area. Your class project must be about new things you have done this term, you can't use results you have developed prior to this class.

What you need to do

  1. Find a partner to work on this project. (Two person teams are encouraged, though you may work alone. No three person teams please.)

  2. Choose your application domain and learning problem within it.  Turn in a project proposal (1 page maximum) by May 12th. Each team is required to meet with the instructor at least once to discuss the project before turning in the proposal. You can come to the office hour or email me to schedule a different time for meeting.

  3. As a guideline, you will need to go through the following questions and make your decisions on each one.

  1. Perform the work, run the experiments!

  2. Turn in a final report (no longer than 8 pages including references, figures and tables). Each team should turn in a single report and please email me your report before the deadline. Your report should precisely describe the following:

         The clarity and content of the report will have a primary impact on your grade. Again, the report should not exceed 8 pages (using fewer pages is ok), 11 point font,  including figures and tables.

Grading and determining when you have done enough

A project that does a solid job building the base learning system and carefully evaluating and describing it might get 75–80% credit. A project that includes additional pursuit of interesting extensions/alternatives or investigations into important issues (such as overfitting, noise tolerance, feature selection etc.), or achieves very impressive results might get 90–100% credit. Weight will also be given to the interestingness and novelty of the learning task considered.

Be creative! Exploring your own interesting ideas and comparing them with the baseline approaches will receive credit whether they beat the baseline or not.



Some Possible Learning Problems