
Classes /
CS178: Machine Learning and Data MiningCLOSED : 2014 OFFERING Assignments and Exams:
Lecture: Mon/Wed/Fri 11am12pm, ICS 174Discussion: Monday 45pm, Eng Tower (ET) 204Instructor: Prof. Alex Ihler (ihler@ics.uci.edu), Office Bren Hall 4066
Teaching Assistant: Moshe Lichman (mlichman@uci.edu)
Course Notes in developmentAlso, a possibly helpful LaTeX template I use for homeworks and solutions. (Or, this link has another nice way to include Matlab code in LaTeX.) Introduction to machine learning and data miningHow can a machine learn from experience, to become better at a given task? How can we automatically extract knowledge or make sense of massive quantities of data? These are the fundamental questions of machine learning. Machine learning and data mining algorithms use techniques from statistics, optimization, and computer science to create automated systems which can sift through large volumes of data at high speed to make predictions or decisions without human intervention. Machine learning as a field is now incredibly pervasive, with applications from the web (search, advertisements, and suggestions) to national security, from analyzing biochemical interactions to traffic and emissions to astrophysics. Perhaps most famously, the $1M Netflix prize stirred up interest in learning algorithms in professionals, students, and hobbyists alike. This class will familiarize you with a broad crosssection of models and algorithms for machine learning, and prepare you for research or industry application of machine learning techniques. BackgroundWe will assume basic familiarity with the concepts of probability and linear algebra. Some programming will be required; we will primarily use Matlab, but no prior experience with Matlab will be assumed. (Most or all code should be Octave compatible, so you may use Octave if you prefer.) Textbook and ReadingThere is no required textbook for the class. However, useful books on the subject for supplementary reading include Murphy's "Machine Learning: A Probabilistic Perspective", Duda, Hart & Stork, "Pattern Classification", and Hastie, Tibshirani, and Friedman, "The Elements of Statistical Learning". PiazzaI use Piazza to manage student discussions and questions. Our class link is: http://piazza.com/uci/winter2014/cs178. MatlabOften we will write code for the course using the Matlab environment. Matlab is accessible through NACS computers at several campus locations (e.g., MSTBA, MSTBB, and the ICS lab), and if you want a copy for yourself student licenses are fairly inexpensive ($100). If you use Octave, please be careful to use Matlabcompatible syntax (not Octave extensions), since otherwise I or the TA may be unable to interpret your code. If you are not familiar with Matlab, there are a number of tutorials on the web:
You may want to start with one of the very short tutorials, then use the longer ones as a reference during the rest of the term. Interesting stuff for students
Syllabus (subject to change)
Previous year's lectures (2012b, 2012a, 2011, 2010) are also available. Course Project
