CPSC445/CPSC545/MBB334/MBB545/CBB545

Introduction to Data Mining

Outline of topics

Class: 2:30-3:45 pm on Tuesday and Thursday, in 200 AKW
Please visit the course wiki for a detailed list of reading materials.

Jan
16
MS
Introduction to Data Mining
18
MG
Overview of Data Mining in Bioinformatics
Slides: Introduction to Bioinformatics
23
MS
Data Processing and Intro to R
Reading: Fayyad et al. 1996
25
MS
Supervised Learning and Rattle
30
MS
Introduction to Classification and Decision Trees
Reading: Chapter 4 of Tan et al. 2006
Feb
1
MS
Decision Trees
Slides: Decision tree
Slides: Ensemble methods
6
MS
Multilinear and Logistic Regression I
Reading: Chapter 6 of Han et al. 2005 (for personal use in this course only)
Reading: Chapter 5 of Tan et al. 2005 (for personal use in this course only)
Slides: Multilinear and logistic regression
8
MS
Multilinear and Logistic Regression II
13
MS
Support Vector Machines (SVM)
Slides: SVM
Slides: SVM review
15
MS
Neural Networks, Naive Bayes
Slides: Perceptron Models
20
MG
Molecular Networks as Application of Mining
Slides: Introduction to Bioinformatics
Slides: Function Prediction & Networks
22
MS
Unsupervised Learning and Clustering
Slides: Clustering
Slides: KNN
Slides: Clustering review
Reading: Modern trends in data mining
27
MG
Predicting Networks Through Bayesian Inf #1
Slides: Predicting Networks through Bayesian Integration #1 - Theory
Mar
1
MG
Predicting Networks Through Bayesian Inf #2
Slides: Predicting Networks through Bayesian Integration #2 - Application
6
MG
Applications of Spectral methods (PCA/SVD) #1
Slides: Spectral Methods (PCA,SVD) #1 - Theory
8
MG
Applications of Spectral methods (PCA/SVD) #2
Slides: Spectral Methods (PCA,SVD) #2 - Application
27
Max Kuhn
Determining Method of Action in Drug Discovery Using Affymetrix Microarray Data (Dr. Max Kuhn, Pfizer Research Lab)
Slides: Determining Method of Action in Drug Discovery Using Affymetrix Microarray Data
Slides: About the caret Package
29
Michael Krauthammer
An Introduction to Text Mining with an Application to the Life Sciences. (Prof. Michael Krauthammer, Yale Medical School)
Slides: Text Mining in Biomedicine
Apr
3
JDU
A(n) (extremely) brief/crude introduction to minimum description length (MDL) principle
Slides: Intro. to MDL principle
5
EL
Kernel PCA
10
Students
Project Reports
12
Students
Project Reports
17
Students
Project Reports
19
Students
Project Reports
24
Students
Project Reports
26
Students
Project Reports