CS 435 Introduction to Data Mining     Fall 2008


This data mining course introduces concepts, algorithms, techniques, and applications of data mining. Topics include background of data mining, data preprocessing, classification, clustering, association-rules mining. This course is designed for CS senior undergraduate students.

Class Schedule: T TH 6:00 PM - 7:25 PM

Classroom:  Science II, 260

Instructor: Dr. Lei Yu  

TA: Yi (Andrea) Xu

Telephone:  (607) 777-6250


Email: lyu AT cs DOT binghamton DOT edu  

Email: yxu4 AT binghamton DOT edu

Office Location: G16, Engineering Building

Office Location: N1, Engineering Building

Office Hours: T TH 12:30PM - 1:30PM or by appointment

Office Hours: M W 10:00AM-11:00AM


  • CS 333 (Algorithms) or equivalent
  • MATH 327 (Probability with Statistical Methods) or equivalent


  • Background of knowledge discovery and data mining
  • Data preprocessing  (e.g., data cleaning, transformation, dimensionality reduction, instance selection)
  • Classification (e.g., decision trees, rule-based classifiers, Bayesian classifiers, instance-based classifiers)
  • Clustering (e.g., K-means, hierarchical clustering, density-based clustering)
  • Mining association rules (e.g., Apriori)


  • Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, April 2005.


There will be 4 written assignments.


There will be several quizzes and two exams in class.


Final grades will be based on class participation (10%), quiz (10%), homework (4 assignments, 20%), Exam I (30%), Exam II (30%).

Academic Integrity:

Discussion of general concepts and questions concerning the homework assignments among students is encouraged. However, each of you is expected to work on the homework solutions on your own. Sharing of any part of solutions is prohibited. If you are unclear about the policy, please consult with the instructor before you act. Suspected cases of academic misconduct will be pursued fully in accordance to the Student Academic Honesty Code of Binghamton University.

Late Policy:

Each assignment is due at the beginning of class on the due date. Any assignment received within the next 24 hours will be penalized by 20% of the full credit; any assignment received within the time between 24 hours and 48 hours pass the deadline is penalized by 50% of the full credit; No assignment will be accepted after 48 hours pass the deadline. Rare exceptions of this policy may be made at the discretion of the instructor under demonstrably circumstances.

Last updated on 09/02/2008