CMP4336 Introduction to Data MiningBahçeşehir UniversityDegree Programs MATHEMATICSGeneral Information For StudentsDiploma SupplementErasmus Policy StatementNational QualificationsBologna Commission
MATHEMATICS
Bachelor TR-NQF-HE: Level 6 QF-EHEA: First Cycle EQF-LLL: Level 6

Course Introduction and Application Information

Course Code Course Name Semester Theoretical Practical Credit ECTS
CMP4336 Introduction to Data Mining Fall 3 0 3 6
This catalog is for information purposes. Course status is determined by the relevant department at the beginning of semester.

Basic information

Language of instruction: English
Type of course: Non-Departmental Elective
Course Level: Bachelor’s Degree (First Cycle)
Mode of Delivery: Face to face
Course Coordinator : Dr. Öğr. Üyesi CEMAL OKAN ŞAKAR
Recommended Optional Program Components: None
Course Objectives: In this course, data mining algorithms and computational paradigms that are used to extract useful knowledge, extract patterns and regularities in databases, and perform prediction and forecasting will be discussed. Supervised and unsupervised learning approaches will be covered with a focus on pattern discovery and cluster analysis.

Learning Outcomes

The students who have succeeded in this course;
1. Be able to understand Data Gathering and Pre-processing
2. Become familiar with Frequent Item Set Detection
3. Be able to understand Association Rule Mining
4. Be able to understand Classifiers, and their benefits
5. Be able to use Clustering
6. Be able to understand Clustering Evaluation

Course Content

1.Introduction to Basic Concepts
2.Data Exploration
3.Classification
4.Clustering
5.Dimensionality Reduction
6.Frequent Item Set Mining
7.Association Rule Mining

Weekly Detailed Course Contents

Week Subject Related Preparation
1) Introduction to Basic Concepts None
2) Data Exploration: Summary Statistics, Visualization, OLAP and Multi-dimensional Data Analysis None
3) Data Pre-Processing, Transformation, Normalization, Standardization None
4) Classification and Regression: Model Selection and Generalization, Decision Trees, Performance Evaluation None
5) Classification: Bayesian Decision Theory, Parametric Classification, Naive Bayes Classifier, Instance-Based Classifiers
6) Classification None
6) Classification and Regression: Artificial Neural Networks, Support Vector Machines
7) Midterm I Review of all topics covered so far
8) Clustering: Partitioning and Hierarchical Algorithms None
9) Clustering: Density-Based Algorithms
10) Cluster Evaluation, Comparing Clusterings None
11) Midterm II none
12) Dimensionality Reduction none
13) Frequent Item Set Mining none
14) Association Rule Mining none

Sources

Course Notes / Textbooks: Introduction to Data Mining by Pang-Ning Tan, Michael Steinbach and Vipin Kumar
References: Data Mining: Concepts and Techniques, by Jiawei Han, Micheline Kamber and Jian Pei

Evaluation System

Semester Requirements Number of Activities Level of Contribution
Homework Assignments 2 % 20
Project 1 % 20
Midterms 2 % 20
Final 1 % 40
Total % 100
PERCENTAGE OF SEMESTER WORK % 40
PERCENTAGE OF FINAL WORK % 60
Total % 100

ECTS / Workload Table

Activities Number of Activities Workload
Course Hours 14 42
Study Hours Out of Class 16 32
Project 5 15
Homework Assignments 6 12
Midterms 8 28
Final 6 26
Total Workload 155

Contribution of Learning Outcomes to Programme Outcomes

No Effect 1 Lowest 2 Low 3 Average 4 High 5 Highest
           
Program Outcomes Level of Contribution
1) To have a grasp of basic mathematics, applied mathematics and theories and applications in Mathematics
2) To be able to understand and assess mathematical proofs and construct appropriate proofs of their own and also define and analyze problems and to find solutions based on scientific methods,
3) To be able to apply mathematics in real life with interdisciplinary approach and to discover their potentials,
4) To be able to acquire necessary information and to make modeling in any field that mathematics is used and to improve herself/himself, 4
5) To be able to tell theoretical and technical information easily to both experts in detail and non-experts in basic and comprehensible way,
6) To be familiar with computer programs used in the fields of mathematics and to be able to use at least one of them effectively at the European Computer Driving Licence Advanced Level,
7) To be able to behave in accordance with social, scientific and ethical values in each step of the projects involved and to be able to introduce and apply projects in terms of civic engagement,
8) To be able to evaluate all processes effectively and to have enough awareness about quality management by being conscious and having intellectual background in the universal sense, 4
9) By having a way of abstract thinking, to be able to connect concrete events and to transfer solutions, to be able to design experiments, collect data, and analyze results by scientific methods and to interfere,
10) To be able to continue lifelong learning by renewing the knowledge, the abilities and the competencies which have been developed during the program, and being conscious about lifelong learning,
11) To be able to adapt and transfer the knowledge gained in the areas of mathematics ; such as algebra, analysis, number theory, mathematical logic, geometry and topology to the level of secondary school,
12) To be able to conduct a research either as an individual or as a team member, and to be effective in each related step of the project, to take role in the decision process, to plan and manage the project by using time effectively.