Lecture “Data Warehousing and Data Mining Techniques”

Information
Classification: 
Master Informatik, Master Wirtschaftsinformatik
Credits: 
4 or 5 (depending on course of study and exam regulations)
Exam: 
Oral
Regular Dates: 
Tuesdays, 15:00-17:15, IZ 160
The first lecture day will be Tuesday the 25th of October
Contents
Contents: 

In this course, we examine the aspects regarding building maintaining and operating data warehouses as well as give an insight to the main knowledge discovery techniques. The course deals with basic issues like storage of the data, execution of the analytical queries and data mining procedures.

Course will be tought completely in English.

The general structure of the course is:

Typical dw use case scenarios
Basic architecture of dw
Data modelling on a conceptual, logical and physical level
Multidimensional E/R modelling
Cubes, dimensions, measures
Query processing, OLAP queries (OLAP vs OLTP), roll-up, drill down, slice, dice, pivot
MOLAP, ROLAP, HOLAP
SQL99 OLAP operators, MDX
Snowflake, star and starflake schemas for relational storage
Multimedia physical storage (linearization)
DW Indexing as search optimization mean: R-Trees, UB-Trees, Bitmap indexes
Other optimization procedures: data partitioning, star join optimization, materialized views
ETL
Association rule mining, sequence patterns, time series
Classification: Decision trees, naive Bayes classifications, SVM
Cluster analysis: K-means, hierarchical clustering, aglomerative clustering, outlier analysis

Materials

Note

Achieving at least 50% of the total homework points is advisable.

Please drop your solutions into the silver homework box located at the second floor of Informatikzentrum at the Information Systems Institute (in front of the elevators) until Tuesday, before the next lecture (the date is mentioned on each exercise sheet). You may answer in either German or English. You are encour-aged to work in teams of 2 students (not more than 2), and send your solution as a team. Please mention in your email the name of both students together with the corresponding inmatriculation numbers (“Matrikelnummer”).

ITIS students outside Braunschweig can send their solutions per mail at silviuatifis [dot] cs [dot] tu-bs [dot] de

 

Download

 

Date Topic Slides Exercises Video
25.10.11 Introduction Slides - Print Slides Exercise 1 Video1
01.11.11

- Architecture

- Data Modeling (Conceptual Model)

Slides - Print Slides  None Video2
08.11.11  Data Modeling (Logical & Physical Models) Slides - Print Slides Exercise 2 Video 3
15.11.11  Indexes Slides - Print Slides Exercise 3 Video 4
22.11.11 Optimization Slides - Print Slides None Video 5
29.11.11 OLAP Operations & Queries Slides - Print Slides None Video 6
06.12.11 Build the DW, ETL Slides - Print Slides Exercise 4 Video 7
13.12.11 Real-Time DW Slides - Print Slides None Video 8
20.12.11 Data Mining Overview, Association Rule Mining Slides - Print Slides None Video 9
10.01.12 Sequence Pattern Mining & Time Series Slides - Print Slides Exercise 5 Video 10
17.01.12 Classification Slides - Print Slides None Video 11
24.01.12 Clustering Slides - Print Slides None Video 12
31.01.12 AdaBoost Slides - Print Slides None Video 13
07.02.12 DWs in Practice - OLD Video  -  - Video 14

 

Data Mining Literature:

The following paper presents an overview of the main techniques we have discussed in the data mining part of the lecture, and should be used as a starting point for each algorithm. Please read the citations of each technique for more speciffic information:

X. Wu, V. Kumar, J. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. McLachlan, A. Ng, B. Liu, P. Yu, Z. Zhou, M. Steinbach, D. Hand, D. Steinberg. Top 10 Algorithms in Data Mining.

Journal Knowledge and Information Systems archive, Volume 14 Issue 1. [http]

.

 

 

AttachmentDateSize
File dwhc6.flv30/11/11 5:06 pm140.85 MB
File C6.pdf30/11/11 5:07 pm3.35 MB
File Print_C6.pdf30/11/11 5:07 pm2.34 MB
File C7.pdf08/12/11 10:43 am2.54 MB
File Print_C7.pdf08/12/11 10:43 am1.44 MB
File dwhc7.flv08/12/11 10:44 am159.01 MB
File DW_04_Uebung.pdf08/12/11 12:55 pm309.31 KB
File C8.pdf14/12/11 1:07 pm2.33 MB
File Print_C8.pdf14/12/11 1:07 pm1.24 MB
File dwhc8.flv14/12/11 1:08 pm124.12 MB
File C9.pdf21/12/11 10:26 am1.98 MB
File Print_C9.pdf21/12/11 10:26 am1.31 MB
File dwhc10.flv11/01/12 5:30 pm158.35 MB
File C10.pdf11/01/12 5:32 pm2.19 MB
File Print_C10.pdf11/01/12 5:32 pm1.45 MB
File DW_05_Uebung.pdf11/01/12 5:33 pm494.35 KB
File C11.pdf18/01/12 5:53 pm1.74 MB
File Print_C11.pdf18/01/12 5:53 pm1.22 MB
File dwhc11.flv18/01/12 5:54 pm153.52 MB
File dwhc12.flv25/01/12 4:21 pm127.81 MB
File C12.pdf25/01/12 4:22 pm1.91 MB
File C12_Print.pdf25/01/12 4:23 pm1.1 MB
File C13.pdf03/02/12 4:04 pm2 MB
File C13_Print.pdf03/02/12 4:04 pm1.03 MB
File dwhc13.flv03/02/12 4:05 pm106.17 MB