Lecture “Data Warehousing and Data Mining Techniques”

Information
Classification: 
Master Informatik, Master Wirtschaftsinformatik
Credits: 
4 or 5 (depending on course of study and exam regulations)
Regular Dates: 
Every Thursday, from 09:45 till 12:15, starting the 17th of April
Lecture takes place at Informatikzentrum, Mühlenpfordtstraße 23, room 161
Available exam dates: 28.8., 29.8., 8.9., 9.9.
Contents
Contents: 

In this course, we examine the aspects of building, maintaining, and operating data warehouses and give an insight into the main knowledge discovery techniques. The course deals with basic issues like the storage of data, execution of analytical queries and data mining procedures.

This course will be completely tought in English.

The general structure of the course is as follows:

  • Typical DW use case scenarios
  • Basic architecture of DW
  • Data modelling on conceptual, logical and physical levels
  • Multidimensional E/R modelling
  • Cubes, dimensions, measures
  • Query processing, OLAP queries (OLAP vs OLTP), roll-up, drill down, slice, dice, pivot
  • MOLAP, ROLAP, HOLAP
  • SQL99 OLAP operators, MDX
  • Snowflake, star and starflake schemas for relational storage
  • Multimedia physical storage (linearization)
  • DW Indexing as search optimization mean: R-Trees, UB-Trees, Bitmap indexes
  • Other optimization procedures: data partitioning, star join optimization, materialized views
  • ETL
  • Association rule mining, sequence patterns, time series
  • Classification: Decision trees, naive Bayes classifications, SVM
  • Cluster analysis: K-means, hierarchical clustering, agglomerative clustering, outlier analysis

Summary: 

 

Materials

Download

Date Topic Slides Exercises Video
17.04 Introduction Slides - Print Slides   Video
24.04 Architecture  Slides - Print Slides    Video
01.05 NO LECTURE      
08.05 Modeling Slides - Print Slides    Video
15.05 Indexes Slides - Print Slides    Video 
22.05 Optimization Slides - Print Slides   Video 
29.05 NO LECTURE      
05.06 OLAP Operations & Queries Slides - Print Slides   Video 
12.06 Build the DW, ETL Slides - Print Slides    Video 
GRefine 
19.06 Real-Time DW Slides - Print Slides    Video 
26.06 DM Overview & Association Rule Mining  Slides - Print Slides    Video (old version) 
03.07 Sequence patterns & Time series  Slides - Print Slides   Video 
10.07 Classification Slides - Print Slides   Video
17.07 Clustering Slides - Print Slides   Video 
24.07 Meta-Algorithms for Classification Slides - Print Slides   Video 

 

 

AttachmentDateSize
File dw_14_05.mp417/05/14 10:48 am267.87 MB
File DW-05.pdf17/05/14 11:10 am1.97 MB
File dw5.mp423/05/14 6:57 pm113.2 MB
File C4-print.pdf02/06/14 5:40 pm936.78 KB
File C5.pdf02/06/14 5:43 pm2.4 MB
File C5-print.pdf02/06/14 5:43 pm1.19 MB
File C6.pdf02/06/14 5:43 pm3.21 MB
File C6-print.pdf02/06/14 5:43 pm1.58 MB
File dw06.mp405/06/14 7:39 pm218.16 MB
File dw14_07.mp413/06/14 3:33 pm251.93 MB
File Google Refine 2.0 - Data Transformation (2 of 3) (video version 2).mp413/06/14 3:36 pm64.5 MB
File C7-print.pdf13/06/14 3:38 pm1.37 MB
File C7.pdf13/06/14 3:38 pm2.51 MB
File dw14_08.mp428/06/14 2:05 pm246.5 MB
File C8.pdf28/06/14 2:06 pm2.33 MB
File C8_print.pdf28/06/14 2:06 pm996.72 KB
File C9.pdf28/06/14 2:06 pm1.98 MB
File C9_print.pdf28/06/14 2:06 pm1.2 MB
File dw10.mp409/07/14 3:21 pm267.77 MB
File dw12.mp424/07/14 12:47 pm235.48 MB