A data warehouse for the MIS department consists of the following four dimension
ID: 3902506 • Letter: A
Question
A data warehouse for the MIS department consists of the following four dimensions: student, course, instructor, semester and each dimension has five levels including the all level. There are two measures: count and average grade. At the lowest level of average grade is the actual grade of a student. You are asked to perform a data mining study on that warehouse
a)Define three data mining problems on that warehouse: involving association, classification and clustering functionalities respectively. Clearly state the importance of each problem.
b) What is the advantage of the data being organized as OLAP cubes compared to relational table organisation?
Explanation / Answer
Answer)
a) Data mining problem on warehouse are :
1) Mining methods and user related interactions:
Some of the major issues in mining methods are pattern evaluation, handling the incomplete data as well as noisy data, prresenting the clustered mining results, putting forward the background knowledge(classification), associating different knowledge of database.
Efficiency as well as scalability of the algorithms of the database,
3) Diverse data type issues:
This is related to handling of the relational as well as complex data's
From heterogenous databases as well as global information system the information is being mined.
b) In OLAP the query performance is stronger than the relational database.This optimizes the data for data warehouse, data mart, data online analytical performance applications.
Relational database can work easily with respect to the individual records present where as when large group of records come into picture it is the OLAP database.