Semester : SEMESTER 8
Subject : Data Mining and Ware Housing
Year : 2020
Term : SEPTEMBER
Branch : COMPUTER SCIENCE AND ENGINEERING
Scheme : 2015 Full Time
Course Code : CS 402
Page:1
۸ 0404000642001 Pages: 5
Reg No.: Name:
APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
Eighth semester B.Tech degree examinations, September 2020
Course Code: CS402
Course Name: DATA MINING AND WAREHOUSING
Max. Marks: 100 Duration: 3 Hours
PARTA
Answer all questions, each carries 4 marks. Marks
1 List out the four major features of data warehouse as defined by William H. (4)
Inmon, the father of data warehousing.
2 What is the purpose of data discretization in data mining? List out any four data (4)
discretization strategies.
3 a) Draw a suitable figure that shows data mining as a process of knowledge (2)
discovery.
b) List out any four methods to handle missing attribute values in a dataset. (2)
4 a) How 15 entropy of a dataset calculated? (2)
b) What are the advantages of DBSCAN over k-Means clustering algorithm? (2)
5 What is confusion matrix? (4)
6 Describe the purpose of kernel function in nonlinear SVM with a suitable (4)
example.
7 What is the significance of CF (Clustering Feature) in BIRCH Algorithm? (4)
8 The transaction details are given in the following table, what is the confidence (4)
and support of the association rule { Diapers } > { Coffee, Nuts}?
T_id Items bought
| 20 | Beer, Nuts, Diapers
| 2 | Beer, Coffee, Diapers, Nuts
| ॐ | Beer, Diapers, Eggs
| % | Beer, Nuts, Eggs, Milk
| ॐ | Nuts, Coffee, Diapers, Eggs, Milk
Page lof 5