Semester : SEMESTER 8
Subject : Data Mining and Ware Housing
Year : 2019
Term : MAY
Branch : COMPUTER SCIENCE AND ENGINEERING
Scheme : 2015 Full Time
Course Code : CS 402
Page:1
Reg No.:_ Name:
Max. Marks: 100
ما ی ~ ॐ
12
H1060 Pages: 3
APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
EIGHTH SEMESTER B.TECH DEGREE EXAMINATION, MAY 2019
Course Code: CS402
Course Name: DATA MINING AND WAREHOUSING
PARTA
Answer all questions, each carries 4 marks.
How is data mining related to business intelligence?
Differentiate between OLTP and OLAP.
Why do we need data transformation? What are the different ways of data
transformation?
An airport security screening station wants to determine if passengers are
criminals or not. To do this, the faces of passengers are scanned and kept in a
database. Is this a classification or prediction task? Justify
Where do we use Linear regression? Explain linear regression.
What is the significance of tree pruning in decision tree algorithms?
What are the two measures used for rule interestingness?
Given two objects represented by the tuples (22,1,42,10) and (20,0,36,8)
Compute the Manhattan distance between the two objects.
How density based clustering varies from other methods?
Differentiate web content mining and web structure mining.
PART تا
Answer any two full questions, each carries 9 marks.
Explain various stages in knowledge discovery process with neat diagram
Use the two methods below to normalize the following group of data:
1000,2000,3000,5000,9000
i)min-max normalization by setting min=0 and max=1
ii) z-score normalization
Suppose that a data warehouse for University consists of four dimensions date,
spectator, location and game and two measures count and charge, where charge
is the fare that a spectator pays when watching a game on the given date.
Spectator may be students , adults or seniors ,with each category having its own
charge rate
Page lof 3
Duration: 3 Hours
Marks
(4)
(4)
(4)
(4)
(4)
(4)
(4)
(4)
(4)
(4)
(5)
(4)