Semester : SEMESTER 6
Subject : Data Warehousing & Mining
Year : 2020
Term : SEPTEMBER
Branch : INFORMATION TECHNOLOGY
Scheme : 2015 Full Time
Course Code : IT 304
Page:1
( 030001T304052001 Pages: 3
Reg No.: Name:
APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
Sixth semester B.Tech examinations (S), September 2020
Course Code: IT304
Course Name: Data Warehousing and Mining
Max. Marks: 100 Duration: 3 Hours
PARTA
Answer any two full questions, each carries 15 marks. Marks
1 a) What are the major challenges of mining a huge amount of data in (5)
comparison with mining a small amount of data?
b) How is data warehouse different from a database? How are they similar? (5)
c) What are the different types of applications where data mining can (5)
directly applied?
2 a) Distinguish between OLTP & OLAP. (8)
b) Use the two methods below to normalize the following group of data: 200, (7)
300, 400, 600, 1000
i) min-max normalization by setting min=0 and max=1
11) z-score normalization
3 8) What is multidimensional schema? (2)
b) Write short notes on Star, Snowflake and Data constellation schema. (9)
c) Suppose that a data warehouse consists of the four dimensions date, (4)
spectator, location and game, and the two measures count and charge,
where charge is the fare that a spectator pays when watching a game on a
given date. Spectators may be students, adults or seniors, with each
category having its own charge rate. Draw a star schema diagram for the
data warehouse.
PART B
Answer any two full questions, each carries 15 marks.
4 a) Use Naive Bayes algorithm to determine whether ared domestic SUV car is (9)
stolen or not using the following data:
Page 1 of 3