Semester : SEMESTER 6
Subject : Data Warehousing & Mining
Year : 2020
Term : SEPTEMBER
Branch : INFORMATION TECHNOLOGY
Scheme : 2015 Full Time
Course Code : IT 304
Page:3
d)
a)
b)
0)
೩)
0)
०)
0)
8)
b)
0)
030001T304052001
೫ 1.0 2.0 3.0 4.0 5.0
Mention the issues regarding classification and prediction.
PART C
Answer any two full questions, each carries 20 marks.
Write note on CRM data mining models.
Draw the classification framework for data mining techniques in CRM and
explain in detail.
Explain the different stages of Customer life cycle with a neat diagram?
Given two objects represented by the tuples (22, 1, 42, 10) and (20, 0, 36, 8)
i) Compute the Eucleidian distance between the two objects.
ii) Compute the Manhattan distance between the two objects.
Use K-means clustering algorithm to divide the following data into two
clusters and also compute the representative data points for the clusters
assuming the initial cluster centre as (2,1) and (2,3).
| 1 | 2 (ख
೫2 | 1 | | 1 | 3
Mention any four features of R programming.
Differentiate web content mining and web structure mining.
Write an algorithm for k-nearest neighbour classification given k and n, the
number of attributes describing each tuple.
How density based clustering varies from other methods?
List the advantages and disadvantages of K-means clustering.
Page 3 of 3
(3)
(7)
(6)
(7)
(4)
(8)
(4)
(4)
(8)
(8)
(4)