Semester : SEMESTER 8
Subject : Data Mining and Ware Housing
Year : 2019
Term : MAY
Branch : COMPUTER SCIENCE AND ENGINEERING
Scheme : 2015 Full Time
Course Code : CS 402
Page:3
A H1060 Pages: 3
Answer any two full questions, each carries 12 marks.
17 Consider the transaction database given below. Set minimum support count as 2
and minimum confidence threshold as 70%
12,14
12,13
11,12.,14
11,13
12,13
11,13
11,12,13,15
11,12,13
a) Find the frequent itemset using FP Growth Algorithm. (8)
b) Generate strong association rules. (4)
18 a) Explain BIRCH Clustering Method. (8)
b) What are the advantages of BIRCH compared to other clustering method. (4)
19 a) Explain k-means partition algorithm. What is the drawback of K-means? (6)
b) Term frequency matrix given in the table shows the frequency of terms per (6)
document. Calculate the TF-IDF value for the term T4 in document 3.
Page 3083