MC0088- DATA MINING

Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
  help.mbaassignments@gmail.com
or
call us at : 08263069601


ASSIGNMENT

DRIVE
FALL – 2016
PROGRAM
MCA(REVISED FALL 2007)
SEMESTER
6
SUBJECT CODE & NAME
MC0088- DATA MINING
BK ID
B1009
CREDITS
4
MARKS
60


Note: Answer all questions. Kindly note that answers for 10 marks questions should be approximately of 400 words. Each question is followed by evaluation scheme.



Question. 1. Differentiate between Data Mining and Data Warehousing.

Answer: Data warehousing is about the STORING of analytical data in a structure suitable for data mining.  This analytical data is extracted from the operational systems usually on a daily basis.

Data mining is a set of techniques used to search, retrieve and analyze data from a data warehouse (or other data storage mechanism) to answer


Question. 2. What are the key features of a Data Warehouse? Explain.

Answer: Data warehouse can be defined as ‘Structural Repository’ of historic data. It is developed in evolutionary process by integrating the data from non integrated systems like text files, excel sheets, databases(The same is shown in the diagram below.)





Question. 3. Differentiate between Data Integration and Transformation.

Answer: ETL is a type of data integration and involves an architecture that extracts, transforms and then loads data in a target database or file. Other forms of data integration include ELT (Extract, Load and Transform) or ELTL or EII. There is also manual data integration where a user exports a database table and imports it into another database. ETL is most commonly used as a name for a specific type of data integration tool.



Question. 4. Differentiate between database management systems (DBMS) and data mining.

Answer: A DBMS (Database Management System) is a complete system used for managing digital databases that allows storage of database content, creation/maintenance of data, search and other functionalities. On the other hand, Data Mining is a field in computer science, which deals with the extraction of previously unknown and interesting information from raw data. Usually, the data used as the input for the Data mining process is stored in databases. Users who are inclined toward statistics use Data Mining. They utilize



Question. 5. Differentiate between K-means and Hierarchical clustering.

Answer: There are a number of important differences between k-means and hierarchical clustering, ranging from how the algorithms are implemented to how you can interpret the results.

The k-means algorithm is parameterized by the value k, which is the number of clusters that you want to create. As the animation below illustrates, the algorithm begins by creating k centroids. It then iterates between an assign step (where



Question. 6. Differentiate between Web content mining and Web usage mining.

Answer: Web mining is a rapid growing research area. It consists of Web usage mining, Web structure mining, and Web content mining. Web usage mining refers to the discovery of user access patterns from Web usage logs. Web structure mining tries to discover useful knowledge from the structure of hyperlinks. Web content mining aims to extract/mine useful information or knowledge from web page contents. This tutorial focuses on Web Content Mining.

Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
  help.mbaassignments@gmail.com
or
call us at : 08263069601


No comments:

Post a Comment

Note: Only a member of this blog may post a comment.