Week 1. Introduction to data mining

Test 1

1、单选题:
‍Which one is not the description of Data mining?‎
选项:
A: Extraction of interesting patterns or knowledge
B: Explorations and analysis by automatic or semi-automatic means
C: Discover meaningful patterns from large quantities of data
D: Appropriate statistical analysis methods to analyze the data collected
答案: 【 Appropriate statistical analysis methods to analyze the data collected

2、单选题:
‎Which one describes the right process of knowledge discovery?​‎​
选项:
A: Selection-Preprocessing-Transformation-Data mining-Interpretation/Evaluation
B: Preprocessing-Transformation-Data mining- Selection- Interpretation/Evaluation
C: Data mining- Selection- Interpretation/Evaluation- Preprocessing-Transformation
D: Transformation-Data mining- election-Preprocessing- Interpretation/Evaluation
答案: 【 Selection-Preprocessing-Transformation-Data mining-Interpretation/Evaluation

3、单选题:
​Which one is not belong to the process of KDD?‎
选项:
A: Data mining
B: Data description
C: Data cleaning
D: Data selection
答案: 【 Data description

4、单选题:
‎Which one is not the right alternative name of data mining? ‌
选项:
A: Knowledge extraction
B: Data archeology
C: Data dredging
D: Data harvesting
答案: 【 Data harvesting

5、单选题:
‎Which one is not the nominal variables?​
选项:
A: Occupation
B: Education
C: Age
D: Color
答案: 【 Age

6、单选题:
​Which one is wrong about classification and regression? ‌
选项:
A: Regression analysis is a statistical methodology that is most often used for numeric prediction.
B: We can construct classification models (functions) without some training examples.
C: Classification predicts categorical (discrete, unordered) labels.
D: Regression models predict continuous-valued functions.
答案: 【 We can construct classification models (functions) without some training examples.

7、单选题:
‌Which one is wrong about clustering and outliers?‌
选项:
A: Clustering belongs to supervised learning.
B: Principles of clustering include maximizing intra-class similarity and minimizing interclass similarity.
C: Outlier analysis can be useful in fraud detection and rare events analysis.
D: Outlier means a data object that does not comply with the general behavior of the data.
答案: 【 Clustering belongs to supervised learning.

8、单选题:
‌About data process, which one is wrong?​
选项:
A: When making data discrimination, we compare the target class with one or a set of comparative cla

剩余75%内容付费后可查看

发表评论

电子邮件地址不会被公开。 必填项已用*标注