Loading...
2020
Some Remarks on Gries’s Criticism on a Tree-based Approach of Multifactorial Data
Some Remarks on Gries’s Criticism on a Tree-based Approach of Multifactorial Data
한국언어정보학회
신근영
논문정보
- Publisher
- 언어와 정보
- Issue Date
- 2020-02-29
- Keywords
- -
- Citation
- -
- Source
- -
- Journal Title
- -
- Volume
- 24
- Number
- 1
- Start Page
- 15
- End Page
- 28
- DOI
- ISSN
- 12267430
Abstract
Keun Young Shin. 2020. Some Remarks on Gries’s Criticism on a Tree-based Approach of Multifactoral Data. Language and Information 24.1, 15-28. A tree model that is typically used in data mining and machine learning has been recently adopted to analyze multifactorial data in corpus linguistics. However, Gries (2019) claims that tree-based approaches fail to deal with a data set in which not all input variables are necessary for predicting the target variable accurately. He demonstrates that a tree model reports a three-way interaction even when a two-way interaction can predict the target variable with 100% accuracy because there is another third significant input variable. In this article, I will argue that the result of a conditional inference tree applied to such a data set suggests that a researcher should analyze a two-way interaction effect. In other words, the result allows two different interpretations that are related to the structure of a data set and requires to test whether a two-way interaction is sufficient to explain the data. In doing so, I will show that the presence of the third input variable does not hinder the detection of a perfect two-way interaction that is displayed by a symmetric tree figure. (Chonnam National University)
- 전남대학교
- KCI
- 언어와 정보
저자 정보
| 이름 | 소속 |
|---|---|
| 신근영 | 영어영문학과 |