结构方程模型和IRT等级反应模型在人格量表项目筛选中的对比研究
A Comparison between the Structural Equation Model and IRT Graded Response Model on Item Selection of Personality Scale
-
摘要:为比较结构方程模型和IRT等级反应模型在人格量表项目筛选上的作用,以《中国大学生人格量表》的7229个实际测量数据为基础,针对因素二“爽直”分别以Lisrel8.70和Multilog7.03进行结构方程模型和等级反应模型的参数估计与拟合,比较两种方法的项目筛选结果.二者统计结果均认为项目5、6、7、8拟合度不佳,在结构方程模型上表现为因子负荷较低,整体拟合指数不理想; 在等级反应模型上表现为区分度参数和位置参数不理想,相关项目的特征曲线和信息曲线形态较差.但结构方程模型倾向于项目6、8更差,而等级反应模型则倾向于项目5、6更差.结构方程模型和IRT等级反应模型对人格量表项目的统计推断结果从总体上讲是一致的,但在个别项目上略有差异.二者各有优势,可以结合使用Abstract:Objective To compare the application of the structural equation model and the IRT graded response model in item selection of personality scale. Methods Lisrel 8.70 and Multilog 7.03 were applied respectively to estimate parameters and indexes of goodness-of-fit in the structural equation model and the IRT graded response model in terms of the second factor of "forthright" in the Chinese College Student Personality Scale, on the basis of 7229 measured cases. Results The results of the two models indicated that neither was good enough in terms of the goodness-of-fit on item 5, 6, 7, and 8. On the one hand, the typical feature was found to be a too low factor load and an unsatisfactory overall fit index in term of the structural equation model. In term of the IRT graded response model, the major faults were found to have a rather unsatisfactory differentiation parameter and a bad location parameter with too poor a shape of the characteristic curve and the information curve for the related items. Items 6 and 8 tend to be much worse in term of the structural equation model while Items 5 and 7 are found to be worse in term of the IRT graded response model. Conclusion The inference results of both models were consistent in general, although there were slight differences on a few specific items. These two models could be applied in combination since each one has its advantages.