-
Title
Generation and recognition of eye movement samples based on generative artificial intelligence
-
作者
谭雪青宋军张慢慢臧传丽
-
Author
TAN Xueqing;SONG Jun;ZHANG Manman;ZANG Chuanli
-
单位
天津师范大学心理学部河南理工大学机械与动力工程学院
-
Organization
Faculty of Psychology, Tianjin Normal University
School of Mechanical and Power Engineering, Henan Polytech⁃nic University
-
摘要
目的目的生成式和传统人工智能模型是信息时代的关键工具。在这些技术的助力下,眼动过程的样本生成与识别显得尤为关键,它已成为深入研究认知机制的重要手段。为了推动生成式人工智能在眼动技术领域的应用发展,解决眼动样本生成及因网络深度增加而导致的不透明性和不可解释性问题,并深入挖掘与幼儿语言发展相关的眼动数据,方法方法采集4~6岁幼儿理解不同焦点结构的眼动数据,采用生成式人工智能模型-变分自编码器(variationalautoen‑coder,VAE)和传统模型-多层感知器(multi-layerperceptron,MLP)识别眼动模式的发展差异并尝试生成新样本,基于灰色关联分析和混淆矩阵对生成式数据集进行解释。结果结果结果表明:(1)VAE生成的4岁组、5岁组和6岁组幼儿眼动数据集精度高于MINIST数据集(mixedNa‑tionalInstituteofStandardsandTechnologydatabase),且与MLP分析结果一致,具有准确性、多样性和一定的可解释性;(2)生成式眼动数据及混淆矩阵结果表明,在无焦点结构句式中,幼儿在4~5岁、5~6岁两个阶段理解水平均有提升,而宾语焦点结构和主语焦点结构的眼动特征在4~5岁变化较小,5~6岁变化较大,说明幼儿对焦点结构的理解在5岁是一个关键期,这符合幼儿焦点结构理解发展规律。结论结论提出的人工智能耦合分析方法,具备有效识别眼动特征发展模式的能力,并能据此生成可靠的新样本。这一方法不仅为生成式人工智能与眼动技术的融合开辟了新的途径,而且为复杂语言理解问题提供了全新的思考方向。
-
Abstract
Objectives Generative and traditional artificial intelligence models are pivotal tools in the infor‑mation age. Leveraging these technologies,the generation and identification of eye movement samples have emerged as critical components, facilitating deeper explorations into cognitive mechanisms. Therefore, this study aims to promote the development of generative artificial intelligence in the field of eye tracking tech‑nology, solve the problem of eye movement sample generation and the opacity and inexplicability caused by the increase in network depth, and deeply mine eye tracking data related to children’s language develop‑ment. Methods This study collected data on the eye movement process of 4~6 years old children’s under‑standing of different focus structures. Generative artificial intelligence model-variational autoencoder(VAE) and traditional models-multilayer perceptron(MLP) were used to identify the developmental differences in their eye movement patterns and attempt to generate new samples. Interpreting generative datasets based on grey relational analysis and confusion matrix. Results The results showed that: (1)the eye movement datasets generated by VAE for 4, 5, and 6 years old children had higher accuracy than the MINIST dataset(mixed National Institute of Standards and Technology), and were consistent with the MLP analysis results, with accuracy, diversity, and certain interpretability; (2)The results of generative eye movement data and con‑fusion matrix indicated that in unfocused structure, children’s understanding level improved at the ages of 4~5 and 5~6, while the eye movement characteristics of object-focus structure and subject-focus structure changed less at the ages of 4~5 and more at the ages of 5~6, indicating that children’s understanding of focus structure was a critical period at the age of 5, which was in line with the development law of children’s un‑derstanding of focus structure. Conclusions The artificial intelligence coupling analysis proposed in this ar‑ticle could identify the development patterns of eye movement features and generate reliable new samples, providing new ideas for the combination of generative artificial intelligence and eye movement technology.
-
关键词
生成式人工智能变分自编码器多层感知器眼动
-
KeyWords
generative artificial intelligence;variational autoencoder;multi-layer perceptron;eye movement
-
基金项目(Foundation)
国家自然科学基金资助项目(31800920);河南省教育科学规划重点课题资助项目(2025JKZD16)
-
DOI
-
引用格式
谭雪青,宋军,张慢慢,等.基于生成式人工智能的眼动样本生成及识别[J].河南理工大学学报(自然科学版),2025,44(1):145‑153.
-
Citation
TAN X Q, SONG J, ZHANG M M, et al. Generation and recognition of eye movement samples based on generative ar‑tificial intelligence[J]. Journal of Henan Polytechnic University (Natural Science),2025,44(1):145-153.