Is Geometry Enough? An Evaluation of Landmark-Based Gaze Estimation

IRIS

Appearance-based gaze estimation frequently relies on deep Convolutional Neural Networks (CNNs). These models are accurate, but computationally expensive and act as ‘‘black boxes’’, offering little interpretability. Geometric methods based on facial landmarks are lightweight alternatives. However, their performance limits and generalization capabilities remain underexplored in modern benchmarks. In this study, we conduct a comprehensive evaluation of the landmark-based gaze estimation. We introduce a standardized pipeline to extract and normalize landmarks from three large-scale datasets (Gaze360, ETH-XGaze, and GazeGene) and train lightweight regression models, specifically extreme gradient boosted trees and two neural architectures: a holistic multilayer perceptron (MLP) and a Siamese MLP designed to capture binocular geometry. We find that landmark-based models exhibit lower performance in the within-domain evaluation, likely due to noise introduced into the datasets by the landmark detector. Never-theless, in the cross-domain evaluation, the proposed MLP architectures show generalization capabilities comparable to those of the ResNet18 baselines. These findings suggest that sparse geometric features encode sufficient information for robust gaze estimation, paving the way for efficient, interpretable, and privacy-friendly edge applications. The source code and generated landmark-based datasets are available at: https://github.com/daniele-agostinelli/LandmarkGaze.git.

Is Geometry Enough? An Evaluation of Landmark-Based Gaze Estimation / Agostinelli, D., Agostinelli, T., Generosi, A., Mengoni, M.. - In: IEEE ACCESS. - ISSN 2169-3536. - (2026), pp. 87241-87251. [10.1109/access.2026.3696778]

Is Geometry Enough? An Evaluation of Landmark-Based Gaze Estimation

Agostinelli, Daniele^Primo;Agostinelli, Thomas^Secondo;Generosi, Andrea^Penultimo;Mengoni, Maura^Ultimo

2026-01-01

Abstract

Appearance-based gaze estimation frequently relies on deep Convolutional Neural Networks (CNNs). These models are accurate, but computationally expensive and act as ‘‘black boxes’’, offering little interpretability. Geometric methods based on facial landmarks are lightweight alternatives. However, their performance limits and generalization capabilities remain underexplored in modern benchmarks. In this study, we conduct a comprehensive evaluation of the landmark-based gaze estimation. We introduce a standardized pipeline to extract and normalize landmarks from three large-scale datasets (Gaze360, ETH-XGaze, and GazeGene) and train lightweight regression models, specifically extreme gradient boosted trees and two neural architectures: a holistic multilayer perceptron (MLP) and a Siamese MLP designed to capture binocular geometry. We find that landmark-based models exhibit lower performance in the within-domain evaluation, likely due to noise introduced into the datasets by the landmark detector. Never-theless, in the cross-domain evaluation, the proposed MLP architectures show generalization capabilities comparable to those of the ResNet18 baselines. These findings suggest that sparse geometric features encode sufficient information for robust gaze estimation, paving the way for efficient, interpretable, and privacy-friendly edge applications. The source code and generated landmark-based datasets are available at: https://github.com/daniele-agostinelli/LandmarkGaze.git.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Rivista su cui è pubblicata l'opera
	
				IEEE ACCESS
			
	Codice DOI
	
				https://dx.doi.org/10.1109/access.2026.3696778
			
	Parole chiave
	
				Deep learning; Facial landmarks; Gaze estimation; Machine learning; Pattern recognition
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Agostinelli_Is-Geometry-Enough-Evaluation-Landmark_2026.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza d'uso: Creative commons Dimensione 816.73 kB Formato Adobe PDF Visualizza/Apri	816.73 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/358981

Citazioni

ND

0

0

social impact