Martin et al., 2022 - Google Patents

Scangan360: A generative model of realistic scanpaths for 360 images

Martin et al., 2022

Document ID: 15006901751013501858
Author: Martin D; Serrano A; Bergman A; Wetzstein G; Masia B
Publication year: 2022
Publication venue: IEEE Transactions on Visualization and Computer Graphics

External Links

Cited by

Snippet

Understanding and modeling the dynamics of human gaze behavior in 360° environments is crucial for creating, improving, and developing emerging virtual reality applications. However, recruiting human observers and acquiring enough data to analyze their behavior …

Continue reading at par.nsf.gov (PDF) (other versions)

241000282414 Homo sapiens 0 abstract description 41

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run

Similar Documents

Publication	Publication Date	Title
Martin et al.	2022	Scangan360: A generative model of realistic scanpaths for 360 images
Xu et al.	2018	Predicting head movement in panoramic video: A deep reinforcement learning approach
Yi et al.	2020	Audio-driven talking face video generation with learning-based personalized head pose
US12053301B2 (en)	2024-08-06	Classifying facial expressions using eye-tracking cameras
Park et al.	2022	Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory
JP7147078B2 (en)	2022-10-04	Video frame information labeling method, apparatus, apparatus and computer program
Qiao et al.	2020	Viewport-dependent saliency prediction in 360 video
US10614289B2 (en)	2020-04-07	Facial tracking with classifiers
Supancic III et al.	2017	Tracking as online decision-making: Learning a policy from streaming videos with reinforcement learning
JP7476428B2 (en)	2024-04-30	Image line of sight correction method, device, electronic device, computer-readable storage medium, and computer program
Sharp et al.	2015	Accurate, robust, and flexible real-time hand tracking
Zhu et al.	2021	Viewing behavior supported visual saliency predictor for 360 degree videos
Wang et al.	2020	Predicting camera viewpoint improves cross-dataset generalization for 3d human pose estimation
Chen et al.	2014	A probabilistic approach to online eye gaze tracking without explicit personal calibration
Elhayek et al.	2018	Fully automatic multi-person human motion capture for vr applications
Chen et al.	2022	3D face reconstruction and gaze tracking in the HMD for virtual interaction
Bernal-Berdun et al.	2022	SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360∘ videos
Corona et al.	2024	VLOGGER: Multimodal diffusion for embodied avatar synthesis
Li et al.	2014	Real-time gaze estimation using a kinect and a HD webcam
Li et al.	2021	Predicting user visual attention in virtual reality with a deep learning model
Shi et al.	2021	I understand you: Blind 3d human attention inference from the perspective of third-person
Wu et al.	2018	Foveated convolutional neural networks for video summarization
Van Gemeren et al.	2018	Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions
Ni	2023	Application of motion tracking technology in movies, television production and photography using big data
Malladi et al.	2022	EG-SNIK: a free viewing egocentric gaze dataset and its applications