{"title":"Eye tracking off the shelf","authors":"D. Hansen, D. MacKay, J. P. Hansen, M. Nielsen","doi":"10.1145/968363.968375","DOIUrl":null,"url":null,"abstract":"What if eye trackers could be downloaded and used immediately with standard cameras connected to a computer, without the need for an expert to setup the system? This has already the case for head trackers, so why not for eye trackers?Using components off-the-shelf (COTS) for camera-based eye tracking tasks has many advantages, but it certainly introduces several new problems as less assumptions on the system can be made. As a consequence of using COTS the price for eye tracking devices can be reduced while increasing the accessibility of these systems. Eye tracking based on COTS holds potential for a large number of possible applications such as in the games industry and eye typing [Majaranta and Räihä 2002]. Different cameras may be purchased depending on the need and the amount of money the user is willing to spend on the camera. In this framework it is not possible to use IR light sources and other novel engineered devices as they cannot be bought in a common hardware store. Very little control over the cameras and the geometry of the setup can be expected. The methods employed for eye tracking should therefore be able to handle changes in light conditions and image defocusing and scale changes [Hansen and Pece 2003]. On the same token pan-and-tilt cameras cannot be used, thus forcing such systems to be passive. Figure 1 shows a possible setup of a COTS-based eye tracker. When designing systems for the general public, it is unrealistic to assume that people are able to do camera calibration and make accurate setups of camera, monitor and user. Since little is known about the setup, would this then require a vast amount of calibration points needed for gaze estimation? That is, how many calibration points are really needed? Obviously the more calibration points are used the better the chances are to be able to infer the mapping from the image to gaze direction. It would even be possible to sample the entire function space provided sufficiently many calibration points are given. From the point of view of the users, a low number of calibration points is preferred as calibration may be considered as a tedious procedure. Systems that require many calibration points for every session are therefore not likely to succeed. It is also important to know the accuracy in gaze determination when using COTS to determine their applicability for various tasks.","PeriodicalId":127538,"journal":{"name":"Eye Tracking Research & Application","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eye Tracking Research & Application","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/968363.968375","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 36
Abstract
What if eye trackers could be downloaded and used immediately with standard cameras connected to a computer, without the need for an expert to setup the system? This has already the case for head trackers, so why not for eye trackers?Using components off-the-shelf (COTS) for camera-based eye tracking tasks has many advantages, but it certainly introduces several new problems as less assumptions on the system can be made. As a consequence of using COTS the price for eye tracking devices can be reduced while increasing the accessibility of these systems. Eye tracking based on COTS holds potential for a large number of possible applications such as in the games industry and eye typing [Majaranta and Räihä 2002]. Different cameras may be purchased depending on the need and the amount of money the user is willing to spend on the camera. In this framework it is not possible to use IR light sources and other novel engineered devices as they cannot be bought in a common hardware store. Very little control over the cameras and the geometry of the setup can be expected. The methods employed for eye tracking should therefore be able to handle changes in light conditions and image defocusing and scale changes [Hansen and Pece 2003]. On the same token pan-and-tilt cameras cannot be used, thus forcing such systems to be passive. Figure 1 shows a possible setup of a COTS-based eye tracker. When designing systems for the general public, it is unrealistic to assume that people are able to do camera calibration and make accurate setups of camera, monitor and user. Since little is known about the setup, would this then require a vast amount of calibration points needed for gaze estimation? That is, how many calibration points are really needed? Obviously the more calibration points are used the better the chances are to be able to infer the mapping from the image to gaze direction. It would even be possible to sample the entire function space provided sufficiently many calibration points are given. From the point of view of the users, a low number of calibration points is preferred as calibration may be considered as a tedious procedure. Systems that require many calibration points for every session are therefore not likely to succeed. It is also important to know the accuracy in gaze determination when using COTS to determine their applicability for various tasks.
如果眼动仪可以下载并立即与连接到电脑上的标准摄像头一起使用,而不需要专家来设置系统,那会怎么样?这已经适用于头部追踪器,那么为什么不适用眼动追踪器呢?在基于摄像头的眼动追踪任务中使用现成组件(COTS)有很多优点,但由于对系统的假设较少,它肯定会引入一些新问题。由于使用COTS,眼动追踪设备的价格可以降低,同时增加了这些系统的可访问性。基于COTS的眼动追踪具有大量潜在应用的潜力,例如在游戏行业和眼睛输入[Majaranta and Räihä 2002]。根据需要和用户愿意在相机上花费的金额,可以购买不同的相机。在这个框架中,不可能使用红外光源和其他新颖的工程设备,因为它们不能在普通的五金店买到。对相机和几何设置的控制很少,可以预期。因此,眼动追踪所采用的方法应该能够处理光线条件的变化、图像散焦和比例变化[Hansen and Pece 2003]。出于同样的原因,平移和倾斜相机不能使用,从而迫使这种系统是被动的。图1显示了基于cots的眼动仪的可能设置。在为大众设计系统时,假设人们能够进行摄像机校准并准确设置摄像机,监视器和用户是不现实的。由于对设置知之甚少,这是否需要大量的凝视估计所需的校准点?也就是说,到底需要多少个校准点?显然,使用的校准点越多,就越有可能推断出从图像到凝视方向的映射。如果给出足够多的校准点,甚至可以对整个函数空间进行采样。从用户的角度来看,较少的校准点是最好的,因为校准可能被认为是一个繁琐的过程。因此,每次会话都需要许多校准点的系统不太可能成功。在使用COTS确定其对各种任务的适用性时,了解注视确定的准确性也很重要。