This research presents an algorithm for three-dimensional (3-D) pose tracking of a rigid object by processing sequences of monocular images. The pose trajectory of the object is estimated by performing linear correlation between the current scene and a filter bank constructed from different views of a 3-D model of the target, which are created synthetically with computer graphics. The pose tracking is guided by particle filters that dynamically adapt the filter bank by taking into account the kinematics of the target in the scene. Experimental results obtained with the proposed algorithm in processing synthetic and real images are presented and discussed. These results show that the proposed algorithm achieves a higher accuracy of pose tracking in terms of objective metrics, in comparison with that of existing similar algorithms.
A visual approach in environment recognition for robot navigation is proposed. This work includes a template matching filtering technique to detect obstacles and feasible paths using a single camera to sense a cluttered environment. In this problem statement, a robot can move from the start to the goal by choosing a single path between multiple possible ways. In order to generate an efficient and safe path for mobile robot navigation, the proposal employs a pseudo-bacterial potential field algorithm to derive optimal potential field functions using evolutionary computation. Simulation results are evaluated in synthetic and real scenes in terms of accuracy of environment recognition and efficiency of path planning computation.
A reliable approach for object segmentation based on template-matching filters is proposed. The system employs an adaptive strategy for the generation of space-variant filters which take into account several versions of the target and local statistical properties of the input scene. Moreover, the proposed method considers the geometric modifications of the target while is moving through a video sequence. The detection accuracy of the matched filter brings the location of the target of interest. The estimated location coordinates are used to compute the support area covered by the target using watershed segmentation technique. In each frame, the filter adapts according the geometrical changes of the target in order to estimate its current support region. Experimental tests carried out in a video sequence show that the proposed system yields a very good performance for accuracy detection, and object segmentation efficiency in real-life scenes.
The problem of 3D pose recognition of a rigid object is difficult to solve because the pose in a 3D space can vary with multiple degrees of freedom. In this work, we propose an accurate method for 3D pose estimation based on template matched filtering. The proposed method utilizes a bank of space-variant filters which take into account different pose states of the target and local statistical properties of the input scene. The state parameters of location coordinates, orientation angles, and scaling parameters of the target are estimated with high accuracy in the input scene. Experimental tests are performed for real and synthetic scenes. The proposed system yields good performance for 3D pose recognition in terms of detection efficiency, location and orientation errors.
Computer vision is an important task in robotics applications. This work proposes an approach for autonomous mobile robot navigation using the integration of the template-matching filters for obstacle detection and the evolutionary artificial potential field method for path planning. The recognition system employs a digital camera to sense the environment of a mobile robot. The captured scene is processed by a bank of space variant filters in order to find the obstacles and a feasible area for the robot navigation. The path planning employs evolutionary artificial potential fields to derive optimal potential field functions using evolutionary computation. Simulation results to validate the analysis and implementation are provided; they were specifically made to show the effectiveness and the efficiency of the proposal.
A reliable method for three-dimensional digitization of human faces based on the fringe projection technique
is presented. The proposed method employs robust fringe analysis algorithms for robust phase computation.
The quality of the resultant 3D face model is characterized in terms of accuracy of surface computation using
objective metrics. We present experimental results obtained with real and synthetic laboratory objects. The
potential of this method to be used in the field of face recognition is discussed.
An accurate algorithm for three-dimensional (3-D) pose recognition of a rigid object is presented. The algorithm is based on adaptive template matched filtering and local search optimization. When a scene image is captured, a bank of correlation filters is constructed to find the best correspondence between the current view of the target in the scene and a target image synthesized by means of computer graphics. The synthetic image is created using a known 3-D model of the target and an iterative procedure based on local search. Computer simulation results obtained with the proposed algorithm in synthetic and real-life scenes are presented and discussed in terms of accuracy of pose recognition in the presence of noise, cluttered background, and occlusion. Experimental results show that our proposal presents high accuracy for 3-D pose estimation using monocular images.
Proc. SPIE. 9598, Optics and Photonics for Information Processing IX
KEYWORDS: 3D acquisition, Detection and tracking algorithms, Image processing, Computing systems, 3D modeling, Light sources and illumination, Image filtering, Object recognition, 3D image processing, RGB color model
In this paper we solve the problem of pose recognition of a 3D object in non-uniformly illuminated and noisy scenes. The recognition system employs a bank of space-variant correlation filters constructed with an adaptive approach based on local statistical parameters of the input scene. The position and orientation of the target are estimated with the help of the filter bank. For an observed input frame, the algorithm computes the correlation process between the observed image and the bank of filters using a combination of data and task parallelism by taking advantage of a graphics processing unit (GPU) architecture. The pose of the target is estimated by finding the template that better matches the current view of target within the scene. The performance of the proposed system is evaluated in terms of recognition accuracy, location and orientation errors, and computational performance.
Light interactions with matter is of remarkable complexity. An adequate modeling of global illumination is a vastly studied topic since the beginning of computer graphics, and still is an unsolved problem. The rendering equation for global illumination is based of refraction and reflection of light in interaction with matter within an environment. This physical process possesses a high computational complexity when implemented in a digital computer. The appearance of an object depends on light interactions with the surface of the material, such as emission, scattering, and absorption. Several image-synthesis methods have been used to realistically render the appearance of light incidence on an object. Recent global illumination algorithms employ mathematical models and computational strategies that improve the efficiency of the simulation solution. This work presents a review the state of the art of global illumination algorithms and focuses on the efficiency of the solution in a computational implementation in a graphics processing unit. A reliable system is developed to simulate realistics scenes in the context of real-time object recognition under different lighting conditions. Computer simulations results are presented and discussed in terms of discrimination capability, and robustness to additive noise, when considering several lighting model reflections and multiple light sources.
A real-time system for illumination-invariant object tracking is proposed. The system is able to estimate at
high-rate the position of a moving target in an input scene when is corrupted by the presence of a high cluttering
background and nonuniform illumination. The position of the target is estimated with the help of a filter bank of
space-variant correlation filters. The filters in the bank, adapt their parameters according to the local statistical
parameters of the observed scene in a small region centered at coordinates of a predicted position for the target
in each frame. The prediction is carried out by exploiting information of present and past frames, and by using
a dynamic motion model of the target in a two-dimensional plane. Computer simulation results obtained with
the proposed system are presented and discussed in terms of tracking accuracy, computational complexity, and
tolerance to nonuniform illumination.
A real-time system for multiclass object recognition is proposed. The system is able to identify and correctly
classify several moving targets from an input scene by using a bank of adaptive correlation filters with complex
constraints implemented on a graphics processing unit. The bank of filters is synthesized with the help of
an iterative algorithm based on complex synthetic discriminant functions. At each iteration, the algorithm
optimizes the discrimination capability of each filter in the bank by using all available information about the
known patterns to be recognized and unwanted patterns to be rejected such as false objects or a background.
Computer simulation results obtained with the proposed system in real and synthetic scenes are presented and
discussed in terms of pattern recognition performance and real-time operation speed.