Consider the 3-dimensional world projected through some point in space
(the camera) onto some 2-dimensional plane in space (the image).
This is the basic pin-hole camera model most of our work assumes.
This model allows us to formalize computer vision
using a geometric and algebraic language.
In this language we prove fundamental constraints and invariants
between images and objects that can be used to develop algorithms for wide
array of tasks.
Most of the work we do is for the uncalibrated case.
Plane + Parallax provides the
basis for 3D scene analysis even in difficult scenarios where estimation
of scene geometry or camera geometry is difficult.
Direct estimation of 3D motion is a difficult and ill-conditioned
problem due to the large number of degrees of freedom. On the other hand
planer motion estimation is a much easier problem with only 8 unknown
variables in the projective model.
Plane + Parallax builds on top of a 2D technique of motion estimation by
viewing 3D space relative to some arbitrary planar surface in the scene.
The core of the method is detecting a single planar surface in the scene
directly from image intensities and computing its 2D motion in the
image plane. The detected 2D motion is used to register the images so that
the planer surface seems stationary.
The parallax is defined as the resulting residual image displacement field.
In affect the registration cancels camera rotation, thus the disparity
field is affected only by 3D translation of the camera.
The magnitude of each parallax displacement is directly related to
its perpendicular distance from the planer surface defining a 3D structure
relative to our plane.
This approach can be applied to many image tasks such as calculation of
ego-motion
and detection of
moving objects.
Algorithms based on this method are robust and allow
multiple moving objects and occlusions.
Selected Papers
Michal Irani, Benny Rousso, Shmuel Peleg
Robust Recovery of Ego-Motion
CAIP 1993, Budapest, 13-15 Sept 1993.
Michal Irani, Benny Rousso, Shmuel Peleg
Computing Occluding and Transparent Motions
IJCV, Vol 12 No. 1, January 1994
A. Shashua and N. Navab.
Relative Affine Structure: Canonical Model for 3D from 2D
Geometry and Applications.
PAMI Vol. 18(9), 1996.
Given a few images of a 3D scene, the relationships exist the
images for a given point in the world are studied.
For a projective transformation from 3D space to image plane, there are
relationships between images that satisfy trilinear constraints.
The Trilinear Tensor is a set of coefficients for these trilinear constraints
that solve the unique geometric relations between
three uncalibrated projective cameras.
It is a powerful tool for the analysis of multiple images of a scene.
Given seven matching triplets across three views we can calculate
these 27 coefficients by solving linear equations.
The Trilinear Tensor is uniquely determined for any camera configuration
of three views, including the case when the three camera centers are
collinear, its coefficients can be calculated directly from this configuration.
This tensor contains within much information the most important being:
the projection of any scene point across the three views must satisfy this set
of trilinear equations.
We list some of the advantages of the Trilinear Tensor.
The Trilinear Tensor has no critical surface and therefore numerically stable.
The Trilinear Tensor is structure invariant, no segmentation is needed and
all scene points can participate in its calculation.
Many interesting geometric relationships can be derived from
this tensor: camera translation, camera rotation, epipolar geometry,
fundamental matrix, intrinsic homographies, relative affine structure.
Currently the tensor assumes prior correspondence of points for it calculation,
in development is use of the constant brightness equation for direct
calculation of the tensor from image intensities.
One of our successful algorithm based on the tensor is a new method to
represent scenes that avoid 3D.
Selected Papers
A. Shashua.
Algebraic Functions For Recognition.
IEEE Transactions on Pattern Analysis and Machine
Intelligence (PAMI) Vol. 17(8), pp. 779--789, 1995.
A. Shashua and M. Werman
Trilinearity of Three Perspective Views and its Associated Tensor
Int. Conf. Computer Vision, Boston, June 1995.
A. Shashua and S. Avidan.
The Rank4 Constraint In Multiple View Geometry.
To appear in ECCV, April 1996.
Benny Rousso, Shai Avidan, Amnon Shashua, Shmuel Peleg
Robust recovery of camera rotation from three frames
ARPA Image Understanding Workshop, February 1996.
A.Shashua and P. Anandan.
Trilinear Constraints Revisited: Generalized Trilinear
Constraints And The Tensor Brightness Constraint.
In Proc. of the ARPA Image Understanding Workshop, Palm Springs, Feb.
1996.