Camera resectioning

Camera resectioning is the process of estimating the parameters of a pinhole camera model approximating the camera that produced a given photograph or video. Usually, the pinhole camera parameters are represented in a 3 × 4 matrix called the camera matrix.

This process is often called camera calibration, although that term can also refer to photometric camera calibration.

Camera resectioning

Homogeneous coordinates

In this context, we use $[u\ v\ 1]^{T}$ to represent a 2D point position in pixel coordinates and $[x_{w}\ y_{w}\ z_{w}\ 1]^{T}$ is used to represent a 3D point position in world coordinates. In both cases, they are represented in homogeneous coordinates (i.e. they have an additional last component, which is initially, by convention, a 1), which is the most common notation in robotics and rigid body transforms.

Projection

Referring to the pinhole camera model, a camera matrix $M$ is used to denote a projective mapping from world coordinates to pixel coordinates.

z_{c}{\begin{bmatrix}u\\v\\1\end{bmatrix}}=K\,{\begin{bmatrix}R&T\end{bmatrix}}{\begin{bmatrix}x_{w}\\y_{w}\\z_{w}\\1\end{bmatrix}}=M{\begin{bmatrix}x_{w}\\y_{w}\\z_{w}\\1\end{bmatrix}}

where $M=K\,{\begin{bmatrix}R&T\end{bmatrix}}$ .

Intrinsic parameters

K={\begin{bmatrix}\alpha _{x}&\gamma &u_{0}&0\\0&\alpha _{y}&v_{0}&0\\0&0&1&0\end{bmatrix}}

The intrinsic matrix $K$ contains 5 intrinsic parameters of the specific camera model. These parameters encompass focal length, image sensor format, and principal point. The parameters $\alpha _{x}=f\cdot m_{x}$ and $\alpha _{y}=f\cdot m_{y}$ represent focal length in terms of pixels, where $m_{x}$ and $m_{y}$ are the scale factors relating pixels to distance and $f$ is the focal length in terms of distance. [1] $\gamma$ represents the skew coefficient between the x and the y axis, and is often 0. $u_{0}$ and $v_{0}$ represent the principal point, which would be ideally in the center of the image.

Nonlinear intrinsic parameters such as lens distortion are also important although they cannot be included in the linear camera model described by the intrinsic parameter matrix. Many modern camera calibration algorithms estimate these intrinsic parameters as well in the form of non-linear optimisation techniques. This is done in the form of optimising the camera and distortion parameters in the form of what is generally known as bundle adjustment.

Extrinsic parameters

${}{\begin{bmatrix}R_{3\times 3}&T_{3\times 1}\\0_{1\times 3}&1\end{bmatrix}}_{4\times 4}$

$R,T$ are the extrinsic parameters which denote the coordinate system transformations from 3D world coordinates to 3D camera coordinates. Equivalently, the extrinsic parameters define the position of the camera center and the camera's heading in world coordinates. $T$ is the position of the origin of the world coordinate system expressed in coordinates of the camera-centered coordinate system. $T$ is often mistakenly considered the position of the camera. The position, $C$ , of the camera expressed in world coordinates is $C=-R^{-1}T=-R^{T}T$ (since $R$ is a rotation matrix).

Camera calibration is often used as an early stage in computer vision.

When a camera is used, light from the environment is focused on an image plane and captured. This process reduces the dimensions of the data taken in by the camera from three to two (light from a 3D scene is stored on a 2D image). Each pixel on the image plane therefore corresponds to a shaft of light from the original scene.

Camera resectioning

Camera resectioning determines which incoming light is associated with each pixel on the resulting image. In an ideal pinhole camera, a simple projection matrix is enough to do this. With more complex camera systems, errors resulting from misaligned lenses and deformations in their structures can result in more complex distortions in the final image.

The camera projection matrix is derived from the intrinsic and extrinsic parameters of the camera, and is often represented by the series of transformations; e.g., a matrix of camera intrinsic parameters, a 3 × 3 rotation matrix, and a translation vector. The camera projection matrix can be used to associate points in a camera's image space with locations in 3D world space.

Camera resectioning is often used in the application of stereo vision where the camera projection matrices of two cameras are used to calculate the 3D world coordinates of a point viewed by both cameras.

Some people call this camera calibration, but many restrict the term camera calibration for the estimation of internal or intrinsic parameters only.

Algorithms

There are many different approaches to calculate the intrinsic and extrinsic parameters for a specific camera setup. The most common ones are:

Direct linear transformation (DLT) method
Zhang's method
Tsai's method
Selby's method (for X-ray cameras)

Zhang's method

Zhang model [2][3] is a camera calibration method that uses traditional calibration techniques (known calibration points) and self-calibration techniques (correspondence between the calibration points when they are in different positions). To perform a full calibration by the Zhang method at least three different images of the calibration target/gauge are required, either by moving the gauge or the camera itself. If some of the intrinsic parameters are given as data (orthogonality of the image or optical center coordinates) the number of images required can be reduced to two.

In a first step, an approximation of the estimated projection matrix $H$ between the calibration target and the image plane is determined using DLT method.[4] Subsequently, applying self-calibration techniques to obtained the image of the absolute conic matrix [Link]. The main contribution of Zhang method is how to extract a constrained instrinsic $K$ and $n$ numbers of $R$ and $T$ calibration parameters from $n$ pose of the calibration target.

Derivation

Assume we have a homography ${\textbf {H}}$ that maps points $x_{\pi }$ on a "probe plane" $\pi$ to points $x$ on the image.

The circular points $I,J={\begin{bmatrix}1&\pm j&0\end{bmatrix}}^{\mathrm {T} }$ lie on both our probe plane $\pi$ and on the absolute conic $\Omega _{\infty }$ . Lying on $\Omega _{\infty }$ of course means they are also projected onto the image of the absolute conic (IAC) $\omega$ , thus $x_{1}^{T}\omega x_{1}=0$ and $x_{2}^{T}\omega x_{2}=0$ . The circular points project as

{\begin{aligned}x_{1}&={\textbf {H}}I={\begin{bmatrix}h_{1}&h_{2}&h_{3}\end{bmatrix}}{\begin{bmatrix}1\\j\\0\end{bmatrix}}=h_{1}+jh_{2}\\x_{2}&={\textbf {H}}J={\begin{bmatrix}h_{1}&h_{2}&h_{3}\end{bmatrix}}{\begin{bmatrix}1\\-j\\0\end{bmatrix}}=h_{1}-jh_{2}\end{aligned}}

.

We can actually ignore $x_{2}$ while substituting our new expression for $x_{1}$ as follows:

{\begin{aligned}x_{1}^{T}\omega x_{1}&=\left(h_{1}+jh_{2}\right)^{T}\omega \left(h_{1}+jh_{2}\right)\\&=\left(h_{1}^{T}+jh_{2}^{T}\right)\omega \left(h_{1}+jh_{2}\right)\\&=h_{1}^{T}\omega h_{1}+j\left(h_{2}^{T}\omega h_{2}\right)\\&=0\end{aligned}}

Tsai's Algorithm

It is a 2-stage algorithm, calculating the pose (3D Orientation, and x-axis and y-axis translation) in first stage. In second stage it computes the focal length, distortion coefficients and the z-axis translation.[5]

Selby's method (for X-ray cameras)

Selby's camera calibration method[6] addresses the auto-calibration of X-ray camera systems. X-ray camera systems, consisting of the X-ray generating tube and a solid state detector can be modelled as pinhole camera systems, comprising 9 intrinsic and extrinsic camera parameters. Intensity based registration based on an arbitrary X-ray image and a reference model (as a tomographic dataset) can then be used to determine the relative camera parameters without the need of a special calibration body or any ground-truth data.

gollark: On the plus side, at least the encouragey bit has been reworded from Lignum's version where the player who installs software they don't understand is the one breaking the rules.

gollark: Not really.

gollark: Especially since I'm used to `rm` on Linux actually deleting multiple things, and it doesn't error if you pass it two arguments... it's very confusing.

gollark: Never heard of it.

gollark: If you have `thingy` and `stuff`, and you want to delete both for some reason, multiple arguments is nice.

References

Richard Hartley and Andrew Zisserman (2003). Multiple View Geometry in Computer Vision. Cambridge University Press. pp. 155–157. ISBN 0-521-54051-8.
Z. Zhang, "A flexible new technique for camera calibration'", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22, No.11, pages 1330–1334, 2000
P. Sturm and S. Maybank, "On plane-based camera calibration: a general algorithm, singularities, applications'", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 432–437, Fort Collins, CO, USA, June 1999
Abdel-Aziz, Y.I., Karara, H.M. "Direct linear transformation from comparator coordinates into object space coordinates in close-range photogrammetry", Proceedings of the Symposium on Close-Range Photogrammetry (pp. 1-18), Falls Church, VA: American Society of Photogrammetry, (1971)
Roger Y. Tsai, "A Versatile Camera Calibration for High-Accuracy 3D Machine Vision Metrology Using Off-the-Shelf TV Cameras and Lenses'", IEEE Journal of Robotics and Automation, Vol. RA-3, No.4, August, 1987
Boris Peter Selby et al., "Patient positioning with X-ray detector self-calibration for image guided therapy", Australasian Physical & Engineering Science in Medicine, Vol.34, No.3, pages 391–400, 2011

External links

Zhang's Camera Calibration and Tsai's Calibration Software on LGPL licence
Zhang's Camera Calibration Method with Software
C++ Camera Calibration Toolbox with source code
Camera Calibration Toolbox for Matlab
The DLR CalDe and DLR CalLab Camera Calibration Toolbox
Camera Calibration - Augmented reality lecture at TU Muenchen, Germany
Tsai's Approach
Camera calibration (using ARToolKit)
A Four-step Camera Calibration Procedure with Implicit Image Correction

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] Richard Hartley and Andrew Zisserman (2003). Multiple View Geometry in Computer Vision. Cambridge University Press. pp. 155–157. ISBN 0-521-54051-8.

[2] Z. Zhang, "A flexible new technique for camera calibration'", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22, No.11, pages 1330–1334, 2000

[3] P. Sturm and S. Maybank, "On plane-based camera calibration: a general algorithm, singularities, applications'", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 432–437, Fort Collins, CO, USA, June 1999

[4] Abdel-Aziz, Y.I., Karara, H.M. "Direct linear transformation from comparator coordinates into object space coordinates in close-range photogrammetry", Proceedings of the Symposium on Close-Range Photogrammetry (pp. 1-18), Falls Church, VA: American Society of Photogrammetry, (1971)

[5] Roger Y. Tsai, "A Versatile Camera Calibration for High-Accuracy 3D Machine Vision Metrology Using Off-the-Shelf TV Cameras and Lenses'", IEEE Journal of Robotics and Automation, Vol. RA-3, No.4, August, 1987

[6] Boris Peter Selby et al., "Patient positioning with X-ray detector self-calibration for image guided therapy", Australasian Physical & Engineering Science in Medicine, Vol.34, No.3, pages 391–400, 2011

Camera resectioning

Camera resectioning

Homogeneous coordinates

Projection

Intrinsic parameters

Extrinsic parameters

Camera resectioning

Algorithms

Zhang's method

Derivation

Tsai's Algorithm

Selby's method (for X-ray cameras)

See also

References

External links