3D rotations matrices can make your head spin. I know it is a bad pun but truth can sometimes be very punny! -- https://www.learnopencv.com/rotation-matrix-to-euler-angles/
It has always been a burden to understand the physical meaning of Euler angle and do the rotation manipulation in code right, for a really long time. Each and every time I thought I was right and tweaked the codes and later found that I was wrong in one way or another.
I will try to show how to do Euler angle right, both intuitively and with code. Hope this time I am right.
Euler Sequence matters
Rotations given as a set of three angles can refer to any of 24 different ways of applying these rotations, or equivalently, 24 conventions for rotation angles. 1
For example, in intrinsic
scenario -- the axes move with the rotations like we rotate one dice. By only rotating around two axes, (X=90°, Y=90°) and (Y=90°, X=90°), we will get a totally different pose, as shown in .
Notations we use
In most of our scenarios (camera relocalization workflow for the most time), we rotate cameras a lot, and intrinsic
representation makes a great scene.
There are some other representations of rotations like axis-angle and quaternion, but they are less intuitive in the hardware transformation view. And the rotation matrix is chosen for mathematical manipulating, which is also adopted by OpenCV in their pose estimation functions.
UnrealCV as an example
We use unrealcv2 extensively for our ACR (Active Camera Relocalization), and it alone is one interesting resource for many other computer vision projects.
For rotating the camera as will, we need to find out the sequence of rotation in UnrealCV and do the transformation if necessary. We propose to rotate with a rather large angle to find out the rotation sequence in virtual env. E.g. rotate with (X=90°, Y=90°, Z=0), (X=90°, Y=0, Z=90°), (X=0, Y=90°, Z=90°) or (X=90°, Y=90°, Z=90°), with them, we can find the rotation sequence in UnrealCV is YXZ (yaw first, then pitch and roll).
After we get the sequence, for demonstration, as in Fig , we transform (pitch(x)=10°, yaw(y)=20°, roll(z)=30°) in different order into YXZ, perform the rotation in UnrealCV and show the captured images as visual clue.
So, we need to transform the rotation matrix (or one specific Euler sequence) into YXZ for image capturing. And the underlying mechanism stays the same when we choose to do camera rotation in other game engine or hardware.
Conclusion
Euler angle is one of the representations of 3D rotation and pretty intuitive one. However, the sequence of Euler angle makes a great difference, we need to figure out the right sequence before using it. And it makes no good to just GUESS
and see how the result comes -- which we did a lot when implementing ACR at first. Codes designed carefully should be better than tweaked one.