forked from jonazpiazu/PTAM
-
Notifications
You must be signed in to change notification settings - Fork 0
/
PatchFinder.h
143 lines (116 loc) · 6.64 KB
/
PatchFinder.h
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
// -*- c++ -*-
// Copyright 2008 Isis Innovation Limited
//
// This header declares the PatchFinder class.
// This is quite a low-level class.
//
// The purpose of the PatchFinder is to find a map point in a new view.
// This proceeds in multiple stages:
// 1. A warping matrix appropriate for the current view is calculated,
// 2. A `coarse' matching template of the map point is generated by warping;
// 3. The new view is searched for this template at corner locations;
// 4. An inverse-composition (`fine') matching template is generated;
// 5. A sub-pixel accurate position is calculated using inverse composition.
//
// To clarify points 1 and 2 above: Each map point does _not_ store a `patch'
// in the style of many other SLAM implementations! Each map point came from
// a key-frame, and that key-frame is stored in the map. Each map point only
// stores the information on where it came from in what key-frame. Patches
// are then generated from the pixels of the source key-frame. According to the
// relative camera poses now and of the source key-frame, a warp has to be
// applied to generate an NxN pixel square template appropriate for the current
// view. Thus, the matching template is always NxN, but the region of source
// pixels used can change size and shape! This is all described in the
// Klein & Murray ISMAR 2007.
//
// Most of the above stages are optional, and there are different versions
// of stages 1/2, including some for operating when there is not yet a 3D map
// point. The class provides no safety checks to ensure that the operations
// are carried out in the right order - it's the caller's responsibility.
//
// The patch finder uses zero-mean SSD as its difference metric.
//
// Although PatchFinder can use arbitrary-sized search templates (it's determined
// at construction), the use of 8x8 pixel templates (the default) is highly
// recommended, as the coarse search for this size is SSE-optimised.
#ifndef __PATCHFINDER_H
#define __PATCHFINDER_H
#include <TooN/TooN.h>
using namespace TooN;
#include <TooN/se3.h>
#include <cvd/image.h>
#include <cvd/byte.h>
#include "MapPoint.h"
#include "LevelHelpers.h"
class PatchFinder
{
public:
// Constructor defines size of search patch.
PatchFinder(int nPatchSize = 8);
// Step 1 Function.
// This calculates the warping matrix appropriate for observing point p
// from the current view (described as an SE3.) This also needs the camera
// projection derivates at level zero for the point's projection in the new view.
// It also calculates which pyramid level we should search in, and this is
// returned as an int. Negative level returned denotes an inappropriate
// transformation.
int CalcSearchLevelAndWarpMatrix(MapPoint &p, SE3<> se3CFromW, Matrix<2> &m2CamDerivs);
inline int GetLevel() { return mnSearchLevel; }
inline int GetLevelScale() { return LevelScale(mnSearchLevel); }
// Step 2 Functions
// Generates the NxN search template either from the pre-calculated warping matrix,
// or an identity transformation.
void MakeTemplateCoarseCont(MapPoint &p); // If the warping matrix has already been pre-calced, use this.
void MakeTemplateCoarse(MapPoint &p, SE3<> se3CFromW, Matrix<2> &m2CamDerivs); // This also calculates the warp.
void MakeTemplateCoarseNoWarp(MapPoint &p); // Identity warp: just copies pixels from the source KF.
void MakeTemplateCoarseNoWarp(KeyFrame &k, int nLevel, CVD::ImageRef irLevelPos); // Identity warp if no MapPoint struct exists yet.
// If the template making failed (i.e. it needed pixels outside the source image),
// this bool will return false.
inline bool TemplateBad() { return mbTemplateBad;}
// Step 3 Functions
// This is the raison d'etre of the class: finds the patch in the current input view,
// centered around ir, Searching around FAST corner locations only within a radius nRange only.
// Inputs are given in level-zero coordinates! Returns true if the patch was found.
bool FindPatchCoarse(CVD::ImageRef ir, KeyFrame &kf, unsigned int nRange);
int ZMSSDAtPoint(CVD::BasicImage<CVD::byte> &im, const CVD::ImageRef &ir); // This evaluates the score at one location
// Results from step 3:
// All positions are in the scale of level 0.
inline CVD::ImageRef GetCoarsePos() { return CVD::ImageRef((int) mv2CoarsePos[0], (int) mv2CoarsePos[1]);}
inline Vector<2> GetCoarsePosAsVector() { return mv2CoarsePos; }
// Step 4
void MakeSubPixTemplate(); // Generate the inverse composition template and jacobians
// Step 5 Functions
bool IterateSubPixToConvergence(KeyFrame &kf, int nMaxIts); // Run inverse composition till convergence
double IterateSubPix(KeyFrame &kf); // Single iteration of IC. Returns sum-squared pixel update dist, or negative if out of imag
inline Vector<2> GetSubPixPos() { return mv2SubPixPos; } // Get result
void SetSubPixPos(Vector<2> v2) { mv2SubPixPos = v2; } // Set starting point
// Get the uncertainty estimate of a found patch;
// This for just returns an appropriately-scaled identity!
inline Matrix<2> GetCov()
{
return LevelScale(mnSearchLevel) * Identity;
};
int mnMaxSSD; // This is the max ZMSSD for a valid match. It's set in the constructor.
protected:
int mnPatchSize; // Size of one side of the matching template.
// Some values stored for the coarse template:
int mnTemplateSum; // Cached pixel-sum of the coarse template
int mnTemplateSumSq; // Cached pixel-squared sum of the coarse template
inline void MakeTemplateSums(); // Calculate above values
CVD::Image<CVD::byte> mimTemplate; // The matching template
CVD::Image<std::pair<float,float> > mimJacs; // Inverse composition jacobians; stored as floats to save a bit of space.
Matrix<2> mm2WarpInverse; // Warping matrix
int mnSearchLevel; // Search level in input pyramid
Matrix<3> mm3HInv; // Inverse composition JtJ^-1
Vector<2> mv2SubPixPos; // In the scale of level 0
double mdMeanDiff; // Updated during inverse composition
CVD::ImageRef mirPredictedPos; // Search center location of FindPatchCoarse in L0
Vector<2> mv2CoarsePos; // In the scale of level 0; hence the use of vector rather than ImageRef
CVD::ImageRef mirCenter; // Quantized location of the center pixel of the NxN pixel template
bool mbFound; // Was the patch found?
bool mbTemplateBad; // Error during template generation?
// Some cached values to avoid duplicating work if the camera is stopped:
MapPoint *mpLastTemplateMapPoint; // Which was the last map point this PatchFinder used?
Matrix<2> mm2LastWarpMatrix; // What was the last warp matrix this PatchFinder used?
};
#endif