- C++ (GNU 7.5)
- ZeroMQ ()
- OpenCV (3.4.1)
- Boost Library (1.65)
- DLib
- DBoW2
Tested on Linux machine with Ubuntu 18.04 LTS.
To be added soon.
The software can be run by passing additional arguments. The arguments have pre-defined positions. The minimum number of parameters is 6. The maximum number is 19. The software does not run if the number of arguments is less than the minimum or higher than the maximum, and a warning and help is provided. The arguments are listed below according to their position along with their brief explanation and default values.
- agent_id: the unique identifier of the agent/camera (e.g., 1)
- rcv_tcp_port: the port number to set in ZMQ for the receiver (request-reply mode). Default: 5555.
- snd_tcp_port: the port number to set in ZMQ for the sender. Default: 5556.
- videopath: . Default: "".
- respath: . Default: "../results/";
- strVocFile: . Default: "./Vocabulary/ORBvoc.txt";
- num_max_frames: the maximum number of frames to process in a recorded video. Default: -1 (no max number, the whole image sequence is processed).
- max_num_kps: the maximum target number of keypoints that can be localised in an image. Default: 1000.
- matching_th: threshold when matching binary local features. Default: -1.
- b_snn: boolean to use Lowe's ratio test to filter out ambiguous matches for binary features. The test defines a distance ratio between the closest and the second closest binary feature (or second nearest neighbour, SNN). Deafult: false.
- snn_th: threshold on Lowe's ratio test. Matches whose test is lower than this threshold are discarded. When the threshold is lower (e.g., 0.6), the test is more restrictive, enforcing a larger distance between the first and second closest neighbours for a query binary feature (fewer matches). When the threshold is higher (e.g., 0.8), the test is more permissive, allowing more matches that can be also erroneous. The recommended value is between 0.6 and 0.8. Default: 0.6.
- gf_type: DBoW, NetVLAD, DeepBit. Default: DBoW.
- gf_dim: the dimensionality of the global feature (vector representing the whole image). Default: -1.
- featpath: . Default: "../feats/backyard/view1/netvlad/netvlad_feats.txt";
- collaborative_mode: . Default: true.
- init_wnd: Number of frames for initialisation window. Default: 30.
- feat_sharing_freq: Frequency of sharing features: Default: 5.
- synch_mode: . Default: false.
- vprcallfilename: . Default: "";
Each agent (camera) is outputting three files in .txt format, and :
- agent(ID)_vpr_res.txt
- running_times_agent(ID).txt
- running_times_VPR_agent(ID).txt
The first output file (agent(ID)_vpr_res.txt) provides a summary of the detection results. The file contains the following columns:
- Index of the current processed frame
- Index of the frame from which features has been shared (i.e., the current frame index in this implementation)
- Index of the other agent/camera
- Index of the current processed frame by the other camera/agent
- Index of the the detected overlapping frame in the other camera
- Number of matched (ORB) binary features
A. Xompero and A. Cavallaro, Cross-Camera View-Overlap Recognition, International Workshop on Distributed Smart Cameras (IWDSC), European Conference on Computer Vision Workshops, 24 October 2022.
@InProceedings{Xompero2022IWDSC,
title = {Cross-Camera View-Overlap Recognition},
author = {Xompero, A. and and Cavallaro, A.},
booktitle = {Proceedings of the European Conference on Computer Vision Workshops},
note = {Internation Workshop on Distributed Smart Cameras},
address={Online},
month="24" # OCT,
year = {2022},
}
If you have any further enquiries, question, or comments, please contact [email protected]. If you would like to file a bug report or a feature request, use the Github issue tracker.
This work is licensed under the MIT License. To view a copy of this license, see LICENSE.