-
Notifications
You must be signed in to change notification settings - Fork 655
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question of tracking the untrackable #1
Comments
@abhineet123 Did you mean that is the input of target RNN(O) and score is the output? But in Figure 2. , is the output of fc layer following RNN(O). |
@abhineet123 I think I can understand it now. Thank you very much for your elaborate explanation. |
Yes that is what I mean. |
@abhineet123 I am really grateful for your reply. Later I will ask the author for more details. |
Glad to be of assistance and please let me know what the authors have to say about this. |
Hi, I asked the author but haven't received response yet. Here I have another question: do you know how to train the LSTM in this appearance model? Is LSTM included in siamese CNN? I'm still connused about the training procedure. Thank you very much. |
No I am also waiting for the authors to release their code to get the details of the training procedure. |
Hi, |
Probably some kind of optical flow algorithm like cvCalcOpticalFlowPyrLK. |
Hi, In 3.5 Target, any idea what is the input sequence to the Target RNN. The authors mention that the output of appearance, motion and interaction are concatenated and passed to Target RNN. But then how do that result in a sequence? |
Has the code been released yet or any implementation available? |
Not as far as I know. |
hi, is the the paper Tracking The Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies have code to be implementation |
Not that I am aware of. |
Do you have any intuition about input image size fed into CNN? VGG16 takes 224*224 size as input and produces 28055 as first FC layer. But, a person's size(height and width) would not be a square. So how they cropped the image? If input image size is different, then first FC layer will be different. |
In their earlier paper, they extract the patch and then resize it to a fixed size (224*224 in your case) without preserving the aspect ratio. Though the patch becomes distorted to human eyes, it probably doesn't make any difference to the CNN as long as test patches are distorted in the same way as training ones. |
hi, is the the paper Tracking The Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies have code to be implementation? |
Not that I'm aware of. |
Is there a tracking method using self-attention(such as transformer、BERT) recently? |
Hi,
I just read the paper Tracking The Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies and I have a question which I hope you can give me some hints:
What is the dimension of the similarity score ( vector or number)? Say if t_i connects d_j, is the score denoted as or is the output of some process of ?
I am looking forward to your answer. Thank you very much,
The text was updated successfully, but these errors were encountered: