FSOD-code icon indicating copy to clipboard operation
FSOD-code copied to clipboard

Question about the FSOD paper

Open NeuZhangQiang opened this issue 5 years ago • 4 comments

Thank you very much for sharing the FSOD code. I have some question when reading the FSOD paper, and I need your help:

image

  1. Does step 1 indicates the encoder features from original image? Does the step 2 indicates the ROI pooling features?

  2. What is the output of Multi-Relation Head (step 3)? Could you please tell me the shape of the output?

  3. For application, we only need the output ROI from step 2. Am I right?

  4. Is the output of step 1 probability? If not, why the "Attention RPN" can calculation the attention? In my understanding, the attention should be 0~1.

I am looking forward to your reply.

NeuZhangQiang avatar Sep 10 '20 07:09 NeuZhangQiang

Good question. Waiting answers with you.

fhong-jpg avatar Sep 14 '20 13:09 fhong-jpg

@fhong-jpg I still don't know the output of Multi-Relation Head (Question 2).

  1. The step 2 does indicate the ROI pooling features.

  2. For application, we need the output of step 4, because it can keep the true object and remove others.

  3. The output of step 1 is not probability. The "Attention" actually means correlation.

Hope someone could give some answers for Question 2.

NeuZhangQiang avatar Sep 15 '20 01:09 NeuZhangQiang

@NeuZhangQiang "The multi-relation detector then matches the query proposals and the support object" , according to the paper. The outputs of RoI Pooling are small matrices, like 4 by 4, then the multi-relation head "match" the matrices. For each matcing pair, I guess multi-relation head outputs probability(the pair match or not).

fhong-jpg avatar Sep 16 '20 02:09 fhong-jpg

@fhong-jpg However, according to the picture, the output of multi-relation detector (step 3) is the input of "Match" (step 4). If the output of multi-relation head is probability, how can it be the input of "Match"?

NeuZhangQiang avatar Sep 16 '20 07:09 NeuZhangQiang