Add checks to roi_heads in detection module #1091

ekagra-ranjan · 2019-07-04T19:41:39Z

This PR addresses #1027 by:

adding checks on boxes, masks, keypoints and labels attributes of target passed to detection models.
updating docs

update before transform

update 9/03/19

11/03/19

12/03/10 10:34pm

update 25/03/19

update 09/04/19

update 3/5/19

Update 31/05/19

sync

codecov-io · 2019-07-04T20:12:42Z

Codecov Report

Merging #1091 into master will decrease coverage by 0.12%.
The diff coverage is 0%.

@@            Coverage Diff             @@
##           master    #1091      +/-   ##
==========================================
- Coverage   64.65%   64.53%   -0.13%     
==========================================
  Files          68       68              
  Lines        5410     5417       +7     
  Branches      830      834       +4     
==========================================
- Hits         3498     3496       -2     
- Misses       1662     1669       +7     
- Partials      250      252       +2

Impacted Files	Coverage Δ
torchvision/models/detection/faster_rcnn.py	`74.39% <ø> (ø)`	⬆️
torchvision/models/detection/mask_rcnn.py	`81.35% <ø> (ø)`	⬆️
torchvision/models/detection/keypoint_rcnn.py	`81.81% <ø> (ø)`	⬆️
torchvision/models/detection/roi_heads.py	`55.93% <0%> (-0.97%)`	⬇️
torchvision/transforms/transforms.py	`80.94% <0%> (-0.56%)`	⬇️
torchvision/models/densenet.py	`86.79% <0%> (ø)`	⬆️
torchvision/models/mnasnet.py	`82.71% <0%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d762537...3fb2a9e. Read the comment docs.

fmassa

Thanks a lot for the PR!

I have a question regarding how to support fp16 training in the future. Let me know what you think

fmassa · 2019-07-05T12:05:10Z

torchvision/models/detection/roi_heads.py

        """
+        if targets is not None:
+            for t in targets:
+                assert t["boxes"].dtype == torch.float32, 'target boxes must of float type'


Those checks might not work if we want to perform fp16 training in the future.
Maybe might be better to use something like t["boxes"].dtype.is_floating_point?

Yes, you are right! Will change it.

fmassa · 2019-07-05T12:06:03Z

torchvision/models/detection/roi_heads.py

+                assert t["boxes"].dtype == torch.float32, 'target boxes must of float type'
+                assert t["labels"].dtype == torch.int64, 'target labels must of int64 type'
+                if self.has_mask:
+                    assert t["masks"].dtype == torch.uint8, 'target masks must of uint8 type'


Do the masks need to be uint8? I thought that the code worked as well for other types, as the only place where it is used in the model we perform a cast to float?

I was following the tutorial, where the dtype of mask was specified uint8. Should I remove that check?

I don't think having the masks being uint8 is a hard-restriction, so yes, maybe remove this check for now

fmassa · 2019-07-05T12:07:39Z

torchvision/models/detection/faster_rcnn.py

+    During training, the model expects both the input tensors, as well as a targets (list of dictionary),
    containing:
-        - boxes (Tensor[N, 4]): the ground-truth boxes in [x0, y0, x1, y1] format, with values
+        - boxes (FloatTensor[N, 4]): the ground-truth boxes in [x0, y0, x1, y1] format, with values


Those are great changes, thanks!

But I'm a bit concerned that this looks like the legacy interface of torch.FloatTensor, etc.
I wonder if there is a better way of representing this? For example, in numpy, everything is a ndarray, but with different types.

Thoughts?

Would the descriptions like boxes(Tensor[N, 4], dtype=torch.float) or boxes(FloatTensor(N, 4)) be better?

(Tensor[N, 4], dtype=torch.float) is not a valid syntax but conveys the requirement whereas FloatTensor(N, 4) would actually create a float tensor with dim = (N, 4).

Let's just keep this as is for now. But I'd like to remove the FloatTensor, changes, and only keep Int64Tensor instead, because boxes is not required to be Float

ekagra-ranjan · 2019-07-09T07:33:23Z

Sorry for the delayed response. Please have a look at my response @fmassa .

ekagra-ranjan · 2019-07-09T13:15:39Z

@fmassa Made the changes!

fmassa

Thanks!

ekagra-ranjan added 30 commits February 14, 2019 22:34

Merge pull request #1 from pytorch/master

154cadb

update before transform

Merge pull request #2 from pytorch/master

e964780

update 9/03/19

Merge pull request #3 from pytorch/master

103b25a

11/03/19

Merge pull request #5 from pytorch/master

2925fed

12/03/10 10:34pm

Merge pull request #9 from pytorch/master

70e21e7

update 25/03/19

Merge pull request #10 from pytorch/master

149b436

update 09/04/19

Merge pull request #11 from pytorch/master

eda387b

update 3/5/19

Merge pull request #13 from pytorch/master

1ff32d3

Update 31/05/19

Merge pull request #14 from pytorch/master

227111b

sync

add float32 to keypoint_rcnn docs

ecda81b

add float32 to faster_rcnn docs

511202d

add float32 to mask_rcnn

dbdd372

Update faster_rcnn.py

0662e6b

Update keypoint_rcnn.py

778207f

Update mask_rcnn.py

299904a

Update faster_rcnn.py

d5335a6

make keypoints float

87d6927

make masks uint8

076fd78

Update keypoint_rcnn.py

8d9fbf1

make labels Int64

3efb26d

make labels Int64

4e6299f

make labels Int64

1068ff5

Add checks for boxes, labels, masks, keypoints

41c36f6

Merge branch 'master' into mz-rpn-float

80c409a

update mask dim

03932e9

remove dtype

2fbef71

check only if targets is not None

86db726

account for targets being a list

105373e

update target to be list of dict

5467466

Update faster_rcnn.py

f1ec459

Update keypoint_rcnn.py

3fffae4

fmassa requested changes Jul 5, 2019

View reviewed changes

allow boxes to be of float16 type as well

f11dc3b

remove checks on mask

3fb2a9e

fmassa approved these changes Jul 10, 2019

View reviewed changes

fmassa merged commit 6693b2c into pytorch:master Jul 10, 2019

fmassa mentioned this pull request Jul 10, 2019

tochvision: Mask RCNN, Anchors generated are float, which fails training #1027

Closed

Add checks to roi_heads in detection module #1091

Add checks to roi_heads in detection module #1091

Uh oh!

Conversation

ekagra-ranjan commented Jul 4, 2019

Uh oh!

codecov-io commented Jul 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan Jul 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan commented Jul 9, 2019

Uh oh!

ekagra-ranjan commented Jul 9, 2019

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-io commented Jul 4, 2019 •

edited

Loading

ekagra-ranjan Jul 9, 2019 •

edited

Loading