diff --git a/docs/source/object_detection.mdx b/docs/source/object_detection.mdx index 083803523d0..75d9dbb61f7 100644 --- a/docs/source/object_detection.mdx +++ b/docs/source/object_detection.mdx @@ -2,9 +2,9 @@ Object detection models identify something in an image, and object detection datasets are used for applications such as autonomous driving and detecting natural hazards like wildfire. This guide will show you how to apply transformations to an object detection dataset following the [tutorial](https://albumentations.ai/docs/examples/example_bboxes/) from [Albumentations](https://albumentations.ai/docs/). -To run these examples, make sure you have up-to-date versions of `albumentations` and `cv2` installed: +To run these examples, make sure you have up-to-date versions of [albumentations](https://albumentations.ai/docs/) and [cv2](https://docs.opencv.org/4.10.0/) installed: -``` +```bash pip install -U albumentations opencv-python ``` @@ -40,12 +40,12 @@ The dataset has the following fields: - `objects`: A dictionary containing bounding box metadata for the objects in the image: - `id`: The annotation id. - `area`: The area of the bounding box. - - `bbox`: The object's bounding box (in the [coco](https://albumentations.ai/docs/getting_started/bounding_boxes_augmentation/#coco) format). + - `bbox`: The object's bounding box (in the [coco](https://albumentations.ai/docs/3-basic-usage/bounding-boxes-augmentations/#understanding-bounding-box-formats) format). - `category`: The object's category, with possible values including `Coverall (0)`, `Face_Shield (1)`, `Gloves (2)`, `Goggles (3)` and `Mask (4)`. You can visualize the `bboxes` on the image using some internal torch utilities. To do that, you will need to reference the [`~datasets.ClassLabel`] feature associated with the category IDs so you can look up the string labels: - + ```py >>> import torch >>> from torchvision.ops import box_convert @@ -68,7 +68,7 @@ You can visualize the `bboxes` on the image using some internal torch utilities. ```
+
+
+