-
Notifications
You must be signed in to change notification settings - Fork 0
Classification and Segmentation Tests
Sergey 'Jin' Bostandzhyan edited this page Oct 28, 2023
·
4 revisions
Testing an image that shows multiple deer with various models.
As far as I can tell the best result was achieved with Grounded-SAM.
https://huggingface.co/spaces/IDEA-Research/Grounded-SAM (https://github.com/IDEA-Research/Grounded-Segment-Anything)




Running Grounded SAM locally in an attempt to focus on deer faces:
python grounded_sam_demo.py \
--config GroundingDINO/groundingdino/config/GroundingDINO_SwinT_OGC.py \
--grounded_checkpoint groundingdino_swint_ogc.pth \
--sam_checkpoint sam_vit_h_4b8939.pth \
--input_image ~/intro_thumbnail.jpg \
--output_dir "outputs" \
--box_threshold 0.3 \
--text_threshold 0.25 \
--text_prompt "deer face" \
--device "cuda"
Original image:

Grounded SAM output:

Mask:
