Skip to content

Commit

Permalink
Update readme.md
Browse files Browse the repository at this point in the history
  • Loading branch information
mehrdad-dev committed Jun 12, 2022
1 parent 2824de9 commit 6e9346b
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,8 @@ CLIP model was proposed by the OpenAI company, to understand the semantic simila
It's used for preform zero-shot learning tasks, to find objects in an image based on an input query.
![Mehrdad Mohammadian](https://raw.githubusercontent.com/mehrdad-dev/SBO/main/assets/clip.png)

CLIP pre-trains an image encoder and a text encoder to predict which images were paired with which texts in our dataset. We then use this behavior to turn CLIP into a zero-shot classifier. We convert all of a dataset’s classes into captions such as “a photo of a dog” and predict the class of the caption CLIP estimates best pairs with a given image.

Also, YOLOv5 was used in the first step of the method, to detect the location of the objects in an image.


Expand Down

0 comments on commit 6e9346b

Please sign in to comment.