CLIP does not have explicit OCR support, but it does somewhat coincidentally hav... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		bo0tzz on April 13, 2024 \| parent \| context \| favorite \| on: I accidentally built a meme search engine CLIP does not have explicit OCR support, but it does somewhat coincidentally have a slight understanding of text. This is explained by training captions containing (some of) the text that is in the image.

osmarks on April 14, 2024 [–]

I think the SigLIP models' dataset (WebLi) includes OCRed things too, so they have very good text understanding. I tested a bunch of things for my own meme search engine.

osmarks on April 14, 2024 | [–]

(https://arxiv.org/pdf/2209.06794.pdf page 20.)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact