Tanishq Mathew Abraham, Ph @iScienceLuvr, Twitter Profile

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

5 months ago

Perception Encoder: The best visual embeddings are not at the output of the network "we find that contrastive vision-language training alone can produce strong, general embeddings for all of these downstream tasks. There is only one caveat: these embeddings are hidden within the intermediate layers of the network."

8 139 829 101K 682

Download Image

Guna Sekhar Venkata Chennaiah Chakka @codevlogger

5 months ago

@iScienceLuvr Best example is the matryoshka embeddings principle the start dimensions are most important than others. It's like super resolution image where the base img is important right. I made a small WebApp for understanding the concept. Pls checkout 🙏

1 1 0 788 0

Guna Sekhar Venkata Chennaiah Chakka @codevlogger

5 months ago

@iScienceLuvr spectrum-craft.streamlit.app

0 0 0 30 0