Abstract: Text-guided object counting aims to count objects specified by textual descriptions in images, but current methods often rely on simple cosine similarity to align visual and textual ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results