VisualRank

VisualRank is a system for finding and ranking images by analysing and comparing their content, rather than searching image names, Web links or other text. Google scientists made their VisualRank work public in a paper describing applying PageRank to Google image search at the International World Wide Web Conference in Beijing in 2008. [1]

We cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose VisualRank to analyze the visual link structures among images. The images found to be "authorities" are chosen as those that answer the image-queries well.

Methods

Both computer vision techniques and locality-sensitive hashing (LSH) are used in the VisualRank algorithm. Consider an image search initiated by a text query. An existing search technique based on image metadata and surrounding text is used to retrieve the initial result candidates (PageRank), which along with other images in the index are clustered in a graph according to their similarity (which is precomputed). Centrality is then measured on the clustering, which will return the most canonical image(s) with respect to the query. The idea here is that agreement between users of the web about the image and its related concepts will result in those images being deemed more similar. VisualRank is defined iteratively by $VR=S^{*}\times VR$ , where $S^{*}$ is the image similarity matrix. As matrices are used, eigenvector centrality will be the measure applied, with repeated multiplication of $VR$ and $S^{*}$ producing the eigenvector we're looking for. Clearly, the image similarity measure is crucial to the performance of VisualRank since it determines the underlying graph structure.

The main VisualRank system begins with local feature vectors being extracted from images using scale-invariant feature transform (SIFT). Local feature descriptors are used instead of color histograms as they allow similarity to be considered between images with potential rotation, scale, and perspective transformations. Locality-sensitive hashing is then applied to these feature vectors using the p-stable distribution scheme. In addition to this, LSH amplification using AND/OR constructions are applied. As part of the applied scheme, a Gaussian distribution is used under the $\ell _{2}$ norm.

gollark: What about? The masks thing?

gollark: I mean, at one point he said something along the lines of "test less so our case numbers are lower".

gollark: Trump really seems to actively be trying to make the US's situation *worse*.

gollark: If they tried to stop people protesting, there would probably be protests about it.

gollark: Weirdly enough, I basically lack any long-term memory of my life.

References

Yushi Jing and Baluja, S. (2008). "VisualRank: Applying PageRank to Large-Scale Image Search". IEEE Transactions on Pattern Analysis and Machine Intelligence. 30 (11): 1877–1890. CiteSeerX 10.1.1.309.741. doi:10.1109/TPAMI.2008.121. ISSN 0162-8828. PMID 18787237..

External links

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[Jing08-1] Yushi Jing and Baluja, S. (2008). "VisualRank: Applying PageRank to Large-Scale Image Search". IEEE Transactions on Pattern Analysis and Machine Intelligence. 30 (11): 1877–1890. CiteSeerX 10.1.1.309.741. doi:10.1109/TPAMI.2008.121. ISSN 0162-8828. PMID 18787237..