Research Statement

Social networks such as Facebook, video and photo hosting sites like YouTube and Flickr, file-hosting service sites like Dropbox, business review sites like Yelp, and e-commerce sites like eBay, have become more and more popular, allowing people to easily upload, share and annotate massive amounts of images and videos. Web-scale visual search thus has recently become a very active inter-disciplinary area, involving computer vision, multimedia, machine learning, information retrieval, and data mining. Most content based image/video processing, such as retrieval or classification, are performed on image "features", which are much more compact relative to the raw data. Yet the volumes of such image features still can be huge due to the amount and the dimensionality.

Hence, my research is focused on addressing some essential aspects of the problem, including scalable clustering, approximate nearest neighbor search, and fast bin matching for near-duplicate image/video detection/mining. The experimental results have shown the proposed methods are comparable or better than existing solutions in terms of trade-off among search time, memory usage, database storage, and search accuracy.

Visual Search: The Future

Google's search by images
Google's similar image search seems to be built upon their powerful keyword-based image search results. That is the initial search results are refined by certain real content based techniques.
Baidu's search by images (百度识图)
Baidu's content based image search is featured by Deep Learning techniques plus memory-based indexing search.
Orbeus ReKognition
Orbeus is highlighted for their cloud-computing framework; they provide face recognition, scene understanding, and fairly impressive object recognition.
INRIA's toy image search engine
This demo image search engine shows power of some advanced nearest neighbor search techniques, which was developed by Herve Jegou et al..

Academic Resources

Software Libraries

Dataset Links