Vector retrieval is the process of finding the nearest K points to a given point P from a bunch of known points. These points can be one-dimensional, two-dimensional, or three-dimensional. They are collectively called vectors.
Image search, such as Pailitao, is a useful search tool next to the search box on Taobao. It can take photos of the goods you are interested in and then search for similar goods on the platform. The technology uses multiple deep CNN models to extract features from the image, and the extracted features are high-dimensional vectors. Let’s say it is a vector of 256 dimensions. When searching, the same feature is extracted from the photos taken, namely, the 256-dimensional vector. Then, the vector is used to search similar vectors throughout the whole library. The following figure shows the process:
The personalized recommendation scenario is similar to searching by image. The difference is that the recommendation is based on the process of searching for products that users are interested in according to users’ features. Its technical implementation uses the twin-tower model, which extracts user features and product features. In the final search stage, the user features are used to retrieve in the product feature library. This is the item retrieval process of personalized recommendation. The following figure shows the process:
Its application scenarios are also very extensive, such as searching on Alipay or Taobao, which uses this technology. The recently popular Boolean model also applies to these scenarios. It uses the deep model to extract the features of the vocabulary, which are used in the retrieval process. It also uses vector retrieval to find products that match the queried content. The following figure shows the process:
The scenarios above show how the vector retrieval technology is widely used in search, recommendation, and other scenarios. Combined with deep learning and the development of technologies, vector retrieval has enjoyed rapid development and has been widely used in recent years.
Proxima - April 30, 2021
digoal - May 9, 2020
ApsaraDB - November 16, 2020
Alibaba F(x) Team - January 4, 2021
Alibaba Clouder - April 26, 2020
Alibaba Clouder - January 9, 2017
An end-to-end platform that provides various machine learning algorithms to meet your data mining and analysis requirements.Learn More
Help media companies build a discovery service for their customers to find the most appropriate content.Learn More
This technology can assist realizing quantitative analysis, speeding up CT image analytics, avoiding errors caused by fatigue and adjusting treatment plans in time.Learn More
An intelligent image search service with product search and generic search features to help users resolve image search requests.Learn More
More Posts by ApsaraDB