If your business data is stored in PolarDB-X 1.0 and you want to perform full-text searches and semantic analytics on the data, you can synchronize the data to Alibaba Cloud Elasticsearch. Alibaba Cloud Elasticsearch is a distributed search and analytics engine based on Lucene. Alibaba Cloud Elasticsearch allows you to store, query, and analyze large amounts of datasets in near real time. You can use Logstash or DataWorks to synchronize data from PolarDB-X 1.0 to Alibaba Cloud Elasticsearch. This topic describes the use scenarios of each synchronization method. You can select a synchronization method based on your business requirements.
| Synchronization method | Description | Use scenario | Limit | References |
|---|---|---|---|---|
| Use the logstash-input-jdbc plug-in to synchronize data | You can use the logstash-input-jdbc plug-in to query multiple data records in a PolarDB-X 1.0 database and synchronize the data to an Elasticsearch cluster. During data synchronization, the plug-in uses a round-robin method to identify the most recently inserted or updated data records in the PolarDB-X 1.0 database on a regular basis. Then, the plug-in queries all identified data records at a time and synchronizes the data records to an Elasticsearch cluster. |
|
|
None |
| Use DataWorks to synchronize offline data | DataWorks is a comprehensive service that provides various modules such as Data Integration, DataStudio, and Data Quality. You can use DataWorks to import and store structured data, convert and develop the data, and then synchronize the processed data to Elasticsearch clusters or other data systems. |
|
|
Use DataWorks to synchronize data from a DRDS database to an Elasticsearch cluster in offline mode |