Use MongoDB change streams to capture data changes in real time - ApsaraDB for MongoDB

You can subscribe to MongoDB change streams to respond to data changes in your database in real time. This topic describes MongoDB change streams, explains how to use them, and provides best practices.

What is a change stream?

A change stream provides a real-time stream of change events from a database. Clients can subscribe to this stream to receive immediate notifications when data is inserted, updated, or deleted. Common scenarios include the following:

Cross-cluster data synchronization: Replicate incremental data between MongoDB clusters.
Auditing: Track high-risk operations, such as deleting a database or collection.
Event-driven architecture: Push changes to downstream systems for real-time analysis, cache updates, or notifications.

Version history

Version	Update description
MongoDB 3.6	Initial release. Only supports subscriptions at the collection level. Limited `events` types. Supports fault recovery. Supports post-images.
MongoDB 4.0	Supports subscriptions at the database and cluster levels. Supports `drop`, `dropDatabase`, and `rename` events. The format of `resumeToken` changed from `BinData` to `Hex`.
MongoDB 4.2	Supports more pipeline operators, such as `$set` and `$unset`. Adds the `startAfter` option to start the listener at a specific point in time. Modifying the `_id` field of an event causes the change stream to throw an exception. Removes the dependency on `{readConcern: majority}`.
MongoDB 5.1	Improves the execution efficiency of some stages in the aggregation framework. Improves resource utilization.
MongoDB 5.3	Supports filtering updates to orphaned documents during chunk migration.
MongoDB 6.0	Supports pre-images. Supports DDL statements such as `create`, `createIndexes`, `modify`, and `shardCollection`. To use this feature, you must set showExpandedEvents to true. For more information, see Change Events. Change events add the `wallTime` field. Timestamps support multiple conversion and display operators, such as `$toDate`, `$tsSeconds`, and `$tsIncrement`, to simplify consumption by services.
MongoDB 7.0	Supports large change events (> 16 MB). You can use the new `$changeStreamSplitLargeEvent` operator to split large change events. Change events support `refineCollectionShardKey` and `reshardCollection` events.
MongoDB 8.0	The `$queryStats` command enhances metrics related to change streams. The `movePrimary` command no longer causes invalid events for collections with an open change stream. This means a change stream can now continuously process data migration caused by the `movePrimary` command.

Limits

The instance must be a replica set instance or a sharded cluster instance.

Configuration Change Stream

Listen for more DDL events

Prerequisites

MongoDB 6.0 or later (Upgrade guide).

Procedure

Connect to the database using mongosh.

Run the watch command and set showExpandedEvents: true:

// mongo shell or mongosh v1.x
cursor = db.getSiblingDB("test").watch([],
  {
    showExpandedEvents: true       // Enable listening for more DDL events
  }
);
cursor.next();

Verify the result

In a new mongosh window, run a change command, such as db.createCollection("myCollection1").
You can view the SQL execution details in the Outputs of the original SQL window.

Enable pre-images

A MongoDB pre-image is a complete snapshot of a document before it is modified or deleted. It records the original value of the data before the change.

Prerequisites

MongoDB 6.0 or later (Upgrade guide).

Procedure

Enable pre-images at the database level:

db.adminCommand({
  setClusterParameter: {
    changeStreamOptions: {
      preAndPostImages: { expireAfterSeconds: "off" } // "off" indicates that the oplog retention period is used
    }
  }
})

Enable pre-images at the collection level:
Note
To enable pre-images for all collections in a database, you must first enable them at the database level. Then, you must enable the feature for each collection individually.
Modify an existing collection
```
db.runCommand({
  collMod: "myCollection",
  changeStreamPreAndPostImages: { enabled: true }
})
```
Specify when creating a new collection
```
db.createCollection("myCollection", { changeStreamPreAndPostImages: { enabled: true }})
```
Create a change stream listener and specify the pre-image option:
```
// Create a listener on the target collection
cursor = db.getCollection("myCollection").watch([],
  {
    fullDocument: 'required', // or 'whenAvailable'
    fullDocumentBeforeChange: 'required' // or 'whenAvailable'
  }
)
cursor.next();
```
- required: The server must return a pre-image or post-image. Otherwise, an error is reported.
- whenAvailable: The server attempts to return the image, but it is not guaranteed.

Verify the result

Check whether the database dimension switch is enabled:
```
db.adminCommand( { getClusterParameter: "changeStreamOptions" } )
```
If enabled, the following output is returned:
Check the collection configuration:
```
db.getCollectionInfos({name: "myCollection"}) // or db.runCommand({listCollections: 1})
```
Expected output: In the returned document, look for a field similar to "options" : { "changeStreamPreAndPostImages" : { "enabled" : true } }.
In another mongosh window, update a document in myCollection.
Observe the event returned by the cursor. It should contain the fullDocumentBeforeChange field, which is the pre-image.

For more information, see Change Streams with Document Pre- and Post-Images.

Enable post-images

A MongoDB post-image is a complete snapshot of a document after a change occurs. It records the full content of the document after the change.

Prerequisites

MongoDB 3.6 or later (Upgrade guide).

Procedure

When you run the watch command, set fullDocument: 'updateLookup'.

cursor = db.getSiblingDB("test").myCollection.watch([], 
  {
    fullDocument: 'updateLookup'
  }
);
cursor.next();

Verify the result

In another mongosh window, add or update a document in myCollection.
Observe the event returned by the cursor. It should contain the fullDocument field, which is the post-image.

Note

The returned full document might be empty or not a point-in-time snapshot. For example:

If you update the same document multiple times in quick succession, the change event for the first update might return the document's content after the final update is complete.
If a document is deleted immediately after an update, the corresponding change event has an empty `fullDocument` field because the document no longer exists.

For more information, see Lookup Full Document for Update Operations.

Handle large change events (> 16 MB)

Prerequisites

MongoDB 7.0 or later (Upgrade guide).

Procedure

You can include the $changeStreamSplitLargeEvent stage in the watch() pipeline:

myChangeStreamCursor = db.myCollection.watch(
  [ { $changeStreamSplitLargeEvent: {} } ], // Add the splitting stage
  {
    fullDocument: "required",
    fullDocumentBeforeChange: "required"
  }
)

Verify the result

Run an operation that causes a change event larger than 16 MB, such as updating a document that contains a very large array.
Observe the returned event stream. It is split into multiple consecutive fragment events, ending with a final fragment event.

Reduce pre-image storage overhead

By default, pre-images expire with the oplog. Set a shorter expiration time to save storage space:

Warning

If you set expireAfterSeconds to a very short duration and downstream consumption is slow, a ChangeStreamHistoryLost error may occur because the pre-image expires too early. For more information, see Change Streams with Document Pre- and Post-Images.

db.adminCommand({
  setClusterParameter: {
    changeStreamOptions: {
      preAndPostImages: { expireAfterSeconds: 100 } // Unit: seconds
    }
  }
})

Change stream best practices

Use pre-images and post-images with caution:
- Enabling fullDocumentBeforeChange (pre-image) and fullDocument (post-image) increases storage overhead (in the config.system.preimages collection) and request latency.
- Enable these options only when your service requires the full document content before and after a change.
Key points for sharded cluster deployments:
- Always create change stream listeners on a mongos instance to ensure the global order of events.
- Under high write loads, the change stream can become a bottleneck because the mongos instance needs to sort and merge events from different shards.
- Unbalanced writes across shards, for example, due to a poorly designed sharding key, can significantly increase change stream latency.
Avoid updateLookup:
- The updateLookup option executes a separate findOne query for each update event, which is inefficient.
- In a sharded cluster, the moveChunk operation can exacerbate the latency issues caused by updateLookup.
Prevent change stream interruptions:
- ⚠️ The following scenarios can cause the change stream cursor to become invalid (operationType: "invalidate") or throw an error.
  - Slow downstream consumption: The consumption speed is slower than the event generation speed, causing the resumeToken to fall outside the oplog window.
  - Invalid resumeToken: Attempting to resume with an old resumeToken whose corresponding timestamp is no longer in the oplog.
  - Failover impact: After a failover, the oplog of the new primary node might not contain the original resumeToken.
  - Metadata changes: Operations such as drop, rename, and dropDatabase can trigger an invalidate event.
  - Pre-image expiration: A short expireAfterSeconds setting combined with slow consumption can cause pre-images to be lost.
- Solutions:
  - Monitor change stream latency.
  - Ensure the oplog window is large enough.
  - Design robust error handling and recovery logic. This includes catching invalidate events, recording the last valid resumeToken, and recreating the listener.
  - Set a reasonable value for expireAfterSeconds.
Scope selection strategy:
- Single change stream vs. multiple collection-level change streams:
  - Single stream (database or instance level): This approach has lower resource overhead because it uses a single thread to pull from the oplog. However, the downstream system must filter and distribute events itself. Under high event volumes, the mongos instance can become a bottleneck.
  - Multiple collection-level streams: This approach uses server-side filtering to reduce network traffic and provides better concurrency. However, too many streams increase oplog read contention and resource consumption.
- Recommendation: Test and choose the optimal solution based on your service workload, including event volume and the number of collections. Typically, use separate streams for a few highly active collections. For many less active collections, use a database-level or instance-level stream combined with downstream filtering.

FAQ

1. Why do change stream slow query logs always show `COLLSCAN`?

This is normal and expected behavior. No optimization is required. A change stream cursor is ultimately created on local.oplog.rs, which is the only collection that records all instance modifications. This collection does not have an index. Therefore, the operation is always a COLLSCAN, which cannot be avoided or optimized.

If you use COLLSCAN as a keyword to filter logs when searching for queries to optimize, also filter out the $changeStream keyword.

Focus on change stream-related slow query logs only when you encounter performance issues, such as increased latency in downstream consumption.

2. On a sharded cluster, why are change stream cursors observed on shards other than the primary shard when a listener is created for an unsharded collection?

This addresses potential sharding (shardCollection) operations. You can run the shardCollection operation at any time on an unsharded collection with an existing listener to distribute its data across all shards (similar to movePrimary). Creating change stream cursors on other shards in advance prepares for this scenario. Typically, these cursors on other shards do not return any change events and have minimal performance overhead.

Similarly, when a listener is established on a sharded cluster, mongos creates related cursors on the Config Server to handle potential addShard or removeShard operations.

3. Why do slow query logs for a change stream cursor appear on the primary node, even when `readPreference:secondary` was specified during cursor creation?

After a cursor is created, it is "pinned" to a specific node and does not automatically migrate if the node's role changes. Therefore, once a change stream cursor is established, it remains on a specific mongod node (primary or secondary) and continuously consumes events using getMore. If a primary/secondary switchover, configuration change, migration, or other event that could trigger a switchover occurs, a cursor that was originally on a secondary node might move to the primary node.

If you do not want this load to continue consuming resources on the primary node, use killCursors to close the relevant change stream cursor. The downstream consumption logic will use the resumeToken and readPreference to re-establish the cursor on a secondary node. This operation does not cause loss or interruption of change events.

db.runCommand(
   {
     killCursors: <collection>,
     cursors: [ <cursor id1>, ... ], comment: <any>
   }
)

db.getSiblingDB("<testDB>").runCommand( { killCursors: "<testColl>", cursors: [NumberLong("2452840976689696187") ] } )

4. Why do I see many 1000 ms slow query logs for change streams on mongos?

2020-08-26T04:34:45.045+0000 I COMMAND [conn21283] command altconfig-b2b-perf.oplog command: getMore \{ getMore: 3513599116181216748, collection: "oplog", $db: "altconfig-b2b-perf", $clusterTime: { clusterTime: Timestamp(1598416483, 1), signature: { hash: BinData(0, EC3841EB1FB7A34F897688BB5983E32E2ADF6763), keyId: 6855385090000683010 } }, lsid: \{ id: UUID("72864d11-c0f8-48bb-823e-de5f18a9c409") } } originatingCommand: \{ find: "oplog", filter: { timestamp: { $gte: new Date(1597895390009) } }, tailable: true, awaitData: true, $db: "altconfig-b2b-perf", $clusterTime: \{ clusterTime: Timestamp(1597895487, 1), signature: { hash: BinData(0, AAAE33D5688F935C70469CBDB8EB1F6882749A40), keyId: 6855385090000683010 } }, lsid: \{ id: UUID("72864d11-c0f8-48bb-823e-de5f18a9c409") } } nShards:1 cursorid:3513599116181216748 numYields:0 nreturned:0 reslen:237 protocol:op_msg 1000ms

This is normal behavior in MongoDB versions earlier than 6.0. It indicates a wait timeout, not a performance bottleneck. In a sharded cluster architecture, change stream cursors created from mongos to shards specify tailable:true, awaitData:true, and maxTimeMS:1000 to return as many change events as possible within 1000 ms.

In major versions before 6.0, mongos recorded these 1000 ms slow query logs, which could be misleading. This behavior has been optimized. For details, see SERVER-50559.

What is a change stream?

Limits

Configuration Change Stream

Listen for more DDL events

Enable pre-images

Modify an existing collection

Specify when creating a new collection

Enable post-images

Handle large change events (> 16 MB)

Reduce pre-image storage overhead

Change stream best practices

FAQ

1. Why do change stream slow query logs always show COLLSCAN?

2. On a sharded cluster, why are change stream cursors observed on shards other than the primary shard when a listener is created for an unsharded collection?

3. Why do slow query logs for a change stream cursor appear on the primary node, even when readPreference:secondary was specified during cursor creation?

4. Why do I see many 1000 ms slow query logs for change streams on mongos?

1. Why do change stream slow query logs always show `COLLSCAN`?

3. Why do slow query logs for a change stream cursor appear on the primary node, even when `readPreference:secondary` was specified during cursor creation?