All Products
Search
Document Center

ApsaraMQ for Kafka:Connect a ApsaraMQ for Kafka instance to Filebeat as an output

Last Updated:Sep 14, 2023

A ApsaraMQ for Kafka instance can be connected to Filebeat as an output. This topic describes how to use Filebeat to send messages to ApsaraMQ for Kafka in a virtual private cloud (VPC).

Prerequisites

Before you start this tutorial, make sure that the following operations are complete:

Step 1: Obtain an endpoint

Filebeat establishes a connection to ApsaraMQ for Kafka by using a ApsaraMQ for Kafka endpoint.

  1. Log on to the ApsaraMQ for Kafka console.

  2. In the Resource Distribution section of the Overview page, select the region where your instance is deployed.

  3. On the Instances page, click the name of the instance that you want to connect to Filebeat as an output.

  4. On the Instance Details page, obtain an endpoint of the instance in the Endpoint Information section. In the Configuration Information section, obtain the values of the Username parameter and Password parameter.

    endpoint
    Note

    For information about the differences among endpoints, see Comparison among endpoints.

Step 2: Create a topic

Perform the following operations to create a topic for storing messages:

  1. Log on to the ApsaraMQ for Kafka console.

  2. In the Resource Distribution section of the Overview page, select the region where your instance is deployed.

    Important

    You must create topics in the region where your application is deployed. When you create a topic, select the region where your Elastic Compute Service (ECS) instance is deployed. A topic cannot be used across regions. For example, if your message producers and consumers run on ECS instances that are deployed in the China (Beijing) region, create topics in the China (Beijing) region.

  3. On the Instances page, click the name of the instance that you want to manage.

  4. In the left-side navigation pane, click Topics.

  5. On the Topics page, click Create Topic.

  6. In the Create Topic panel, configure the parameters and click OK.

    Create a topic

    Parameter

    Description

    Example

    Name

    The topic name.

    demo

    Description

    The topic description.

    demo test

    Partitions

    The number of partitions in the topic.

    12

    Storage Engine

    Note

    You can select the type of the storage engine only if you use a Professional Edition instance. If you use a Standard Edition instance, cloud storage is selected by default.

    The type of the storage engine that is used to store messages in the topic.

    ApsaraMQ for Kafka supports the following types of storage engines:

    • Cloud Storage: If you select this value, the system uses Alibaba Cloud disks for the topic and stores data in three replicas in distributed mode. This storage engine features low latency, high performance, durability, and high reliability. If you set the Instance Edition parameter to Standard (High Write) when you created the instance, you can set this parameter only to Cloud Storage.

    • Local Storage: If you select this value, the system uses the in-sync replicas (ISR) algorithm of open source Apache Kafka and stores data in three replicas in distributed mode.

    Cloud Storage

    Message Type

    The message type of the topic. Valid values:

    • Normal Message: By default, messages that have the same key are stored in the same partition in the order in which the messages are sent. When a broker in the cluster fails, the order of the messages may not be preserved in the partitions. If you set the Storage Engine parameter to Cloud Storage, this parameter is automatically set to Normal Message.

    • Partitionally Ordered Message: By default, messages that have the same key are stored in the same partition in the order in which the messages are sent. When a broker in the cluster fails, the messages are still stored in the partitions in the order in which the messages are sent. Messages in some partitions cannot be sent until the partitions are restored. If you set the Storage Engine parameter to Local Storage, this parameter is automatically set to Partitionally Ordered Message.

    Normal Message

    Log Cleanup Policy

    The log cleanup policy that is used by the topic.

    If you set the Storage Engine parameter to Local Storage, you must configure the Log Cleanup Policy parameter. You can set the Storage Engine parameter to Local Storage only if you use an ApsaraMQ for Kafka Professional Edition instance.

    ApsaraMQ for Kafka provides the following log cleanup policies:

    • Delete: The default log cleanup policy. If sufficient storage space is available in the system, messages are retained based on the maximum retention period. After the storage usage exceeds 85%, the system deletes messages from the earliest stored message to ensure service availability.

    • Compact: The Apache Kafka log compaction policy is used. For more information, see Kafka 3.4 Documentation Log compaction ensures that the latest values are retained for messages that have the same key. This policy is suitable for scenarios such as restoring a failed system or reloading the cache after a system restarts. For example, when you use Kafka Connect or Confluent Schema Registry, you must store the information about the system status and configurations in a log-compacted topic.

      Important

      You can use log-compacted topics only in specific cloud-native components such as Kafka Connect and Confluent Schema Registry. For more information, see aliware-kafka-demos.

    Compact

    Tag

    The tags that you want to attach to the topic.

    demo

    After the topic is created, it is displayed on the Topics page.

Step 3: Use Filebeat to send a message

Start Filebeat on the server where Filebeat is installed to send a message to the topic that you created.

  1. Run the cd command to switch to the installation directory of Filebeat.

  2. Create a configuration file named output.conf.

    1. Run the vim output.conf command to create an empty configuration file.

    2. Press the I key to enter the insert mode.

    3. Enter the following content:

      filebeat.inputs:
      - type: stdin
      
      output.kafka:
        hosts: ["alikafka-pre-cn-zv**********-1-vpc.alikafka.aliyuncs.com:9092", "alikafka-pre-cn-zv**********-2-vpc.alikafka.aliyuncs.com:9092", "alikafka-pre-cn-zv**********-3-vpc.alikafka.aliyuncs.com:9092"]
      
        topic: 'filebeat_test'
      
        required_acks: 1
        compression: none
        max_message_bytes: 1000000

      Parameter

      Description

      Example

      hosts

      The VPC endpoint of the ApsaraMQ for Kafka instance. ApsaraMQ for Kafka supports the following VPC endpoints:

      • Default endpoint

      • Simple Authentication and Security Layer (SASL) endpoint

      alikafka-pre-cn-zv**********-1-vpc.alikafka.aliyuncs.com:9092, alikafka-pre-cn-zv**********-2-vpc.alikafka.aliyuncs.com:9092, alikafka-pre-cn-zv**********-3-vpc.alikafka.aliyuncs.com:9092

      topic

      The name of the topic.

      filebeat_test

      required_acks

      The reliability level of acknowledgments (ACK). Valid values:

      • 0: no response

      • 1: waits for local commit

      • -1: waits for all replicas to commit

      Default value: 1.

      1

      compression

      The data compression codec. Default value: gzip. Valid values:

      • none: none

      • snappy: the C ++ development package used for compression and decompression

      • lz4: the lossless data compression algorithm that enables faster compression and decompression

      • gzip: the file compression program for GNU free software

      none

      max_message_bytes

      The maximum size of a message. Unit: bytes. Default value: 1000000. The value must be smaller than the maximum message size that you specify for ApsaraMQ for Kafka.

      1000000

      For more information about parameter settings, see Kafka output plugin.

    4. Press the Esc key to return to the command line mode.

    5. Press the : key to go to the bottom line. Enter wq, press the Enter key to save the file, and then exit.

  3. Send a message to the topic that you created.

    1. Run the ./filebeat -c ./output.yml command.

    2. Enter test and press Enter.

Step 4: View the partitions of the topic

Perform the following operations to view the message that was sent to the topic:

  1. Log on to the ApsaraMQ for Kafka console.

  2. In the Resource Distribution section of the Overview page, select the region where your instance is deployed.

  3. On the Instances page, click the name of the instance that you want to manage.

  4. In the left-side navigation pane, click Topics.

  5. On the Topics page, find the topic whose partition status you want to view, and choose More > Partition Status in the Actions column.

    Table 1. Information about the status of a partition

    Parameter

    Description

    Partition ID

    The ID of the partition.

    Minimum Offset

    The earliest offset based on which messages in the partition are consumed.

    Maximum Offset

    The latest offset based on which messages in the partition are consumed.

    Last Updated At

    The most recent point in time when a message is stored in the partition.

    Information about the status of a partition

Step 5: Query the message by offset

You can query the sent message based on its partition ID and offset information.

  1. Log on to the ApsaraMQ for Kafka console.

  2. In the Resource Distribution section of the Overview page, select the region where your instance is deployed.

  3. On the Instances page, click the name of the instance that you want to manage.

  4. In the left-side navigation pane, click Message Query.

  5. On the Message Query page, select Search by offset from the Search Method drop-down list.

  6. Select a topic name from the Topic drop-down list and a partition from the Partition drop-down list, enter an offset value in the Offset field, and then click Search.

    Messages whose offset values are greater than or equal to the specified offset value are displayed. For example, if you specify 5 as the value of both the Partition parameter and Offset parameter, the system queries messages whose offset values are greater than or equal to 5 from Partition 5.

    Table 2. Parameters that are included in message query results

    Parameter

    Description

    Partition

    The partition from which the message is obtained.

    Offset

    The offset of the message.

    Key

    The key of the message. The key is converted to a string.

    Value

    The content of the message. The message content is converted to a string.

    Created At

    The point in time when the message was produced. The value is the timestamp that the producer recorded when the producer sent the message or the value is the same as the value of the timestamp field that you specified for ProducerRecord.

    Note
    • If you specified a value for the timestamp field, the specified value is displayed.

    • If you did not specify a value for the timestamp field, the local system time when the message is sent is displayed.

    • A value in the 1970/x/x x:x:x format indicates that the timestamp field is set to 0 or an invalid value.

    • You cannot specify a value for the timestamp field on clients of ApsaraMQ for Kafka V0.9 and earlier.

    Actions

    • Click Download Key to download the key of the message.

    • Click Download Value to download the content of the message.

    Important
    • The Message Queue for Apache Kafka console can display up to 1 KB of content for each message. If the size of a message exceeds 1 KB, the excess content of the message is omitted. If you want to view the complete message, download the message.

    • You can download up to 10 MB of message content. If the size of a message exceeds 10 MB, only the first 10 MB of message content can be downloaded.