All Products
Search
Document Center

Content Moderation:Use the online testing feature

Last Updated:Dec 04, 2025

This product provides an online testing feature. You can use this feature to test our efficient detection services for content compliance, sensitive content, and prompt attacks.

Prerequisites

Go to the AI Guardrails activation page to activate the AI Guardrails service.

Note

Charges are incurred when you use the online testing feature. For more information, see Billing overview.

Procedure

  1. Log on to the AI Guardrails console.

  2. In the input box, enter the content that you want to test.

  3. Below the input box, select a policy template for detection.

    The following policy templates are available:

    AI input content moderation (query_security_check_intl)

    AI-generated content moderation (response_security_check_intl)

  4. Click the Run test button.

  5. View the Test results.

  6. The following figure shows an example.

    image

  7. Alternatively, select a sample template. A set of detection templates is provided for content compliance, sensitive content, and prompt attacks. After you select a template, click the Run test button to view the detection results.

Enable features

  1. After you run a test, if the status of Sensitive content detection or Prompt injection detection is Not enabled in the Test results, click Proceed to enable to activate the feature.

    image

  2. On the Check Item Configuration page, select the feature that you want to enable.

    image

  3. If you enable Sensitive content detection detection, take note of the message in the dialog box. This feature is billed separately. For more information, see Billing overview.

    image

  4. If you enable Prompt injection detection detection, take note of the message in the dialog box. This feature is in public preview and is available for a free trial. If you have any questions, contact your business manager.

    image