All Products
Search
Document Center

Alibaba Cloud Model Studio:Data import

Last Updated:Mar 10, 2026

To build a knowledge base for RAG (Retrieval-Augmented Generation) applications, import your source data into Alibaba Cloud Model Studio. This data serves as the initial source for your knowledge base.

Important

Model Studio supports three import methods:

Import methodSourceDestination
Upload local filesFiles from your computerFile tab, organized by categories
Upload local tablesSpreadsheets from your computerTable tab, organized by data tables
Import from OSSFiles from Object Storage Service (OSS)File tab, organized by categories
Note

Model Studio does not support direct import of JSON, CSV, or YAML files. Convert these files to XLSX or XLS format before importing.

For supported data formats and capacity limits, see Knowledge base quotas and limits.

Parsing methods

When importing files (local or OSS), select a parsing method that matches your content type. Choose Default Settings for the standard configuration, or Custom Settings to define parsing rules for specific formats.

Parsing methodBest forHow it works
Digital ParsingText-only documentsDoes not parse illustrations or charts in files.
Intelligent ParsingDocuments with illustrationsDetects images, extracts text from them, and generates summaries. These summaries and other text content are chunked, converted into vectors, and used for knowledge base retrieval.
LLM ParsingVisual Q&A over illustrations and chartsUses the Qwen-VL model to detect and interpret visual content, enabling agent applications to answer questions about illustrations and charts.
Qwen VL ParsingImage files onlySelect a Qwen-VL model and write a prompt to specify the layout, elements, and content for detection. Provides the same capabilities as LLM Parsing but is limited to image formats.
Note

If your documents contain figures, charts, or images, use Intelligent Parsing, LLM Parsing, or Qwen VL Parsing instead of Digital Parsing.

imageimage

How to make a Model Studio application display illustrations from a file in its answers

Import local files

  1. Go to the File tab.

  2. In Category Management on the left, select an existing category or click the image icon to create one.

    Note

    Model Studio uses categories to organize imported files.

  3. Click Import Data. On the Import Data page, set the import method to Upload Local File.

  4. Select a Parsing Method. For details, see Parsing methods.

  5. (Optional) Configure Tags for the file.

    Note

    Tags help filter files during knowledge base retrieval. When calling an application through an API, specify tags in the tags request parameter. For agent applications, set tags when debugging the knowledge base in the console.

  6. Click Confirm to start parsing and importing. Track progress on the page.

    Note

    Model Studio converts the file into a processable format. During peak hours, this may take several hours.

  7. After the import completes, click Details next to the file to review it.

Post-import behavior

  • Imported files are stored as independent replicas in free platform storage. Replicas are not linked to your original data, and no capacity limit applies.

  • Files imported within the last 90 days can be viewed. After 90 days, files remain stored but are no longer viewable.

  • Imported files are accessible only within the current workspace. Model Studio does not use them for commercial purposes or make them public.

Import local tables

  1. Go to the Table tab.

  2. In Table Management on the left, select an existing data table or click the image icon to create a new one.

    Note

    Model Studio manages imported tabular data using data tables.

Import to a new data table

  1. Enter a Table Name. Then configure the table schema by choosing either Upload Excel File or Custom Header:

    • Upload Excel File: Model Studio auto-detects the table header in the uploaded file, uses it to create the data table schema, and imports the remaining rows as data records.

    • Custom Header: Column Name and Type are required. Description is optional.

    Warning

    The table structure (column name, description, and type) cannot be modified after confirmation.

    Important
    • The schema of the uploaded file (column count and column names) must exactly match the target data table schema. Otherwise, the import fails. Add or remove fields by clicking New Columns or Delete in the Actions column.

    • Provide a clear, natural-language description for each field to help the model understand its meaning. For example, specify that the age field represents a user's age.

    • If the field type is image_url, the value must be a publicly accessible image URL (for example: https://example.com/downloads/pic.jpg). When creating a knowledge base, image_url fields generate an image index -- Model Studio accesses the image, extracts its features, converts them into a vector using image embedding, and saves it. During retrieval, this vector is compared with the user-uploaded image vector for similarity matching.

  2. Click the image icon to select and upload a file in XLSX or XLS format.

    Note

    The file must contain a table header. Otherwise, the import fails.

  3. Click OK to start the import. The new data table appears in the Table Management navigation tree on the left.

Import to an existing data table

  1. In Table Management on the left, select the target data table and click Import Data.

  2. Set the import type to Upload and Overwrite or Incremental Upload.

    Note

    Click Download Template to download a blank file that contains only the table header. Insert new data into this file and use it for an overwrite or incremental upload.

  3. Click the image icon to select and upload a file in XLSX or XLS format.

    Note

    The file must contain a table header that matches the structure of the current data table. Otherwise, the import fails.

Import OSS files

  1. Go to the File tab.

  2. In Category Management on the left, select an existing category or click the image icon to create one.

  3. Click Import Data. On the Import Data page, set the import method to OSS.

    Note

    First-time OSS imports require authorization. Complete the authorization as prompted and add the bailian-datahub-access tag to the target bucket. For details, see Configure file import from OSS.

    Important
    • Buckets with the Archive, Cold Archive, or Deep Cold Archive storage class are not supported.

    • Root directory access is not supported. Select an existing subdirectory or create a new one.

    • Buckets with content encryption and private buckets are supported.

    • If the bucket has Referer hotlink protection enabled, add *.console.aliyun.com to the Referer whitelist. For details, see Allow access only from trusted websites.

  4. Select a Parsing Method. For details, see Parsing methods.

  5. (Optional) Configure Tags for the file.

    Note

    Tags help filter files during knowledge base retrieval. When calling an application through an API, specify tags in the tags request parameter. For agent applications, set tags when debugging the knowledge base in the console.

  6. Click OK to start parsing and importing. Track progress on the page.

    Note

    Model Studio converts the file into a processable format. During peak hours, this may take several hours.

  7. After the import completes, click Details next to the file to view the results.

    Note

    Imported files are stored as independent replicas in free platform storage, separate from your original data with no capacity limit. Files are accessible only within the current workspace. Model Studio does not use them for commercial purposes or make them public.

Next step

Create a knowledge base

Configure file import from OSS

When importing files from OSS for the first time, grant Model Studio access to your OSS resources. The authorization flow differs for an Alibaba Cloud account and a RAM user.

Alibaba Cloud account authorization

  1. Click Authorize Now.

    image

  2. In the dialog box, click Confirm Authorization. This automatically creates an OSS service-linked role that grants Model Studio access to your OSS resources.

    Note

    The authorization usually takes effect within seconds, but a slight delay may occur during peak hours.

    Note

    If you receive the error "This request failed. Try submitting again or contact an administrator. Error code: 10041495", see Resolve error code 10041495.

    image

  3. Add the bailian-datahub-access tag to the target OSS bucket. This tag marks the buckets that Model Studio can access. Buckets without this tag are inaccessible.

    1. Log on to the OSS console. In the left navigation pane, click Buckets. Find the target bucket.

    2. Hover over the image icon and click Edit.

    3. On the Bucket Tag page, click Create Tag if no tags exist, or click Settings to modify existing tags.

    4. Click Tag, set the tag key to bailian-datahub-access and the tag value to read, then click Save.

    image

  4. Return to the Import Data page, reselect the target bucket, and retry the import.

    Important

    Model Studio does not support accessing files in the root directory of a bucket. Select an existing subdirectory or create a new one.

RAM user authorization

  1. Click Authorize Now.

    image

  2. In the dialog box, click Confirm Authorization. If you receive an Authorization Failed or No Permission error, grant the RAM user permission to create service-linked roles first:

    1. Log on to the RAM console. In the left navigation pane, choose Permissions > Policies, then click Create Policy.

    2. Click the JSON tab, paste the following policy, then click OK: ``json { "Action": [ "ram:CreateServiceLinkedRole" ], "Resource": "*", "Effect": "Allow", "Condition": { "StringEquals": { "ram:ServiceName": "datahub.sfm.aliyuncs.com" } } } `` image

    3. Enter a policy name and click OK. image

    4. In the left navigation pane, choose Identities > Users. Find the target RAM user and click Add Permissions in the Actions column.

    5. Select the custom policy you just created and click Grant permissions. image

  3. Grant the RAM user permission to access OSS through Model Studio:

    1. Return to the Import Data page and click Authorize Now. image

    2. In the dialog box, click Confirm Authorization to automatically create the required OSS service-linked role.

      Note

      The authorization usually takes effect within seconds, but a slight delay may occur during peak hours.

      Note

      If you receive the error "The request failed. Try to submit again or contact an administrator. Error code: 10041495", see Resolve error code 10041495.

      image

  4. Add the bailian-datahub-access tag to the target OSS bucket. This tag marks the buckets that Model Studio can access. Buckets without this tag are inaccessible.

    1. Log on to the OSS console. In the left navigation pane, click Buckets. Find the target bucket.

    2. Hover over the image icon and click Edit.

    3. On the Bucket Tag page, click Create Tag if no tags exist, or click Settings to modify existing tags.

    4. Click Tag, set the tag key to bailian-datahub-access and the tag value to read, then click Save.

    image

  5. Return to the Import Data page, reselect the target bucket, and retry the import.

    Important

    Model Studio does not support accessing files in the root directory of a bucket. Select an existing subdirectory or create a new one.

FAQ

Resolve "You are not authorized to access this module" during data import

By default, a RAM user cannot perform write operations such as data import or knowledge base creation. Use the Alibaba Cloud account to assign the RAM user page permissions for Administrator, or at minimum, both Application Data - Operations and Knowledge Base - Operations.

Resolve error code 10041495

This error typically means the Alibaba Cloud account has not activated OSS.

  1. Log on to the OSS console with the Alibaba Cloud account and activate OSS as prompted.

  2. Return to the Model Studio Import Data page and retry the authorization.