英語テキストアナライザー - OpenSearch - Alibaba Cloud ドキュメントセンター

英語単語ステミングアナライザー

概要

英語単語ステミングアナライザー (eng_standard) は、各英語単語を語根に還元します。このアナライザーは、英語のセマンティクスに基づく検索に適しています。

Example: If the value of a field is "英文分词器 english analyzer" in a document and the English word stemming analyzer is specified, the document can be retrieved when a user searches for "英文分词器", "english", "analyz", "analyzer", "analyzers", "analyze", "analyzed", or "analyzing." 
Take note that an English text analyzer analyzes consecutive Chinese characters as one word.

使用方法

このアナライザーは、TEXT データ型のフィールドにのみ適用されます。アナライザーを使用するには、スキーマを構成するときにアナライザーを eng_standard に設定します。

ステミングされていない英語単語アナライザー

概要

ステミングされていない英語単語アナライザー (eng_nostem) は、スペースと句読点に基づいてテキストをトークン化します。このアナライザーは、書籍のタイトルや著者名など、英語のセマンティクスに基づかない検索に適しています。

Example: If the value of a field is "英文分词器 english analyzer" in a document and the unstemmed English word analyzer is specified, the document can be retrieved when a user searches for "英文分词器", "english", or "analyzer." 
Take note that an English text analyzer analyzes consecutive Chinese characters as one word.

使用方法

このアナライザーは、TEXT データ型のフィールドにのみ適用されます。アナライザーを使用するには、スキーマを構成するときにアナライザーを eng_nostem に設定します。

英語最小粒度アナライザー

概要

英語最小粒度アナライザー (en_min) は、検索単位を使用して、英語のセマンティクスに基づいて英語テキストをトークン化します。アナライザーは、スペースなしで接続された単語列をトークン化できます。この英語アナライザーは、あらゆる業界の英語テキスト分析に適しています。

Example: If the value of a field is "dataprocess" in a document and the English minimum-granularity analyzer is specified, the analysis result is "data process". In this case, the document can be retrieved when a user searches for "dataprocess", "data process", "data", or "process".

使用方法

このアナライザーは、TEXT データ型のフィールドにのみ適用されます。アナライザーを使用するには、スキーマを構成するときにアナライザーを en_min に設定します。