一个过滤器,通过单个统一接口提供(几乎)所有可用的词干词元过滤器的访问,例如:
{
"index" : {
"analysis" : {
"analyzer" : {
"my_analyzer" : {
"tokenizer" : "standard",
"filter" : ["standard", "lowercase", "my_stemmer"]
}
},
"filter" : {
"my_stemmer" : {
"type" : "stemmer",
"name" : "light_german"
}
}
}
}
}
language / name 参数用以下可用值控制词干(首选过滤器以粗体标记):
| Arabic | arabic
|
| Armenian | armenian
|
| Basque | basque
|
| Brazilian Portuguese | brazilian
|
| Bulgarian | bulgarian
|
| Catalan | catalan
|
| Czech | czech
|
| Danish | danish
|
| Dutch | dutch
, dutch_kp
|
| English | english
, light_english
, minimal_english
, possessive_english
, porter2
,lovins
|
| Finnish | finnish
, light_finnish
|
| French | french
, light_french
, minimal_french
|
| Galician | galician
, minimal_galician
(Plural step only) |
| German | german
, german2
, light_german
, minimal_german
|
| Greek | greek
|
| Hindi | hindi
|
| Hungarian | hungarian
, light_hungarian
|
| Indonesian | indonesian
|
| Irish | irish
|
| Italian | italian
, light_italian
|
| Kurdish (Sorani) | sorani
|
| Latvian | latvian
|
| Lithuanian | lithuanian
|
| Norwegian (Bokmål) | norwegian
, light_norwegian
, minimal_norwegian
|
| Norwegian (Nynorsk) | light_nynorsk
, minimal_nynorsk
|
| Portuguese | portuguese
, light_portuguese
, minimal_portuguese
, portuguese_rslp
|
| Romanian | romanian
|
| Russian | russian
, light_russian
|
| Spanish | spanish
, light_spanish
|
| Swedish | swedish
, light_swedish
|
| Turkish | turkish
|