如何解决当字段包含〜时过滤弹性搜索数据
我有一堆文件,如下所示。我想在projectkey以〜开头的地方过滤数据。 我确实读过一些文章,其中说〜是Elastic查询中的运算符,因此无法真正进行过滤。 有人可以帮忙为/ branch / _search API形成搜索查询吗?
{
"_index": "branch","_type": "_doc","_id": "GAz-inQBJWWbwa_v-l9e","_version": 1,"_score": null,"_source": {
"branchID": "refs/heads/feature/12345","displayID": "feature/12345","date": "2020-09-14T05:03:20.137Z","projectKey": "~user","repoKey": "deploy","isDefaultBranch": false,"eventStatus": "CREATED","user": "user"
},"fields": {
"date": [
"2020-09-14T05:03:20.137Z"
]
},"highlight": {
"projectKey": [
"~@kibana-highlighted-field@user@/kibana-highlighted-field@"
],"projectKey.keyword": [
"@kibana-highlighted-field@~user@/kibana-highlighted-field@"
],"user": [
"@kibana-highlighted-field@user@/kibana-highlighted-field@"
]
},"sort": [
1600059800137
]
}
更新 ***
我使用以下prerana的答案在查询中使用-prefix
当我使用前缀和范围时,仍然有些错误-我得到以下错误-我缺少了什么?
GET /branch/_search
{
"query": {
"prefix": {
"projectKey": "~"
},"range": {
"date": {
"gte": "2020-09-14","lte": "2020-09-14"
}
}
}
}
{
"error": {
"root_cause": [
{
"type": "parsing_exception","reason": "[prefix] malformed query,expected [END_OBJECT] but found [FIELD_NAME]","line": 6,"col": 5
}
],"type": "parsing_exception","col": 5
},"status": 400
}
解决方法
如果我很了解您的问题,建议您创建一个自定义分析器来搜索特殊字符~
。
将~
替换为__SPECIAL__
时,我在本地进行了如下测试:
我创建了一个带有自定义char_filter
的索引,并在projectKey
字段中添加了一个字段。新的multi_field的名称为special_characters
。
这里是映射:
PUT wildcard-index
{
"settings": {
"analysis": {
"char_filter": {
"special-characters-replacement": {
"type": "mapping","mappings": [
"~ => __SPECIAL__"
]
}
},"analyzer": {
"special-characters-analyzer": {
"tokenizer": "standard","char_filter": [
"special-characters-replacement"
]
}
}
}
},"mappings": {
"properties": {
"projectKey": {
"type": "text","fields": {
"special_characters": {
"type": "text","analyzer": "special-characters-analyzer"
}
}
}
}
}
}
然后我在索引中提取了以下内容:
“ projectKey”:“ content1〜”
“ projectKey”:“这〜是内容”
“ projectKey”:“〜路上的汽车”
“ projectKey”:“ o〜ngram”
然后,查询为:
GET wildcard-index/_search
{
"query": {
"match": {
"projectKey.special_characters": "~"
}
}
}
答复为:
"hits" : [
{
"_index" : "wildcard-index","_type" : "_doc","_id" : "h1hKmHQBowpsxTkFD9IR","_score" : 0.43250346,"_source" : {
"projectKey" : "content1 ~"
}
},{
"_index" : "wildcard-index","_id" : "iFhKmHQBowpsxTkFFNL5","_score" : 0.3034693,"_source" : {
"projectKey" : "This ~ is a content"
}
},"_id" : "-lhKmHQBowpsxTkFG9Kg","_source" : {
"projectKey" : "~ cars on the road"
}
}
]
如果您有任何问题,请告诉我。
注意:如果~
后面有空格,则此方法有效。您可以从响应中看到未显示第4个数据。
虽然@hansley答案可行,但是它要求您创建一个自定义分析器,并且仍然如您所说,您只想获取以~
开头的文档,但是在他的结果中,我看到了所有包含{ {1}},因此提供我的答案所需的配置非常少,并且可以按要求工作。
索引映射为默认设置,因此仅文档和ES下方的索引将为所有~
字段的.keyword
字段创建一个默认映射
索引样本文档
text
搜索查询应从示例文档中提取第二个文档
{
"title" : "content1 ~"
}
{
"title" : "~ staring with"
}
{
"title" : "in between ~ with"
}
和搜索结果
{
"query": {
"prefix" : { "title.keyword" : "~" }
}
}
有关更多信息,请参考prefix query
更新1:
索引映射:
"hits": [
{
"_index": "pre","_type": "_doc","_id": "2","_score": 1.0,"_source": {
"title": "~ staring with"
}
}
]
索引数据:
{
"mappings": {
"properties": {
"date": {
"type": "date"
}
}
}
}
搜索查询:
{
"date": "2015-02-01","title" : "in between ~ with"
}
{
"date": "2015-01-01","title": "content1 ~"
}
{
"date": "2015-02-01","title" : "~ staring with"
}
{
"date": "2015-02-01","title" : "~ in between with"
}
搜索结果:
{
"query": {
"bool": {
"must": [
{
"prefix": {
"title.keyword": "~"
}
},{
"range": {
"date": {
"lte": "2015-02-05","gte": "2015-01-11"
}
}
}
]
}
}
}
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。