Elasticsearch在计算后将两个字段回填到一个新字段中

2条回答

网友

1楼 · 编辑于 2024-05-20 00:01:41

以下是我的大致情况：

我一直在与Python和bulk helpers一起工作，到目前为止我在这里：

doc = helpers.scan(es, query={
"query": {
"match_all": {}

},
"size":1000 
},index=INDEX, scroll='5m', raise_on_error=False)


    for x in doc:
x['_index'] = NEW_INDEX
try:
    time_sec = x['_source']['payload']['time_sec']
    time_nanosec=x['_source']['payload']['time_nanosec']
    duration = (time_sec * 10**9) + time_nanosec
except KeyError: pass

count = count + 1

x['_source']['payload']['duration'] = duration
new_index_data.append(x) 

helpers.bulk(es,new_index_data)

从这里开始，我将使用bulkpython帮助程序将其插入到一个新索引中。不过，我将尝试对现有索引进行批量更新和测试。在

这看起来是一个正确的方法？在

网友

2楼 · 编辑于 2024-05-20 00:01:41

Bulk helpers to pull a scroll ID (bulk _update?), iterate over each doc id, pull that data in from the two fields for each dock, do the math, and finish the update request with the new field data.

基本上，是的：

使用/_search?scroll获取文档
做你的手术
发送/_bulk更新请求

其他选项包括：

use the ^{} API
如果您不想创建新的索引，则可能不是很好
use the ^{} API

两者都支持脚本，如果我理解正确的话，这将是一个完美的选择，因为您的更新不依赖于外部因素，所以这也可以直接在服务器内完成。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

Elasticsearch在计算后将两个字段回填到一个新字段中

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >