我在我的elasticsearch中编入了索引文档。样本文件看起来像这样:Elasticsearch:在索引数据上应用小写
{
"_index": "processed_tweets",
"_type": "processed",
"_id": "830403820580663296",
"_score": 1,
"_source": {
"at": [
"@LouisDasch"
],
"original_tweet_id": "830398288352403457",
"id_str": "830403820580663296",
"trigrams": [
"blessed lourdes lady",
"lourdes lady feast",
"lady feast day",
"feast day wishing"
],
"hashtags": [
"#Catholic"
],
"id_tweet_creator": "487735029",
"tokens": [
"blessed",
"lourdes",
"lady",
"feast",
"day",
"wishing"
],
"bigrams": [
"blessed lourdes",
"lourdes lady",
"lady feast",
"feast day",
"day wishing"
],
"retweeted": true
}
}
我想小写所有出现在游戏场“主题标签”因为我已经索引的所有文件的主题标签。 例如,我会有: “hashtags”:[“#Catholic”] - >“hashtags”:[“#catholic”] 更新每个关键字到其小写等价物的最佳方式(减少时间消耗)保存“#”)?
你试过做什么? – depperm
他们是否都遵循相同的结构? –
@depperm其实我的解决方案是总重新索引,但我想知道如果有替代 – mel