Elasticsearch 介绍 Elasticsearch 7:快速上手 Elasticsearch 7:关于 Index、Type、Document Elasticsearch 7:安装与启动 Elasticsearch 7:Kibana 的使用 Elasticsearch 7:下载历史版本 Elasticsearch 7:文档唯一性 Elasticsearch 7:默认端口与端口设置 Elasticsearch 7:创建和删除索引 Elasticsearch 7:自定义 mapping 和 settings Elasticsearch 7:设置索引副本数量和分片数量 Elasticsearch 7:查看所有索引 Elasticsearch 7:数据类型 Elasticsearch 7:字符串类型 keyword 、text Elasticsearch 7:数组 Elasticsearch 7:添加和更新文档 Elasticsearch 7:通过 _bulk 批量添加文档 Elasticsearch 7:使用 from 、size 进行分页查询 Elasticsearch 7:查询中使用 sort 进行排序 Elasticsearch 7:查询结果只展示部分字段 Elasticsearch 7:查询结果中展示 _version 字段 Elasticsearch 7:使用 ignore_above 限制字符串长度 Elasticsearch 7:动态映射 Elasticsearch 7:doc_values 属性 Elasticsearch 7:刷新周期 refresh_interval Elasticsearch 7:使用 _refresh 刷新索引 Elasticsearch 7:分片(shard)限制 Elasticsearch 7:使用 _cat thread_pool 查询线程池运行情况 Elasticsearch 7:事务日志 translog Elasticsearch 7:文档 _id 的长度限制 Elasticsearch 7:分片 shard Elasticsearch 7:滚动查询 Elasticsearch 7:聚合查询 Elasticsearch 7:索引模板 Elasticsearch 7:获取文档所属的 shard Elasticsearch 7:获取版本号 Elasticsearch 7:获取指定 shard 中的文档 Elasticsearch 7:获取 shard 统计信息 Elasticsearch 7:搜索实战 Elasticsearch 7:Python 客户端 Elasticsearch 7:Java TransportClient API 客户端 Elasticsearch 7:Java REST Client API 客户端 Elasticsearch:将 SQL 转换为 DSL Elasticsearch 6 快速上手 Elasticsearch 5 快速上手 Elasticsearch 5:禁止自动创建索引 Elasticsearch 5:禁止动态增加字段 Elasticsearch 产品版本支持周期 基于 Elasticsearch 的站内搜索引擎实战

Elasticsearch 7:文档唯一性


#Elasticsearch


一般说法是下面三个字段在一个ES实例/集群中是全局唯一的:

index + type + 文档 _id

但是实际上是:

index + type + 分片标识 + 文档 _id

以下为验证:

创建有10个分片的 index:

PUT student
{
  "mappings" : {
    "properties" : {
      "uid": {
        "type" : "integer"
      },
      "name" : {
        "type" : "keyword"
      },
      "age" : {
        "type" : "integer"
      }
    }
  },
  "settings" : {
    "index" : {
      "number_of_shards" : 10,
      "number_of_replicas" : 1
    }
  }
}

添加记录1:

POST student/_doc/1?routing=1
{
  "uid": 1,
  "name": "张三",
  "age": 10
}

查询中带上指定 explain 为 true,响应中能看到文档属于哪个 shard:

# 请求
GET student/_search
{
  "query": {
    "match": {
      "uid": 1
    }
  },
  "explain": true
}

# 响应
{
  "took" : 9,
  "timed_out" : false,
  "_shards" : {
    "total" : 10,
    "successful" : 10,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_shard" : "[student][8]",
        "_node" : "wFhSfuLwR3OX21eldbRIHg",
        "_index" : "student",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 1.0,
        "_routing" : "1",
        "_source" : {
          "uid" : 1,
          "name" : "张三",
          "age" : 10
        },
        "_explanation" : {
          "value" : 1.0,
          "description" : "uid:[1 TO 1]",
          "details" : [ ]
        }
      }
    ]
  }
}

添加记录2:

POST student/_doc/1?routing=2
{
  "uid": 1,
  "name": "张三",
  "age": 10
}

注意,和记录1相比,除了 routing ,其他均没有变化。

我们再次查询_id为1的记录,会发现有两条,唯一区别是 _shard_routing值不相同:

# 请求
GET student/_search
{
  "query": {
    "match": {
      "uid": 1
    }
  },
  "explain": true
}

# 响应
{
  "took" : 565,
  "timed_out" : false,
  "_shards" : {
    "total" : 10,
    "successful" : 10,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_shard" : "[student][7]",
        "_node" : "wFhSfuLwR3OX21eldbRIHg",
        "_index" : "student",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 1.0,
        "_routing" : "2",
        "_source" : {
          "uid" : 1,
          "name" : "张三",
          "age" : 10
        },
        "_explanation" : {
          "value" : 1.0,
          "description" : "uid:[1 TO 1]",
          "details" : [ ]
        }
      },
      {
        "_shard" : "[student][8]",
        "_node" : "wFhSfuLwR3OX21eldbRIHg",
        "_index" : "student",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 1.0,
        "_routing" : "1",
        "_source" : {
          "uid" : 1,
          "name" : "张三",
          "age" : 10
        },
        "_explanation" : {
          "value" : 1.0,
          "description" : "uid:[1 TO 1]",
          "details" : [ ]
        }
      }
    ]
  }
}


( 本文完 )