What is elasticsearch index , Lucene index and inverted index

Elasticsearch index consist of Shards(Lucene index) and Lucene index consists of inverted index(which consists of individual tokens), below terms can be used interchangeably.

Elasticsearch shards = Lucene index

Elasticsearch is based on Lucene, and every Elasticsearch shard is a full blown lucene index, there is no concept of shards in the Luence. Unlike ES index, Lucene index can’t be distributed to multiple nodes, and Elasticsearch index distributed to multiple nodes simply means its shards(Lucene index) distributed to multiple nodes.

Index and shards are just the logical names, and actual data is stored in the lucene segments.

P.S: There are several articles on ES sites explaining index, shards and segments in the more details, which would be difficult to cover and would be duplicated here.

Read more here: Source link