Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Approach

No docs

No docs (incl. hidden docs)

Docs increase

Nested docs

2,563,630

27,284,158

x10.64278

Keyword

2,563,630

10,292,406

x4.014778

Info

Old index is ~2~1.65 times bigger greater than new index in terms of number of documents

...

Info

Not nested regions is slighly faster in most of the executions

Denormalise attributes

The aim of this recommendation is to get rid of nested fields (which create “hiden” Lucene documents) in order to prevent the uncontrolled growth in the size of the index.

Each attribute in the ”nested” field will be modelled as a specific column in the document, instead of nesting them in an “array/list” of documents.

This approach should:

  • Reduce the final number of documents.

  • Make aggregations by attribute value simpler and faster?

Index size stats

Approach

No docs

No docs (incl. hidden docs)

Docs increase

Nested docs

2,563,630

27,284,158

x10.64

Denormalised

2,563,630

19,580,736

x7.64

Info

Old index is ~0.39 times greater than new index in terms of number of documents