I have been searching for days now to find a good explanation on how indices are used by Graylog/Elasticsearch and when to create a new one. There is a lot of info about sharding indices but not much about the indices themselves apart from that they are a set of settings for how much data to retain and how to mange them, but not why. (or so it seems to me)
Background:
We are using Graylog 4.0 with Elasticsearch 7.10 and MongoDB 3.
We try to centralize the logs from 6 warehouse locations (all a few hundred km apart from on another). Each has 6 to 20 RFID gates which each have a log. Each gate has its on connector middleware to a central controller middleware and all of those have logs. Then there is the controller of the automated warehouse 'AWMS', WMS server, ERP server and their frontends. We also consider to collect at least some of the events from the Windows Eventlog of the servers those services run on.
Usually we need to analyse a problem in one subsystem so need to search one of those logs. Occasionally we need to look at the whole flow from RFID gate to AWMS, WMS and ERP.
At the moment I consider having a stream for each of those logs and using the relevant streams in the search. (or is that approach already flawed, if so why?)
Questions:
- Is an index set in Graylog just the settings about the retention strategy?
- What impact does it have if I have a lot or a few indices?
- in Elasticsearch Index Model it sounds like shard sizes and their distribution have the main impact on search performance, and indices are just a framework to manage the shards
- How many index sets should I have for my use case?
- Multiple per stream?
- One per stream?
- One per location?
- One per subsystem?
- One per retention time or size interval?
- One global one?
- Does it matter from a performance point of view?
- Where can I find more information about this stuff, that explains the 'Why', not just the 'How' to mange indices? (I have been looking at both the Graylog index model, Elasticsearch index model, Elasticsearch index templates)