Not a customer? Click the 'Start a free trial' link to begin a 30-day SaaS trial of our product and to join our community.
Existing Cisco AppDynamics customers should click the 'Sign In' button to authenticate to access the community
on 12-13-2017 05:33 PM - edited on 09-14-2018 01:39 PM by Nina.Wolinsky
Once in a while, you may come across an instance where your events-service cluster gets stuck during startup and its health state doesn't change for a long time.
There are many reasons for it, however one of the most likely reasons is a translog issue.
You can easily find out if a cluster is stuck on a translog issue by running the below command
curl -s -XGET http://<Events-service-hostname>:9200/_cat/recovery | grep translog | awk '{ print $1 }'
Note: Port 9200 might be disabled and so you need to enable it first by following this article Events-Service-debugging-port-disabled-by-default-in-version-4-3 to run the above command.
This will give a list of indexes that are stuck in translog. The fix is quite easy; just set replication=0 and then back to replication=2 or 1 (whatever your replication factor is). You can set the replication factor by running thru the following the steps:
To find out the replication factor, you can use the below command:
curl -XGET http://<Events-service-hostname>:9200/<index_name>/_settings
To update it, use the below command:
curl -XPUT http://<Events-service-hostname>:9200/<index_name>/_settings -d '{ "index" : { "number_of_replicas" : <replication factor> } }'
Thank you! Your submission has been received!
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form