AutoAction
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.auto.action.default.snooze.period.ms The time repeated violations are to be ignored for the violator, for example, app, user. If the violation is still occurring when awakened the AutoAction executes the action and the violator is once again snoozed. An AutoAction containing a kill or move action is never snoozed. 0: snooze is turned off > 0: snooze is on, there is no upper bound | ms | 3600000 (1 hour) | |
com.unraveldata.auto.action.include.customer.org Includes the customer organization name in the AutoAction. | string | ||
com.unraveldata.auto.action.policy.enforce.period.ms Maximum wait period for policy enforcement to be triggered. | ms | 180000 (3 min) | |
com.unraveldata.auto.action.publish.finish.apps.metrics.enabled Publish non-running application metrics to auto-action so that auto-action rules will be triggered. | Required | boolean | false |
AutoAction daemon
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.auto.action.enable.policy.enforce.in.jcs Enables AutoActions to be processed under the legacy (pre-4.5.2.0) mode of operation when AutoAction enforcement resided in JCS2 daemon. This works for on-prem mode only. When using this mode you can stop AutoAction service. | boolean | false | |
com.unraveldata.auto.action.max.buffered.group.count A message group represents one complete YARN metrics polling cycle. It is incomplete when it has missing messages, messages out of order, etc. This property controls how many message groups can be buffered while waiting for completion of the current group. Once the daemon has buffered the max.buffered.group.count , it either drops or accepts the incomplete group (see max.lost.messages.count ) and moves onto processing the next group. Setting this value higher than 1 may increase latency of enforcement of an AutoActions but can help to alleviate errors in message transport protocol. | count | 1 | |
com.unraveldata.auto.action.max.lost.messages.count Maximum number of lost messages a group can have and still be accepted for aggregation and policy enforcement. Accepting incomplete groups lowers the consistency of triggered violations but lets the daemon to operate with an “unstable” connection. 0: only complete groups are accepted. > 0: groups missing up to X messages are accepted. | count | 0 | |
com.unraveldata.auto.action.metric.discard.ms When an incoming metric's timestamp is older than this value, it isn't processed but is discarded. This mechanism is designed to deal with Kafka latency, consumer lags and other message delivery delays. It prevents the daemon from acting on outdated data and issuing false-positive violation events that are irrelevant at the present time | ms | 3600000 (1 hour) | |
com.unraveldata.auto.action.policy.db.update.skip.cycle Defines how often the daemon refreshes the AutoAction policy definitions when aggregating and enforcing per enforcement cycle (see enforce.period.ms ). 0: All policies are read every time the daemon's ready to aggregate and enforce policies. > 0: The policies are read every X cycle. For instance, 1: the daemon reads them every other cycle, 2: every third period, and so on. Changing this value makes sense only if there are a lot of policies defined. | count | 0 | |
com.unraveldata.auto.action.policy.enforce.period.ms Maximum wait period for policy enforcement to be triggered. AutoAction metric aggregation and enforcement cycle is driven by metric producers, i.e., the daemon synchronizes with the metric polling cycles for each monitored cluster. When no YARN metrics are delivered or accepted no policies are being evaluated for longer than this period, the daemon forcibly executes metric aggregation and policy enforcement to process other internal Unravel metrics, such as Workflow, Hive, Tez, Impala metrics. | ms | 180000 (3 min) | |
com.unraveldata.auto.action.transport.receiver.ttl.ms Identifies how long a transport protocol connection remains active without receiving any data from the cluster. If no metrics are received for longer than this value, the cluster is considered terminated and the connection is closed on the receiver (daemon's) side. | ms | 1800000 (30 minutes) | |
com.unraveldata.job.collector.running.load.conf This is required for Hive metrics in running state. | boolean | false |
AutoAction command line interface (CLI) for actions
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.auto.action.use.cli.enabled Enables use of Command Line Interface (CLI) for actions associated with AutoActions. | boolean | true |
AutoAction support for EMR cluster
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.auto.action.emr.cluster.enabled Enables ElasticSearch polling for collecting EMR cluster metrics. | Optional | boolean | false |
com.unraveldata.auto.action.emr.cluster.idle.lookback.millis Set the minimum look-back period for calculating the idleness of an EMR cluster. The EMR cluster idle duration is specific to the EMR Cluster Idle AutoAction policy. You must set this property if configured idle threshold for an EMR cluster is more than an hour. Ensure to specify the value in milliseconds. | Optional | long | 3600000 (1 hour) |
AutoAction email notifications to LDAP users
The following properties can be configured for email notifications to be sent to application owners who are LDAP users.
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.ldap.bind_dn LDAP bind account for LDAP lookup. For example: CN=sethbind, OU=seth, DC=unraveldata, DC=com | String | - | |
com.unraveldata.ldap.bind_pw LDAP bind account password. | String | - | |
com.unraveldata.ldap.realUserAttribute Enables a secondary LDAP lookup. When the AD object does not have the available email string, Unravel does a second lookup to retrieve the user's email address. This email address is used by AutoActions when sending an email to the application owner. | String | - | |
com.unraveldata.ldap.mailAttribute Mail attribute name in the LDAP response used by the Unravel server to extract the LDAP user's email address. If this is not configured, the Unravel server uses the attribute name "mail". | String | - |
Notification for SSL certifications and Kerberos keytabs expiry
Property/Description | Set by user | Unit | Default |
---|---|---|---|
com.unraveldata.monitoring.keytab.check.interval Keytab query interval in minutes. | Required | Mins | 3 minutes |
com.unraveldata.monitor.keystore.expire.days If certificate expiry falls within this number, an alert is sent. | Required | - | - |
com.unraveldata.monitor.truststore.expire.days If certificate expiry falls within this number, an alert is sent. | Required | - | - |
com.unraveldata.monitoring.keystore.check.interval Keystore query interval in minutes. | Required | Mins | 5 minutes |