Apache Hadoop with YARN: Configuring Pepperdata Activation (RPM/DEB)
Supported versions: See the Apache Hadoop entries for Pepperdata 8.1.x in the table of Supported Platforms by Pepperdata Version
To activate Pepperdata—inject the necessary instrumentation—across a manually configured cluster, add the activation script to each Hadoop service’s environment shell executable file.
Procedure
- 
    
Beginning with any host, add the snippet below to the appropriate environment shell executable file(s), based on which services are configured to run on the host.
- 
        
YARN ResourceManager/NodeManager:
yarn-env.sh, typically located in/etc/hadoop/conf - 
        
Apache Spark:
spark-env.sh(all hosts, including edge/client), typically located in/etc/spark/conf. If you are running multiple versions of Spark, add the snippet to the configuration file for each Spark version. 
Important: Add the snippet to the end of the script(s).
This ensures that the activation script’s variable appends (YARN_NODEMANAGER_OPTS,YARN_RESOURCEMANAGER_OPTS,HBASE_REGIONSERVER_OPTS, andSPARK_SUBMIT_OPTS) are not overwritten by other assignments in the script(s).PEPPERDATA_ACTIVATE_SCRIPT_PATH="/opt/pepperdata/supervisor/lib/pepperdata-activate.sh" if [ -e $PEPPERDATA_ACTIVATE_SCRIPT_PATH ]; then . $PEPPERDATA_ACTIVATE_SCRIPT_PATH fi - 
        
 - 
    
Repeat step 1 on every host in the cluster.
 - 
    
Restart the following application daemons:
- YARN ResourceManager
 - YARN NodeManagers (all in the cluster)