YARN daemons are ResourceManager, NodeManager, and WebAppProxy. HDFS daemons are NameNode, SecondaryNameNode, and DataNode. To configure the Hadoop cluster you will need to configure the environment in which the Hadoop daemons execute as well as the configuration parameters for the Hadoop daemons.
Site-specific configuration - etc/hadoop/core-site.xml, etc/hadoop/hdfs-site.xml, etc/hadoop/yarn-site.xml and etc/hadoop/mapred-site.xml.Īdditionally, you can control the Hadoop scripts found in the bin/ directory of the distribution, by setting site-specific values via the etc/hadoop/hadoop-env.sh and etc/hadoop/yarn-env.sh. Read-only default configuration - core-default.xml, hdfs-default.xml, yarn-default.xml and mapred-default.xml. Hadoop’s Java configuration is driven by two types of important configuration files: Running Applications in runC Containers.Running Applications in Docker Containers.