hadoop jar command Execution -
- We submit a jar file through the hollow jar command
- This master node
Copy the jar file into all slave nodes and start execution or how does it work and who does this? Job tracker or namedade?
The "hop jar" command tells the houop to provide jobs in the cluster. Output path jar file includes job config and all map and lower code.
Step:
-
The job client presents JobTracker (Jetties) for a job. In the background, it copes Binense, which contains information about config, mapper and reducer code, input and output paths, which is in a centralized area, which is close to the Task Trackers (TTS). Once TTS needs to use that code, they just download it locally on datyonode, so when the map and lowering work standups, they use that code to run on local data. We do.
-
The JT asks about data space about the data location and the data node that contains the data.
-
With this information, JT talks to TTS and considers TTS to form an execution plan, closest to the data, if they have available slots available for execution . If they are not, then this rack goes to the locality and available slots available to any TTS in the same rack. If still not able to know, then find any racks / TTs in any area.
-
Based on the execution plan, Jetties present work for TT. TTSs now reduce and execute stand-up maps and data.
-
TTS routinely reports JT to progress and heartbeat (every 5 seconds by default). According to the progress / perfection / error TT of each map and report, the report of Mapper and Reduce works according to the JTT report to TT and TT which tells Jetti if T.R. will die, then TTS will report to JT, JT has spin to deal with adopting other MR functions.
-
Once all the mappers work with their actions, TT Reducers to JT signals to run their reducer execution (i.e. at least () method)
-
Once all the Mappers and Reducers are finished, and the final output is written, JT update updates its status for SUCCESS and notifies the client.
-
Comments
Post a Comment