Pig – Architecture:
- Parser – It checks the syntax of the script.
- Optimizer – It performs activities such as merge, split, joins, Order by, group by, etc. It basically tries to reduce the amount of data which is being send to the next stage.
- Compiler – It converts the code into Mapreduce jobs.
- Execution – Finally the job is submitted and the code is executed. We can then use Dump to show the output on the screen or can use store to store the output in text file or other type of file.