To be done. I watched the talk by Aaron Davidson and found it very helpful in understanding how spark jobs been executed and how partitions, shuffling really works.

Need sometimes to digest the content and hopefully can write a summary on my own.