Master Thesis: Energy-based Multi-Modal Attention (EMMA). A novel method improving the robustness of multi-modal deep learning.