PyTorch Implementation for "Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query"