Array-based Spectro-temporal Masking For Automatic Speech Recognition