Due to the effectiveness of Trans- former layer in capturing non-local long-range dependen- cies, the potential of Transformer is explored in conditional denoising of HSI. Unfortunately, the vanilla Transformer focuses only on spatial relationships between pixels while ne...