Can we pass output_attentions=True to DiT model such as pixart to get attention output? Like using output_attentions=True in transformer?