GAID: Frame-Level Gated Audio-Visual Integration with Directional perturbation for Text-Video Retrieval