### 🚀 Describe the new functionality needed - Add support for the ```/rerank``` API for NVIDIA Inference Provider. ### 💡 Why is this needed? What if we don't build it? - This allows Llama Stack users to use rerank NIMs, for example [nvidia/llama-3_2-nv-rerankqa-1b-v2](https://build.nvidia.com/nvidia/llama-3_2-nv-rerankqa-1b-v2). ### Other thoughts _No response_