Skip to content
View tonywu71's full-sized avatar

Block or report tonywu71

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tonywu71/README.md

Hi, I'm Tony 👋🏼

I'm a Research Engineer based in Paris 🇫🇷. I grew up in a small town near Disneyland Paris 🏰, and I'm lucky to have traveled around the world during my academic years (love you 🇧🇷🇭🇰🇬🇧).

  • 🎓 I studied a MEng CentraleSupélec in Paris-Saclay 🇫🇷 and the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge (Sidney Sussex College) 🇬🇧.
  • 💼 I am currently working at H Company on building state-of-the-art web agents.
  • 🔬 Research interests: LLM, Multimodal, Agents, Information Retrieval, RAG, Speech.

💬 Feel free to reach out to discuss research ideas (mostly active on X)!

Contact: tonywu.ai@outlook.com


GitHub X Hugging Face Scholar LinkedIn

Pinned Loading

  1. illuin-tech/colpali illuin-tech/colpali Public

    The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

    Python 2k 177

  2. illuin-tech/vidore-benchmark illuin-tech/vidore-benchmark Public

    Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

    Python 213 30

  3. AnswerDotAI/byaldi AnswerDotAI/byaldi Public

    Use late-interaction multi-modal models such as ColPali in just a few lines of code.

    Python 801 86

  4. colpali-cookbooks colpali-cookbooks Public

    Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳

    314 23

  5. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 146k 29.5k

  6. dotfiles dotfiles Public

    My personal dotfiles.

    Ruby