diff --git a/_includes/sections/about.html b/_includes/sections/about.html index e2d66c9..810b270 100644 --- a/_includes/sections/about.html +++ b/_includes/sections/about.html @@ -113,8 +113,8 @@

{{ site.author.first_name }} {{ site.author.last_n Responsible Foundation Models, encompassing jailbreaks, adversarial attacks, and risks associated with scientific foundation models. [USENIX Security '25] - [MemFM@ICML '25] - [arXiv] + [arXiv] + [MemFM@ICML '25]
  • Accessible Foundation Models, incorporating PEFT, quantization, and outlier removal for @@ -122,7 +122,7 @@

    {{ site.author.first_name }} {{ site.author.last_n [ICML '24] [ICML '25] [ES-FoMo@ICML '24] - [ES-FoMo@ICML '25] + [ES-FoMo@ICML '25]

  • Actionable Foundation Models, featuring Chain of Thought and Chain of Action methodologies. @@ -136,12 +136,12 @@

    {{ site.author.first_name }} {{ site.author.last_n [ICML '25] [Information Systems (2025)] [HuMob@SIGSPATIAL '24] - [ES-FoMo@ICML '25] + [ES-FoMo@ICML '25]

  • Memory retrieval, memory-enhanced models, and memory editing techniques. - [MemFM@ICML '25] + [MemFM@ICML '25] [arXiv]
  • diff --git a/_includes/sections/images/GERM.png b/_includes/sections/images/GERM.png new file mode 100644 index 0000000..c594e72 Binary files /dev/null and b/_includes/sections/images/GERM.png differ diff --git a/_includes/sections/miscellaneous.html b/_includes/sections/miscellaneous.html index 35f57a2..0f7655a 100644 --- a/_includes/sections/miscellaneous.html +++ b/_includes/sections/miscellaneous.html @@ -20,15 +20,15 @@

    30-30 Project

  • Visit 30 states in the US (34/30)
    ❄️AK, 🌉CA, 🏂 CO, 📃 CT, 🐼DC, 1⃣ DE, 🍊 FL, 🍑 GA, 🌋HI, 💨IL, 🏁IN, 🚜IA, 🏇KY, 🔮MA, 🐢MD, 🚘MI, ♈ MO, 🌟MN, ✈ NC, 🐍 NH, 💡NJ, 🏜️NV, 🗽NY, 🏈 OH, 🌹 OR, 🔔 PA, 🌊 RI, 🌴 SC, 🎸 TN, 🗼TX, 🚬 VA, 🍺WI, ☔WA, 🗻 WV.

  • -
  • Visit 30 National Parks in the US (12/30)
    - Indiana Dunes NP, Yosemite NP, Mount Rainier NP, North Cascades NP, Olympic NP, Great Smoky Mountains NP, Gateway Arch NP, Rocky Mountain NP, Mammoth Cave NP, Congaree NP, New River Gorge NP, Shenandoah NP

    +
  • Visit 30 National Parks in the US (14/30)
    + Indiana Dunes NP, Yosemite NP, Mount Rainier NP, North Cascades NP, Olympic NP, Great Smoky Mountains NP, Gateway Arch NP, Rocky Mountain NP, Mammoth Cave NP, Congaree NP, New River Gorge NP, Shenandoah NP, Crater Lake NP, Redwood NSP

  • Visit 30 Other National Parks Services in the US (51/30)
    Boston NHP, Manhattan Project NHP, Harpers Ferry NHP, First State NHP, Minute Man NHP, Independence NHP, Lewis & Clark NHP, San Francisco Maritime NHP, Hopewell Culture NHP, Golden Gate NRA, Ross Lake NRA, Lake Chelan NRA, Big South Fork NRNA, Gauley River NRA, Herbert Hoover NHS, Edgar Allan Poe NHS, Gloria Dei Church NHS, Lincoln Home NHS, Ulysses S. Grant NHS, Boston African American NHS, Fort Point NHS, Saugus Iron Works NHS, Salem Maritime NHS, Statue of Liberty NM, Fort Pulaski NM, Muir Woods NM, Fort Mchenry NM, Florissant Fossil Beds NM, Lewis & Clark NHT, Washington-Rochambeau Revolutionary Route NHT, Star-Spangled Banner NHT, Juan Bautista de Anza NHT, Sleeping Bear Dunes NL, Ice Age NST, Appalachian NST, Point Reyes NS, Obed WSR, Bluestone NSR, Korean War Veterans Memorial, Lincoln Memorial, Pullman Memorial, Pearl Harbor Memorial, Vietnam Veterans Memorial, White House, Alcatraz Island, Presidio of San Francisco, Washington Monument, World War II Memorial, Wing Luke Museum Affiliated Area, Blue Ridge Parkway, Baltimore-Washington Parkway

  • -
  • Visit 30 Airpots (42/30)
    -KORD, KJFK, ZSHC, ZSPD, KATL, KDTW, ZYHB, ZBTJ, ZBAA, ZJSY, ZLXY, KLAX, KSFO, KLEX, KEWR, KLGA, VHHH, KMIA, KSJC, KMCO, KMSP, KLAN, KDFW, KDEN, ZSAM, KIAH, KAUS, KBWI, KDCA, KSEA, KCID, KSLC, KLAS, KHNL, KIAD, KBRL, ZGGG, ZGSZ, VVTS, ZSYT, KBOS, KBDL

    +
  • Visit 30 Airpots (44/30)
    +KORD, KJFK, ZSHC, ZSPD, KATL, KDTW, ZYHB, ZBTJ, ZBAA, ZJSY, ZLXY, KLAX, KSFO, KLEX, KEWR, KLGA, VHHH, KMIA, KSJC, KMCO, KMSP, KLAN, KDFW, KDEN, ZSAM, KIAH, KAUS, KBWI, KDCA, KSEA, KCID, KSLC, KLAS, KHNL, KIAD, KBRL, ZGGG, ZGSZ, VVTS, ZSYT, KBOS, KPDX, CYVR

  • Take flights with 30 Airplane Companies (18/30)
    HU, CA, DL, AA, UA, CZ, MU, JD, CX, MF, KA, F9, NK, WN, AS, AC, 9K, NK

    diff --git a/_includes/sections/publication.html b/_includes/sections/publication.html index ac2b393..3b13290 100644 --- a/_includes/sections/publication.html +++ b/_includes/sections/publication.html @@ -5,25 +5,469 @@

    Publication

    -
    -
    -
    - {% for publication in site.data.index.publication %} - {% assign loopindex = forloop.index | modulo: 2 %} -
    - -
    - -

    {{ publication.name.detail }}

    -

    {{ publication.desc.detail }}

    - {{ publication.date.detail }} -
    {{ publication.job.detail }}
    -
    -
    - {% endfor %} -
    -
    -
    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    +
    + + + +
    + +
    + + Fast and Low-Cost Genomic Foundation Models via Outlier Removal + +
    + Haozheng Luo*, Chenghao Qiu*, Maojiang Su, Zhihan Zhou, Zoe Mehta, Guo Ye, Jerry Yao-Chieh Hu, Han Liu +
    + International Conference on Machine Learning (ICML) 2025 +
    + paper / + code / + model +

    +

    + GERM is a genomic foundation model optimized for low-resource settings by removing outliers, enhancing low-rank adaptation and quantization, achieving up to 64.34% efficiency gains and 37.98% better fine-tuning performance over baseline models. +

    +
    +
    + + +
    + +
    + + Mind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs' Refusal Boundaries + +
    + Jiahao Yu*, Haozheng Luo*, Jerry Yao-Chieh, Yan Chen, Wenbo Guo, Han Liu, Xinyu Xing +
    + USENIX Security Symposium (USENIX Security) 2025 +
    + paper / + code +

    +

    + Mind the Inconspicuous is a study showing that appending multiple \eos tokens triggers context segmentation in aligned LLMs, shifting inputs toward refusal boundaries and enabling jailbreaks, with up to 16× increased attack success rates across 16 models and major APIs like OpenAI and Anthropic. +

    +
    +
    + + +
    + +
    + + GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models + +
    + Haozheng Luo*, Chenghao Qiu*, Yimin Wang, Shang Wu, Jiahao Yu, Han Liu, Binghui Wang, Yan Chen +
    + Preprint, 2025 +
    + paper / + code / + datasets / +

    +

    + GenoArmory is the first unified adversarial attack benchmark for Genomic Foundation Models (GFMs), offering a comprehensive framework and the GenoAdv dataset to evaluate model vulnerabilities across architectures, quantization, and tasks, revealing that classification GFMs are more robust than generative ones and that attacks often target biologically meaningful regions. + +

    +
    +
    + + +
    + +
    + + Knowledge‑Distilled Memory Editing for Plug‑and‑Play LLM Alignment + +
    + Haozheng Luo*, Jiahao Yu*, Wenxin Zhang*, Jialong Li, Jerry Yao-Chieh Hu, Yan Chen, Binghui Wang, Xinyu Xing, Han Liu +
    + Workshop on MemFM @ ICML 2025 +
    + paper / + code +

    +

    + We propose a low-resource method to align LLMs for safety by distilling alignment-relevant knowledge from well-aligned models and identifying essential components via delta debugging, enabling plug-and-play integration into unaligned LLMs. + +

    +
    +
    + + +
    + +
    + + Efficient Temporal Tokenization for Mobility Prediction with Large Language Models + +
    + Haoyu He*, Haozheng Luo*, Yan Chen, Qi R. Wang +
    + Workshop on Efficient Systems for Foundation Models III@ ICML2025 +
    + paper + +

    +

    + RHYTHM is a framework that uses hierarchical temporal tokenization and frozen LLMs to efficiently model human mobility, achieving 2.4% higher accuracy (5.0% on weekends) and 24.6% faster training by capturing spatio-temporal dependencies with reduced sequence lengths and enriched prompt embeddings. +

    +
    +
    + + +
    + +
    + + SMUTF: Schema Matching Using Generative Tags and Hybrid Features + +
    + Yu Zhang*, Mei Di*, Haozheng Luo*, Chenwei Xu, Richard Tzong-Han Tsai +
    + Information Systems Volume 133, 2025 +
    + paper / + code +

    +

    + SMUTF is a schema matching framework that combines rule-based features, pre-trained and generative LLMs with novel “generative tags” to enable effective cross-domain matching, achieving up to 11.84% F1 and 5.08% AUC gains over SOTA, with the new HDXSM dataset released to support large-scale open-domain schema matching. + +

    +
    +
    + + +
    + +
    + + Chain-of-action: Faithful and Multimodal Question Answering through Large Language Models + +
    + Zhenyu Pan, Haozheng Luo, Manling Li, Han Liu +
    + International Conference on Learning Representations (ICLR) 2025 +
    + paper / + code +

    +

    + CoA is a Chain-of-Action framework for multimodal and retrieval-augmented QA that decomposes complex questions into reasoning steps with plug-and-play retrieval actions, reducing hallucinations and token usage while improving reasoning and factual accuracy across benchmarks and a Web3 case study. + +

    +
    +
    + + +
    + +
    + + Outlier Efficient Modern Hopfield Model for Large Transformer-Based Models + +
    + Jerry Yao-Chieh Hu*, Pei-Hsuan Chang*, Haozheng Luo*, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu +
    + International Conference on Machine Learning (ICML) 2024 +
    + paper / + code / + model +

    +

    + We debut an outlier-efficient modern Hopfield model, OutEffHop, providing robust outlier-reduction for large transformer-based models from associative memory models. + +

    +
    +
    + + +
    + +
    + + Fast Adaptation and Robust Quantization of Multi-Modal Foundation Models from Associative Memory - A Case Study in SpeechLM Authors + +
    + Shang Wu*, Yen-Ju Lu*, Haozheng Luo*, Jerry Yao-Chieh Hu, Jiayi Wang, Najim Dehak, Jesus Villalba, Han Liu +
    + Workshop on Efficient Systems for Foundation Models II@ ICML2024 +
    + paper + +

    +

    + SpARQ is an outlier-free SpeechLM framework that replaces attention with a stabilized layer to mitigate performance drops from cross-modal low-rank adaptation and quantization, achieving 41% and 45% relative improvements respectively, plus 1.33× faster training on OPT-1.3B across ASR, TTS, and multi-modal tasks. +

    +
    +
    + + +
    + +
    + + Open-Ended Multi-Modal Relational Reasoning for Video Question Answering + +
    + Haozheng Luo*, Ruiyang Qin*, Chenwei Xu, Guo Ye, Zening Luo +
    + IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) 2023 +
    + paper / + code +

    +

    + We introduce a robotic agent that combines video recognition and language models to assist users through language-based interactions in video scenes, showing improved human-robot interaction efficiency and achieving 2–3% gains over benchmark methods. + +

    +
    +
    + + +
    + +
    + + IGN: Implicit Generative Networks + +
    + Haozheng Luo, Tianyi Wu, Feiyu Han, Zhijun Yan +
    + IEEE International Conference on Machine Learning and Applications (ICMLA) 2022 +
    + paper / + code +

    +

    + IGN is a distributional reinforcement learning model that integrates GAN-based quantile regression with IQN, achieving state-of-the-art performance and risk-sensitive policy optimization across 57 Atari games. + +

    +
    +
    + + +
    + +
    + + Question Classification with Deep Contextualized Transformer + +
    + Haozheng Luo, Ningwei Liu, Charles Feng +
    + Future of Information and Communication Conference (FICC) 2022 +
    + paper +

    +

    + We present a Deep Contextualized Transformer model that enhances QA classification by handling aberrant expressions, achieving up to 83.1% accuracy on SQuAD and SwDA datasets—outperforming prior models for industry-level QA tasks. +

    +
    + diff --git a/images/BOOST.png b/images/BOOST.png new file mode 100644 index 0000000..c03f986 Binary files /dev/null and b/images/BOOST.png differ diff --git a/images/CoA.png b/images/CoA.png new file mode 100644 index 0000000..405662e Binary files /dev/null and b/images/CoA.png differ diff --git a/images/GERM.png b/images/GERM.png new file mode 100644 index 0000000..c594e72 Binary files /dev/null and b/images/GERM.png differ diff --git a/images/IGN.png b/images/IGN.png new file mode 100644 index 0000000..06332b6 Binary files /dev/null and b/images/IGN.png differ diff --git a/images/SMUTF.png b/images/SMUTF.png new file mode 100644 index 0000000..68b5fc6 Binary files /dev/null and b/images/SMUTF.png differ diff --git a/images/dapa.png b/images/dapa.png new file mode 100644 index 0000000..aff3d92 Binary files /dev/null and b/images/dapa.png differ diff --git a/images/geno.png b/images/geno.png new file mode 100644 index 0000000..80b963f Binary files /dev/null and b/images/geno.png differ diff --git a/images/open.png b/images/open.png new file mode 100644 index 0000000..6274857 Binary files /dev/null and b/images/open.png differ diff --git a/images/out.png b/images/out.png new file mode 100644 index 0000000..914da1b Binary files /dev/null and b/images/out.png differ diff --git a/images/qa.png b/images/qa.png new file mode 100644 index 0000000..781d7fa Binary files /dev/null and b/images/qa.png differ