FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Kadhe, Swanand Ravindra, Halimi, Anisa, Rawat, Ambrish, Baracaldo, Nathalie
Year of Publication 12.12.2023
Year of Publication 12.12.2023
Get full text
Journal Article
MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
Cornacchia, Giandomenico, Zizzo, Giulio, Fraser, Kieran, Hameed, Muhammad Zaid, Rawat, Ambrish, Purcell, Mark
Year of Publication 26.09.2024
Year of Publication 26.09.2024
Get full text
Journal Article
Domain Adaptation for Time series Transformers using One-step fine-tuning
Khanal, Subina, Tirupathi, Seshu, Zizzo, Giulio, Rawat, Ambrish, Pedersen, Torben Bach
Year of Publication 12.01.2024
Year of Publication 12.01.2024
Get full text
Journal Article
Certified Federated Adversarial Training
Zizzo, Giulio, Rawat, Ambrish, Sinn, Mathieu, Maffeis, Sergio, Hankin, Chris
Year of Publication 20.12.2021
Year of Publication 20.12.2021
Get full text
Journal Article
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Foley, Myles, Rawat, Ambrish, Lee, Taesung, Hou, Yufang, Picco, Gabriele, Zizzo, Giulio
Year of Publication 15.06.2023
Year of Publication 15.06.2023
Get full text
Journal Article
Robust Learning Protocol for Federated Tumor Segmentation Challenge
Rawat, Ambrish, Zizzo, Giulio, Kadhe, Swanand, Epperlein, Jonathan P, Braghin, Stefano
Year of Publication 16.12.2022
Year of Publication 16.12.2022
Get full text
Journal Article