Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks
Wang, Bolun, Yao, Yuanshun, Shan, Shawn, Li, Huiying, Viswanath, Bimal, Zheng, Haitao, Zhao, Ben Y.
Published in 2019 IEEE Symposium on Security and Privacy (SP) (01.05.2019)
Published in 2019 IEEE Symposium on Security and Privacy (SP) (01.05.2019)
Get full text
Conference Proceeding
Backdoor Attacks Against Deep Learning Systems in the Physical World
Wenger, Emily, Passananti, Josephine, Bhagoji, Arjun Nitin, Yao, Yuanshun, Zheng, Haitao, Zhao, Ben Y.
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2021)
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2021)
Get full text
Conference Proceeding
A daily global mesoscale ocean eddy dataset from satellite altimetry
Faghmous, James H, Frenger, Ivy, Yao, Yuanshun, Warmka, Robert, Lindell, Aron, Kumar, Vipin
Published in Scientific data (09.06.2015)
Published in Scientific data (09.06.2015)
Get full text
Journal Article
"My face, my rules": Enabling Personalized Protection Against Unacceptable Face Editing
Xiao, Zhujun, Cryan, Jenna, Yao, Yuanshun, Cheo, Yi Hong Gordon, Shu, Yuanchao, Saroiu, Stefan, Zhao, Ben Y., Zheng, Haitao
Published in Proceedings on Privacy Enhancing Technologies (01.07.2023)
Published in Proceedings on Privacy Enhancing Technologies (01.07.2023)
Get full text
Journal Article
Learning to Watermark LLM-generated Text via Reinforcement Learning
Get full text
Paper
Journal Article
Fairness Without Harm: An Influence-Guided Active Sampling Approach
Pang, Jinlong, Wang, Jialu, Zhu, Zhaowei, Yao, Yuanshun, Chen, Qian, Liu, Yang
Published in arXiv.org (31.05.2024)
Published in arXiv.org (31.05.2024)
Get full text
Paper
Journal Article
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Shen, Wei, Zhang, Xiaoying, Yao, Yuanshun, Zheng, Rui, Guo, Hongyi, Liu, Yang
Published in arXiv.org (14.03.2024)
Published in arXiv.org (14.03.2024)
Get full text
Paper
Journal Article
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Guo, Hongyi, Yao, Yuanshun, Shen, Wei, Wei, Jiaheng, Zhang, Xiaoying, Wang, Zhaoran, Liu, Yang
Year of Publication 06.01.2024
Year of Publication 06.01.2024
Get full text
Journal Article
Differentially Private Label Protection in Split Learning
Yang, Xin, Sun, Jiankai, Yao, Yuanshun, Xie, Junyuan, Wang, Chong
Published in arXiv.org (04.03.2022)
Published in arXiv.org (04.03.2022)
Get full text
Paper
Journal Article
Fair Classifiers that Abstain without Harm
Yin, Tongxin, Jean-François Ton, Guo, Ruocheng, Yao, Yuanshun, Liu, Mingyan, Liu, Yang
Published in arXiv.org (09.10.2023)
Published in arXiv.org (09.10.2023)
Get full text
Paper
Journal Article
Label Smoothing Improves Machine Unlearning
Di, Zonglin, Zhu, Zhaowei, Jia, Jinghan, Liu, Jiancheng, Takhirov, Zafar, Jiang, Bo, Yao, Yuanshun, Liu, Sijia, Liu, Yang
Year of Publication 11.06.2024
Year of Publication 11.06.2024
Get full text
Journal Article
Measuring and Reducing LLM Hallucination without Gold-Standard Answers
Jiaheng Wei, Yao, Yuanshun, Jean-Francois, Ton, Guo, Hongyi, Estornell, Andrew, Liu, Yang
Published in arXiv.org (06.06.2024)
Published in arXiv.org (06.06.2024)
Get full text
Paper
Journal Article
Defending against Reconstruction Attack in Vertical Federated Learning
Sun, Jiankai, Yao, Yuanshun, Gao, Weihao, Xie, Junyuan, Wang, Chong
Published in arXiv.org (21.07.2021)
Published in arXiv.org (21.07.2021)
Get full text
Paper
Journal Article