Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses
Goldblum, Micah, Tsipras, Dimitris, Xie, Chulin, Chen, Xinyun, Schwarzschild, Avi, Song, Dawn, Madry, Aleksander, Li, Bo, Goldstein, Tom
Published in IEEE transactions on pattern analysis and machine intelligence (01.02.2023)
Published in IEEE transactions on pattern analysis and machine intelligence (01.02.2023)
Get full text
Journal Article
Universal Guidance for Diffusion Models
Bansal, Arpit, Chu, Hong-Min, Schwarzschild, Avi, Sengupta, Soumyadip, Goldblum, Micah, Geiping, Jonas, Goldstein, Tom
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2023)
Get full text
Conference Proceeding
Headless Horseman: Adversarial Attacks on Transfer Learning Models
Abdelkader, Ahmed, Curry, Michael J., Fowl, Liam, Goldstein, Tom, Schwarzschild, Avi, Shu, Manli, Studer, Christoph, Zhu, Chen
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Forcing Diffuse Distributions out of Language Models
Zhang, Yiming, Schwarzschild, Avi, Carlini, Nicholas, Kolter, Zico, Ippolito, Daphne
Year of Publication 16.04.2024
Year of Publication 16.04.2024
Get full text
Journal Article
Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Schwarzschild, Avi, Cembalest, Max, Rao, Karthik, Hines, Keegan, Dickerson, John
Year of Publication 23.03.2023
Year of Publication 23.03.2023
Get full text
Journal Article
Neural Auctions Compromise Bidder Information
Stein, Alex, Schwarzschild, Avi, Curry, Michael, Goldstein, Tom, Dickerson, John
Year of Publication 28.02.2023
Year of Publication 28.02.2023
Get full text
Journal Article
Rethinking LLM Memorization through the Lens of Adversarial Compression
Schwarzschild, Avi, Feng, Zhili, Maini, Pratyush, Lipton, Zachary C, Kolter, J. Zico
Year of Publication 23.04.2024
Year of Publication 23.04.2024
Get full text
Journal Article
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Hans, Abhimanyu, Schwarzschild, Avi, Cherepanova, Valeriia, Kazemi, Hamid, Saha, Aniruddha, Goldblum, Micah, Geiping, Jonas, Goldstein, Tom
Year of Publication 22.01.2024
Year of Publication 22.01.2024
Get full text
Journal Article
TOFU: A Task of Fictitious Unlearning for LLMs
Maini, Pratyush, Feng, Zhili, Schwarzschild, Avi, Lipton, Zachary C, Kolter, J. Zico
Year of Publication 11.01.2024
Year of Publication 11.01.2024
Get full text
Journal Article
The CLRS-Text Algorithmic Reasoning Language Benchmark
Markeeva, Larisa, McLeish, Sean, Ibarz, Borja, Bounsi, Wilfried, Kozlova, Olga, Vitvitskyi, Alex, Blundell, Charles, Goldstein, Tom, Schwarzschild, Avi, Veličković, Petar
Year of Publication 06.06.2024
Year of Publication 06.06.2024
Get full text
Journal Article
Adversarial Attacks on Machine Learning Systems for High-Frequency Trading
Goldblum, Micah, Schwarzschild, Avi, Patel, Ankit B, Goldstein, Tom
Published in arXiv.org (29.10.2021)
Published in arXiv.org (29.10.2021)
Get full text
Paper
Journal Article
Universal Guidance for Diffusion Models
Bansal, Arpit, Chu, Hong-Min, Schwarzschild, Avi, Sengupta, Soumyadip, Goldblum, Micah, Geiping, Jonas, Goldstein, Tom
Year of Publication 14.02.2023
Year of Publication 14.02.2023
Get full text
Journal Article
Effective Backdoor Mitigation Depends on the Pre-training Objective
Verma, Sahil, Bhatt, Gantavya, Schwarzschild, Avi, Singhal, Soumye, Das, Arnav Mohanty, Shah, Chirag, Dickerson, John P, Bilmes, Jeff
Year of Publication 25.11.2023
Year of Publication 25.11.2023
Get full text
Journal Article
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Ding, Mucong, Deng, Chenghao, Choo, Jocelyn, Wu, Zichu, Agrawal, Aakriti, Schwarzschild, Avi, Zhou, Tianyi, Goldstein, Tom, Langford, John, Anandkumar, Anima, Huang, Furong
Year of Publication 26.09.2024
Year of Publication 26.09.2024
Get full text
Journal Article
The Uncanny Similarity of Recurrence and Depth
Schwarzschild, Avi, Gupta, Arjun, Ghiasi, Amin, Goldblum, Micah, Goldstein, Tom
Year of Publication 22.02.2021
Year of Publication 22.02.2021
Get full text
Journal Article
Transformers Can Do Arithmetic with the Right Embeddings
McLeish, Sean, Bansal, Arpit, Stein, Alex, Jain, Neel, Kirchenbauer, John, Bartoldson, Brian R, Kailkhura, Bhavya, Bhatele, Abhinav, Geiping, Jonas, Schwarzschild, Avi, Goldstein, Tom
Year of Publication 27.05.2024
Year of Publication 27.05.2024
Get full text
Journal Article