Lifelong learning in costly feature spaces
Balcan, Maria-Florina, Blum, Avrim, Nagarajan, Vaishnavh
Published in Theoretical computer science (12.02.2020)
Published in Theoretical computer science (12.02.2020)
Get full text
Journal Article
What do larger image classifiers memorise?
Lukasik, Michal, Nagarajan, Vaishnavh, Rawat, Ankit Singh, Menon, Aditya Krishna, Kumar, Sanjiv
Year of Publication 08.10.2023
Year of Publication 08.10.2023
Get full text
Journal Article
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Jin, Tian, Clement, Nolan, Dong, Xin, Nagarajan, Vaishnavh, Carbin, Michael, Ragan-Kelley, Jonathan, Dziugaite, Gintare Karolina
Year of Publication 06.10.2023
Year of Publication 06.10.2023
Get full text
Journal Article
Think before you speak: Training Language Models With Pause Tokens
Goyal, Sachin, Ji, Ziwei, Rawat, Ankit Singh, Menon, Aditya Krishna, Kumar, Sanjiv, Nagarajan, Vaishnavh
Year of Publication 03.10.2023
Year of Publication 03.10.2023
Get full text
Journal Article
Assessing Generalization of SGD via Disagreement
Jiang, Yiding, Nagarajan, Vaishnavh, Baek, Christina, Kolter, J. Zico
Year of Publication 25.06.2021
Year of Publication 25.06.2021
Get full text
Journal Article
On student-teacher deviations in distillation: does it pay to disobey?
Nagarajan, Vaishnavh, Menon, Aditya Krishna, Bhojanapalli, Srinadh, Mobahi, Hossein, Kumar, Sanjiv
Year of Publication 30.01.2023
Year of Publication 30.01.2023
Get full text
Journal Article