On the Insufficiency of Existing Momentum Schemes for Stochastic Optimization
Kidambi, Rahul, Netrapalli, Praneeth, Jain, Prateek, Kakade, Sham
Published in 2018 Information Theory and Applications Workshop (ITA) (01.02.2018)
Published in 2018 Information Theory and Applications Workshop (ITA) (01.02.2018)
Get full text
Conference Proceeding
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Swamy, Gokul, Dann, Christoph, Kidambi, Rahul, Wu, Zhiwei Steven, Agarwal, Alekh
Published in arXiv.org (13.06.2024)
Published in arXiv.org (13.06.2024)
Get full text
Paper
Journal Article
Auctions with LLM Summaries
Dubey, Kumar Avinava, Feng, Zhe, Kidambi, Rahul, Mehta, Aranyak, Wang, Di
Published in arXiv.org (11.04.2024)
Published in arXiv.org (11.04.2024)
Get full text
Paper
Journal Article
Enhancing Group Fairness in Online Settings Using Oblique Decision Forests
Somnath Basu Roy Chowdhury, Monath, Nicholas, Beirami, Ahmad, Kidambi, Rahul, Dubey, Avinava, Ahmed, Amr, Chaturvedi, Snigdha
Published in arXiv.org (28.04.2024)
Published in arXiv.org (28.04.2024)
Get full text
Paper
Journal Article
MOReL : Model-Based Offline Reinforcement Learning
Kidambi, Rahul, Rajeswaran, Aravind, Praneeth Netrapalli, Joachims, Thorsten
Published in arXiv.org (02.03.2021)
Published in arXiv.org (02.03.2021)
Get full text
Paper
Journal Article
On Shannon capacity and causal estimation
Kidambi, Rahul, Kannan, Sreeram
Published in 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton) (01.09.2015)
Published in 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton) (01.09.2015)
Get full text
Conference Proceeding
Counterfactual Learning To Rank for Utility-Maximizing Query Autocompletion
Block, Adam, Kidambi, Rahul, Hill, Daniel N, Joachims, Thorsten, Dhillon, Inderjit S
Published in arXiv.org (22.04.2022)
Published in arXiv.org (22.04.2022)
Get full text
Paper
Journal Article
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage
Chang, Jonathan D, Uehara, Masatoshi, Sreenivas, Dhruv, Kidambi, Rahul, Sun, Wen
Published in arXiv.org (31.01.2022)
Published in arXiv.org (31.01.2022)
Get full text
Paper
Journal Article
Deformable trellises on factor graphs for robust microtubule tracking in clutter
Kidambi, R., Min-Chi Shih, Rose, K.
Published in 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI) (01.05.2012)
Published in 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI) (01.05.2012)
Get full text
Conference Proceeding
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Wang, Kaiwen, Kidambi, Rahul, Sullivan, Ryan, Agarwal, Alekh, Dann, Christoph, Michi, Andrea, Gelmi, Marco, Li, Yunxuan, Gupta, Raghav, Dubey, Avinava, Ramé, Alexandre, Ferret, Johan, Cideron, Geoffrey, Hou, Le, Yu, Hongkun, Ahmed, Amr, Mehta, Aranyak, Hussenot, Léonard, Bachem, Olivier, Leurent, Edouard
Year of Publication 22.07.2024
Year of Publication 22.07.2024
Get full text
Journal Article
Top-$k$ eXtreme Contextual Bandits with Arm Hierarchy
Sen, Rajat, Rakhlin, Alexander, Ying, Lexing, Kidambi, Rahul, Foster, Dean, Hill, Daniel, Dhillon, Inderjit
Year of Publication 15.02.2021
Year of Publication 15.02.2021
Get full text
Journal Article
Making Paper Reviewing Robust to Bid Manipulation Attacks
Wu, Ruihan, Guo, Chuan, Wu, Felix, Kidambi, Rahul, Laurens van der Maaten, Weinberger, Kilian Q
Published in arXiv.org (22.02.2021)
Published in arXiv.org (22.02.2021)
Get full text
Paper
Journal Article
On the insufficiency of existing momentum schemes for Stochastic Optimization
Kidambi, Rahul, Praneeth Netrapalli, Jain, Prateek, Kakade, Sham M
Published in arXiv.org (31.07.2018)
Published in arXiv.org (31.07.2018)
Get full text
Paper
Journal Article
Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification
Jain, Prateek, Kakade, Sham M, Kidambi, Rahul, Praneeth Netrapalli, Sidford, Aaron
Published in arXiv.org (31.07.2018)
Published in arXiv.org (31.07.2018)
Get full text
Paper
Journal Article
Accelerating Stochastic Gradient Descent For Least Squares Regression
Jain, Prateek, Kakade, Sham M, Kidambi, Rahul, Praneeth Netrapalli, Sidford, Aaron
Published in arXiv.org (31.07.2018)
Published in arXiv.org (31.07.2018)
Get full text
Paper
Journal Article
A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares)
Jain, Prateek, Kakade, Sham M, Kidambi, Rahul, Praneeth Netrapalli, Pillutla, Venkata Krishna, Sidford, Aaron
Published in arXiv.org (21.07.2018)
Published in arXiv.org (21.07.2018)
Get full text
Paper
Journal Article
Leverage Score Sampling for Faster Accelerated Regression and ERM
Agarwal, Naman, Kakade, Sham, Kidambi, Rahul, Lee, Yin Tat, Praneeth Netrapalli, Sidford, Aaron
Published in arXiv.org (22.11.2017)
Published in arXiv.org (22.11.2017)
Get full text
Paper
Journal Article
Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles
Mahajan, Dhruv, Gupta, Vivek, S Sathiya Keerthi, Sundararajan, Sellamanickam, Narayanamurthy, Shravan, Kidambi, Rahul
Published in arXiv.org (15.11.2017)
Published in arXiv.org (15.11.2017)
Get full text
Paper
Journal Article