Performance Implications of Failures in Large-Scale Cluster Scheduling
Zhang, Yanyong, Squillante, Mark S., Sivasubramaniam, Anand, Sahoo, Ramendra K.
Published in Job Scheduling Strategies for Parallel Processing (2005)
Published in Job Scheduling Strategies for Parallel Processing (2005)
Get full text
Book Chapter
Conference Proceeding
Failure Prediction in IBM BlueGene/L Event Logs
Yinglung Liang, Yanyong Zhang, Hui Xiong, Sahoo, R.
Published in Seventh IEEE International Conference on Data Mining (ICDM 2007) (01.10.2007)
Published in Seventh IEEE International Conference on Data Mining (ICDM 2007) (01.10.2007)
Get full text
Conference Proceeding
Temporal profiling of uplift rate along an active fault using river long profile in the Kuchchh region, Western India
Sonam, Sahoo, Ramendra, Singh, R.N., Jain, Vikrant
Published in Quaternary international (30.05.2021)
Published in Quaternary international (30.05.2021)
Get full text
Journal Article
A plural knowledges model to support sustainable management of dryland rivers in western India
Brierley, Gary, Sahoo, Sonam, Danino, Michel, Fryirs, Kirstie, Pandey, Chhavi N., Sahoo, Ramendra, Khan, Sana, Mohapatra, Pranab, Jain, Vikrant
Published in River research and applications (01.11.2023)
Published in River research and applications (01.11.2023)
Get full text
Journal Article
BlueGene/L Failure Analysis and Prediction Models
Liang, Y., Zhang, Y., Jette, M., Anand Sivasubramaniam, Sahoo, R.
Published in International Conference on Dependable Systems and Networks (DSN'06) (2006)
Published in International Conference on Dependable Systems and Networks (DSN'06) (2006)
Get full text
Conference Proceeding
EXPANDING METHOD FOR CONTINUOUSLY MONITORING REMOTELY ACCESSIBLE RESOURCES IN ORDER TO PREPARE FOR NODE FAILURE IN WIDE CLUSTERS, ESPECIALLY RELATED TO A METHOD FOR MANAGING EXPANDED RESOURCES, WHICH IS USEFUL IN A SYSTEM WHICH HAS MANY NODES AND ARE TOLERABLE TO A MANAGEMENT NODE FAILURE FOR RESOURCES LOCATED IN REMOTE NODES
Get full text
Patent
Blue Gene/L programming and operating environment
Moreira, J. E., Almasi, G., Archer, C., Bellofatto, R., Bergner, P., Brunheroto, J. R., Brutman, M., Castanos, J. G., Crumley, P. G., Gupta, M., Inglett, T., Lieber, D., Limpert, D., McCarthy, P., Megerian, M., Mendell, M., Mundy, M., Reed, D., Sahoo, R. K., Sanomiya, A., Shok, R., Smith, B., Stewart, G. G.
Published in IBM journal of research and development (01.03.2005)
Published in IBM journal of research and development (01.03.2005)
Get full text
Journal Article
Optimization of fast Fourier transforms on the Blue Gene/L supercomputer
Sabharwal, Yogish, Garg, Saurabh K., Garg, Rahul, Gunnels, John A., Sahoo, Ramendra K.
Published in Proceedings of the 15th international conference on High performance computing (17.12.2008)
Published in Proceedings of the 15th international conference on High performance computing (17.12.2008)
Get full text
Conference Proceeding
The Blue Gene/L Supercomputer: A Hardware and Software Story
Moreira, Jose E, Salapura, Valentina, Almasi, George, Archer, Charles, Bellofatto, Ralph, Bergner, Peter, Bickford, Randy, Blumrich, Mathias, Brunheroto, Jose R, Bright, Arthur A, Brutman, Michael, Castanos, Jose G, Chen, Dong, Coteus, Paul, Crumley, Paul, Ellis, Sam
Published in International journal of parallel programming (01.06.2007)
Published in International journal of parallel programming (01.06.2007)
Get full text
Journal Article
Evaluating cooperative checkpointing for supercomputing systems
Oliner, Adam, Sahoo, Ramendra
Published in Proceedings of the 20th international conference on Parallel and distributed processing (25.04.2006)
Published in Proceedings of the 20th international conference on Parallel and distributed processing (25.04.2006)
Get full text
Conference Proceeding
Lossless compression for large scale cluster logs
Balakrishnan, Raju, Sahoo, Ramendra K.
Published in Proceedings of the 20th international conference on Parallel and distributed processing (25.04.2006)
Published in Proceedings of the 20th international conference on Parallel and distributed processing (25.04.2006)
Get full text
Conference Proceeding