2011 Poster Sessions : Fast Crash Recovery in RAMCloud

Student Name : Diego Ongaro
Advisor : John K. Ousterhout
Research Areas: Computer Systems
Abstract:
RAMCloud is a DRAM-based storage system that provides inexpensive durability and availability by recovering quickly after crashes, rather than storing replicas in DRAM. RAMCloud scatters backup data across hundreds or thousands of disks, and it harnesses hundreds of servers in parallel to reconstruct lost data. The system uses a log-structured approach for all its data, in DRAM as well as on disk; this provides high performance both during normal operation and during recovery. RAMCloud employs randomized techniques to manage the system in a scalable and decentralized fashion. In a 36-node cluster,RAMCloud recovers 6.6 GB of server data in about 1 second; we argue that the approach will scale to recover larger memory sizes (64 GB or more) in the same time with larger clusters.

Bio:
Diego is a 2nd-year Ph.D. student in Computer Science, focusing on systems research under Professor John Ousterhout.