Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Guitart Fernández, Jordi
الموضوع:
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors; High performance computing; Cloud computing; Checkpoint/restore; Live migration; Diskless migration; Iterative migration; Networking migration; CRIU; RunC; Containerization; HPC cloud; Supercomputadors; Computació en núvol
نوع التسجيلة:
article in journal/newspaper
اللغة:
English

معلومة اضافية
- Contributors:
  Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors; Barcelona Supercomputing Center; Universitat Politècnica de Catalunya. CROMAI - Computing Resources Orchestration and Management for AI
- الموضوع:
  2024
- Collection:
  Universitat Politècnica de Catalunya, BarcelonaTech: UPCommons - Global access to UPC knowledge
- نبذة مختصرة :
  Checkpoint/Restore techniques had been thoroughly used by the High Performance Computing (HPC) community in the context of failure recovery. Given the current trend in HPC to use containerization to obtain fast, customized, portable, flexible, and reproducible deployments of their workloads, as well as efficient and reliable sharing and management of HPC Cloud infrastructures, there is a need to integrate Checkpoint/Restore with containerization in such a way that the freeze time of the application is minimal and live migrations are practicable. Whereas current Checkpoint/Restore tools (such as CRIU) support several options to accomplish this, most of them are rarely exploited in HPC Clouds and, consequently, their potential impact on the performance is barely known. Therefore, this paper explores the use of CRIU’s advanced features to implement diskless, iterative (pre-copy and post-copy) migrations of containers with external network namespaces and established TCP connections, so that memory-intensive and connection-persistent HPC applications can live-migrate. Our extensive experiments to characterize the performance impact of those features demonstrate that properly-configured live migrations incur low application downtime and memory/disk usage and are indeed feasible in containerized HPC Clouds. ; This research was partially supported by the Spanish Government under contract PID2019-107255GB-C22, by the Generalitat de Catalunya, Spain under contract 2021-SGR-00478, and by the EU-HORIZON programme under grant agreement 101092646. ; Peer Reviewed ; Postprint (published version)
- File Description:
  17 p.; application/pdf
- ISSN:
  1383-7621
- Relation:
  https://www.sciencedirect.com/science/article/pii/S1383762124000948; info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-107255GB-C22/ES/UPC-COMPUTACION DE ALTAS PRESTACIONES VIII/; Guitart, J. Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent. "Journal of systems architecture", Juliol 2024, vol. 152, article 103157.; http://hdl.handle.net/2117/408410
- الرقم المعرف:
  10.1016/j.sysarc.2024.103157
- الدخول الالكتروني :
  http://hdl.handle.net/2117/408410
  https://doi.org/10.1016/j.sysarc.2024.103157
- Rights:
  Attribution-NonCommercial 4.0 International ; http://creativecommons.org/licenses/by-nc/4.0/ ; Open Access
- الرقم المعرف:
  edsbas.12757084

تعليقات

No Comments.

Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent

اتصل بنا

اتبع