翻译自:http://www.pixelbeat.org/docs/openstack_libvirt_images/

The main stages of a Virtual Machine disk image as it transfers through OpenStack to be booted under libvirt are:

Initially the image is downloaded from glance and cached in libvirt base. We'll consider the options for handling a qcow2 image stored in glance, as that format can be downloaded quite efficiently from glance as it supports compression, and image sparseness can be maintained. This article will focus on the flow and transformations in "libvirt base", which is used to cache, preprocess and optionally back, VM disk images.

Configuration

First we'll summarize the config variables involved, before presenting the operations associated with each config combination, in each OpenStack release. Note I'm describing upstream OpenStack here, and not my employer's Red Hat OpenStack which has back-ported enhancements between versions where appropriate.

Config Default Release Description
use_cow_images True Cactus Whether to use CoW images for "libvirt instance disks"
force_raw_images True Essex Allows disabling convert to raw in "libvirt base" for operational reasons
libvirt_images_type 'default' Folsom Deprecates use_cow_images and allows selecting LVM libvirt images
[libvirt]/images_type 'default' Icehouse Deprecates libvirt_images_type in the [DEFAULT] section
preallocate_images 'none' Grizzly Instance disks preallocation mode. 'space' => fallocate images
resize_fs_using_block_device False Havana Allows enabling of direct resize for qcow2 images

The main reason that raw images are written in "libvirt base" by default (since Diablo), is to remove possible compression from the qcow2 image received from glance. Note compression in qcow2 images is read only, and so this will impact reads from unwritten portions of the qcow2 image. Users may want to change this option, depending on CPU resources and I/O bandwidth available. For example, systems with slower I/O or less space available, may want to trade the higher CPU requirements of compression, to minimize input bandwidth. Note raw images are used unconditionally with libvirt_images_type=lvm.

Whether to use CoW images for the "libvirt instance disks" also depends on I/O characteristics of the user's system. Without CoW, more space will be used for common parts of the disk image, but on the flip side depending on the backing store and host caching, there may be better concurrency achieved by having each VM operate on its own copy.

Enabling preallocation of space for the "libvirt instance disks" can help with both space guarantees and I/O performance. Even when not using CoW instance disks, the copy each VM gets is sparse and so the VM may fail unexpectedly at run time with ENOSPC. By running fallocate(1) on the instance disk images, we immediately and efficiently allocate the space for them in the file system (if supported). Also run time performance should be improved as the file system doesn't have to dynamically allocate blocks at run time, reducing CPU overhead and more importantly file fragmentation.

Disk image operations

For each release and config combination, here are the created files and associated operations in getting a qcow2 image from glance through to being booted in a libvirt Virtual Machine.

Folsom, force_raw_images=True, use_cow_images=True

This results in each instance booting from a CoW image, backed by a resized raw image.

Nova command Source code Notes
wget http://glance/$image -O base_/$hex.part images.fetch  
qemu-img convert -O raw $hex.part $hex.converted images.fetch_to_raw Creates sparse file
mv $hex.converted $hex; rm $hex.part images.fetch_to_raw  
  imagebackend.create_image  
cp $hex $hex_$size   libvirt.utils.copy_image Creates sparse file
qemu-img resize $hex_$size $size   disk.extend  
resize2fs $hex_size   disk.extend Unpartitioned ext[234]
qemu-img create -f qcow2 -o backing_file=... $instance_dir/disk   libvirt.utils.create_image  

Folsom, force_raw_images=True, use_cow_images=False

This results in each instance booting from a copy of a resized raw image.

Nova command Source code Notes
wget http://glance/$image -O base_/$hex.part images.fetch  
qemu-img convert -O raw $hex.part $hex.converted images.fetch_to_raw Creates sparse file
mv $hex.converted $hex; rm $hex.part images.fetch_to_raw  
  imagebackend.create_image  
cp $hex $instance_dir/disk   libvirt.utils.copy_image  
qemu-img resize disk   disk.extend  
resize2fs disk   disk.extend Unpartitioned ext[234]

Folsom, force_raw_images=False, use_cow_images=False

This results in each instance booting from a copy of a resized qcow2 image.

Nova command Source code Notes
wget http://glance/$image -O base_/$hex.part images.fetch  
mv $hex.part $hex images.fetch_to_raw  
  imagebackend.create_image  
cp $hex $instance_dir/disk   libvirt.utils.copy_image  
qemu-img resize disk   disk.extend  
resize2fs disk   disk.extend Ignored for qcow2 ¹

Folsom, force_raw_images=False, use_cow_images=True

This results in each instance booting from a CoW image, backed by a resized qcow2 image.

Nova command Source code Notes
wget http://glance/$image -O base_/$hex.part images.fetch  
mv $hex.part $hex images.fetch_to_raw  
  imagebackend.create_image  
cp $hex $hex_$size   libvirt.utils.copy_image  
qemu-img resize $hex_$size $size   disk.extend  
resize2fs $hex_size   disk.extend Ignored for qcow2 ¹
qemu-img create -f qcow2 -o backing_file=... $instance_dir/disk   libvirt.utils.create_image  

Grizzly, force_raw_images=True, use_cow_images=True

Grizzly introduces a change for use_cow_images=True, where it will resize in the $instance_dir rather than in base_. So the resize will not be cached, but this is minimal CPU tradeoff per instance boot, for the extra space saved in base_. We'll just present the default config values here which illustrates the only significant change from Folsom.
This results in each instance booting from a resized CoW image, backed by a raw image.

Nova command Source code Notes
wget http://glance/$image -O base_/$hex.part images.fetch  
qemu-img convert -O raw $hex.part $hex.converted images.fetch_to_raw Creates sparse file
mv $hex.converted $hex; rm $hex.part images.fetch_to_raw  
  imagebackend.create_image  
qemu-img create -f qcow2 -o backing_file=... $instance_dir/disk   libvirt.utils.create_image  
qemu-img resize disk $size   disk.extend  
resize2fs disk   disk.extend Grizzly always ignores ²

[² Update Sep 2013: Stanislaw Pitucha also noticed that the above referenced Grizzly change introduced a regression where unpartitioned qcow2 images were no longer resized. See the Havana resize_fs_using_block_device option below for details.]

Grizzly, preallocate_images='space'

Grizzly also has new fallocate functionality in this area controlled by the preallocate_images config option. If that is set to 'space', then after the operations above, the $instance_dir/ images will be fallocated to immediately determine if enough space is available, and to possibly improve VM I/O performance due to ongoing allocation avoidance, and better locality of block allocations.

¹ Havana, resize_fs_using_block_device=False

As noted in the first Grizzly change above, Stanislaw Pitucha noticed that change introduced a regression where unpartitioned qcow2 images were no longer resized. He supplied a fix to resize qcow directly rather than relying on the raw image being available, which would also cater for the force_raw_images=False case that even pre Grizzly did not. This new option can be used to enable this support, but there are some large performance and possible security issues so it's not enabled by default. This support will be available in the upcoming Havana release.

General performance considerations

Performance has improved in this area through each OpenStack release, with some of the main topics to consider, for past and future changes being:

Minimize I/O

Note these were implemented in Essex:

  • Copy images, then resize, rather than vice versa
  • Directly generating images in the $instance_dir/
  • Intelligent reading of sparse input
  • Reproduction of sparse input on output
  • Use compression

Note this was implemented in Folsom:

Minimize storage

  • Use compression
  • Use sparse output/generation
  • Avoid resized copies when not needed
  • Use CoW if appropriate

Improve caching

  • Avoid thrashing the page cache with large intermediate images
  • Improve low level caching through better storage allocation

Preprocessing

    • Preprocessing may be possible on images like preallocation=metadata which trades off initial CPU cost for possibly much better run time I/O performance
    • Such cost would be some what alleviated by having asynchronous population of the base_ cache

[To be translated] Nova:libvirt image 的生命周期的更多相关文章

  1. KVM 介绍(6):Nova 通过 libvirt 管理 QEMU/KVM 虚机 [Nova Libvirt QEMU/KVM Domain]

    学习 KVM 的系列文章: (1)介绍和安装 (2)CPU 和 内存虚拟化 (3)I/O QEMU 全虚拟化和准虚拟化(Para-virtulizaiton) (4)I/O PCI/PCIe设备直接分 ...

  2. [译] libvirt 虚机的生命周期 (Libvirt Virtual Machine Lifecycle)

    翻译自:http://wiki.libvirt.org/page/VM_lifecycle   这篇文章描述虚机生命周期的基本概念.其目的在于在一篇文章中提供完整的关于虚机创建.运行.停止.迁移和删除 ...

  3. react组件的生命周期

    写在前面: 阅读了多遍文章之后,自己总结了一个.一遍加强记忆,和日后回顾. 一.实例化(初始化) var Button = React.createClass({ getInitialState: f ...

  4. 浅谈 Fragment 生命周期

    版权声明:本文为博主原创文章,未经博主允许不得转载. 微博:厉圣杰 源码:AndroidDemo/Fragment 文中如有纰漏,欢迎大家留言指出. Fragment 是在 Android 3.0 中 ...

  5. C# MVC 5 - 生命周期(应用程序生命周期&请求生命周期)

    本文是根据网上的文章总结的. 1.介绍 本文讨论ASP.Net MVC框架MVC的请求生命周期. MVC有两个生命周期,一为应用程序生命周期,二为请求生命周期. 2.应用程序生命周期 应用程序生命周期 ...

  6. UIViewController生命周期-完整版

    一.UIViewController 的生命周期 下面带 (NSObject)的方法是NSObject提供的方法.其他的都是UIViewController 提供的方法. load   (NSObje ...

  7. angular2系列教程(十一)路由嵌套、路由生命周期、matrix URL notation

    今天我们要讲的是ng2的路由的第二部分,包括路由嵌套.路由生命周期等知识点. 例子 例子仍然是上节课的例子:

  8. Spring中Bean的作用域、生命周期

                                   Bean的作用域.生命周期 Bean的作用域 Spring 3中为Bean定义了5中作用域,分别为singleton(单例).protot ...

  9. Autofac - 生命周期

    实例生命周期决定在同一个服务的每个请求的实例是如何共享的. 当请求一个服务的时候,Autofac会返回一个单例 (single instance作用域), 一个新的对象 (per lifetime作用 ...

随机推荐

  1. 设计模式之 面向对象的养猪厂的故事,C#演示(二)

    (三) 优先使用聚合,而不是继承 有一段时间,养猪场的老板雇用了清洁工人来打扫猪舍.但有一天,老板忽然对自己说"不对啊,既然我有机器人,为什么还要雇人来做这件事情?应该让机器人来打扫宿舍!& ...

  2. linux常用命令之文件管理

    LS ls:list directory contents 默认情况 默认情况下显示的是mtime 选项 -a 列出全部文件及目录包括隐藏的 -l 列出详细信息,包括文件类型.权限.节点.owner. ...

  3. UML类图关系总结

    在UML类图中,常见的有以下几种关系: 泛化(Generalization) 实现(Realization) 关联(Association) 聚合(Aggregation) 组合(Compositio ...

  4. JavaScript学习(2):对象、集合以及错误处理

    在这篇文章里,我们讨论一下JavaScript中的对象.数组以及错误处理. 1. 对象 对象是JavaScript中的一种基本类型,它内部包含一些属性,我们可以对这些属性进行增删操作. 1.1 属性 ...

  5. 为友盟消息推送开发的PHP SDK(composer版):可以按省发Android push

    一直以来APP希望按省市县推送Android push,只能自己分析用户经纬度,打tag发送. 现在终于有服务商提供了. 友盟消息推送 可以“按省推送”,很方便. 我为友盟做了PHP SDK(comp ...

  6. Quartz.NET开源作业调度框架系列(三):IJobExecutionContext 参数传递

    前面写了关于Quartz.NET开源作业调度框架的入门和Cron Trigger , 这次继续这个系列, 这次想讨论一下Quartz.NET中的Job如何通过执行上下文(Execution Conte ...

  7. JavaScriptOO.com – 快速找到你需要的 JS 框架

    JavaScriptOO.com 集合了目前 Web 开发中最常用的422(截至目前)款 JavaScript 框架,你可以根据功能类别(Ajax,动画,图表,游戏等)进行过滤和排序,快速找到你需要的 ...

  8. [转]很详细的devexpress应用案例

    很详细的devexpress应用案例,留着以后参考. 注:转载自http://***/zh-CN/App/Feature.aspx?AppId=50021 UPMS(User Permissions ...

  9. SharePoint文档库,如何在新窗口打开中的文件

    默认情况下,点击文档库中的文件是在当前浏览器中打开的(如果你设置的是在客户端软件打开,则不符合本文情况).那么如果让他在新窗口中打开呢? 这里需要借助jQuery,关于如何将jQuery集成到Shar ...

  10. 转 用C API 操作MySQL数据库

    用C API 操作MySQL数据库 参考MYSQL的帮助文档整理 这里归纳了C API可使用的函数,并在下一节详细介绍了它们.请参见25.2.3节,“C API函数描述”. 函数 描述 mysql_a ...