<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: When the links go bouncing&#8230;</title>
	<atom:link href="http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/</link>
	<description>Ranting and raving about storage and technology</description>
	<lastBuildDate>Mon, 19 Dec 2011 15:36:52 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: brerrabbit</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8216</link>
		<dc:creator>brerrabbit</dc:creator>
		<pubDate>Tue, 13 Jul 2010 21:12:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8216</guid>
		<description>Just so you know, the effective limit on the number of hosts in an ESX cluster for 4.0 (haven&#039;t checked 4.1 since it just came out this morning) is 256 - # number of LUNs you intend to present to the hosts.

In other words, in 4.0 there&#039;s a hard limit of 256 LUNs per cluster, and each host brings a LUN into the count....ESX has a local VMFS LUN where the serv console runs, and I suspect that there&#039;s a hidden VMFS partition on ESXi, while probably tiny, still counts toward the limit</description>
		<content:encoded><![CDATA[<p>Just so you know, the effective limit on the number of hosts in an ESX cluster for 4.0 (haven&#8217;t checked 4.1 since it just came out this morning) is 256 &#8211; # number of LUNs you intend to present to the hosts.</p>
<p>In other words, in 4.0 there&#8217;s a hard limit of 256 LUNs per cluster, and each host brings a LUN into the count&#8230;.ESX has a local VMFS LUN where the serv console runs, and I suspect that there&#8217;s a hidden VMFS partition on ESXi, while probably tiny, still counts toward the limit</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Suiter</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8213</link>
		<dc:creator>Thomas Suiter</dc:creator>
		<pubDate>Mon, 12 Jul 2010 19:42:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8213</guid>
		<description>Not to be argumentative... but I think we are talking the same thing.  Clvm is this: lvm2 with an extra daemon (clvmd) to handle locking and storage.  It is basically part of lvm, just open /etc/lvm/lvm.conf on any RedHat box (Advanced Platform to Fedora) and you&#039;ll find references to it.  RedHat does separate the two and only includes the clvm daemon in certain channels, so if you want their binary you have pay for it but there&#039;s nothing really special or unique about it.</description>
		<content:encoded><![CDATA[<p>Not to be argumentative&#8230; but I think we are talking the same thing.  Clvm is this: lvm2 with an extra daemon (clvmd) to handle locking and storage.  It is basically part of lvm, just open /etc/lvm/lvm.conf on any RedHat box (Advanced Platform to Fedora) and you&#8217;ll find references to it.  RedHat does separate the two and only includes the clvm daemon in certain channels, so if you want their binary you have pay for it but there&#8217;s nothing really special or unique about it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dim</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8210</link>
		<dc:creator>dim</dc:creator>
		<pubDate>Sun, 11 Jul 2010 06:26:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8210</guid>
		<description>Well, you&#039;re in for a surprise - no clvm in there, plane lvm2 instead, with an extra daemon to handle locking and storage management.

If you are interested in actually seeing the product you might want to contact your sales rep, otherwise it&#039;s just guesses that don&#039;t lead anywhere</description>
		<content:encoded><![CDATA[<p>Well, you&#8217;re in for a surprise &#8211; no clvm in there, plane lvm2 instead, with an extra daemon to handle locking and storage management.</p>
<p>If you are interested in actually seeing the product you might want to contact your sales rep, otherwise it&#8217;s just guesses that don&#8217;t lead anywhere</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: InsaneGeek</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8209</link>
		<dc:creator>InsaneGeek</dc:creator>
		<pubDate>Sat, 10 Jul 2010 20:54:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8209</guid>
		<description>My understanding of RHEV with block storage, it generally uses lvm volumes to carve &amp; present the storage.  I could be wrong (as there is very limited docs on the tech behind the RHEV management layer... assuming it&#039;s similar tech if you weren&#039;t using their management) but I believe RHEV uses RHCS for mgmt of clustering Xen and now KVM with RHEL6, at least that&#039;s the pitch from our RHEL sales reps over the years.  Since RHCS has pretty much always used clvm from their Sistina purchase years ago for clustered logical volumes without contradictory info, I&#039;m assuming they are continuing to use it here with RHEV clusters (there are a number of guides on making XEN Linux clusters with clvm and/or GFS).</description>
		<content:encoded><![CDATA[<p>My understanding of RHEV with block storage, it generally uses lvm volumes to carve &amp; present the storage.  I could be wrong (as there is very limited docs on the tech behind the RHEV management layer&#8230; assuming it&#8217;s similar tech if you weren&#8217;t using their management) but I believe RHEV uses RHCS for mgmt of clustering Xen and now KVM with RHEL6, at least that&#8217;s the pitch from our RHEL sales reps over the years.  Since RHCS has pretty much always used clvm from their Sistina purchase years ago for clustered logical volumes without contradictory info, I&#8217;m assuming they are continuing to use it here with RHEV clusters (there are a number of guides on making XEN Linux clusters with clvm and/or GFS).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dim</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8208</link>
		<dc:creator>dim</dc:creator>
		<pubDate>Sat, 10 Jul 2010 06:29:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8208</guid>
		<description>What does clvm have to do with RHEV?</description>
		<content:encoded><![CDATA[<p>What does clvm have to do with RHEV?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: InsaneGeek</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8207</link>
		<dc:creator>InsaneGeek</dc:creator>
		<pubDate>Sat, 10 Jul 2010 04:08:20 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8207</guid>
		<description>Unfortunately technology can hardly ever fully cover a sysadmin mistake.  I don&#039;t know of any volume manager that will prevent an admin from adding different types of luns into a volume group (vxvm, ZFS, lvm, etc).  

If you want to play dangerous, you can also have a vmfs filesystem span across luns as well.  Generally against best practices (but it&#039;s there if you really want to do it) and it doesn&#039;t *have* to be the same protection type.  You do have to intentionally want to do it and know how to do it... so much, much less likely to happen (but possible).

As to the claim of avoiding the filesystem from a performance perspective... not completely sure on that claim.  We too have had a very large single cluster of ESX servers (52x nodes) running in production... and yes it was very much out of spec (not my idea but ran for year+ that way).  There are certain times when locks happen that have an affect on performance within VMFS: create vmdk, metadata change (i.e. grow vmdk or snapshot), poweron/off guest or host.  There are certain similar times when lvm2 cluster type locks also occur.  LVM2 snapshots growing don&#039;t have the same lock affects but other operations do.  

I haven&#039;t run a RHEV system, but having tried to run a 90TB gfs2 filesystem under RHEL 5... I have to say had nothing but problems with clvm (patch on top of patch on top of patch and still nothing but hanging problems) some rather painful as well where it required a full cluster outage.  VMFS I&#039;ve found just works... additionally if you are that twitchy about the filesystem being an issue then you can do RDM and avoid filesystems completely (but that is damn icky for management) and use raw luns for guests.</description>
		<content:encoded><![CDATA[<p>Unfortunately technology can hardly ever fully cover a sysadmin mistake.  I don&#8217;t know of any volume manager that will prevent an admin from adding different types of luns into a volume group (vxvm, ZFS, lvm, etc).  </p>
<p>If you want to play dangerous, you can also have a vmfs filesystem span across luns as well.  Generally against best practices (but it&#8217;s there if you really want to do it) and it doesn&#8217;t *have* to be the same protection type.  You do have to intentionally want to do it and know how to do it&#8230; so much, much less likely to happen (but possible).</p>
<p>As to the claim of avoiding the filesystem from a performance perspective&#8230; not completely sure on that claim.  We too have had a very large single cluster of ESX servers (52x nodes) running in production&#8230; and yes it was very much out of spec (not my idea but ran for year+ that way).  There are certain times when locks happen that have an affect on performance within VMFS: create vmdk, metadata change (i.e. grow vmdk or snapshot), poweron/off guest or host.  There are certain similar times when lvm2 cluster type locks also occur.  LVM2 snapshots growing don&#8217;t have the same lock affects but other operations do.  </p>
<p>I haven&#8217;t run a RHEV system, but having tried to run a 90TB gfs2 filesystem under RHEL 5&#8230; I have to say had nothing but problems with clvm (patch on top of patch on top of patch and still nothing but hanging problems) some rather painful as well where it required a full cluster outage.  VMFS I&#8217;ve found just works&#8230; additionally if you are that twitchy about the filesystem being an issue then you can do RDM and avoid filesystems completely (but that is damn icky for management) and use raw luns for guests.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dim</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8200</link>
		<dc:creator>dim</dc:creator>
		<pubDate>Wed, 07 Jul 2010 22:11:11 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8200</guid>
		<description>&gt;But in RHEV when you create a storage domain you’re not 
&gt;able to “VMotion” a system between domains right? 
You probably mean &quot;storage vmotion&quot; - that&#039;s currently an offline only feature (so you can move VMs that aren&#039;t running), but qemu has it in the upstream, so RHEV will as well soon enough


&gt;Also – since the VG’s are essentially LVM2, how do 
&gt;they handle locking when multiple hypervisors are 
&gt;hitting VM’s? 
that&#039;s what the SPM is for, and why it&#039;s so important.

Not having an FS makes RHEV storage handling much faster in the long run, and also removes the host number limitation (seen a cluster of ~70 hosts, and that&#039;s only because I ran out of machines to add)

&gt;So the mistake here was when the admin threw all 
&gt;storage into the same domain and didn’t bother to 
&gt;check on mirroring. 
yup, not a &quot;best practice&quot; in anyone&#039;s book</description>
		<content:encoded><![CDATA[<p>&gt;But in RHEV when you create a storage domain you’re not<br />
&gt;able to “VMotion” a system between domains right?<br />
You probably mean &#8220;storage vmotion&#8221; &#8211; that&#8217;s currently an offline only feature (so you can move VMs that aren&#8217;t running), but qemu has it in the upstream, so RHEV will as well soon enough</p>
<p>&gt;Also – since the VG’s are essentially LVM2, how do<br />
&gt;they handle locking when multiple hypervisors are<br />
&gt;hitting VM’s?<br />
that&#8217;s what the SPM is for, and why it&#8217;s so important.</p>
<p>Not having an FS makes RHEV storage handling much faster in the long run, and also removes the host number limitation (seen a cluster of ~70 hosts, and that&#8217;s only because I ran out of machines to add)</p>
<p>&gt;So the mistake here was when the admin threw all<br />
&gt;storage into the same domain and didn’t bother to<br />
&gt;check on mirroring.<br />
yup, not a &#8220;best practice&#8221; in anyone&#8217;s book</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jesse</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8199</link>
		<dc:creator>Jesse</dc:creator>
		<pubDate>Wed, 07 Jul 2010 22:03:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8199</guid>
		<description>But in RHEV when you create a storage domain you&#039;re not able to &quot;VMotion&quot; a system between domains right?  Doesn&#039;t that pose serious limitations?  I mean with VMWare it&#039;s easy to move a VM from one storge type to another without downtime, which I&#039;ve found to be invaluable in many instances...  (I&#039;ve run test boxes on NFS storage and then migrated it to FC storage when the test system goes into production.)

Also - since the VG&#039;s are essentially LVM2, how do they handle locking when multiple hypervisors are hitting VM&#039;s?  VMFS handles this very well to a point, (I think 16-20 machines is the recomended limit per lun) but I&#039;ve never known LVM2 to be particularly tolerant of block-level sharing...  

So the mistake here was when the admin threw all storage into the same domain and didn&#039;t bother to check on mirroring.  (I have not been allowed to have any kind of real view into the system, so I don&#039;t know, I just handed them the storage they requested.)</description>
		<content:encoded><![CDATA[<p>But in RHEV when you create a storage domain you&#8217;re not able to &#8220;VMotion&#8221; a system between domains right?  Doesn&#8217;t that pose serious limitations?  I mean with VMWare it&#8217;s easy to move a VM from one storge type to another without downtime, which I&#8217;ve found to be invaluable in many instances&#8230;  (I&#8217;ve run test boxes on NFS storage and then migrated it to FC storage when the test system goes into production.)</p>
<p>Also &#8211; since the VG&#8217;s are essentially LVM2, how do they handle locking when multiple hypervisors are hitting VM&#8217;s?  VMFS handles this very well to a point, (I think 16-20 machines is the recomended limit per lun) but I&#8217;ve never known LVM2 to be particularly tolerant of block-level sharing&#8230;  </p>
<p>So the mistake here was when the admin threw all storage into the same domain and didn&#8217;t bother to check on mirroring.  (I have not been allowed to have any kind of real view into the system, so I don&#8217;t know, I just handed them the storage they requested.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dim</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8198</link>
		<dc:creator>dim</dc:creator>
		<pubDate>Wed, 07 Jul 2010 19:58:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8198</guid>
		<description>Well, there are different ways of working with the storage domains. You have to understand what you are doing of course
Since a storage domain (SD) is, on the storage level, a VG, you can expand it with relevant LUNs (which are in effect PVs) thus making it larger. Here is where the mistake took place actually.
On the other hand, you can always add new SDs, if the LUNs (PVs) are based on different storage types, and create your VMs with virtual disks in the different SDs, according to the VM&#039;s requirements, e.g. have a NIS server VM on a SD that is based on a slower LUN, and put the database VMs on the faster LUNs. 

As long as all the hosts in the same cluster can access all the relevant LUNs (and they should), this will work.

I don&#039;t think you can actually &quot;sense&quot;, on the host side, if the LUNs you are presented with are different (might be a nice feature for dm-mp to have I guess -- does powerpath do that?) so you have to know your storage when creating a setup of course


note to self - this is good KB material :)</description>
		<content:encoded><![CDATA[<p>Well, there are different ways of working with the storage domains. You have to understand what you are doing of course<br />
Since a storage domain (SD) is, on the storage level, a VG, you can expand it with relevant LUNs (which are in effect PVs) thus making it larger. Here is where the mistake took place actually.<br />
On the other hand, you can always add new SDs, if the LUNs (PVs) are based on different storage types, and create your VMs with virtual disks in the different SDs, according to the VM&#8217;s requirements, e.g. have a NIS server VM on a SD that is based on a slower LUN, and put the database VMs on the faster LUNs. </p>
<p>As long as all the hosts in the same cluster can access all the relevant LUNs (and they should), this will work.</p>
<p>I don&#8217;t think you can actually &#8220;sense&#8221;, on the host side, if the LUNs you are presented with are different (might be a nice feature for dm-mp to have I guess &#8212; does powerpath do that?) so you have to know your storage when creating a setup of course</p>
<p>note to self &#8211; this is good KB material <img src='http://blog.50micron.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jesse</title>
		<link>http://blog.50micron.com/2010/07/07/when-the-links-go-bouncing/comment-page-1/#comment-8197</link>
		<dc:creator>Jesse</dc:creator>
		<pubDate>Wed, 07 Jul 2010 16:49:39 +0000</pubDate>
		<guid isPermaLink="false">http://blog.50micron.com/?p=640#comment-8197</guid>
		<description>Touche - as well as the guy who decided to simply hit the power button rather than actually troubleshoot the problem.  

The way I look at it, and from what i read, RHEV doesn&#039;t really give you the option of what to put where, does it?  can someone answer that question?  I mean with VMWare you simply don&#039;t stripe VMFS across luns, you create the luns to be the size you need them, and that&#039;s that.  So if/when there is a crash, there is no risk of any issues with consistency.</description>
		<content:encoded><![CDATA[<p>Touche &#8211; as well as the guy who decided to simply hit the power button rather than actually troubleshoot the problem.  </p>
<p>The way I look at it, and from what i read, RHEV doesn&#8217;t really give you the option of what to put where, does it?  can someone answer that question?  I mean with VMWare you simply don&#8217;t stripe VMFS across luns, you create the luns to be the size you need them, and that&#8217;s that.  So if/when there is a crash, there is no risk of any issues with consistency.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

