The market is depressed it’s not the “Depression”

The market had a huge day today with the Dow rising 936 points!  Following the biggest rise in the markets history I thought I would post a few figures from Friday for all the people who pulled thier money out the market last week in a panic and who most likely are questioning the move after today.

As of Friday’s closing bell the US martkets were down ~ 18%, EU markets ~ 25% and the Asian markets ~ 30%.  While an 18% slump sucks is it enough to make me panic?  Well let’s look at a few additional indicators today vs. the "Depression"

  Today "Depression"
Industrial Production

Down ~ 1% Down ~ 44%
Unemployment ~6% ~ 25%
Housing payments in the rears ~ 4% ~ 40%

The market conditions today are a long way from the conditions in 1929.  Things look much closer to the market crash of 1973/74 where the Dow lost ~ 45% of its value.

Now let’s look at an investmetment made in 1972 and present day value.  This would be representative of a thirty somethings investment today with a retirement target cashout.

I looked at an investment in XOM made 01/70 and sold today, even following last week the gain is ~ +843%.  The Dow inthe same time perios is ~ +1060%.

I also looked at the past 10 years from Oct 1998 til today which shows a ~ +107% for XOM and ~ +19% for the Dow.

Conclusion, the market is not a place to put your cash if you need liquidity.  As a long term investment the market is the place to be!

Thin Provisioning 101

I was asked by someone for a quick overview of Thin Provisioning and when I view this as an valuable or applicable technology.

So let me start with a quick simplified visual overview of both Thin Provisioning and Traditional (aka Thick)Provisioning.

thin_provisioning_101

So now that you understand the concepts of Thin Provisioning and Traditional (aka Thick) Provisioning let me quick talk about the only where I see Thin Provisioning as a valuable technology.

I look at Thin Provisioning in the same way that a Disaster Recovery (DR) provider looks at taking on new customers.  If storage is your business (i.e. - You are offering a shared storage model to customer co-located in your data center) then Thin Provisioning may be a key ingredient to your business model.  Let me expand on this, DR providers like SunGuard oversubscribe their data centers hedging that 100% of their customer will not have a disaster at the same time (BTW this has happened and put the provider out of business).  Thin Provisioning works in the same way by providing the user with the belief that they have 100% of the capacity while in fact the capacity may be over provisioned and the storage service provider (SSP) is hedging that 100% of the user co-located on the storage array will not demand 100% of the resources at the same time.

There are some very minor management benefits that I outlined in the pictorial above but IMO given some of the pitfalls associated with Thin Provisioning these do not provide a compelling reason consider Thin Provisioning.  Read and interesting article here that outlines one very real issue encountered with Thin Provisioning and NTFS.

So in conclusion if you are SSP of some sort consider Thin Provisioning otherwise go thick or go home :)

Thoughts on Super Bowl XLII

imageCongratulations to the NY Giants and NY Giants fans. Unbelievable showing!!!! Yes, the Giants looked possessed on defense but did the Patriots look like an 18-0 team? I have a theory on this; read on.? Regardless I could have cared less who won the game, neither team is my team and I did not have money on

the game, but everyone loves the underdog and frankly I am so freakin tired of the the Patriots. Congratulations to Eli Manning MVP of Super Bowl XLII. I would like to offer up that there should be a co-MVP or at least adefensive player of the game should be named… can

you guess who? image Gisele Bundchen co-MVP Super Bowl XLII  Maybe next year Tom Brady should leave Gisele at home. While she is not a bad trophy / consolation prize it may be a bit easier for Eli to share his trophy (The Vince Lombardi Tropy, shown here on the right) with the team and the fans. Congrats NY!

The perception is emerging but the reality is legacy…

The market perception seems to be that iSCSI is gaining tremendous steam and many customers who would have adopted Fibre Channel as their interconnect a year ago are now adopting iSCSI. I would agree that this is the case but iSCSI is not an emerging interconnect and might be better classified as a legacy interconnect which is now experiencing a newsworthy adoption rate. An early concern of iSCSI was the performance penalty associated with TCP operations and the impact of software based initiators, this spawned the TOE (TCP Offload Engine) which would offload TCP calculation from the system CPU to a dedicated onboard processor dedicated to TCP operations, today most iSCSI implementations leverage software initiators (today system CPU resources the most part are so under utilized that most environments will never notice a 10% CPU utilization increase that may be associated with iSCSI). Some vendors addressed the TCP concern by modifying the iSCSI protocol to ride on UDP as opposed to TCP thus increasing performance via proprietary protocols which resemble iSCSI (i.e.- LeftHand Networks, HammerStorage and Zetera). It is important to note that LeftHand has pretty much abandoned the proprietary protocol they started with and has now adopted the iSCSI standard, it is also interesting to note that their adoption rate seems to have increased since doing this, when you are a new player I think evangelizing your protocols superiority over the standard is probably a tall order. LeftHand has also changed morphed their business into a software play and embraced the VMware Virtual Appliance Markerplace as a way to propagate their technology.

There are a number of emerging interconnects that technologically out shine iSCSI the question is how quickly can the market makers move to adopt these technologies, are the market makers interested in accelerating the adoption curve? This is a complicated question, on one hand you could argue that technologies are more stable once they have been around longer (I remember reading papers on iSCSI in 2001, did it really take 7+ years to get iSCSI to where is today?) on the other hand if the market makers validate these technologies too early they run the risk of fierce competition from more nimble startups. It is a complex problem, my feeling is that for the most part the adoption cycle is slowed by the market makers as a way to recoup development cost and and slow competition. IMO the by product of this is a slower innovation cycle.

AoE and HyperSCSI both offer the interconnect price point of iSCSI without the performance burden associated with TCP. AoE and HyperSCSI ride on Layer 2 and do not experience the protocol overhead associated with Layer 3 protocols. SoIP (Storage over IP) uses UDP as the transport protocol as opposed to TCP. We are also seeing the emergence of iSER and iWARP, next generation TCP technology that closely resembles Infiniband. iWARP (Internet Wide Area RDMA Protocol) is a superset of the VI architecture and is aimed at reducing the overhead associated with TCP, iSER (iSCSI Extensions for RDMA) maps the iSCSI protocol over RDMA networks like Infiniband or iWARP. iSER address the overhead associated with TCP and out-of-order packet delivery. How many of us hear about these protocols?

It seems that the majority of marchitecture effort is being put into FCoE (Fibre Channel over Ethernet). Major players such as Brocade, Cisco, EMC, Emulex, QLogic, IBM, Intel, Sun and Mellanox have all gotten behind FCoE in a big way so most likely this will be the next big thing in storage interconnects. The emergence of technologies that further leverage Ethernet as the interconnect will change the game, it is hard to imagine that Cisco’s dominance will will not continue to grow. It is likely that more storage services (i.e. - replication, snapshots, etc…) will be handled at the network layer, as these services move into the network layer we will continue to see the further commoditization of the storage market. Should be interesting to watch over the next few years.

While on the surface protocols that sit on top of Layer 2 (AoE and FCoE) may seem to be superior there is a tremendous amount of functionality that is provided at Layer 3 so it is not a forgone conclusion that FCoE will will the battle. Right now the only forgone conclusion I can see it that Cisco wins regardless, the others will be battling for a piece of the pie. But who knows anything can happen, after all this is technology.

Oracle Storage Guy: Direct NFS on EMC NAS

I have been chomping at the bit to test VMware on dNFS on EMC NAS for a couple of reasons.  A number of my customers who are looking at EMC NAS in particular the NS20 would like to consolidate storage, servers, file services, etc… on to a unified platform and leverage a single replication technology like Celerra Replicator.   dNFS may offer this possibility, .vmdks can now reside on the a NFS volume, CIFS shares can be consolidated to the the NS20 and all can be replicated with Celerra Replicator.  The only downside to this solution that I can see is right now the replicated volumes will be crash consistent copies but I think with some VMware scripting even this concern can be addressed.  I hope to stand this configuration up in the lab in the next couple of weeks so I should have more detail and a better idea of is viability shortly.  You may be wondering why this post entitled Oracle Storage Guy…… the answer is I was searching the blogsphere for an unbiased opinion and some performance metrics of VMware and dNFS and this was the blog that I stumbled upon.

The performance numbers I have seen for VMware on dNFS come very close to the numbers I have seen for iSCSI, both technologies offer benefits but for the use case I mention above dNFS may become very compelling.  I recommend reading this post Oracle Storage Guy: Direct NFS on EMC NAS, is offers some great commentary on the performance characteristics and benefits of dNFS.

The Cache Effect

Following a fit of rage last night after I inadvertently deleted 2 hours worth of content I have now calmed down enough to recreate the post.

The story starts out like this, a customer who recently installed a EMC CX3–80 was working on a backup project roll out, the plan was to leverage ATA capacity in the CX3–80 as a backup-to-disk (B2D) target.  Once they rolled out the backup application they were experiencing very poor performance for the backup jobs that were running to disk, additionally the customer did some file system copies to this particular device and the performance appeared to slow.

The CX3–80 is actually a fairly large array but for the purposes of this post I will focus on the particular ATA RAID group which was the target of the backup job where the performance problem was identified.

I was aware that the customer only had on power rail due to some power constraints in their current data center.  The plan was to power up the CX using just the A side power until they could de-commission some equipment and power on the B side.  My initial though was that cache the culprit but I wanted to investigate further before drawing a conclusion.

My first step was to log into the system and validate that cache was actually disabled, which it was.  This was due to the fact that the SPS (supplemental power supply) only had one power feed and the batteries where not charging.  In this case write–back cache is disabled to protect from potential data loss.  Once I validated that cache was in fact disabled I thought that I would take a scientific approach to resolving the issue by base lining the performance without cache and then enabling cache and running the performance test again.

The ATA RAID group which I was testing on was configured as a 15 drive R5 group with 5 LUNs (50 – 54) ~ 2 TB in size.

Figure 1:  Physical disk layout

R5

My testing was run against drive f: which is LUN 50 which resides on the 15 drive R5 group depicted above.  LUNs 51, 52, 53 and 54 were not being used so the RG was only being used by the benchmark I was running on LUN 50.

Figure 2:  Benchmark results before cache was enabled

Pre-cache

As you can see the performance for writes is abysmal.  I will focus on the 64k test as we progress through the rest of this blog.  You will see above that the 64k test only push ~ 4.6 MB/s.  Very poor performance for a 15 drive stripe.  I have a theory for why this is but I will get to that later in the post.

Before cache couple be enabled we needed to power the second power supply on the the SPS, this was done by plugging the B power supply on the SPS into the A side power rail.  Once this was complete and the SPS battery was charged cache was enabled on the CX and the benchmark was run a second time.

Figure 3:  Benchmark results post cache being enabled (Note the scale on this chart differs from the above chart)

Post-cache

As you can see the performance increased from ~ 4.6 MB/s for 64k writes to ~ 160.9 MB/s for 64k writes.  I have to admit I would not have expected write cache to have this dramatic of an effect.

After thinking about it for a while I formulated some theories that I hope to fully prove out in the near future.  I believe that the performance characteristics that presented themselves in this particular situation was a combination of a number of things, the fact that the stripe width was 15 drives and cache being disabled created the huge gap in performance.

Let me explain some RAID basics so hopefully the explanation will become a bit clearer.

A RAID group had two key components that we need to be concerned with for the purpose of this discussion:

  1. Stripe width – which is typically synonymous with the number of drives in the the raid group
  2. Stripe depth – which is the size of the write that the controller performs before it round robin to the next physical spindle (Depicted in Figure 4)

Figure 4: Stripe Depth

Stripe_depth

The next concept is write cache, specifically two features of write cache know as write-back cache and write-gathering cache.

First lets examine the I/O pattern without the use of cache.  Figure 5 depicts a typical 16k I/O on an array with and 8k stripe depth and a 4 drive stripe width, with no write cache.

Figure 5:  Array with no write cache

No_cache

The effect of no write cache is two fold.  First there is no write-back so the I/O needs to be acknowledge by the physical disk, this is obviously much slower that and ack from memory.  Second, because there is no write-gathering full-stripe writes can not be facilitated which means more back-end I/O operations, affectionately referred to as the Read-Modify-Write penalty.

Now lets examine the same configuration with write-cache enabled.  Depicted in Figure 6.

Figure 6:  Array with write cache enabled

W_cache

Here you will note that acks are sent back to the host before they are written to physical spindles, this dramatically improves performance.  Second write-gathering cache is used to facilitate full-stripe writes which negates the read-modify-write penalty.

Finally my conclusion is that the loss of write cache could be somewhat negated by reducing stripe widths from 15 drives to 3 or 4 drives and creating a meta to accommodate larger LUN sizes.  With a 15 drive raid group the read-modify-write penalty can be severe as I believe we have seen in Figure 2.  This theory needs to be test, which I hope to do in the near future.  Obviously write-back cache also had an impact but I am not sure that is was as important as write-gathering in this case.  I could have probably tuned the stripe-depth and file system I/O size to improve the efficiency without cache as well.

This site is protected with Urban Giraffe's plugin 'HTML Purified' and Edward Z. Yang's Powered by HTML Purifier. 44 items have been purified.