Subscribe in a reader

Enter your email address:

Delivered by FeedBurner

 

Joe Kelly

 

 

Thursday
Jul292010

RecoverPoint 3.3 SP1 Released..



To whom it  may concern, here is a friendly reminder that RecoverPoint 3.3 SP1 has been released. Here is a sampling of what it encompasses, release notes here.. Carry on…

RecoverPoint/SE for Unisphere 1.0 management console support

The RecoverPoint/SE for Unisphere 1.0 management console integrates RecoverPoint/SE functionality into the CLARiiON CX Unisphere management interface. Using RecoverPoint/SE for Unisphere, you can manage a RecoverPoint/SE environment from the same central Unisphere management console that you use to manage your CLARiiON storage environment. You do not need to invoke the classic RecoverPoint interface to manage your RecoverPoint environment.

  • RecoverPoint/SE for Unisphere provides the same functionality as the RecoverPoint/SE functionality with the following exceptions:
  • RecoverPoint/SE for Unisphere does not provide the graphical representation of consistency group transfer and failover status provided by the Status tab of the Component pane of the standalone RecoverPoint/SE management application.
  • RecoverPoint/SE for Unisphere does not provide the vCenter Server management functionality provided by the standalone RecoverPoint/SE management application.
    Note: General availability of Unisphere 1.0 is expected in Q3 2010.

“Fake Sizing” support for CLARiiON splitter environments

RecoverPoint and RecoverPoint/SE 3.3 SP1 implement “fake sizing” for CLARiiON splitter environments. It is no longer required that a replica volume added to replication set be exactly the same size as the production volume it is replicating. If an exact size volume is not available, a volume that is larger than the production volume can be used as the replica volume instead.

Support for VMware vStorage API for Array Integration

As the first phase of providing support for VMware's vStorage API for Array Integration (VAAI), RecoverPoint now supports the proprietary VMware Atomic Test and Set SCSI command, which implements a LUN locking mechanism that is more efficient in the clustered host environment.
LUN remapping for Solaris and Windows with host-based splitter

Support for LUN remapping has been improved also for host-based splitters with Solaris or Windows hosts.

LUN remapping support allows users to remap replica LUNs using a less disruptive procedure and without risk of inconsistent data. If the remapping procedure is followed, only a volume sweep of remapped LUNs will be required.

Brocade updates

RecoverPoint and Brocade Storage Application Services (SAS) 3.4.1 provide the following enhancements:

  • increased stability of the process creating new Brocade bindings
  • improved handling of Brocade-to-RPA connectivity issues in real-time


More descriptive “paused by system” event handling


Events that are generated when RecoverPoint pauses transfer now contain a more detailed description of the cause of the pause in transfer, in the event Details section. A tooltip has also been added to the Transfer section of the Management Application consistency group Status Tab. When the state of Transfer is Paused by System and a user-actionable reason for the pause in transfer is detected, the tooltip automatically displays the reason.

Enhanced by Zemanta
Sunday
Jul182010

Quick Hit: ALUA additions to vSphere 4.1



Following some twitter banter, an added PSP to 4.1 was made known to me. PSP’s, as you know, are Path Selection Plug-in’s, which under the umbrella of NMP and ultimately PSA, provide physical path selection for I/O requests. As of 4.0, the following VMware NMP PSPs were supported…

  • MRU-Most Recently Used (VMW_PSP_MRU), common among Active/Passive arrays
  • Fixed-(VMW_PSP_FIXED), common among Active/Active arrays
  • Round Robin-(VMW_PSP_RR), common among Active/Active arrays or ALUA based arrays

As of 4.1, the Fixed PSP has extended its functionality to include the following path selection module..

  • VMW_PSP_FIXED_AP-which as noted in the SAN configuration guide, is a formally inclusive extension for Active/Passive and ALUA mode arrays.

Additionally there are some parameters specific to ALUA devices as noted here

Explicit ALUA devices allow the host to use the Set Target Port Group task management command to set the Target Port Group's state. In implicit ALUA, a device's Target Port Group states are managed by the target device itself.

  • implicit_support – states whether or not the LUN supports implicit ALUA, property of the LUN, can’t set.
  • explicit_support – states whether or not the LUN supports explicit ALUA, property of the LUN, can’t set.
  • explicit_allow - This parameter shows whether or not the user allows the SATP to exercise its explicit ALUA capability if the need arises during path failure. This only matters if the device actually supports explicit ALUA
  • alua_followover - This parameter shows whether or not the user allows the SATP to exercise the follow-over policy, which prevents path thrashing in multi-host setups.

 

Enhanced by Zemanta
Tuesday
Jul132010

VMware vSphere 4.1 Emancipated!!! “<++++++++++++1”



Its no secret, vSphere 4.1 is finally among us living! Its been covered over and over and over again. Any how I will spare you the “What’s New” copy and paste, although the list is impressive. Here are the features that caught the ‘ol eye. For more info visit the source, http://www.vmware.com/support/vsphere4/doc/vsp_41_new_feat.html.

  • VMware is touting ESXi big time with this release, its now time for us to depart from our old friend the Service Console.
  • ESXi Boot from SAN. iSCSI, FC and FCoE, this is big..
  • VAAI support, I blogged about this here, http://blog.virtualtacit.com/home/2010/7/10/vaai-vstorage-apis-for-array-integrationand-well-you.html. Very important vStorage API capability.
  • Storage IO Control, QoS for storage via shares and limits, this is big…
  • Network IO Control and Load based teaming, traffic management partitioning and dynamic load balancing on vDS’s..
  • DRS interop for HA and FT
  • VADP enhancements, another in the line of the vStorage APIs, 2008/R2 VSS quiescing to boot
  • Serial port redirection for a VM
  • USB pass through to the VM
  • Convertor Hyper-V import support..love it..
  • Admin password changes in host profiles
  • Orchestrator server support for 64bit OS
  • Historically tracking of licensing via the licensing reporting manager
  • Config maximums up’d, http://www.vmware.com/pdf/vsphere4/r41/vsp_41_config_max.pdf

 

 

 

 

 

 

 

 

Enhanced by Zemanta
Sunday
Jul112010

EMC RecoverPoint – Questions Answered



Here are some quick questions (w/ answers) I put together in regards to EMC’s RecoverPoint that I get from time to time. Perhaps this will help put you in the know as well as arm you with the right questions to ask when considering this product for your datacenter. Any discrepancies? Please feel free to politely comment.

What is RecoverPoint?

RecoverPoint is a continuous backup solution offered by EMC capable of providing asynchronous and synchronous replication across heterogeneous arrays. As of today it supports both block based storage protocols, Fiber Channel and iSCSI. All replication takes place over standard IP for asynchronous replication and fiber channel for synchronous replication. RecoverPoint handles all FC to IP conversions for asynch replication.

What does continuous backup mean?

Simple..every write is captured (or split/copied) and replicated in real time (depends on async or sync req.). In MOST situations every write is deemed a snapshot. These small aperture delineations allow you to roll back to any point in time. This is important as you don’t have to worry about rolling corruption as is the case with most mirrored based solutions. Continuous Backup is the best of both worlds folks. The ability to meet an RPO of zero as well as return to any point in time for recovery, this is the now and future of backup. I am with Mr. Preston on this, “..Things have got to change, people. We can't keep doing things the way we've been doing them..” <-Amen brother Curtis, Amen..

How are my writes replicated?

Splitters make a copy of the write I/O and send it to RecoverPoint (RPAs) for replication local or remote. To split these writes you need a write splitter, remember this isn’t complicated? Lets keep it simple…

There are three types of splitters..

  • Host splitter-code that is installed on the host itself
  • Fabric splitter-code that is installed within you FC fabric switches (Brocade and Cisco)
  • Array splitter-code that is installed on your array (Clariion Only)

The most widely used in my travels is the array or Clariion splitter. The assumption is..you guessed it..you have a Clariion. There are two components that are needed for the Splitting capabilities on the Clariion. They are..

  • The Splitter engine or driver as its called and the enabler package. As of today all current FLARE bundles include the Splitter driver/engine. So you just need to worry about enabling the splitter (.ena file). This is NDU requiring a staggered reboot of your SP’s. Take of advantage of the Navisphere Service Taskbar for this.. 
    • For CX3 arrays you will need to be at Flare release 26 patch .029, .031 + RP splitter driver 03.26.003.6.012 for RP3.3 support
    • For CX4 arrays you will need to be at Flare release 29 patch .006 + RP splitter driver 04.29.006.6.003.

Opinion time: It doesn’t get much easier and less error prone than the Array (Clariion) Splitter. The Fabric Splitter is complicated plain and simple, BUT capable of replicating your entire environment whether you are an EMC shop, an HP shop, an IBM shop, etc., all from a single pane of glass. So you will need to weigh both sides. And that’s right, I didn’t explicitly say it, but you don’t have to be an EMC shop to use RecoverPoint. In most cases replacing all your third party arrays with Clariion’s just won’t cut the mustard. So make sure up front that the splitter you choose is appropriate for your situation. Check out this post (under Modus Operandi) for a snapshot of what’s involved with the Cisco Fabric Splitter (SANTap), now close your eyes and think of a happy place. No really its not that bad, just make sure your chosen partner understands the pitfalls of said solution.

Is RecoverPoint just software or hardware as well?

RecoverPoint is intelligent software that operates on commodity based hardware from Dell. Current shipping hardware are R610’s. This hardware operates under the context of an appliance. Each site is capable of supporting 2-8 appliances in a single configuration. The appliances themselves are out of the data path and do not regulate or impede data flow under asynchronous situations. As part of your purchase you will receive the Dell appliances loaded with the RecoverPoint software. It goes without saying, but I’m saying make sure your implementer updates the appliances with the current code and make sure if your using say VMware Site Recovery Manager that the current RP code is supported (VMware’s Storage Partner Compatibility Matrix)

How is RecoverPoint licensed?

RP is licensed on a per replicated capacity basis. There are two flavors of RP-RP Full and RP/SE. As of the most current code, SE supports up to 150TB of replicated storage. RP full supports up to 600TB of replicated storage. SE is licensed in increments of 4TB, 8TB, 16TB, 24TB, 32TB, and then on a per TB basis up to 150TB. SE only supports the host and array based splitter. So for heterogeneous environments you will need to purchase RP full as well as any requirements set forth within your fabric splitter or host splitter.

RecoverPoint Terminology (Most Common)

  • RPA-RecoverPoint Appliance- this is another name for the dell commodity hardware supportive of the RP software. Each appliance has 2 copper gig Ethernet ports and 4 (8G) FC ports. Word on the street is these will be virtualized soon (and by soon I mean sometime between now and Dec. 2012)
  • Production Volume-this is what we all know and love our server data. This is the data that is worth protecting, worth replicating, peachy..
  • Repository Volume-Contains configuration information for your RP environment. 3G in size volume, light I/O. Vault drive worthy..
  • Journal volumes- these volumes are the core behind replication in RP. This volume is the first stop for all writes during replication local or remote. Depending on what side the journal volume is operating on each has a different function.
    • Production JVOL-responsible for tracking writes during periods of WAN connectivity loss. Secondarily and only during times of “failover and replicate back”, the PJVOL becomes the RJVOL.
    • Recovery JVOL-All writes being replicated hit this volume first. FIFO scenario. Every write is maintained for as long as possible based on the available capacity. The longer the Recovery Point Objective for your PVOL, the more JVOL space you will need. Look to snapshot consolidation to help.
  • Replica-Directly proportional to the size of the PVOL. As all incoming writes hit the RJVOL they are immediately rolled to the replica.
  • Cluster-A cluster is a band of 2 to 8 RPAs which accesses a single repository volume for their configuration.
  • Site-Geo locale. Each site can have multiple clusters but each cluster has a single repository volume.
  • System-Source and destination endpoints in a RecoverPoint environment whether its one site or two makes up a system
  • Consistency Group-Logical grouping of like volumes or a single volume with the intent of maintaining write order between volumes.
  • Replication Set-Defined on a per CG basis. Mapping of ProdVol to Replica. Additionally is the allocation of PJVOL and RJVOL’s for the CG in question. Remember you can have multiple Replication Sets within a single Consistency Group, with the assumption that they will share the same journal volumes.
  • CDP-Continuous Data Protection. No write left behind. Every write is captured and sent to RecoverPoint synchronously over fiber channel. Prominently known as continuous backup.
  • CRR-Continuous Remote Replication. Unidirectional asynchronous replication to a remote site.
  • CLR-Concurrent Local and Remote Replication. Combination of local (CDP ) and remote (CRR) replication for a single production volume.
  • Splitters-How do you copy a write from a production LUN in midstroke for Replication? Splitters! See above..

Is RP a HA/Scale Out solution, how is that accomplished?

RP itself is out of the data path, meaning the absence or loss of the appliances themselves will not cause you to lose access to your data. In synchronous situations, RP can regulate the application in an attempt to control data flow, but the appliances themselves are NOT in the data path.

Based on Dell commodity hardware, traditional HA capabilities are embedded and baked through with such architecture. HA and scale out is maintained via appliance clustering. As with typical clustering, each RecoverPoint appliance is privy to a shared cluster volume known as the repository volume. It is here where meta-data specific to the cluster itself is maintained. This includes, system and site information, Consistency group, replication set specifics, etc. If an appliance is lost, replication will pause momentarily and then failover over to an existing appliance in the cluster. This simply couldn’t happen without a shared cluster environment.

There is NO redundancy built into the networking, such as NIC teaming. There is a copper port for LAN and a copper port for WAN, that’s it. RP will continually bit map track all writes if the WAN interface is dropped. This mapping is held within the Production JVOL. Once the interface is up, all tracked block changes will be replicated. An important note on site control (or Virtual IP for the cluster), the VIP is passed between the first two RPA’s within a cluster, RPA1 and RPA2 for example. If RPA1 loses its LAN interface then it will pass site control to RPA2 and vice versa.

Each R610 has a single Quad port Qlogic FC card. The single card implies a SPOF, but think of this environment as a redundant array of nodes, where most HA capabilities are not defined on a per node basis, but rather on the cluster as a whole. Standard Clariion zoning applies for HA, of which I will go over later in subsequent posts.  

Beyond that, scale out is accomplished via the addition of more appliances up to 8 per cluster. With the introduction of RecoverPoint 3.3, the idea of distributed consistency groups has surfaced. Traditionally a single consistency group has been tied to a single RecoverPoint appliance. With DCG’s you now have the option of spreading the data load across up to four appliances effectively increasing the throughput from under 100Mb per appliance/per CG to over 200Mb per CG.

In summary, redundancy extends beyond a single appliance, keep your physical networking and storage infrastructure within your sights when designing such an environment. More to come..

 

Enhanced by Zemanta
Saturday
Jul102010

VAAI-vStorage APIs for Array Integration..and well you..



With vSphere 4.1 right around the corner for general release there are a lot of efficiencies and capabilities that are EXPECTED (but not guaranteed) to be presented. One of those is a subset of the vStorage API suite more appropriately known as VAAI. The main take away from this integration point between vendor storage arrays and VMware vSphere is storage offload functionality. This is effectively the equivalent of the CPU offload capabilities that AMD-V and Intel VT offers within the compute world, coming soon to your VAAI enabled array.

Note: What follows is information received during a recent EMC presentation by Chad Sakac called “Chad's Choice - Virtually Any Virtualization Topic!”.

As I mentioned, vStorage APIs are a suite of APIs designed to enhance and offload functionality to compliant arrays (beyond what vSphere can provide natively) via the following integration points..

  • Site Recovery Manager (VASRM?)
  • Data Protection (VADP)
  • Multipathing (VAMP) otherwise known as PSA or the Pluggable Storage Architecture of which I discuss here.

This graphic speaks volumes about where these points exist within a typical virtual environment. What’s in green is representative of said API’s. Items of note..

  • VAAI will only be supported on block based storage (iSCSI and FC). So VMFS and RDM’s are supported, NFS offloads will not be supported day one.
  • From the EMC perspective, only V-Max and Unified arrays. Enginuity 5875 and FLARE 30 respectively.
  • ALUA (ie. failover mode 4) must be configured for the initiator records of the participating vSphere host.

751e55ff-7b36-4c92-94e6-58a78b215a74

 

The APIs that comprise VAAI are as follows..

  • Hardware-Accelerated Locking-Main takeaway here is reducing or eliminating complete SCSI LUN locks (ie, metadata changes) which has always been a point of contention in larger clusters. With this mechanism, locks are block level induced and have no affect on other bordering hosts. The assumption here is you are running compliant hardware and this specific Advanced Parameter is enabled on the ESX server itself. Net effect? Bigger, badder, more scalable clusters…
  • Hardware-Accelerated Zero – Main takeaway here is how SCSI writes are written. Instead of identical small block writes of zeros, larger blocks of writes are written effectively reducing I/O overhead and time to creation for new VM’s.
  • Hardware-Accelerated Copy – Main takeaway here is reducing I/O operations (R/W) by using an extended copy function that moves data intra-array as opposed to array to host, host to array. Net effect? Shorter Storage VMotions and VM template deployments. 

Exciting information coming on this front, look forward to more detailed documentation and coverage.

Enhanced by Zemanta