Certification | Electric Monk

Archive for Certification

Compare and contrast physical and virtual hardware resources

February 1, 2013 Objective 3 Tuning and Optimisation No comments

Before Virtualisation

Single O/S Image per machine
Software and Hardware tightly coupled
Running multiple applications on the same machine can create conflicts
Underutilised resources
Inflexible and costly infrastructure
Datacenter space taken up in multiple physical servers
High Management, Support and Operating costs

After Virtualisation

Hardware independence of operating system and applications
Virtual machines can be provisioned to any system
Can manage OS and application as a single unit by encapsulating them into virtual machines
Power Savings
Rack space savings
Feature rich flexibility (vMotion, Storage vMotion, DRS, HA)
Rapidly provision test and development servers

Virtual Infrastructure now

While virtualization has been a part of the IT landscape for decades, it is only recently (in 1998) that VMware delivered the benefits of virtualization to industry-standard x86-based platforms which now form the majority of desktop, laptop and server shipments. A key benefit of virtualisation is the ability to run multiple O/S’s on a single virtual system and share the underlying hardware resource

There are 2 types of architecture – Hosted architecture (VMware Workstation) and Bare metal Architecture (ESX/ESXi)

We can now take advantage of features such as

The latest generation of x86-based systems feature processors with 64-bit extensions supporting very large memory capacities. This enhances their ability to host large, memory-intensive applications, as well as allowing many more virtual machines to be hosted by a physical server deployed within a virtual infrastructure.
The VMKernel runs individual VMMs for each VM responsible for the execution of the VM
The VMM which is a thin layer providing virtual x86 hardware to the overlying O/S including memory management, CPU scheduling, networking and storage
Hardware Assist (Intel VD and AMD V)
Paravirtualisation hardware
DirectPath
Direct Memory Access
AMD RVI and Intel EPT
Emulated Hardware
DRS
HA
sDRS
I/O Control for Networking and Storage
NPIV
Passthrough devices
Shares, Reservations and Limits
Resource Pools
Clustered file system VMFS
Raw Disk Mappings
3 Virtual Disk types (Eager zeroed thick, Lazy Zeroed thick and Thin)
Memory reclamation techniques

Storage DRS

February 1, 2013 Objective 3 Tuning and Optimisation No comments

What is Storage DRS?

Virtual machine provisioning has traditionally imposed some operational challenges. Monitoring datastore capacity and I/O load has proven to be very difficult and is often neglected. The datastores used to host virtual disks for new virtual machines are often randomly selected, leading to hot spots and over- or under-utilized datastores. A new feature in vSphere 5.0, Storage DRS provides smart virtual machine placement and load balancing mechanisms based on I/O utilization and storage capacity.

Storage DRS can

Manage the storage resources comparable to how DRS manages compute resources in a cluster.
Enable smart and rapid placement of new virtual machines disk files and load balancing of existing workload
Storage vMotion VMs based on storage capacity, I/O latency and IOPS load
Be configured in Manual or Automated Modes
Use Affinity and Anti-Affinity rules
Use of fully automated Storage Maintenance Mode to clear a LUN for maintenance

Migration Recommendations

When the IOPs Response time is exceeded
When the space utilisation threshold is exceeded
Space utilisation is checked every 5 minutes
IOPS load history is checked every 8 hours
Load balancing is based on IOPs workload to ensure that no datastore exceeds a particular VMKernel I/O latency level

Storage DRS utilizes vCenter Server’s datastore utilization reporting mechanism to make recommendations for migrations whenever the configured utilized space threshold (default of 80% utilized) is exceeded. Storage DRS evaluates I/O load every 8 hours. The default configured maximum I/O latency threshold is 15ms. To avoid being caught by peak load issues, Storage DRS generates migration recommendations only when an imbalance persist for a long period of time (several hours out of the evaluation period)

It will take at least 16 hours of I/O Statistics before sDRS will make recommendations.

Configuration of sDRS Settings

sDRS Automation

sDRS Runtime Rules

sDRS Scheduling

sDRS Rules for Affinity/Anti-Affinity

sDRS VM Settings

vSphere Storage DRS Demo

http://www.youtube.com/watch?v=FmVTY3_SQRw

Properly size a Virtual Machine based on application workload

January 31, 2013 Objective 3 Tuning and Optimisation No comments

Considerations

Make sure you know whether the Application is multithreaded or single threaded in order to select the correct amount of CPUs
Make sure you know whether you should be adding RDMs or VMFS VMDKs. E.g Microsoft Clustering
Java, Oracle and SQL Applications are very good at taking all the memory they are assigned and trying to manage it themselves. Be especially careful with Java which does not mix well with Ballooning and Paging
Start off small and work upwards in terms of resources. Assigning huge resources can interfere with Cluster and DRS calculations
Use the fastest network adapter you can for the O/S. VMXNET3 preferably to take advantage of all the new features
If using FT, use Thick Provisioned Eager Zeroed disks.
Decide where to place the VM swap file
Decide on Reservations, Limits and Shares if required
Check the Manufacturers recommendations for setting any advanced attributes
Use correctly raided storage. E.g RAID5, RAID10 etc
Choose the Disk Mode – Independent and Dependent

Modify Large Page Settings

January 31, 2013 Objective 3 Tuning and Optimisation No comments

Modify Large Page Settings

VMware ESXi Server supports the use of large pages inside virtual machines. The large‐page support enables server applications to establish large‐page memory regions. Memory address translations use translation lookaside buffers (TLB) inside the CPU. The use of large pages can potentially increase TLB access efficiency and thus improve program performance.

Large pages improve the performance of many applications, but misuse of large pages may also hurt performance in some situations. The potential for performance degradation is a result of the fact that the number of large TLB entries is usually smaller than the number of small TLB entries. If the working set of an application is scattered over a wide range of address space, the application is likely to experience thrashing of a relatively small number of large TLB entries. This thrashing may result in worse overall performance with higher TLB miss rates.

Configuring Large Page Settings on the Host

Click on your host
Click the Configuration tab
Click Advanced Settings under Software
Select LPage

Configuring Large Page Settings on the O/S

Consult the documentation for your operating system for details on how to configure large page support

Enabling Large Page Support in Applications

Consult the documentation for your application for details on how to configure large page support. For example, the Oracle Database Platform Guide has details on how to enable large page support for an Oracle database

VMware Document on Large Pages

http://www.vmware.com/files/pdf/large_pg_performance.pdf

Identify pre-requisites for Hot-Add Features

January 31, 2013 Objective 3 Tuning and Optimisation, Objective 6 Advanced Troubleshooting No comments

What is Hot-Add?

Hot add options allow configuration changes to a virtual machine while it is powered on. Hot add options can be turned on or off for memory and number of CPU configurations for eligible virtual machines.

Pre-Requisites

You must disable CPU hot add if you plan to use USB device passthrough from an ESX/ESXi host to a virtual machine.
When you configure multi-core virtual CPUs for a virtual machine, CPU hot Add/remove is disabled.
Not enabled by default.
Check Guest OS support
Memory and CPUs can be hot added (but not hot removed)
Enabled per VM and needs a reboot to take effect
Enable on templates
Virtual H/W v7
Not compatible with Fault Tolerance

Identify VMware CPU Load Balancing Techniques

January 31, 2013 Objective 3 Tuning and Optimisation No comments

The VMKernel CPU scheduler is crucial to providing good performance in a consolidated environment. Most processors these days are equipped with multiple cores per processor and controlling, managing and scheduling these multi way processors is essential. It assigns execution contexts to processors

The CPU Scheduler

The CPU Scheduler has the following features

Schedules the vCPUs on physical CPUs
Enforces the proportional-share algorithm for CPU usage
Supports SMP VMs
Uses relaxed co-scheduling for SMP VMs
Uses NUMA
Processor Topology/Cache aware
Hyperthreading

Schedules the vCPUs on physical CPUs

The Scheduler checks physical utilisation every 2-40ms and migrates vCPUs as necessary

Enforces the proportional-share algorithm for CPU usage

When CPUs are over-committed, hosts time slice physical CPUs across all VMs where each CPU is also prioritised by resource allocation settings in terms of Shares, Reservations and Limits)

Supports SMP VMs

If a VM is configured with multiple processors then it believes that it is running on a dedicated physical multiprocessor. ESXi maintains this by using co-scheduling of the vCPUs.

Co-Scheduling is a technique for scheduling, descheduling, preempting and blocking transactions across multiple processors. Without it, vCPUs would be scheduled independently, breaking the guests assumption regarding uniform process.

The CPU Scheduler takes “Skew” into account when scheduling vCPUs. Skew is the difference in execution rates between 2 or more vCPUs in an SMP VM. The Scheduler maintains a fine grained cumulative skew value for each vCPU in a VM. Time spent in the hypervisor is excluded from the process as sometimes the operations do not benefit from being co-scheduled. The vCPU is considered to be skewed if its cumulative skew value exceeds a configurable threshold, usually a few seconds

Uses relaxed co-scheduling for SMP VMs

Relaxed Co-Scheduling refers to a technology where vCPUs have become skewed and must be co-started. When any vCPU is scheduled, it ensures that all other vCPUs that are behind will also be scheduled

The vCPUs that move too far forward are stopped and wait for the other VMs to catch up. An idle vCPU does not gather skew and is classed as if it was running normally

Uses NUMA

Please see this blog post for more information on NUMA

http://www.electricmonk.org.uk/2012/03/01/numa/

Processor Topology/Cache aware

Basically the CPU Scheduler uses Processor Topology information to calculate and optimise the placement of vCPUs on to different sockets using socket, core and logical processor information

The CPU Scheduler also takes advantage of the Shared Last Level Cache which exists within cores on the same processor. This is a memory cache that has a dedicated channel to a CPU socket bypassing the main memory bus which makes it run at the same speed of the CPU

In some situations the CPU scheduler will spread the load across all sockets and sometimes it can be beneficial to schedule all vCPus on to the same socket. Dependent on workload and over/under committed systems

Hyperthreading

The applications most likely to benefit are 3D rendering programs, heavy-duty audio/video transcoding apps, and scientific applications built for maximum multi-threaded performance. But you may also enjoy a performance boost when encoding audio files in iTunes, playing 3D games and zipping/unzipping folders. The boost in performance can be up to 30%, although there will also be situations where Hyper-Threading provides no boost at all.

Hyper-Threading is where two threads are able to run on one single-threaded core. When a thread on the core in question is stalling or in a halt state, hyper-threading enables the core to work on a second thread instead. It makes the OS think that the processor has double the number of cores, and often yields a performance improvement

Tune ESXi VM Storage Configuration

January 30, 2013 Objective 3 Tuning and Optimisation No comments

Tuning Configuration

Use the correct virtual hardware for the VM O/S
Use paravirtual hardware for I/O intensive applications
LSI Logic SAS for newer O/S’s
Size the Guest O/S Queue depth appropriately
Make sure Guest O/S partitions are aligned
Know what Disk provisioning policies are best. Thick provision lazy zeroed (default), Thick provision eager zeroed and Thin provision.
Store swap file on a fast or SSD Datastore

When deploying a virtual machine, an administrator has a choice between three virtual disk modes. For optimal performance, independent persistent is the best choice. The virtual disk mode can be modified when the VM is powered off.

Choose VMFS or RDM Disks to use. RDM Disk generally used by clustering software.

Use Disk Shares to configure more fined grained resource control

In some cases large I/O requests issued by applications can be split by the guest storage driver. Changing the VMs registry settings to issue larger block sizes can eliminate this splitting thus enhancing performance. See http://kb.vmware.com/kb/9645697

Tune ESXi VM Network Configuration

January 30, 2013 Objective 3 Tuning and Optimisation No comments

Tuning Configuration

Use the VMXNet3 adapter and if it is not supported use the VMXNET/VMXNET2 adapter

Use a network adapter that supports TCP Checksum, TSO and Jumbo Frames multiqueue support (also known as Receive Side Scaling in Windows), IPv6 offloads, and MSI/MSI-X interrupt delivery
Use the fastest ethernet you can. 10GB preferable
Ensure the speed and duplex settings on the network adapters is correct. For 10/100 nics, set the speed and duplex. Make sure the duplex is set to full duplex
For NICs, Gigabit Ethernet or higher set the speed and duplex to auto-negotiate
DirectPath I/O (DPIO) provides a means of bypassing the vmkernel, giving a VM direct access to hardware devices by leveraging Intel VT-D and AMD-V hardware support. Specific to networking, DPIO allows a VM to connect directly to the hosts physical network adapter without the overhead associated with emulation or paravirtualization. The bandwidth increases associated with DPIO are nominal but the savings on CPU cycles can be substantial for busy workloads. There are quite a few restrictions when utilizing DPIO. For example, unless using Cisco UCS hardware, DPIO is not compatible with hot-add, FT, HA, DRS or snapshots.
Use NIC Teaming where possible. VMware’s proprietry network teaming or Etherchannel
Virtual Machine Communications Interface (VMCI) is a virtual device that promotes enhanced communication between a virtual machine and the host on which it resides, and between VMs running on the same host. VMCI provides a high-speed alternative to standard TCP/IP sockets. The VMCI SDK enables engineers to develop applications which take advantage of the VMCI infrastructure. With VMCI, VM application traffic (of VMs on the same host) bypasses the network layer, reducing communication overhead. With VMCI, it’s not uncommon for inter-VM traffic to exceed 10 GB/s

Tune ESXi VM CPU Configuration

January 30, 2013 Objective 3 Tuning and Optimisation No comments

Tuning Configuration

Configuring Multicore Virtual CPUs. There are some limitations and considerations on this subject, like ESXi host configuration, VMware License, Guest OS (license) restrictions and so on. Only then can you decide the number of virtual sockets and the number of cores per socket.
CPU affinity is a technique that doesn’t necessarily imply load balancing, but it can be used to restrict a virtual machine to a particular set of processors. Affinity may not apply after a vMotion and it can disrupt ESXi’s ability to apply and meet shares and reservations
Duncan Epping raises some good points in this link http://www.yellow-bricks.com/2009/04/28/cpu-affinity/

You can use Hot Add to add vCPUs on the fly

Check Hyperthreading is enabled

Generally keep CPU/MMU Virtualisation on Automatic

You can adjust Limits, Reservations and Shares to control CPU Resources

Tune ESXi VM Memory Configuration

January 30, 2013 Objective 3 Tuning and Optimisation No comments

Tuning Configuration

Minimum memory size is 4MB for virtual machines that use BIOS firmware. Virtual machines that use EFI firmware require at least 96MB of RAM or they cannot power on.
The memory size must be a multiple of 4MB
vNUMA exposes NUMA technology to the Guest O/S. Hosts must have matching NUMA architecture and VMs must be running Hardware Version 8

Size the VM so they align with physical boundaries. If you have a system with 6 cores per NUMA node then size your machines with a multiple of 6 vCPUs
vNUMA can be enabled on smaller machines by adding numa.vcpu.maxPerVirtualNode=X (Where X is the number of vCPUs per vNUMA node)
Enable Memory Hot Add to be able to add memory to the VMs on the fly

Use Operating Systems that support large memory pages as ESXi will by default provide them to those O/S’s which request them
Store a VMs swap file in a different faster location to the working directory
Configure a special host cache on an SSD (If one is installed) to be used for the swap to host cache feature. Host cache is new in vSphere 5. If you have a datastore that lives on a SSD, you can designate space on that datastore as host cache. Host cache acts as a cache for all virtual machines on that particular host as a write-back storage for virtual machine swap files. What this means is that pages that need to be swapped to disk will swap to host cache first, and the written back to the particular swap file for that virtual machine
Keep Virtual Machine Swap files on low latency, high bandwidth storage systems
Do not store swap files on thin provisioned LUNs. This can cause swap file growth to fail.

You can use Limits, Reservations and Shares to control Resources per VM

« Older Entries Recent Entries »

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Archive for Certification

Compare and contrast physical and virtual hardware resources

Storage DRS

Properly size a Virtual Machine based on application workload

Modify Large Page Settings

Identify pre-requisites for Hot-Add Features

Identify VMware CPU Load Balancing Techniques

Tune ESXi VM Storage Configuration

Tune ESXi VM Network Configuration

Tune ESXi VM CPU Configuration

Tune ESXi VM Memory Configuration

Electric Monk

Don't think about what can happen in a month. Don't think what can happen in a year. Just focus on the 24 hours in front of you and do what you can to get closer to where you want to be :-)

Search

Calendar

Social Media and RSS

vExpert

Recent Posts

Archives

Categories

Fatcow Webhosting

Archive for Certification

Electric Monk

Don't think about what can happen in a month. Don't think what can happen in a year. Just focus on the 24 hours in front of you and do what you can to get closer to where you want to be :-)

Search

Calendar

Social Media and RSS

vExpert

Recent Posts

Archives

Categories

Tags

Fatcow Webhosting