Hi John,
I upgraded memory on server ex01 & ex02 yesterday to exclude "lack of memory" problem in future, and I figured out that failover didn't work, here things happened :
Step 1 : I run commands on server ex02 to get it into maintenance mode, following instruction here : http://blogs.technet.com/b/nawar/archive/2014/03/30/exchange-2013-maintenance-mode.aspx
1. Drain active mail queues on the mailbox server
Set-ServerComponentState EX02 -Component HubTransport -State Draining -Requester Maintenance
2. To help transport services immediately pick the state change run:
Restart-Service MSExchangeTransport
Restart-Service MSExchangeFrontEndTransport
3. To redirect messages pending delivery in the local queues to another Mailbox server run:
Redirect-Message -Server EX02 Target EX01
4. To prevents the node from being and becoming the PAM, pause the cluster node by running
Suspend-ClusterNode EX02
5. To move all active databases currently hosted on the DAG member to other DAG members, run
Set-MailboxServer EX02 -DatabaseCopyActivationDisabledAndMoveNow $True
6. Get the status of the existing database copy auto activation policy, run the following and note the value of DatabaseCopyAutoActivationPolicy, we will need this when taking the server out of Maintenance in the future
To prevent the server from hosting active database copies, run
Set-MailboxServer EX02 -DatabaseCopyAutoActivationPolicy Blocked
7. To put the server in maintenance mode run:
Set-ServerComponentState EX02 -Component ServerWideOffline -State Inactive -Requester Maintenance
After run above commands, I see that server ex02 in maintenance mode:
[PS] C:\Windows\system32>Get-ServerComponentState ex02 | ft Component,State -Autosize
Component State
--------- -----
ServerWideOffline Inactive
HubTransport Inactive
FrontendTransport Inactive
Monitoring Active
RecoveryActionsEnabled Active
AutoDiscoverProxy Inactive
ActiveSyncProxy Inactive
EcpProxy Inactive
EwsProxy Inactive
ImapProxy Inactive
OabProxy Inactive
OwaProxy Inactive
PopProxy Inactive
PushNotificationsProxy Inactive
RpsProxy Inactive
RwsProxy Inactive
RpcProxy Inactive
UMCallRouter Inactive
XropProxy Inactive
HttpProxyAvailabilityGroup Inactive
ForwardSyncDeamon Inactive
ProvisioningRps Inactive
MapiProxy Inactive
But when I get "Mailbox Database 01" copy status
Get-MailboxDatabaseCopyStatus "Mailbox Database 01"
Name Status ContentIndexState
Mailbox Database 01\EX01 Mounted Healthy
Mailbox Database 01\EX02 Healthy Healthy
Is that normal ?
Step 2 : I shutdown server ex02 to upgrade RAM, after server ex02 off, I get "Mailbox Database 01" copy status
Get-MailboxDatabaseCopyStatus "Mailbox Database 01"
Name Status ContentIndexState
Mailbox Database 01\EX01 Mounted Healthy
Mailbox Database 01\EX02 ServiceDown Unknown
Everything is fine, afew seconds later, "Mailbox Database 01" copy on server ex01 is dismounted (server ex01 is
primaryActiveManager as I showed you previous replies), I get "Mailbox Database 01" copy status again
Get-MailboxDatabaseCopyStatus "Mailbox Database 01"
Name Status ContentIndexState
Mailbox Database 01\EX01 Dismounted Failed
Mailbox Database 01\EX02 ServiceDown Unknown
I tried to mount "Mailbox Database 01" copy on server ex01 but it didn't work
Mount-Database -Identity "Mailbox Database 01"
Couldn't mount the database that you specified. Specified database: Mailbox Database 01; Error code: An Active Manager
operation failed. Error: An Active Manager operation encountered an error. To perform this operation, the server must
be a member of a database availability group, and the database availability group must have quorum. Error: Active
Manager encountered an error while trying to access the cluster database. [Server: EX01.mydomain.com].
+ CategoryInfo : InvalidOperation: (Mailbox Database 01:ADObjectId) [Mount-Database], InvalidOperationExc
eption
+ FullyQualifiedErrorId : [Server=EX01,RequestId=719edf90-3a8a-47e1-a85a-0e37902f0253,TimeStamp=3/29/2015 4:3
3:26 AM] 9BA9C40F,Microsoft.Exchange.Management.SystemConfigurationTasks.MountDatabase
+ PSComputerName : ex01.mydomain.com
A process is holding onto a transport performance counter. processId : 2888, counter : time in resource per second Value=0 SpinLock=0 Lifetime=Type:...
So my user experienced "service down" for about 10 mins, after I power on server ex02 (when ex02 startup it is still in maintenance mode) , "Mailbox Database 01" copy on server ex01 is mounted and healthy automatically, everything back
to normal. I run commands on server ex02 to get it out of maintenance mode
Set-ServerComponentState EX02 -Component ServerWideOffline -State Active -Requester Maintenance
Resume-ClusterNode EX02
Set-MailboxServer EX02 -DatabaseCopyActivationDisabledAndMoveNow $False
Set-MailboxServer EX02 -DatabaseCopyAutoActivationPolicy Unrestricted
Set-ServerComponentState EX02 -Component HubTransport -State Active -Requester Maintenance
Restart-Service MSExchangeTransport
Restart-Service MSExchangeFrontEndTransport
I active "Mailbox Database 01" copy on server ex02, I do same things with server ex01 to upgrade RAM and same things happen.
My witness server is Windows 7 Pro 32bit, joined domain,
Witness directory: C:\Witness , "Exchange Trusted Subsystem" is added into local administrators group in witness server
I don't know what is wrong with my DAG.
-
Edited by
Jack Chuong
Monday, March 30, 2015 4:00 AM