Showing posts with label Cluster. Show all posts
Showing posts with label Cluster. Show all posts

MSMQ error events in Failover Clustering Diagnostic Log

I was checking Failover Clustering Diagnostic Log on Windows Server 2012 cluster nodes, and all nodes were having error events for MSMQ:

[RHS] s_RhsRpcCreateResType: ERROR_NOT_READY(21)' because of 'Startup routine for ResType MSMQ returned 21.'
[RCM] result of first load attempt for type MSMQ: 21
[RCM] Failed to load restype 'MSMQ': error 21.
[RHS] s_RhsRpcCreateResType: ERROR_NOT_READY(21)' because of 'Startup routine for ResType MSMQTriggers returned 21.'
[RCM] result of first load attempt for type MSMQTriggers: 21
[RCM] Failed to load restype 'MSMQTriggers': error 21.

These events can be safely ignored, since during installation of Failover Cluster feature, MSMQ and MSMQ triggers resource types are registered with cluster service, but the MSMQ feature is not installed.

More about these events check:
http://blogs.msdn.com/b/clustering/archive/2013/04/05/10408075.aspx
 
 
 
 

 

Failover Clustering Event 1196 and 1228

In this case Hyper V failover cluster was installed on Windows Server 2012, and on one of the nodes that was hosting the "Cluster Group" started to log following error events in System event log:

Event 1228:
Cluster network name resource 'Cluster Name' encountered an error enabling the network name on this node. The reason for the failure was:

'Unable to obtain a logon token'.
 

The error code was '1326'. 

You may take the network name resource offline and online again to retry.

and Event 1196:
Cluster network name resource 'Cluster Name' failed registration of one or more associated DNS name(s) for the following reason: DNS bad key.


Ensure that the network adapters associated with dependent IP address resources are configured with at least one accessible DNS server.
I moved the "Cluster Group" to another node, but same story and same events were logged. I tried to live migrate VMs between nodes, but unsuccessfully, the live migration was failing. Quick migration was working fine. Failover Clustering Diagnostic Log during live migration was showing following error messages:
[RES] Network Name: [NNLIB] LogonUserEx fails for user 'Cluster Name': 1326 (useSecondaryPassword: 0)  
[RES] Network Name: [NNLIB] LogonUserEx fails for user 'Cluster Name': 1326 (useSecondaryPassword: 1)  
[RES] Network Name: [NNLIB] Logon failed for user 'Cluster Name' (Error 1326), DC \\dc.domain.name, domain domain.name  
[RES] Network Name <Cluster Name>: Identity: Obtaining Windows Token for Name: 'Cluster Name', SamName: 'Cluster Name', Type: Singleton, Result: 1326, LastDC: \\dc.domain.name  
… 
[RES] Network Name <Cluster Name>: Initializing Identity module failed with error 1326  
[RHS] Error 1326 from ResourceControl for resource Cluster Name.  
[RCM] ResourceControl(NETNAME_GET_VIRTUAL_SERVER_TOKEN) to Cluster Name returned 1326.  
[RES] Virtual Machine <Virtual Machine Name>: Live migration of 'Virtual Machine Name' failed.

I've checked for the permissions of the CNO DNS record and CNO AD object, and everything was fine, but somehow the password was out of sync with AD. And here are the steps for remediation:

Moved the CNO account to Computers container
Logged on one of the cluster nodes with account that had Reset Password right
Simulate multiple failures of the cluster Network Name resource until permanent failed state
Once in failed state, right click on resource and in More Action chose Repair The last action will reset the CNO password in AD, and will bring the resource online. CNO DNS record was successfully updated, live migration of VMs started to work, and no error events were logged on 'Cluster Group' owner.

For more info about CNO on Windows Server 2012 please check : http://blogs.technet.com/b/askcore/archive/2012/09/25/cno-blog-series-increasing-awareness-around-the-cluster-name-object-cno.aspx

VSS Error Event 8194 on Windows Server 2012 Failover Cluster

In this case I was running Windows Server 2012 Hyper V Failover Cluster with CSV, and I was trying to backup the nodes from cluster using DPM 2012 SP1 RU 2. I have scheduled BMR system protection to all cluster member nodes. The backup of nodes was finishing successfully without any errors on DPM server. But, on all cluster nodes application log was having error logs 8194 from VSS source :

Volume Shadow Copy Service error: Unexpected error querying for the IVssWriterCallback interface. hr = 0x80070005, Access is denied.
This is often caused by incorrect security settings in either the writer or requestor process.
Operation:
Gathering Writer Data
Context:
Writer Class Id: {e8132975-6f93-4464-a53e-1050253ae220}
Writer Name: System Writer
 
I have opened support case with Microsoft, and support engineer has confirmed that this behavior is bug. This behavior will probably not be changed, and should be ignored.

How to check EMBG (Unique Master Citizen Number) using regex

In this post, I will share my implementation of how to check if some number looks like EMBG or Unique Master Citizen Number. For those of yo...