Quantcast
Channel: SQL Server High Availability and Disaster Recovery forum
Viewing all articles
Browse latest Browse all 4689

SQL Server 2012 CLuster did not failover

$
0
0

Hello

We recently had a SQL Server 2012 SE service that went down for almost a minute. This is a clustered environment. Whats interesting is the cluster service did not failover to the other node when the service went down, rather it started back the service on the same node. Also we are not quite sure why the service had to go down. I went to cluster.log and here is what it says. Can some please help me understand this log:

2015/02/28-03:11:13.762 WARN  [RES] SQL Server <SQL Server>: [sqsrvres] Failed to retrieve data column. Return code -1
2015/02/28-03:11:14.745 ERR   [RES] SQL Server <SQL Server>: [sqsrvres] Failure detected, diagnostics heartbeat is lost
2015/02/28-03:11:14.745 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] IsAlive returns FALSE
2015/02/28-03:11:14.745 WARN  [RHS] Resource SQL Server IsAlive has indicated failure.
2015/02/28-03:11:14.745 INFO  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SQL Server', gen(0) result 1.
2015/02/28-03:11:14.745 INFO  [RCM] TransitionToState(SQL Server) Online-->ProcessingFailure.
2015/02/28-03:11:14.745 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL Server)
2015/02/28-03:11:14.745 INFO  [RCM] resource SQL Server: failure count: 1, restartAction: 2.
2015/02/28-03:11:14.745 INFO  [RCM] Will restart resource in 500 milliseconds.
2015/02/28-03:11:14.745 INFO  [RCM] TransitionToState(SQL Server) ProcessingFailure-->[WaitingToTerminate to DelayRestartingResource].
2015/02/28-03:11:14.745 INFO  [RCM] rcm::RcmGroup::UpdateStateIfChanged: (SQL Server (MSSQLSERVER), Online --> Pending)
2015/02/28-03:11:14.745 INFO  [RCM] TransitionToState(SQL Server Agent) Online-->[WaitingToTerminate to OnlineCallIssued].
2015/02/28-03:11:14.745 INFO  [RCM] TransitionToState(SQL Server Agent) [WaitingToTerminate to OnlineCallIssued]-->[Terminating to OnlineCallIssued].
2015/02/28-03:11:16.617 INFO  [RCM] HandleMonitorReply: TERMINATERESOURCE for 'SQL Server Agent', gen(0) result 0.
2015/02/28-03:11:16.617 INFO  [RCM] Restarting resource 'SQL Server Agent'.
2015/02/28-03:11:16.617 INFO  [RCM] TransitionToState(SQL Server) [WaitingToTerminate to DelayRestartingResource]-->[Terminating to DelayRestartingResource].
2015/02/28-03:11:16.617 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Request to terminate SQL Server
2015/02/28-03:11:16.617 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Stop service MSSQLSERVER immediately
2015/02/28-03:11:16.617 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Online worker was asked to terminate
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Online worker helper is stopped
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] SQLMoreResults() returns -1 with following information
2015/02/28-03:12:09.002 ERR   [RES] SQL Server <SQL Server>: [sqsrvres] ODBC Error: [08S01] [Microsoft][SQL Server Native Client 11.0]TCP Provider: The specified network name is no longer available.(64)
2015/02/28-03:12:09.002 ERR   [RES] SQL Server <SQL Server>: [sqsrvres] ODBC Error: [08S01] [Microsoft][SQL Server Native Client 11.0]Communication link failure (64)
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] No more diagnostics results
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Diagnostics is stopped
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Disconnect from SQL Server
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Extended Event logging is stopped
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Extended Event target state:
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Extended Event session summary: dropped buffers = 0, dropped events = 0
2015/02/28-03:12:09.002 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Online worker is stopped
2015/02/28-03:12:12.387 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Service was stopped successfully
2015/02/28-03:12:12.387 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Terminate handling is completed
2015/02/28-03:12:12.387 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] SQL Server resource state is changed from 'ClusterResourceOnline' to 'ClusterResourceFailed'
2015/02/28-03:12:12.387 WARN  [RHS] returning ResourceExitStateTerminate.
2015/02/28-03:12:12.387 INFO  [RCM] HandleMonitorReply: TERMINATERESOURCE for 'SQL Server', gen(1) result 0.
2015/02/28-03:12:12.387 INFO  [RCM] TransitionToState(SQL Server) [Terminating to DelayRestartingResource]-->DelayRestartingResource.
2015/02/28-03:12:12.902 INFO  [RCM] Delay-restarting SQL Server and any waiting dependents.
2015/02/28-03:12:12.902 INFO  [RCM] TransitionToState(SQL Server) DelayRestartingResource-->OnlineCallIssued.
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Request to bring SQL Server online
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] SQL Server resource state is changed from 'ClusterResourceFailed' to 'ClusterResourceOnlinePending'
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Online worker is started
2015/02/28-03:12:12.902 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL Server', gen(1) result 997.
2015/02/28-03:12:12.902 INFO  [RCM] TransitionToState(SQL Server) OnlineCallIssued-->OnlinePending.
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] XEvent session MSSQLSERVER is created with RolloverCount 10, MaxFileSizeInMBytes 100, and LogPath 'M:\MSSQL11.MSSQLSERVER\MSSQL\LOG\'
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Extended Event logging is started
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property VerboseLogging is 0
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property HealthCheckTimeout is 60000
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property FailureConditionLevel is 3
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property SqlDumperDumpFlags is 0x0
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property SqlDumperDumpTimeOut is 0
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The private property SqlDumperDumpPath is ''
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The property LogIsEnabled is 1
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The property LogFileRolloverCount is 10
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The property LogMaxFileSizeInMBytes is 100
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The property LogPath is ''
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Server name is SQLSERVER
2015/02/28-03:12:12.902 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Service name is MSSQLSERVER
2015/02/28-03:12:12.917 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Dependency expression for resource 'SQL Network Name (SQLServer)' is '([e7738d9a-695e-4250-8254-55b9e23e8726])'
2015/02/28-03:12:12.917 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Starting service MSSQLSERVER...
2015/02/28-03:12:13.183 INFO  [NM] Received request from client address DBServer2.
2015/02/28-03:12:14.197 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Service status checkpoint was changed from 0 to 1 (wait hint 20000). Pid is 6148
2015/02/28-03:12:14.789 INFO  [NM] Received request from client address DBServer2.
2015/02/28-03:12:15.211 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Service is started. SQL Server pid is 6148
2015/02/28-03:12:15.211 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] Connect to SQL Server ...
2015/02/28-03:12:15.211 INFO  [RES] SQL Server <SQL Server>: [sqsrvres] The connection was established successfully

However in the eventvwr system logs I am seeing a bunch of iscsiprt errors that say:

Target did not respond in time for a SCSI request. The CDB is given in the dump data.

 Initiator could not find a match for the initiator task tag in the received PDU. Dump data contains the entire iSCSI header.

Initiator sent a task management command to reset the target. The target name is given in the dump data.

Warning:

The description for Event ID 129 from source iScsiPrt cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event: 

\Device\RaidPort1

the message resource is present but the message is not found in the string/message table

and a sql service error:

A timeout (30000 milliseconds) was reached while waiting for a transaction response from the MSSQLSERVER service.

Can someone please help me understand this:

Thanks a ton.


Viewing all articles
Browse latest Browse all 4689

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>