SAP Knowledge Base Article - Preview

3095218 - Stopping master's RepAgent with nowait caused Signal 11 and ASE crash - SRS 16.0

Symptom

  • "admin who_is_down" in the standby SRS shows:
     
       REP AGENT CI    Down                 xxx_xxx_R2.master
  • "sp_help_rep_agent master, 'process'  in the primary ASE shows:

      Replication Agent Coordinator Process Status dbname spid sleep_status state
      --------------------------------------------------------------------------------------
      (1 row affected)
  • When we try to restart this RepAgent in the primary ASE, it fails because it was not completely down:
1> sp_start_rep_agent master
2> go
Msg 9431, Level 16, State 0:
Server 'xxx', Procedure 'sp_start_rep_agent', Line 281:
A Rep Agent Thread for database 'master' (dbid=1) is being stopped.
Msg 18421, Level 16, State 1:
Server 'xxx', Procedure 'sp_start_rep_agent', Line 286:
Failed to start the Replication Agent thread for database 'master'.
(return status = 1)
  • If the first sp_stop_rep_agent fails, then the subsequent attempt to stop RepAgent with nowait (force stop) causes a Signal 11 error and the ASE server crashes:

00:0049:00000:04282:2021/08/11 15:50:19.31 server  Error: 9414, Severity: 20, State: 1
00:0049:00000:04282:2021/08/11 15:50:19.31 server  RepAgent(1): Streaming replication stream CI Library error -1 with severity 2 and message 'Producer receiver 'master' failed to receive data, shutting down transport to 'nnnn:nn ssl=nnn"'.'. Rep Agent detected error 3 with message 'Unknown' at line 3653 in file ra_ci_scanner.c.
00:0000:00000:00000:2021/08/11 15:50:19.31 kernel  (CI-Info) Message: 76050, Severity: 0 Producer of Stream 'master' is exiting.
00:0049:00000:00753:2021/08/11 15:50:19.31 server  Rep Agent on database 'master' switched from mode 'sync' to mode 'async' because scanner reopened the stream after a retryable error.

……

00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  Current process (0x0) infected with signal 11 (SIGSEGV)
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  Current Process is running on a Non-Engine Thread
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  Address 0x(nil) (), siginfo (code, address) = (1, 0x(nil))
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  **** Saved signal context (0x0x00007e8ec61a4ac0): ****
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  uc_flags: 0x7, uc_link: 0x(nil)
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  uc_sigmask: 0x7bfbf037 0xb 0x1 0x0
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  uc_stack: ss_sp: 0x(nil), ss_size: 0x0, ss_flags: 0x2
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  General Registers (uc_mcontext.gregs):
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       PC : (nil) ()
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel          RAX : 0x0000000000000025  RBX : 0x00007e8ec61a57f0
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel          RCX : (nil)  RDX : (nil)
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       RBP : 0x00007e8ec61a5b40  RSP : 0x00007e8ec61a57a8
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       R8  : (nil)  R9  : 0x00007e9839d50370
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       R10 : (nil)  R11 : 0x0000000000000206
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       R12 : (nil)  R13 : 0x00007e983a973d30
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       R14 : 0x00007e983a9714f0  R15 : 0x0000000000000001
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       RDI : 0x00007e983a9714f0  RSI : 0x00007e8ec61a57f0
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       RIP : (nil)  CSGSFS : 0x002b000000000033
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       TRAPNO : 0x000000000000000e  ERR : 0x0000000000000014
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel       EFL : 0x0000000000010246
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  **** end of signal context ****
00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  Spinlocks held by taskblock 0x7e8efb28aa08

00:0090:00000:00000:2021/08/13 11:37:38.50 kernel  Spinlock Kernel->kpprocspin[i] at address 0x00007e8ef9525400 owned by 0x7e8efb28aa08
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  Spinlock DES Name Hash Bucket Spin at address 0x00007e8efa01d200 owned by 0x7e8efb28aa08
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  Spinlock DES Name Hash Bucket Spin at address 0x00007e8efa057400 owned by 0x7e8efb28aa08
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  Spinlock default data cache at address 0x00007e8efa3a9480 owned by 0x7e8efb28aa08
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  End of spinlock display.
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  Exception in blocking call execution
00:0040:00000:00000:2021/08/13 11:37:38.51 kernel  ueshutdown: exiting
00:0000:00000:00000:2021/08/13 11:37:38.51 kernel  Main thread performing final shutdown.
00:0000:00000:00000:2021/08/13 11:37:38.51 kernel  Blocking call queue shutdown.
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x000000000153b8a1 nthrd_backtrace+0x3b()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000001563612 kbctException+0x44()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000001561575 krtctskException+0x57()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x000000000156999e kisignal+0xa22()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: (nil) ()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000f00c40 ra__ci_call_CloseStream_unblocking+0x3f()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x000000000224cd54 ubcExecute+0x54()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x00000000022591a3 kbctMain+0x73()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000002252f79 krtctskRun+0x39()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000001a8496c ktskRun+0x5c()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000002278daa kt__RTCMain+0xaa()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  end of stack trace
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  Blocking call executing on behalf of 440862010 failed with exception
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000001a7ce2b upsleepgeneric+0x5cb()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x000000000224ce6f ubcWaitForCompletion+0x5f()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x000000000224dccc undo_blockingcall_thread+0x3c()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000ef4a87 ra__ci_sleep_on_close_CI_stream+0x3d0()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000ef448b ra__ci_close_CI_stream+0x921()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000ef3835 ra_ci_close_CI_stream+0xfa()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000f1d31e ra__ci_scanner_close_stream+0xa7()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000f1e3a8 ra__ci_scanner_reopen_stream+0x55()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  [Handler pc: 0x0x0000000000f98589 ra_hdlr installed by the following function:-]
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000f11dad ra__ci_scanner_execute+0x18a()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  [Handler pc: 0x0x0000000000f98589 ra_hdlr installed by the following function:-]
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  [Handler pc: 0x0x0000000000f98589 ra_hdlr installed by the following function:-]
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  [Handler pc: 0x0x0000000000f98589 ra_hdlr installed by the following function:-]
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  pc: 0x0000000000f0ea74 ra_ci_scanner_thread+0x3ce()
00:0090:00000:00000:2021/08/13 11:37:38.51 kernel  end of stack trace, spid 4282, kpid 440862010, suid 0


Read more...

Environment

  • SAP Adaptive Server Enterprise (ASE) 16.0 with Always-on Option (HADR)
  • SAP Replication Server (SRS) 16.0

Product

SAP Replication Server 16.0

Keywords

sybase, 825379, CR825379, CR#825379, SEGV, segmentation, storage, access, violation, fault, segfault, stacktrace , KBA , BC-SYB-REP-SAP , Replication with SAP Suite / SAP BW , BC-DB-SYB , Business Suite on Adaptive Server Enterprise , BC-SYB-ASE , Sybase ASE Database Platform (non Business Suite) , BC-SYB-REP , Sybase Replication Server (standalone) , Known Error

About this page

This is a preview of a SAP Knowledge Base Article. Click more to access the full version on SAP for Me (Login required).

Search for additional results

Visit SAP Support Portal's SAP Notes and KBA Search.