SreedharM3618 Member since 2019 7 posts
Transport for NSW
Posted: 2 months ago
Last activity: 2 months 2 weeks ago

Pega 8.4 kafka stability issues: CharlatanExceptions in logs

We recently upgraded to 8.4.1 from 7.2.2 and after that we have been constantly seeing lot of issues related to kafka on our Dev and SIT servers. 

Recently we have done multi node setup in our test environments (2 separate servers) with default node classification  (i.e Web, BackgroundProcessing, Search and Stream)

1) we started seeing strange issues like the below log which gets logged every 10 seconds.

We tried truncating PR_SYS_STATUSNODES and started the nodes and things looked fine for couple of days before surfacing again.

Stream status shows normal for both the nodes.

2) One more strange behaviour is that Admin studio is showing '0 nodes as running' though both the nodes are running fine and we're able to log into the applications on both the nodes.

The communication b/w both the nodes also seem to be working - I telnet'ed connecting to different ports used by pega from one node to other and the connection is established okay (unless I overlooked something).

Has anyone faced this and any help is really appreciated?



2020-08-11 22:35:55,215 [  New I/O worker #65] [  STANDARD] [                    ] [                    ] (ion.service.SessionServiceImpl) ERROR   - Failed to accept in coming connection for the session '-1375665877' com.pega.charlatan.utils.CharlatanException$SessionExpiredException: KeeperErrorCode = Session expired         at com.pega.charlatan.session.service.SessionServiceImpl.handleConnectRequest( ~[charlatan-server.jar:?]         at com.pega.charlatan.session.service.SessionServiceImpl.processRequest( ~[charlatan-server.jar:?]         at com.pega.charlatan.server.CharlatanNettyConnection.receiveMessage( ~[charlatan-server.jar:?]         at com.pega.charlatan.server.CharlatanNettyServer$CharlatanChannelHandler.processMessage( ~[charlatan-server.jar:?]         at com.pega.charlatan.server.CharlatanNettyServer$CharlatanChannelHandler.messageReceived( ~[charlatan-server.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at com.pega.charlatan.server.CharlatanNettyServer$CharlatanChannelHandler.handleUpstream( ~[charlatan-server.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at$DefaultChannelHandlerContext.sendUpstream( ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at org.jboss.netty.handler.codec.frame.FrameDecoder.unfoldAndFireMessageReceived( ~[netty-3.10.6.Final.jar:?]         at org.jboss.netty.handler.codec.frame.FrameDecoder.callDecode( ~[netty-3.10.6.Final.jar:?]         at org.jboss.netty.handler.codec.frame.FrameDecoder.messageReceived( ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at ~[netty-3.10.6.Final.jar:?]         at org.jboss.netty.util.internal.DeadLockProofWorker$ ~[netty-3.10.6.Final.jar:?]         at java.util.concurrent.ThreadPoolExecutor.runWorker( ~[?:1.8.0_77]         at java.util.concurrent.ThreadPoolExecutor$ ~[?:1.8.0_77]         at com.pega.dsm.dnode.util.PrpcRunnable$ ~[d-node.jar:?]         at com.pega.dsm.dnode.util.PrpcRunnable$ ~[d-node.jar:?]         at com.pega.dsm.dnode.util.PrpcRunnable.execute( ~[d-node.jar:?]         at com.pega.dsm.dnode.impl.prpc.PrpcThreadFactory$ ~[d-node.jar:?]



***Edited by Moderator Marissa to change type from General to Upgrade, update Platform Capability tags****

Pega Platform System Administration Low-Code App Development Upgrades Government Upgrade
Share this page LinkedIn