We are running tungsten-community-1.3-rc-2 with 2 nodes (one master and one slave w/ RW splitting enabled) Under our stress tests of 800 users concurrent we have seen a couple of cases where replication is breaking. Two examples are below...
Example 1:
pendingError: Event application failed: |
| com.mysql.jdbc.exceptions.MySQLN |
| Connection.close() has already |
| been called. Invalid operation in |
| this state. |
| pendingErrorCode: NONE |
| pendingErrorEventId: 000003:0000000047304340;0 |
| pendingErrorSeqno: 72892 |
| pendingExceptionMessage: com.mysql.jdbc.exceptions.MySQLN |
| Connection.close() has already |
| been called. Invalid operation in |
| this state. |
| resourceJdbcDriver: com.mysql.jdbc.Driver |
| resourceJdbcUrl: jdbc:mysql://decxpdb007:3306/${D |
| BNAME}?jdbcCompliantTruncation=f |
| alse&zeroDateTimeBehavior=conver |
| tToNull&tinyInt1isBit=false&allo |
| wMultiQueries=true&yearIsDateTyp |
Example 2:
[LOGICAL] /cluster/cluster2/> ls -l decxpdb007
COORDINATOR[decxpdb007:MANUAL:ONLINE]
ROUTERS:
+-----------------------------------------------------------------------+
|decxpdb006:10999:ONLINE |
|decxpdb007:10999:ONLINE |
+-----------------------------------------------------------------------+
DATASOURCES:
+-----------------------------------------------------------------------+
|decxpdb007(slave:ONLINE, progress=-1, latency=0.0) |
+-----------------------------------------------------------------------+
| activeConnectionCount: 28370 |
| appliedLatency: 0.0 |
|callableStatementsCreatedCount: 0 |
| connectionsCreatedCount: 14185 |
| dataServiceName: cluster2 |
| driver: com.mysql.jdbc.Driver |
| highWater: 2(000003:0000000047300711;0) |
| host: decxpdb007 |
| isAvailable: true |
| lastError: |
| lastShunReason: |
| name: decxpdb007 |
| precedence: 99 |
|preparedStatementsCreatedCount: 0 |
| role: slave |
| sequence: Sequence(0:0) |
| state: ONLINE |
| statementsCreatedCount: 0 |
| url: jdbc:mysql://decxpdb007:3306/${D |
| BNAME}?jdbcCompliantTruncation=f |
| alse&zeroDateTimeBehavior=conver |
| tToNull&tinyInt1isBit=false&allo |
| wMultiQueries=true&yearIsDateTyp |
| e=false |
| vendor: mysql |
+-----------------------------------------------------------------------+
+-----------------------------------------------------------------------+
|decxpdb007.corporate.connextions.net: EPLICATOR(role=slave, |
|state=OFFLINE:ERROR) |
+-----------------------------------------------------------------------+
| appliedLastEventId: NONE |
| appliedLastSeqno: -1 |
| appliedLatency: -1.0 |
| clusterName: cluster2 |
| currentEventId: NONE |
| host: decxpdb007.corporate.connextions |
|.net |
| latestEpochNumber: -1 |
| masterConnectUri: thl://decxpdb006/ |
| masterListenUri: thl://decxpdb007/ |
| maximumStoredSeqNo: -1 |
| minimumStoredSeqNo: -1 |
| pendingError: Event application failed: |
| com.mysql.jdbc.exceptions.MySQLN |
| Connection.close() has already |
| been called. Invalid operation in |
| this state. |
| pendingErrorCode: NONE |
| pendingErrorEventId: 000003:0000000047304340;0 |
| pendingErrorSeqno: 72892 |
| pendingExceptionMessage: com.mysql.jdbc.exceptions.MySQLN |
| Connection.close() has already |
| been called. Invalid operation in |
| this state. |
| resourceJdbcDriver: com.mysql.jdbc.Driver |
| resourceJdbcUrl: jdbc:mysql://decxpdb007:3306/${D |
| BNAME}?jdbcCompliantTruncation=f |
| alse&zeroDateTimeBehavior=conver |
| tToNull&tinyInt1isBit=false&allo |
| wMultiQueries=true&yearIsDateTyp |
| e=false |
| resourceVendor: mysql |
| role: slave |
| seqnoType: java.lang.Long |
| sourceId: decxpdb007 |
| state: OFFLINE:ERROR |
| timeInStateSeconds: 299.571 |
| uptimeSeconds: 82980.335 |
+-----------------------------------------------------------------------+
+-----------------------------------------------------------------------+
|decxpdb007

ATASERVER(state=ONLINE) |
+-----------------------------------------------------------------------+
| state: ONLINE |
+-----------------------------------------------------------------------+
We are able to bring them back online no problem w/ the cctrl command replicator decxpdb007 online but why is it breaking in the 1st place?! These servers have 16 cores and 128 GB of memory. And chose "large" on the install of Tungsten
Thank You!
Cody