Fix ledger replicated failed blocks bookie decommission process by hangc0276 · Pull Request #3917 · apache/bookkeeper

hangc0276 · 2023-04-13T11:53:19Z

Motivation

When I decommission one bookie (bk3), one ledger replicate failed and blocked decommission process.

This is the auto-recovery log:

2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.client.LedgerHandle - ReadEntries exception on ledgerId:904368 firstEntry:14 lastEntry:14 lastAddConfirmed:13
2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Received error: -1 while trying to read entry: 14 of ledger: 904368 in ReplicationWorker
2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Failed to read faulty entries, so giving up replicating ledgerFragment Fragment(LedgerID: 904368, FirstEntryID: 0[0], LastKnownEntryID: 14[14], Host: [betausc1-bk-10.betausc1-bk-headless.o-vaxkx.svc.cluster.local:3181], Closed: true)
2023-03-29T06:29:22,644+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - ReplicationWorker failed to replicate Ledger : 904368 for 6 number of times, so deferring the ledger lock release by 300000 msecs

The ledger's metadata:

ledgerID: 904368
2023-03-29T06:47:56,511+0000 [main] INFO  org.apache.bookkeeper.tools.cli.commands.
client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=3, writeQuorumSize=3, 
ackQuorumSize=2, state=OPEN, digestType=CRC32C, password=base64:, 
ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]},...}

The ledger (904368) has two ensembles, ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]}. However, the replication worker got the ledger's LAC is 13, but it got the replication fragment entry range is [0, 14]. When reading entry 14, it failed.

One question

Why the ensembles created a new ensemble starting with entryId = 15, but the ledger's lastAddConfirm is 13.

This question is related to two parts, one is how the new ensemble was created and the other is how the lastAddConfirm was generated.

1. How the new ensemble was created

The ensemble change is controlled on the bookie client side.

When one entry is ready to send to the bookie server, the bookie client will check whether need to do the ensemble change.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/PendingAddOp.java

Line 254 in 912896d

lh.maybeHandleDelayedWriteBookieFailure();

For the above case, when writing entry 15, one bookie is lost, it will trigger the ensemble change and generate the new ensemble: 15=[bk1:3181, bk2:3181, bk4:3181]. However, entry 15 write failed, such as timeout or bookie server rejected the write.

For now, entry 14 is written succeed.

2. How the lastAddConfirm was generated

Due to the ledger being in the OPEN state, the ledger handle will send a readLAC request according to the last ensemble to get the ledger's lastAddConfirm.

For the above case, the readLAC request will send to bk1, bk2, and bk4.

For the V2 protocol (Pulsar uses the V2 protocol to interact with the BookKeeper cluster), the bookie client put the lastAddConfirm EntryId in the next Entry's metadata.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java

Line 134 in df44920

buf.writeLong(lastAddConfirmed);

When we use the V2 protocol to open an OPEN state ledger to read, it will send a readLastAddConfirm request to the bookie server, and the bookie server gets the last entry of this ledger and return to the client.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java

Line 108 in df44920

    
           RecoveryData recoveryData = digestManager.verifyDigestAndReturnLastConfirmed(buffer);

However, the bookie client will parse the response entry and get the lastAddConfirm from the entry's metadata. Due to the entry just recording the previous EntryId as the lastAddConfirm, the LedgerHandle got the lastAddConfirm will be the penultimate EntryId of the ledger.

For the above case, the bk1 holds the max entry 14, bk2 holds the max entry 14, and bk4 returns NoSuchEntryException, LedgerHandle gets lastAddConfirm will be 14 - 1 = 13, not 14.

When the replicator tries to recover the first ensemble 0=[bk1:3181, bk2:3181, bk3:3181] with entry range [0, 14], reading entry 14 will throw a ReadEntryException due to the lastAddConfirm is 13.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java

Lines 685 to 690 in df44920

    
           if (lastEntry > lastAddConfirmed) { 
        
               LOG.error("ReadEntries exception on ledgerId:{} firstEntry:{} lastEntry:{} lastAddConfirmed:{}", 
        
                       ledgerId, firstEntry, lastEntry, lastAddConfirmed); 
        
               cb.readComplete(BKException.Code.ReadException, this, null, ctx); 
        
               return; 
        
           }

Solution

When encountered that case that

The ledger is OPEN
The ledger has multiple ensembles
The ledger's last ensemble doesn't have any entries, which means lastAddConfirm < last ensemble key - 1

We should treat the penultimate segment/ensemble of the ledger as an OPEN state instead of a closed state.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java

Lines 56 to 57 in df44920

    
           this.isLedgerClosed = lh.getLedgerMetadata().isClosed() 
        
                   || !ensemble.equals(ensembles.get(ensembles.lastKey()));

After we treat the segment/ensemble as OPEN state, the replicator will close the ledger first and replicate it.

eolivelli

I wonder of we risk to lose one entry in case of AQ < WQ.

I have to think more

bookkeeper-server/src/test/java/org/apache/bookkeeper/client/BookieWriteLedgerTest.java

horizonzy · 2023-04-13T16:51:47Z

How the lastAddConfirm was generated
Due to the ledger being in the OPEN state, the ledger handle will send a readLAC request to get the ledger's lastAddConfirm.
For the above case, if bk1 holds the max entry 14, bk2 holds the max entry 13 and bk3 holds the max entry 14 but it is lost, the LedgerHandle get lastAddConfirm will be 13, not 14.

I have a question about this.

public void checkLedger(final LedgerHandle lh,
                            final GenericCallback<Set<LedgerFragment>> cb,
                            long percentageOfLedgerFragmentToBeVerified)

There are two places to invoke LedgerChecker#checkLedger.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/AuditorCheckAllLedgersTask.java

Lines 196 to 212 in 35e9da9

    
           localAdmin.asyncOpenLedgerNoRecovery(ledgerId, (rc, lh, ctx) -> { 
        
               openLedgerNoRecoverySemaphore.release(); 
        
               if (BKException.Code.OK == rc) { 
        
                   // BookKeeperClientWorker-OrderedExecutor threads should not execute LedgerChecker#checkLedger 
        
                   // as this can lead to deadlocks 
        
                   ledgerCheckerExecutor.execute(() -> { 
        
                       checker.checkLedger(lh, 
        
                               // the ledger handle will be closed after checkLedger is done. 
        
                               new ProcessLostFragmentsCb(lh, callback), 
        
                               conf.getAuditorLedgerVerificationPercentage()); 
        
                       // we collect the following stats to get a measure of the 
        
                       // distribution of a single ledger within the bk cluster 
        
                       // the higher the number of fragments/bookies, the more distributed it is 
        
                       auditorStats.getNumFragmentsPerLedger().registerSuccessfulValue(lh.getNumFragments()); 
        
                       auditorStats.getNumBookiesPerLedger().registerSuccessfulValue(lh.getNumBookies()); 
        
                       auditorStats.getNumLedgersChecked().inc(); 
        
                   });

2.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/ReplicationWorker.java

Lines 449 to 451 in 35e9da9

    
           try (LedgerHandle lh = admin.openLedgerNoRecovery(ledgerIdToReplicate)) { 
        
               Set<LedgerFragment> fragments = getUnderreplicatedFragments(lh, 
        
                       conf.getAuditorLedgerVerificationPercentage());

Both places use the openLedgerNoRecovery to open the LedgerHandle.

It will use lh.asyncReadLastConfirmed() to get the LAC.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerOpenOp.java

Lines 222 to 245 in 35e9da9

    
           lh.asyncReadLastConfirmed(new ReadLastConfirmedCallback() { 
        
               @Override 
        
               public void readLastConfirmedComplete(int rc, 
        
                       long lastConfirmed, Object ctx) { 
        
                   if (rc == BKException.Code.TimeoutException) { 
        
                       closeLedgerHandleAsync().whenComplete((r, ex) -> { 
        
                           if (ex != null) { 
        
                               LOG.error("Ledger {} close failed", ledgerId, ex); 
        
                           } 
        
                           openComplete(bk.getReturnRc(rc), null); 
        
                       }); 
        
                   } else if (rc != BKException.Code.OK) { 
        
                       closeLedgerHandleAsync().whenComplete((r, ex) -> { 
        
                           if (ex != null) { 
        
                               LOG.error("Ledger {} close failed", ledgerId, ex); 
        
                           } 
        
                           openComplete(bk.getReturnRc(BKException.Code.ReadException), null); 
        
                       }); 
        
                   } else { 
        
                       lh.lastAddConfirmed = lh.lastAddPushed = lastConfirmed; 
        
                       openComplete(BKException.Code.OK, lh); 
        
                   } 
        
               } 
        
           }, null);

And we use the V2 protocol, it invokes asyncReadPiggybackLastConfirmed.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java

Lines 1388 to 1395 in 35e9da9

    
           public void asyncReadLastConfirmed(final ReadLastConfirmedCallback cb, final Object ctx) { 
        
               if (clientCtx.getConf().useV2WireProtocol) { 
        
                   // in v2 protocol we don't support readLAC RPC 
        
                   asyncReadPiggybackLastConfirmed(cb, ctx); 
        
               } else { 
        
                   asyncReadExplicitLastConfirmed(cb, ctx); 
        
               } 
        
           }

In asyncReadPiggybackLastConfirmed, it invokes ReadLastConfirmedOp#initiate() to get the LAC.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java

Lines 1422 to 1428 in 35e9da9

    
           new ReadLastConfirmedOp(clientCtx.getBookieClient(), 
        
                                   distributionSchedule, 
        
                                   macManager, 
        
                                   ledgerId, 
        
                                   getCurrentEnsemble(), 
        
                                   ledgerKey, 
        
                                   innercb).initiate();

In the ReadLastConfirmedOp, it will read all bookies in the currentEnsemble. The currentEnsemble is 15=[bk1:3181, bk2:3181, bk4:3181]. Not the {0=[bk1:3181, bk2:3181, bk3:3181].
So it will send readLac RPC to bk1, bk2, and bk4. Then get response from bk1, bk2, and bk4.
When ReadLastConfirmedOp#readEntryComplete, it will compare the response from different bookies, then pick the max lac response to override maxRecoveredData. Then invoke the callback using the maxRecoveredData.

So if the bk1 last entry is 14, the lac is 14-1 = 13

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java

Lines 97 to 154 in 35e9da9

    
           public synchronized void readEntryComplete(final int rc, final long ledgerId, final long entryId, 
        
                   final ByteBuf buffer, final Object ctx) { 
        
               int bookieIndex = (Integer) ctx; 
        
               // add the response to coverage set 
        
               coverageSet.addBookie(bookieIndex, rc); 
        
               numResponsesPending--; 
        
               boolean heardValidResponse = false; 
        
               if (rc == BKException.Code.OK) { 
        
                   try { 
        
                       RecoveryData recoveryData = digestManager.verifyDigestAndReturnLastConfirmed(buffer); 
        
                       if (recoveryData.getLastAddConfirmed() > maxRecoveredData.getLastAddConfirmed()) { 
        
                           maxRecoveredData = recoveryData; 
        
                       } 
        
                       heardValidResponse = true; 
        
                   } catch (BKDigestMatchException e) { 
        
                       // Too bad, this bookie didn't give us a valid answer, we 
        
                       // still might be able to recover though so continue 
        
                       LOG.error("Mac mismatch for ledger: " + ledgerId + ", entry: " + entryId 
        
                                 + " while reading last entry from bookie: " 
        
                                 + currentEnsemble.get(bookieIndex)); 
        
                   } 
        
               } 
        
               if (rc == BKException.Code.NoSuchLedgerExistsException || rc == BKException.Code.NoSuchEntryException) { 
        
                   // this still counts as a valid response, e.g., if the client crashed without writing any entry 
        
                   heardValidResponse = true; 
        
               } 
        
               if (rc == BKException.Code.UnauthorizedAccessException  && !completed) { 
        
                   cb.readLastConfirmedDataComplete(rc, maxRecoveredData); 
        
                   completed = true; 
        
               } 
        
               if (!heardValidResponse && BKException.Code.OK != rc) { 
        
                   lastSeenError = rc; 
        
               } 
        
               // other return codes dont count as valid responses 
        
               if (heardValidResponse 
        
                   && coverageSet.checkCovered() 
        
                   && !completed) { 
        
                   completed = true; 
        
                   if (LOG.isDebugEnabled()) { 
        
                       LOG.debug("Read Complete with enough validResponses for ledger: {}, entry: {}", 
        
                               ledgerId, entryId); 
        
                   } 
        
                   cb.readLastConfirmedDataComplete(BKException.Code.OK, maxRecoveredData); 
        
                   return; 
        
               } 
        
               if (numResponsesPending == 0 && !completed) { 
        
                   LOG.error("While readLastConfirmed ledger: {} did not hear success responses from all quorums, {}", 
        
                             ledgerId, coverageSet); 
        
                   cb.readLastConfirmedDataComplete(lastSeenError, maxRecoveredData); 
        
               }

line_l08-line_112, it will pick the max lac to override maxRecoveredData.

line_137-line_148, use the maxRecoveredData to invoke callback.

hangc0276 · 2023-04-15T04:52:13Z

How the lastAddConfirm was generated
Due to the ledger being in the OPEN state, the ledger handle will send a readLAC request to get the ledger's lastAddConfirm.
For the above case, if bk1 holds the max entry 14, bk2 holds the max entry 13 and bk3 holds the max entry 14 but it is lost, the LedgerHandle get lastAddConfirm will be 13, not 14.

I have a question about this.
public void checkLedger(final LedgerHandle lh,
                            final GenericCallback<Set<LedgerFragment>> cb,
                            long percentageOfLedgerFragmentToBeVerified)
There are two places to invoke LedgerChecker#checkLedger.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/AuditorCheckAllLedgersTask.java

Lines 196 to 212 in 35e9da9

localAdmin.asyncOpenLedgerNoRecovery(ledgerId, (rc, lh, ctx) -> {

openLedgerNoRecoverySemaphore.release();

if (BKException.Code.OK == rc) {

// BookKeeperClientWorker-OrderedExecutor threads should not execute LedgerChecker#checkLedger

// as this can lead to deadlocks

ledgerCheckerExecutor.execute(() -> {

checker.checkLedger(lh,

// the ledger handle will be closed after checkLedger is done.

new ProcessLostFragmentsCb(lh, callback),

conf.getAuditorLedgerVerificationPercentage());

// we collect the following stats to get a measure of the

// distribution of a single ledger within the bk cluster

// the higher the number of fragments/bookies, the more distributed it is

auditorStats.getNumFragmentsPerLedger().registerSuccessfulValue(lh.getNumFragments());

auditorStats.getNumBookiesPerLedger().registerSuccessfulValue(lh.getNumBookies());

auditorStats.getNumLedgersChecked().inc();

});

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/ReplicationWorker.java

Lines 449 to 451 in 35e9da9

try (LedgerHandle lh = admin.openLedgerNoRecovery(ledgerIdToReplicate)) {

Set<LedgerFragment> fragments = getUnderreplicatedFragments(lh,

conf.getAuditorLedgerVerificationPercentage());

Both places use the openLedgerNoRecovery to open the LedgerHandle.

It will use lh.asyncReadLastConfirmed() to get the LAC.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerOpenOp.java

Lines 222 to 245 in 35e9da9

lh.asyncReadLastConfirmed(new ReadLastConfirmedCallback() {

@Override

public void readLastConfirmedComplete(int rc,

long lastConfirmed, Object ctx) {

if (rc == BKException.Code.TimeoutException) {

closeLedgerHandleAsync().whenComplete((r, ex) -> {

if (ex != null) {

LOG.error("Ledger {} close failed", ledgerId, ex);

}

openComplete(bk.getReturnRc(rc), null);

});

} else if (rc != BKException.Code.OK) {

closeLedgerHandleAsync().whenComplete((r, ex) -> {

if (ex != null) {

LOG.error("Ledger {} close failed", ledgerId, ex);

}

openComplete(bk.getReturnRc(BKException.Code.ReadException), null);

});

} else {

lh.lastAddConfirmed = lh.lastAddPushed = lastConfirmed;

openComplete(BKException.Code.OK, lh);

}

}

}, null);

And we use the V2 protocol, it invokes asyncReadPiggybackLastConfirmed.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java

Lines 1388 to 1395 in 35e9da9

public void asyncReadLastConfirmed(final ReadLastConfirmedCallback cb, final Object ctx) {

if (clientCtx.getConf().useV2WireProtocol) {

// in v2 protocol we don't support readLAC RPC

asyncReadPiggybackLastConfirmed(cb, ctx);

} else {

asyncReadExplicitLastConfirmed(cb, ctx);

}

}

In asyncReadPiggybackLastConfirmed, it invokes ReadLastConfirmedOp#initiate() to get the LAC.

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java

Lines 1422 to 1428 in 35e9da9

new ReadLastConfirmedOp(clientCtx.getBookieClient(),

distributionSchedule,

macManager,

ledgerId,

getCurrentEnsemble(),

ledgerKey,

innercb).initiate();

In the ReadLastConfirmedOp, it will read all bookies in the currentEnsemble. The currentEnsemble is 15=[bk1:3181, bk2:3181, bk4:3181]. Not the {0=[bk1:3181, bk2:3181, bk3:3181]. So it will send readLac RPC to bk1, bk2, and bk4. Then get response from bk1, bk2, and bk4. When ReadLastConfirmedOp#readEntryComplete, it will compare the response from different bookies, then pick the max lac response to override maxRecoveredData. Then invoke the callback using the maxRecoveredData.

So if the bk1 last entry is 14, the lac is 14-1 = 13

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java

Lines 97 to 154 in 35e9da9

public synchronized void readEntryComplete(final int rc, final long ledgerId, final long entryId,

final ByteBuf buffer, final Object ctx) {

int bookieIndex = (Integer) ctx;

// add the response to coverage set

coverageSet.addBookie(bookieIndex, rc);

numResponsesPending--;

boolean heardValidResponse = false;

if (rc == BKException.Code.OK) {

try {

RecoveryData recoveryData = digestManager.verifyDigestAndReturnLastConfirmed(buffer);

if (recoveryData.getLastAddConfirmed() > maxRecoveredData.getLastAddConfirmed()) {

maxRecoveredData = recoveryData;

}

heardValidResponse = true;

} catch (BKDigestMatchException e) {

// Too bad, this bookie didn't give us a valid answer, we

// still might be able to recover though so continue

LOG.error("Mac mismatch for ledger: " + ledgerId + ", entry: " + entryId

+ " while reading last entry from bookie: "

+ currentEnsemble.get(bookieIndex));

}

}

if (rc == BKException.Code.NoSuchLedgerExistsException || rc == BKException.Code.NoSuchEntryException) {

// this still counts as a valid response, e.g., if the client crashed without writing any entry

heardValidResponse = true;

}

if (rc == BKException.Code.UnauthorizedAccessException && !completed) {

cb.readLastConfirmedDataComplete(rc, maxRecoveredData);

completed = true;

}

if (!heardValidResponse && BKException.Code.OK != rc) {

lastSeenError = rc;

}

// other return codes dont count as valid responses

if (heardValidResponse

&& coverageSet.checkCovered()

&& !completed) {

completed = true;

if (LOG.isDebugEnabled()) {

LOG.debug("Read Complete with enough validResponses for ledger: {}, entry: {}",

ledgerId, entryId);

}

cb.readLastConfirmedDataComplete(BKException.Code.OK, maxRecoveredData);

return;

}

if (numResponsesPending == 0 && !completed) {

LOG.error("While readLastConfirmed ledger: {} did not hear success responses from all quorums, {}",

ledgerId, coverageSet);

cb.readLastConfirmedDataComplete(lastSeenError, maxRecoveredData);

}

line_l08-line_112, it will pick the max lac to override maxRecoveredData.

line_137-line_148, use the maxRecoveredData to invoke callback.

@horizonzy Yes, you are right. I updated the description, please help take a look, thanks.

hangc0276 · 2023-04-15T04:53:53Z

I wonder of we risk to lose one entry in case of AQ < WQ.

I have to think more

@eolivelli Sorry, I misunderstood the lastAddConfirm part. It doesn't have the risk of losing one entry. I updated the description, please help take a look, thanks.

horizonzy · 2023-04-16T16:24:11Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java

        this.isLedgerClosed = lh.getLedgerMetadata().isClosed()
-                || !ensemble.equals(ensembles.get(ensembles.lastKey()));
+                || (!ensemble.equals(ensembles.get(ensembles.lastKey()))
+            && lh.getLastAddConfirmed() >= ensembles.lastKey() - 1);


Here should be `lh.getLastAddConfirmed() == ensembles.lastKey() - 2.
The lac maybe 100. The last ensemble key maybe 10.

Maybe not. Take the following case for example:
0: [bk1, bk2, bk3]
10: [bk1, bk2, bk4]
LAC = 100
Ledger is OPEN

For the first ensemble: isLedgerClosed = (false || (true && 100 >= 9)) => true
For the last ensemble: isLedgerClosed = (false || (false && 100 >= 9)) => false
The result is expected.

Yes, you are right.

horizonzy · 2023-04-17T04:05:21Z

bookkeeper-server/src/test/java/org/apache/bookkeeper/replication/TestReplicationWorker.java

+        LedgerHandle lh = bkc.createLedger(3, 3, 2, BookKeeper.DigestType.CRC32, TESTPASSWD);
+        assertFalse(lh.getLedgerMetadata().isClosed());
+
+        List<BookieId> firstEnsemble = lh.getLedgerMetadata().getAllEnsembles().firstEntry().getValue();


firstEnsemble is unused

horizonzy · 2023-04-17T04:06:33Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/ReplicationWorker.java

+     * segment/ensemble has missing bookies.
     */
    private boolean isLastSegmentOpenAndMissingBookies(LedgerHandle lh) throws BKException {
        LedgerMetadata md = admin.getLedgerMetadata(lh);


md is unused.

horizonzy · 2023-04-17T04:07:49Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/replication/ReplicationWorker.java

        SortedMap<Long, ? extends List<BookieId>> ensembles = admin.getLedgerMetadata(lh).getAllEnsembles();
        List<BookieId> finalEnsemble = ensembles.get(ensembles.lastKey());
+        if (ensembles.size() > 1 && lh.getLastAddConfirmed() < ensembles.lastKey() - 1) {
+            finalEnsemble = new ArrayList<>(finalEnsemble);


finalEnsemble = new ArrayList<>(finalEnsemble); may be meaningless.

The finalEnsemble is immutable in the ledger metadata.

Oh, get it.

hangc0276 · 2023-04-23T02:01:14Z

@merlimat @eolivelli @dlg99 Please help take a look, thanks.

eolivelli

+1 LGTM

wenbingshen

LGTM

### Motivation When I decommission one bookie (bk3), one ledger replicate failed and blocked decommission process. This is the auto-recovery log: ``` 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.client.LedgerHandle - ReadEntries exception on ledgerId:904368 firstEntry:14 lastEntry:14 lastAddConfirmed:13 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Received error: -1 while trying to read entry: 14 of ledger: 904368 in ReplicationWorker 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Failed to read faulty entries, so giving up replicating ledgerFragment Fragment(LedgerID: 904368, FirstEntryID: 0[0], LastKnownEntryID: 14[14], Host: [betausc1-bk-10.betausc1-bk-headless.o-vaxkx.svc.cluster.local:3181], Closed: true) 2023-03-29T06:29:22,644+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - ReplicationWorker failed to replicate Ledger : 904368 for 6 number of times, so deferring the ledger lock release by 300000 msecs ``` The ledger's metadata: ``` ledgerID: 904368 2023-03-29T06:47:56,511+0000 [main] INFO org.apache.bookkeeper.tools.cli.commands. client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=3, writeQuorumSize=3, ackQuorumSize=2, state=OPEN, digestType=CRC32C, password=base64:, ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]},...} ``` The ledger (904368) has two ensembles, `ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]}`. However, the replication worker got the ledger's LAC is 13, but it got the replication fragment entry range is [0, 14]. When reading entry 14, it failed. ### One question **Why the ensembles created a new ensemble starting with entryId = 15, but the ledger's lastAddConfirm is 13.** This question is related to two parts, one is how the new ensemble was created and the other is how the lastAddConfirm was generated. #### 1. How the new ensemble was created The ensemble change is controlled on the bookie client side. When one entry is ready to send to the bookie server, the bookie client will check whether need to do the ensemble change. https://github.com/apache/bookkeeper/blob/912896deb2e748389e15e74c37539b2ff36302c7/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/PendingAddOp.java#L254 For the above case, when writing entry 15, one bookie is lost, it will trigger the ensemble change and generate the new ensemble: 15=[bk1:3181, bk2:3181, bk4:3181]. However, entry 15 write failed, such as timeout or bookie server rejected the write. For now, entry 14 is written succeed. #### 2. How the lastAddConfirm was generated Due to the ledger being in the `OPEN` state, the ledger handle will send a readLAC request according to the last ensemble to get the ledger's lastAddConfirm. For the above case, the readLAC request will send to bk1, bk2, and bk4. For the `V2` protocol (Pulsar uses the V2 protocol to interact with the BookKeeper cluster), the bookie client put the lastAddConfirm EntryId in the next Entry's metadata. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java#L134 When we use the `V2` protocol to open an `OPEN` state ledger to read, it will send a readLastAddConfirm request to the bookie server, and the bookie server gets the last entry of this ledger and return to the client. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java#L108 However, the bookie client will parse the response entry and get the lastAddConfirm from the entry's metadata. Due to the entry just recording the previous EntryId as the lastAddConfirm, the LedgerHandle got the lastAddConfirm will be the penultimate EntryId of the ledger. For the above case, the bk1 holds the max entry 14, bk2 holds the max entry 14, and bk4 returns NoSuchEntryException, LedgerHandle gets lastAddConfirm will be `14 - 1 = 13`, not 14. When the replicator tries to recover the first ensemble 0=[bk1:3181, bk2:3181, bk3:3181] with entry range [0, 14], reading entry 14 will throw a ReadEntryException due to the lastAddConfirm is 13. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java#L685-L690 ### Solution When encountered that case that - The ledger is `OPEN` - The ledger has multiple ensembles - The ledger's last ensemble doesn't have any entries, which means `lastAddConfirm < last ensemble key - 1` We should treat the penultimate segment/ensemble of the ledger as an `OPEN` state instead of a closed state. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java#L56-L57 After we treat the segment/ensemble as `OPEN` state, the replicator will close the ledger first and replicate it. (cherry picked from commit eff38e4)

…he#3917) When I decommission one bookie (bk3), one ledger replicate failed and blocked decommission process. This is the auto-recovery log: ``` 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.client.LedgerHandle - ReadEntries exception on ledgerId:904368 firstEntry:14 lastEntry:14 lastAddConfirmed:13 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Received error: -1 while trying to read entry: 14 of ledger: 904368 in ReplicationWorker 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Failed to read faulty entries, so giving up replicating ledgerFragment Fragment(LedgerID: 904368, FirstEntryID: 0[0], LastKnownEntryID: 14[14], Host: [betausc1-bk-10.betausc1-bk-headless.o-vaxkx.svc.cluster.local:3181], Closed: true) 2023-03-29T06:29:22,644+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - ReplicationWorker failed to replicate Ledger : 904368 for 6 number of times, so deferring the ledger lock release by 300000 msecs ``` The ledger's metadata: ``` ledgerID: 904368 2023-03-29T06:47:56,511+0000 [main] INFO org.apache.bookkeeper.tools.cli.commands. client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=3, writeQuorumSize=3, ackQuorumSize=2, state=OPEN, digestType=CRC32C, password=base64:, ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]},...} ``` The ledger (904368) has two ensembles, `ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]}`. However, the replication worker got the ledger's LAC is 13, but it got the replication fragment entry range is [0, 14]. When reading entry 14, it failed. **Why the ensembles created a new ensemble starting with entryId = 15, but the ledger's lastAddConfirm is 13.** This question is related to two parts, one is how the new ensemble was created and the other is how the lastAddConfirm was generated. The ensemble change is controlled on the bookie client side. When one entry is ready to send to the bookie server, the bookie client will check whether need to do the ensemble change. https://github.com/apache/bookkeeper/blob/912896deb2e748389e15e74c37539b2ff36302c7/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/PendingAddOp.java#L254 For the above case, when writing entry 15, one bookie is lost, it will trigger the ensemble change and generate the new ensemble: 15=[bk1:3181, bk2:3181, bk4:3181]. However, entry 15 write failed, such as timeout or bookie server rejected the write. For now, entry 14 is written succeed. Due to the ledger being in the `OPEN` state, the ledger handle will send a readLAC request according to the last ensemble to get the ledger's lastAddConfirm. For the above case, the readLAC request will send to bk1, bk2, and bk4. For the `V2` protocol (Pulsar uses the V2 protocol to interact with the BookKeeper cluster), the bookie client put the lastAddConfirm EntryId in the next Entry's metadata. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java#L134 When we use the `V2` protocol to open an `OPEN` state ledger to read, it will send a readLastAddConfirm request to the bookie server, and the bookie server gets the last entry of this ledger and return to the client. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java#L108 However, the bookie client will parse the response entry and get the lastAddConfirm from the entry's metadata. Due to the entry just recording the previous EntryId as the lastAddConfirm, the LedgerHandle got the lastAddConfirm will be the penultimate EntryId of the ledger. For the above case, the bk1 holds the max entry 14, bk2 holds the max entry 14, and bk4 returns NoSuchEntryException, LedgerHandle gets lastAddConfirm will be `14 - 1 = 13`, not 14. When the replicator tries to recover the first ensemble 0=[bk1:3181, bk2:3181, bk3:3181] with entry range [0, 14], reading entry 14 will throw a ReadEntryException due to the lastAddConfirm is 13. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java#L685-L690 When encountered that case that - The ledger is `OPEN` - The ledger has multiple ensembles - The ledger's last ensemble doesn't have any entries, which means `lastAddConfirm < last ensemble key - 1` We should treat the penultimate segment/ensemble of the ledger as an `OPEN` state instead of a closed state. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java#L56-L57 After we treat the segment/ensemble as `OPEN` state, the replicator will close the ledger first and replicate it. (cherry picked from commit eff38e4)

### Motivation When I decommission one bookie (bk3), one ledger replicate failed and blocked decommission process. This is the auto-recovery log: ``` 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.client.LedgerHandle - ReadEntries exception on ledgerId:904368 firstEntry:14 lastEntry:14 lastAddConfirmed:13 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Received error: -1 while trying to read entry: 14 of ledger: 904368 in ReplicationWorker 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Failed to read faulty entries, so giving up replicating ledgerFragment Fragment(LedgerID: 904368, FirstEntryID: 0[0], LastKnownEntryID: 14[14], Host: [betausc1-bk-10.betausc1-bk-headless.o-vaxkx.svc.cluster.local:3181], Closed: true) 2023-03-29T06:29:22,644+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - ReplicationWorker failed to replicate Ledger : 904368 for 6 number of times, so deferring the ledger lock release by 300000 msecs ``` The ledger's metadata: ``` ledgerID: 904368 2023-03-29T06:47:56,511+0000 [main] INFO org.apache.bookkeeper.tools.cli.commands. client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=3, writeQuorumSize=3, ackQuorumSize=2, state=OPEN, digestType=CRC32C, password=base64:, ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]},...} ``` The ledger (904368) has two ensembles, `ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]}`. However, the replication worker got the ledger's LAC is 13, but it got the replication fragment entry range is [0, 14]. When reading entry 14, it failed. ### One question **Why the ensembles created a new ensemble starting with entryId = 15, but the ledger's lastAddConfirm is 13.** This question is related to two parts, one is how the new ensemble was created and the other is how the lastAddConfirm was generated. #### 1. How the new ensemble was created The ensemble change is controlled on the bookie client side. When one entry is ready to send to the bookie server, the bookie client will check whether need to do the ensemble change. https://github.com/apache/bookkeeper/blob/912896deb2e748389e15e74c37539b2ff36302c7/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/PendingAddOp.java#L254 For the above case, when writing entry 15, one bookie is lost, it will trigger the ensemble change and generate the new ensemble: 15=[bk1:3181, bk2:3181, bk4:3181]. However, entry 15 write failed, such as timeout or bookie server rejected the write. For now, entry 14 is written succeed. #### 2. How the lastAddConfirm was generated Due to the ledger being in the `OPEN` state, the ledger handle will send a readLAC request according to the last ensemble to get the ledger's lastAddConfirm. For the above case, the readLAC request will send to bk1, bk2, and bk4. For the `V2` protocol (Pulsar uses the V2 protocol to interact with the BookKeeper cluster), the bookie client put the lastAddConfirm EntryId in the next Entry's metadata. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java#L134 When we use the `V2` protocol to open an `OPEN` state ledger to read, it will send a readLastAddConfirm request to the bookie server, and the bookie server gets the last entry of this ledger and return to the client. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java#L108 However, the bookie client will parse the response entry and get the lastAddConfirm from the entry's metadata. Due to the entry just recording the previous EntryId as the lastAddConfirm, the LedgerHandle got the lastAddConfirm will be the penultimate EntryId of the ledger. For the above case, the bk1 holds the max entry 14, bk2 holds the max entry 14, and bk4 returns NoSuchEntryException, LedgerHandle gets lastAddConfirm will be `14 - 1 = 13`, not 14. When the replicator tries to recover the first ensemble 0=[bk1:3181, bk2:3181, bk3:3181] with entry range [0, 14], reading entry 14 will throw a ReadEntryException due to the lastAddConfirm is 13. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java#L685-L690 ### Solution When encountered that case that - The ledger is `OPEN` - The ledger has multiple ensembles - The ledger's last ensemble doesn't have any entries, which means `lastAddConfirm < last ensemble key - 1` We should treat the penultimate segment/ensemble of the ledger as an `OPEN` state instead of a closed state. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java#L56-L57 After we treat the segment/ensemble as `OPEN` state, the replicator will close the ledger first and replicate it. (cherry picked from commit eff38e4)

…he#3917) ### Motivation When I decommission one bookie (bk3), one ledger replicate failed and blocked decommission process. This is the auto-recovery log: ``` 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.client.LedgerHandle - ReadEntries exception on ledgerId:904368 firstEntry:14 lastEntry:14 lastAddConfirmed:13 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Received error: -1 while trying to read entry: 14 of ledger: 904368 in ReplicationWorker 2023-03-29T06:29:22,642+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - Failed to read faulty entries, so giving up replicating ledgerFragment Fragment(LedgerID: 904368, FirstEntryID: 0[0], LastKnownEntryID: 14[14], Host: [betausc1-bk-10.betausc1-bk-headless.o-vaxkx.svc.cluster.local:3181], Closed: true) 2023-03-29T06:29:22,644+0000 [ReplicationWorker] ERROR org.apache.bookkeeper.replication.ReplicationWorker - ReplicationWorker failed to replicate Ledger : 904368 for 6 number of times, so deferring the ledger lock release by 300000 msecs ``` The ledger's metadata: ``` ledgerID: 904368 2023-03-29T06:47:56,511+0000 [main] INFO org.apache.bookkeeper.tools.cli.commands. client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=3, writeQuorumSize=3, ackQuorumSize=2, state=OPEN, digestType=CRC32C, password=base64:, ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]},...} ``` The ledger (904368) has two ensembles, `ensembles={0=[bk1:3181, bk2:3181, bk3:3181], 15=[bk1:3181, bk2:3181, bk4:3181]}`. However, the replication worker got the ledger's LAC is 13, but it got the replication fragment entry range is [0, 14]. When reading entry 14, it failed. ### One question **Why the ensembles created a new ensemble starting with entryId = 15, but the ledger's lastAddConfirm is 13.** This question is related to two parts, one is how the new ensemble was created and the other is how the lastAddConfirm was generated. #### 1. How the new ensemble was created The ensemble change is controlled on the bookie client side. When one entry is ready to send to the bookie server, the bookie client will check whether need to do the ensemble change. https://github.com/apache/bookkeeper/blob/912896deb2e748389e15e74c37539b2ff36302c7/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/PendingAddOp.java#L254 For the above case, when writing entry 15, one bookie is lost, it will trigger the ensemble change and generate the new ensemble: 15=[bk1:3181, bk2:3181, bk4:3181]. However, entry 15 write failed, such as timeout or bookie server rejected the write. For now, entry 14 is written succeed. #### 2. How the lastAddConfirm was generated Due to the ledger being in the `OPEN` state, the ledger handle will send a readLAC request according to the last ensemble to get the ledger's lastAddConfirm. For the above case, the readLAC request will send to bk1, bk2, and bk4. For the `V2` protocol (Pulsar uses the V2 protocol to interact with the BookKeeper cluster), the bookie client put the lastAddConfirm EntryId in the next Entry's metadata. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java#L134 When we use the `V2` protocol to open an `OPEN` state ledger to read, it will send a readLastAddConfirm request to the bookie server, and the bookie server gets the last entry of this ledger and return to the client. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/ReadLastConfirmedOp.java#L108 However, the bookie client will parse the response entry and get the lastAddConfirm from the entry's metadata. Due to the entry just recording the previous EntryId as the lastAddConfirm, the LedgerHandle got the lastAddConfirm will be the penultimate EntryId of the ledger. For the above case, the bk1 holds the max entry 14, bk2 holds the max entry 14, and bk4 returns NoSuchEntryException, LedgerHandle gets lastAddConfirm will be `14 - 1 = 13`, not 14. When the replicator tries to recover the first ensemble 0=[bk1:3181, bk2:3181, bk3:3181] with entry range [0, 14], reading entry 14 will throw a ReadEntryException due to the lastAddConfirm is 13. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerHandle.java#L685-L690 ### Solution When encountered that case that - The ledger is `OPEN` - The ledger has multiple ensembles - The ledger's last ensemble doesn't have any entries, which means `lastAddConfirm < last ensemble key - 1` We should treat the penultimate segment/ensemble of the ledger as an `OPEN` state instead of a closed state. https://github.com/apache/bookkeeper/blob/df4492012cc03682534cbc8dd68dd81163b0c947/bookkeeper-server/src/main/java/org/apache/bookkeeper/client/LedgerFragment.java#L56-L57 After we treat the segment/ensemble as `OPEN` state, the replicator will close the ledger first and replicate it.

hangc0276 added 2 commits April 13, 2023 19:01

Fix ledger replicated failed blocks bookie decommission process

4c82417

format code

47ce4d3

hangc0276 requested review from StevenLuMT, dlg99, eolivelli, merlimat, rdhabalia and zymap April 13, 2023 11:53

hangc0276 self-assigned this Apr 13, 2023

hangc0276 added area/autorecovery release/4.15.0 release/4.14.8 release/4.16.1 type/bug labels Apr 13, 2023

hangc0276 added this to the 4.17.0 milestone Apr 13, 2023

eolivelli reviewed Apr 13, 2023

View reviewed changes

bookkeeper-server/src/test/java/org/apache/bookkeeper/client/BookieWriteLedgerTest.java Show resolved Hide resolved

address comments

4b8c980

hangc0276 added release/4.15.5 release/4.16.2 and removed release/4.16.1 release/4.15.0 labels Apr 14, 2023

hangc0276 added 2 commits April 14, 2023 22:40

fix a bug

c41fcfc

Fix this bug in another way

b91266f

horizonzy reviewed Apr 16, 2023

View reviewed changes

horizonzy reviewed Apr 17, 2023

View reviewed changes

eolivelli approved these changes May 15, 2023

View reviewed changes

wenbingshen approved these changes May 19, 2023

View reviewed changes

zymap merged commit eff38e4 into apache:master Jun 19, 2023

hangc0276 added cherry-picked/branch-4.14 cherry-picked/branch-4.16 labels Jul 14, 2023

zymap added the cherry-picked/branch-4.15 label Dec 6, 2023

	if (lastEntry > lastAddConfirmed) {
	LOG.error("ReadEntries exception on ledgerId:{} firstEntry:{} lastEntry:{} lastAddConfirmed:{}",
	ledgerId, firstEntry, lastEntry, lastAddConfirmed);
	cb.readComplete(BKException.Code.ReadException, this, null, ctx);
	return;
	}

	this.isLedgerClosed = lh.getLedgerMetadata().isClosed()
	\|\| !ensemble.equals(ensembles.get(ensembles.lastKey()));

Conversation

hangc0276 commented Apr 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

One question

1. How the new ensemble was created

2. How the lastAddConfirm was generated

Solution

Uh oh!

eolivelli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

horizonzy commented Apr 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hangc0276 commented Apr 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hangc0276 commented Apr 15, 2023

Uh oh!

horizonzy Apr 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hangc0276 Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

horizonzy Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

horizonzy Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

horizonzy Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

horizonzy Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

hangc0276 Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

horizonzy Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

hangc0276 commented Apr 23, 2023

Uh oh!

eolivelli left a comment

Choose a reason for hiding this comment

Uh oh!

wenbingshen left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

hangc0276 commented Apr 13, 2023 •

edited

Loading

horizonzy commented Apr 13, 2023 •

edited

Loading

hangc0276 commented Apr 15, 2023 •

edited

Loading

horizonzy Apr 16, 2023 •

edited

Loading