sync_flush should wait is_completed not is_active. #461

xiaoxichen · 2024-07-10T16:21:00Z

Log entry added into StreamTracker in write_async , through m_records.create(). Once create the status is active.

Later on in HomeLogStore::on_write_completion it update the tracker and in the m_records.update, is_completed set to true.

We should wait for is_completed == true to indicate a log entry has been persisted.

xiaoxichen · 2024-07-10T16:21:52Z

I dont know how much removing async will remove this part of code @JacksonYao287

hkadayam · 2024-07-11T17:14:06Z

src/lib/logstore/log_store.cpp

@@ -405,7 +408,7 @@ void HomeLogStore::flush_sync(logstore_seq_num_t upto_seq_num) {
    if (upto_seq_num == invalid_lsn()) { upto_seq_num = m_records.active_upto(); }

    // if we have flushed already, we are done
-    if (!m_records.status(upto_seq_num).is_active) { return; }
+    if (m_records.status(upto_seq_num).is_completed) { return; }


This status is a tri-state check.

Inactive: We have not received this lsn at all

Active: Received LSN, yet to flush

Completed: Completed flush.

So we only need to flush if it is not active. This change will flush when it is inactive as well. So I don't think we should make this change.

I get the point, we need to flush only active, it make sense.
However I am not getting how is_active == false implies log getting flushed? are we reset bit in m_active_slot_bits when doing update? I didnt find this part of code in SISL.

hkadayam · 2024-07-11T17:14:17Z

src/lib/logstore/log_store.cpp

@@ -416,13 +419,13 @@ void HomeLogStore::flush_sync(logstore_seq_num_t upto_seq_num) {

        // Step 2: After marking this lsn, we again do a check, to avoid a race where completion checked for no lsn
        // and the lsn is stored in step 1 above.
-        if (!m_records.status(upto_seq_num).is_active) { return; }
+        if (m_records.status(upto_seq_num).is_completed) { return; }


Same as above.

hkadayam · 2024-07-11T17:15:41Z

src/lib/logstore/log_store.cpp


        // Step 3: Force a flush (with least threshold)
        m_logdev->flush_if_needed(1);

        // Step 4: Wait for completion
-        m_sync_flush_cv.wait(lk, [this, upto_seq_num] { return !m_records.status(upto_seq_num).is_active; });
+        m_sync_flush_cv.wait_for(lk, std::chrono::milliseconds(10), [this, upto_seq_num] { return m_records.status(upto_seq_num).is_completed; });


I am assuming it is temporary, because caller expect data at the end of it. Either we return an error or we assert? In any case, after new flush changes, we probably don't need this cv so at that point it is moot.

yes this is temporary for solving the racing we had seen. It is not right , but generally ensure we dont stuck on this CV infinitely for concurrent issue.

src/lib/replication/log_store/repl_log_store.cpp

Signed-off-by: Xiaoxi Chen <[email protected]>

Previous code will add `ndevices` simulated drive into device list which make each replica go with one real drive and ndevices simulated drive. Those simulated drives are identified as FAST, meta/log were on them. Due to the very limited size of simulated drive, we can hit size limit in long running test. Fixing by honor input from hs_repl_test_common. After this fix, if only one drive passed in for a replica of test_raft_repl_dev, that drive will be used as FAST. All services will be started on that real drive. Signed-off-by: Xiaoxi Chen <[email protected]>

xiaoxichen requested review from hkadayam, sanebay and JacksonYao287 July 10, 2024 16:21

hkadayam reviewed Jul 11, 2024

View reviewed changes

src/lib/replication/log_store/repl_log_store.cpp Show resolved Hide resolved

hkadayam reviewed Jul 11, 2024

View reviewed changes

src/lib/replication/log_store/repl_log_store.cpp Outdated Show resolved Hide resolved

hkadayam reviewed Jul 11, 2024

View reviewed changes

src/lib/replication/log_store/repl_log_store.cpp Show resolved Hide resolved

xiaoxichen force-pushed the fix_log branch 2 times, most recently from 542fb2b to cab0a0f Compare July 15, 2024 08:15

xiaoxichen added 3 commits July 16, 2024 07:29

use m_sync_flush_cv.wait_for and more logging.

ac2da3d

Signed-off-by: Xiaoxi Chen <[email protected]>

flush log on leader as well.

3faa2d6

Signed-off-by: Xiaoxi Chen <[email protected]>

locking for sync_flush and cp fix

cf006cf

Signed-off-by: Xiaoxi Chen <[email protected]>

xiaoxichen force-pushed the fix_log branch from 357d178 to cf006cf Compare July 16, 2024 14:30

xiaoxichen closed this Jul 17, 2024

xiaoxichen mentioned this pull request Jul 23, 2024

log store sync flush #469

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync_flush should wait is_completed not is_active. #461

sync_flush should wait is_completed not is_active. #461

xiaoxichen commented Jul 10, 2024

xiaoxichen commented Jul 10, 2024

hkadayam Jul 11, 2024

xiaoxichen Jul 11, 2024

hkadayam Jul 11, 2024

hkadayam Jul 11, 2024

xiaoxichen Jul 11, 2024

sync_flush should wait is_completed not is_active. #461

sync_flush should wait is_completed not is_active. #461

Conversation

xiaoxichen commented Jul 10, 2024

xiaoxichen commented Jul 10, 2024

hkadayam Jul 11, 2024

Choose a reason for hiding this comment

xiaoxichen Jul 11, 2024

Choose a reason for hiding this comment

hkadayam Jul 11, 2024

Choose a reason for hiding this comment

hkadayam Jul 11, 2024

Choose a reason for hiding this comment

xiaoxichen Jul 11, 2024

Choose a reason for hiding this comment