bug(rdb save): snapshot: on push data to channel check serializer len #532

adiholden · 2022-12-05T10:04:36Z

No description provided.

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden · 2022-12-05T10:07:49Z

@dranikpg Now the sink in snapshot is just a temporary sink before pushing to channel, serializer holds all the data till calling FlushDefaultBuffer, therefore we compare the default_serializer_->SerializedLen() .
This was a change in my PR, which I got lost when you fetched my changes.

dranikpg · 2022-12-05T10:12:20Z

src/server/snapshot.cc

-    if (auto comp = zstd_serializer_->Compress(payload); comp) {
+    if (auto comp = zstd_serializer_->Compress(payload)) {


I made this on purpose, see #508

Not sure what side you will join 🙂

In this case, with declaring a new variable, it probably doesn't matter

Ok I will undo this change

dranikpg · 2022-12-05T10:23:28Z

src/server/snapshot.h

+
+  // TODO : drop default_buffer from this class, we dont realy need it.
  std::unique_ptr<io::StringFile> default_buffer_;  // filled by default_serializer_


That's true, because currently we always move out of the default buffer

However, in an optimized version, we shouldn't do this and should keep the default buffer.

With it, we can make use of the following:

If we do compression, then we already copy the compressed part into a new string, so there is no reason the steal the default buffer and then drop it

We just use it do to copy over data from the rdb_serializer, so why should we allocate a new buffer each time for this if we end up with case 1

Actually, instead of using FlushToSink at all, we should be able to get the io::Bytes/string_view from the serializer directly, so that we can do compression without an intermediate copy

But I guess that's another topic, off from the current discussion

I am going to move the compression under RdbSerilaizer, so the flow will change, you can see the PR created already.
I agree on the the optimization for not allocating new buffer for each copy, I believe the flow will change more till I get to this TODO but in the current flow I believe we can call it tmp_buffer_ and use it in the temporary Serializer as will, right?
Anyway I am not going to change this now

Yep, I just looked at this first, the other PR solves this issue

dranikpg · 2022-12-05T10:23:54Z

src/server/snapshot.cc

 bool SliceSnapshot::FlushDefaultBuffer(bool force) {
-  if (!force && default_buffer_->val.size() < 4096)
+  if (!force && default_serializer_->SerializedLen() < 4096)


Signed-off-by: adi_holden <adi@dragonflydb.io>

bug(rdb save): snapshot: on push data to channel check serializer len

e8504b0

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden requested a review from dranikpg December 5, 2022 10:04

dranikpg reviewed Dec 5, 2022

View reviewed changes

dranikpg previously approved these changes Dec 5, 2022

View reviewed changes

bug(rdb save): undo change

9c419a4

Signed-off-by: adi_holden <adi@dragonflydb.io>

adiholden dismissed dranikpg’s stale review via 9c419a4 December 5, 2022 10:45

adiholden requested a review from dranikpg December 5, 2022 10:45

dranikpg approved these changes Dec 5, 2022

View reviewed changes

adiholden merged commit e803432 into main Dec 5, 2022

romange deleted the fix_flush_default_buffer branch December 27, 2022 16:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug(rdb save): snapshot: on push data to channel check serializer len #532

bug(rdb save): snapshot: on push data to channel check serializer len #532

Uh oh!

adiholden commented Dec 5, 2022

Uh oh!

adiholden commented Dec 5, 2022

Uh oh!

dranikpg Dec 5, 2022

Uh oh!

adiholden Dec 5, 2022

Uh oh!

dranikpg Dec 5, 2022 •

edited

Loading

Uh oh!

dranikpg Dec 5, 2022

Uh oh!

adiholden Dec 5, 2022

Uh oh!

dranikpg Dec 5, 2022

Uh oh!

dranikpg Dec 5, 2022

Uh oh!

Uh oh!

		if (auto comp = zstd_serializer_->Compress(payload); comp) {
		if (auto comp = zstd_serializer_->Compress(payload)) {


		// TODO : drop default_buffer from this class, we dont realy need it.
		std::unique_ptr<io::StringFile> default_buffer_; // filled by default_serializer_

bug(rdb save): snapshot: on push data to channel check serializer len #532

bug(rdb save): snapshot: on push data to channel check serializer len #532

Uh oh!

Conversation

adiholden commented Dec 5, 2022

Uh oh!

adiholden commented Dec 5, 2022

Uh oh!

dranikpg Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

adiholden Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

dranikpg Dec 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dranikpg Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

adiholden Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

dranikpg Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

dranikpg Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dranikpg Dec 5, 2022 •

edited

Loading