[Enhancement]: improve ProtoBuf marshal and unmarshal with vtprotobuf #38847

jaime0815 · 2024-12-30T08:24:12Z

Is there an existing issue for this?

I have searched the existing issues

What would you like to be added?

See more about https://github.com/planetscale/vtprotobuf

This benchmark for InsertRequest shows that unmarshal latency is 3x faster than the original proto unmarshal method, while marshal latency is 70% slower than the original proto marshal method.

goos: darwin
goarch: amd64
pkg: vtprotobuf-bench
cpu: Intel(R) Core(TM) i7-1068NG7 CPU @ 2.30GHz
BenchmarkInsertRequest
BenchmarkInsertRequest/Marshal
BenchmarkInsertRequest/Marshal-8         	     703	   1739091 ns/op	   3078891 bytes
BenchmarkInsertRequest/MarshalVT
BenchmarkInsertRequest/MarshalVT-8       	     450	   2874546 ns/op	   3078891 bytes
BenchmarkInsertRequest/Unmarshal
BenchmarkInsertRequest/Unmarshal-8       	     180	   8882166 ns/op	   3078891 bytes
BenchmarkInsertRequest/UnmarshalVT
BenchmarkInsertRequest/UnmarshalVT-8     	     384	   3296314 ns/op	   3078891 byte

Another key advantage is reduced CPU usage, with unmarshalling saving about 1X CPU time and marshalling saving 25%. Using the memory pool function of vtprotobuf will further reduce CPU usage.

benchmark code snippet

func getInsertReq() *protos.InsertRequest {
	return &protos.InsertRequest{
		DbName:         "db1",
		CollectionName: "col1",
		FieldsData:     []*protos.FieldData{NewFloatVectorFieldData("f1", 1000, 768)},
		HashKeys:       GenerateHashKeys(1000),
		NumRows:        uint32(1000),
	}
}

func BenchmarkInsertRequest(b *testing.B) {
	req := getInsertReq()
	bs, err := proto.Marshal(req)
	if err != nil {
		b.Fatal(err)
	}
	b.Run("Marshal", func(b *testing.B) {
		total := 0
		for i := 0; i < b.N; i++ {
			bs, err := proto.Marshal(req)
			total += len(bs)
			if err != nil {
				b.Fatal(err)
			}
		}
		b.ReportMetric(float64(total)/float64(b.N), "bytes")
	})

	b.Run("MarshalVT", func(b *testing.B) {
		total := 0
		for i := 0; i < b.N; i++ {
			bs, err := req.MarshalVT()
			total += len(bs)
			if err != nil {
				b.Fatal(err)
			}
		}
		b.ReportMetric(float64(total)/float64(b.N), "bytes")
	})

	b.Run("Unmarshal", func(b *testing.B) {
		total := 0
		for i := 0; i < b.N; i++ {
			var l protos.InsertRequest
			total += len(bs)
			if err := proto.Unmarshal(bs, &l); err != nil {
				b.Fatal(err)
			}
		}
		b.ReportMetric(float64(total)/float64(b.N), "bytes")
	})

	b.Run("UnmarshalVT", func(b *testing.B) {
		total := 0
		for i := 0; i < b.N; i++ {
			var l protos.InsertRequest
			total += len(bs)
			if err := l.UnmarshalVT(bs); err != nil {
				b.Fatal(err)
			}
		}
		b.ReportMetric(float64(total)/float64(b.N), "bytes")
	})
}

Why is this needed?

No response

Anything else?

No response

The text was updated successfully, but these errors were encountered:

xiaofan-luan · 2024-12-30T14:41:27Z

https://github.com/planetscale/vtprotobuf

how does it be compatible with current implementation?
what's the size comparison with vprotobuf and protobuf

jaime0815 · 2024-12-31T09:58:45Z

https://github.com/planetscale/vtprotobuf

how does it be compatible with current implementation?

what's the size comparison with vprotobuf and protobuf

It is fully compatible with the current implementation, but the API requires changes.
The sizes remain the same after marshaling or unmarshaling.

alexanderguzhva · 2025-01-02T14:29:30Z

@jaime0815

the results may be misleading because of a CPU type (cpu: Intel(R) Core(TM) i7-1068NG7 CPU @ 2.30GHz)
it needs to be ensured that the most recent versions of both baseline and candidate libraries are used. Is it requirement satisfied?

xiaofan-luan · 2025-01-02T16:01:34Z

@jaime0815

the results may be misleading because of a CPU type (cpu: Intel(R) Core(TM) i7-1068NG7 CPU @ 2.30GHz)

it needs to be ensured that the most recent versions of both baseline and candidate libraries are used. Is it requirement satisfied?

Good Suggestion, we can test it on R7gd. Ideally this won't make too much difference becasue the optimization is more on the data structure side

jaime0815 added the kind/enhancement Issues or changes related to enhancement label Dec 30, 2024

jaime0815 self-assigned this Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement]: improve ProtoBuf marshal and unmarshal with vtprotobuf #38847

[Enhancement]: improve ProtoBuf marshal and unmarshal with vtprotobuf #38847

jaime0815 commented Dec 30, 2024 •

edited

Loading

xiaofan-luan commented Dec 30, 2024

jaime0815 commented Dec 31, 2024

alexanderguzhva commented Jan 2, 2025

xiaofan-luan commented Jan 2, 2025

[Enhancement]: improve ProtoBuf marshal and unmarshal with vtprotobuf #38847

[Enhancement]: improve ProtoBuf marshal and unmarshal with vtprotobuf #38847

Comments

jaime0815 commented Dec 30, 2024 • edited Loading

Is there an existing issue for this?

What would you like to be added?

Why is this needed?

Anything else?

xiaofan-luan commented Dec 30, 2024

jaime0815 commented Dec 31, 2024

alexanderguzhva commented Jan 2, 2025

xiaofan-luan commented Jan 2, 2025

jaime0815 commented Dec 30, 2024 •

edited

Loading