HDDS-10338. Implement a Client Datanode API to stream a block #6613

chungen0126 · 2024-04-30T15:20:59Z

What changes were proposed in this pull request?

To reduce round trips between the Client and Datanode for reading a block, we nee a new API to read.

Client -> block(offset, length) -> Datanode
Client <- chunkN <- Datanode
Client <- chunkN+1 <- Datanode
..
Client <-chunkLast <- Datanode

This is using the ability of gRPC to send bidirectional traffic such that the server can pipeline the chunks to the client without waiting for ReadChunk API calls. This also avoids the client from creating multiple Chunk Stream Clients and should simplify the read path on the client side by a bit.

Please describe your PR in detail:

Add a new logic at both client and server side to read block as streaming chunks.
Add a new StreamBlockInput at client side called from KeyInputStream to read a block from the container.
Add unit tests and integration tests for `StreamBlockInput.
Add a new version in datanode for compatibilities, while new client reading blocks from old server, it will fallback and read blocks by BlockInputStream.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-10338

How was this patch tested?

There are existed test for reading data.

ChenSammi · 2024-12-03T09:35:16Z

...er-service/src/test/java/org/apache/hadoop/ozone/container/keyvalue/TestKeyValueHandler.java

@@ -458,4 +485,77 @@ private static ContainerCommandRequestProto createContainerRequest(
        .setContainerID(containerID).setPipelineID(UUID.randomUUID().toString())
        .build();
  }
+
+  @Test
+  public void testReadBlock() throws IOException {


Could you add add an end to end case, to test read an empty file with an empty block?

Add testReadEmptyBlock in the TestStreamBlockInputStream to test read an empty block.

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/XceiverClientGrpc.java

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/storage/StreamBlockInput.java

ChenSammi · 2024-12-04T02:42:04Z

@chungen0126 , thanks for the quick patch updating. I will try to finish the review this week.

ptlrs

Thanks for the PR @chungen0126. I have a few comments.

ptlrs · 2024-12-05T07:52:10Z

...hdds/client/src/test/java/org/apache/hadoop/hdds/scm/storage/TestStreamBlockInputStream.java

+  private static final int CHUNK_SIZE = 100;
+  private static final int BYTES_PER_CHECKSUM = 20;


Can we add some tests which cover the conditions where BYTES_PER_CHECKSUM is equal-to and greater than CHUNK_SIZE?

I don't think we need to add this test case. Logically, BYTES_PER_CHECKSUM should be smaller than CHUNK_SIZE. Otherwise, a single chunk file cannot be verified.

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/XceiverClientGrpc.java

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/storage/StreamBlockInputStream.java

ptlrs · 2024-12-21T05:41:18Z

Test conducted on our cluster(3DN / HDD / 10 Gigabit network) shows this improvement can boost read speed by at least 30%.

It can be seen that for 1GB file reading, streaming reading can reduce the reading time by about 30%.

Hi @chungen0126 @guohao-rosicky, do we know how much of an improvement is being seen for files smaller than 1GB?

fenixjin · 2025-01-07T18:09:11Z

Test conducted on our cluster(3DN / HDD / 10 Gigabit network) shows this improvement can boost read speed by at least 30%.

It can be seen that for 1GB file reading, streaming reading can reduce the reading time by about 30%.

Hi @chungen0126 @guohao-rosicky, do we know how much of an improvement is being seen for files smaller than 1GB?

@ptlrs
conducted test using freon on the latest version of this pr, hardware is 3DN / HDD / 10 Gigabit network, tested file sizes are 1GB 128MB 16MB 2MB 256KB. there is an index bug when verifying checksum, tests were conducted after fixing it.
here are the redunctions in mean read time given by freon：

1GB: 7%
128MB: 10%
16MB 8%
2MB 10%
256KB 8%

further tests were conducted with verifying checksum skipped, it turns out checksum have a substantial impact on read time.
when testing using freon on 1GB file, turning on checksum caused read time to increase from around 3700ms to 5000ms.
with checksum off, the read time reduction of stream block compared with current read method(checksum also off) is:

1GB: 30%
128MB: 21%
16MB: 11%
2MB: -5.6% (chunk size is 4MB)

guohao-rosicky · 2025-01-24T02:14:07Z

Can you resolve the code conflict? @chungen0126

adoroszlai · 2025-02-14T10:14:04Z

Temporarily converted to draft and assigned to myself, to resolve conflicts.

chungen0126 · 2025-02-15T20:17:31Z

Thanks @adoroszlai for fixing the conflicts. I was just about to address it.

chungen0126 added 30 commits March 11, 2024 16:40

implement data stream api

d6d7b2a

implement XceiverClientGrpc#sendCommandOnlyRead

d8a519d

read data to buffers

ffebd66

create NewBlockInputStream to support Streaming data

a6ed056

fix checkstyle

fbd20eb

fix checkstyle

a663604

fix synchronized

0f35371

fix synchronized

80329dc

fix synchronized

e398d17

fix synchronized

a404266

fix synchronized

c57c2e6

fix synchronized

4dc9082

ignore find bugs in TestNewBlockInputStream

af4a25b

clean up

79ae3eb

implement server side stream data

b1b301e

fix bug

ad0fd8d

fix bug

e86aee9

Merge branch 'master' into HDDS-10338

5442761

fix bug

8d97d47

fix bug

b694448

fix bug

d5dc908

fix bug

74eac1e

fix bug

29c8f80

fix bug

741effb

fix checkstyle

f1d4d7f

fix bug

6a67375

Merge branch 'master' into HDDS-10338

b0c64d7

fix checkstyle

b4cfd3f

fix bug

91631ac

fix bug

6e36ec1

ChenSammi reviewed Dec 3, 2024

View reviewed changes

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/XceiverClientGrpc.java Outdated Show resolved Hide resolved

ChenSammi reviewed Dec 3, 2024

View reviewed changes

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/storage/StreamBlockInput.java Outdated Show resolved Hide resolved

ChenSammi reviewed Dec 3, 2024

View reviewed changes

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/storage/StreamBlockInput.java Outdated Show resolved Hide resolved

chungen0126 added 3 commits December 3, 2024 18:40

address comments

0e4e41e

address comments

f8e5f28

add read empty block in TestStreamBlockInputStream.java

228de8d

guohao-rosicky requested a review from ChenSammi December 4, 2024 02:48

ptlrs reviewed Dec 5, 2024

View reviewed changes

chungen0126 added 2 commits December 7, 2024 02:11

address comments

c24da9b

fix checkstyle

f457ac2

ChenSammi reviewed Dec 11, 2024

View reviewed changes

hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/storage/StreamBlockInputStream.java Outdated Show resolved Hide resolved

address comments

c1fbad2

guohao-rosicky requested a review from ChenSammi December 19, 2024 06:01

chungen0126 added 2 commits January 31, 2025 02:18

Merge branch 'master' into HDDS-10338

5e2b3cf

fix conflict

ba100c8

adoroszlai self-assigned this Feb 14, 2025

adoroszlai marked this pull request as draft February 14, 2025 10:13

Merge remote-tracking branch 'origin/master' into HDDS-10338

1fec95f

adoroszlai removed their assignment Feb 15, 2025

chungen0126 added 4 commits March 17, 2025 01:30

Merge branch 'master' into HDDS-10338

7daa875

fix checks

5db133a

fix checks

2bf716d

fix checkstyle

d53bdcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-10338. Implement a Client Datanode API to stream a block #6613

HDDS-10338. Implement a Client Datanode API to stream a block #6613

chungen0126 commented Apr 30, 2024 •

edited

Loading

ChenSammi Dec 3, 2024 •

edited

Loading

chungen0126 Dec 3, 2024

ChenSammi commented Dec 4, 2024 •

edited

Loading

ptlrs left a comment

ptlrs Dec 5, 2024

chungen0126 Dec 6, 2024

ptlrs commented Dec 21, 2024

fenixjin commented Jan 7, 2025 •

edited

Loading

guohao-rosicky commented Jan 24, 2025

adoroszlai commented Feb 14, 2025

chungen0126 commented Feb 15, 2025

		private static final int CHUNK_SIZE = 100;
		private static final int BYTES_PER_CHECKSUM = 20;

HDDS-10338. Implement a Client Datanode API to stream a block #6613

Are you sure you want to change the base?

HDDS-10338. Implement a Client Datanode API to stream a block #6613

Conversation

chungen0126 commented Apr 30, 2024 • edited Loading

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

ChenSammi Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

chungen0126 Dec 3, 2024

Choose a reason for hiding this comment

ChenSammi commented Dec 4, 2024 • edited Loading

ptlrs left a comment

Choose a reason for hiding this comment

ptlrs Dec 5, 2024

Choose a reason for hiding this comment

chungen0126 Dec 6, 2024

Choose a reason for hiding this comment

ptlrs commented Dec 21, 2024

fenixjin commented Jan 7, 2025 • edited Loading

guohao-rosicky commented Jan 24, 2025

adoroszlai commented Feb 14, 2025

chungen0126 commented Feb 15, 2025

chungen0126 commented Apr 30, 2024 •

edited

Loading

ChenSammi Dec 3, 2024 •

edited

Loading

ChenSammi commented Dec 4, 2024 •

edited

Loading

fenixjin commented Jan 7, 2025 •

edited

Loading