Remove extra GET for tarball uncompressed size by charlespierce · Pull Request #1812 · volta-cli/volta

charlespierce · 2024-07-30T04:57:31Z

Info

Currently, we make two overlapping GET requests when fetching a tarball: First, the main request to get the headers and the data stream, and then a second request to attempt to fetch the ISIZE field which holds the uncompressed size. Getting the uncompressed size lets us use the stream data after uncompressing to populate our progress bar for tarballs.

However, the extra request causes issues for some users, where the second request for the ISIZE field hangs during the TLS handshake. By removing the uncompressed size entirely and connecting the progress bar to the compressed read data, we can avoid the extra request entirely.

Changes

Removed the uncompressed_size method from the Archive trait.
Removed the extra request for the ISIZE field from the Tarball implementation of Archive
Updated Tarball to emit progress off the compressed reader, rather than the output of the decoder (so that progress values will be based on the compressed size).

Tested

Ran locally and verified that fetching a Tarball still works as expected.
Visual test didn't show any imperfections with the progress bar—it seemed to work the same as the previous approach using the uncompressed size.

Notes

I didn't have a chance to test the progress bar on a Unix machine yet, I'd be interested to see if there is any noticeable delay or jitteriness from the change in how the progress bar is filled in those cases.

Currently, we make two overlapping GET requests when fetching a tarball: First, the main request to get the headers and the data stream, and then a second request to attempt to fetch the ISIZE field which holds the uncompressed size. Getting the uncompressed size lets us use the stream data after uncompressing to populate our progress bar for tarballs. However, the extra request causes issues for some users, where the second request for the ISIZE field hangs during the TLS handshake. By removing the uncompressed size entirely and connecting the progress bar to the compressed read data, we can avoid the extra request entirely.

chriskrycho · 2024-07-30T15:21:27Z

crates/archive/src/tarball.rs

+        let decoded = GzDecoder::new(ProgressRead::new(self.data, (), progress));
+        let mut tarball = tar::Archive::new(decoded);


Just for my own clarity: what motivated changing up where we put the ProgressRead? It seems like it should be the same behavior either way, since ultimately in both cases it’s just wrapping the Read stream?

Yeah, I can definitely explain! For some background:

ProgressRead is a thin wrapper around a reader that calls the provided FnMut every time data is read, passing in the amount of data that was read from the wrapped reader.

GzDecoder is a Read wrapper around a stream of compressed bytes, which produces a stream of uncompressed bytes.

Given that, the motivation behind the change is that previously we were wrapping the decoded reader with ProgressRead, meaning the number of bytes it would report would be from the stream of uncompressed bytes—they would add up to the total uncompressed size. Since we're no longer fetching / relying on the uncompressed size, we instead move the ProgressRead to wrap the stream of compressed bytes, which allows the total amount of data recorded to match the compressed size.

chriskrycho

This looks eminently reasonable, and local testing on my Mac seems to be working well with no visual hiccups.

(Intuitively, this makes a lot of sense to me: decoding the file should more or less always be dwarfed by the download time, and the download time is going to be a function of the compressed, not the uncompressed, size.)

I had one question, but it’s absolutely not blocking.

charlespierce mentioned this pull request Jul 30, 2024

When trying to install new versions volta hangs indefinitely and blocks all other network requests from resolving #1744

Closed

chriskrycho reviewed Jul 30, 2024

View reviewed changes

chriskrycho approved these changes Jul 30, 2024

View reviewed changes

chriskrycho merged commit bc471dc into volta-cli:main Jul 30, 2024

charlespierce deleted the tarball_compressed_size branch August 1, 2024 03:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove extra GET for tarball uncompressed size#1812

Remove extra GET for tarball uncompressed size#1812
chriskrycho merged 1 commit intovolta-cli:mainfrom
charlespierce:tarball_compressed_size

charlespierce commented Jul 30, 2024

Uh oh!

chriskrycho Jul 30, 2024

Uh oh!

charlespierce Jul 30, 2024

Uh oh!

chriskrycho left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		let decoded = GzDecoder::new(ProgressRead::new(self.data, (), progress));
		let mut tarball = tar::Archive::new(decoded);

Conversation

charlespierce commented Jul 30, 2024

Info

Changes

Tested

Notes

Uh oh!

chriskrycho Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

charlespierce Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

chriskrycho left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants