`Read::read_to_end` time is quadratic for input over 8KB

The [default implementation of `read_to_end`][1] grows its buffer by at most `DEFAULT_BUF_SIZE` bytes (8KB) at a time.  When the input is large, this takes O(n²) time due to repeated reallocation and copying.  Calling `read_to_end` on a 100MB file with an empty buffer will resize and copy the buffer over 10,000 times.

When the input size is known ahead of time, callers can avoid this problem by passing in a pre-allocated buffer.  It would be nice if this was not necessary in cases where the standard library could do this automatically.

* `read_to_end` could be specialized to pre-reserve space for types that implement `Seek`.
* Types like `File` that have other ways to determine the input size could provide their own implementations of `read_to_end`.

This still leaves potentially quadratic behavior for cases where the input size can't be determined up front. This could be fixed using exponential growth, at the cost of wasting more memory due to unused buffer space.

[1]: https://github.com/rust-lang/rust/blob/7f6417e9b72de63666fd92caf1a6b96a778b60ed/src/libstd/io/mod.rs#L357

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`Read::read_to_end` time is quadratic for input over 8KB #45851

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Read::read_to_end time is quadratic for input over 8KB #45851

Description

Activity

mbrubeck commented on Nov 7, 2017

SimonSapin commented on Nov 11, 2017

SimonSapin commented on Nov 11, 2017

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions

`Read::read_to_end` time is quadratic for input over 8KB #45851