takeRight and dropRight of views eagerly traverse the underlying collection elements #11275

julienrf · 2018-11-26T15:46:52Z

I’m not sure we can do better, but I’d like to draw attention to the current behaviour of takeRight and dropRight applied to views, at least to agree on what we should expect.

This REPL session shows the current behaviour:

Welcome to Scala 2.13.0-20181120-204541-343c2d4 (OpenJDK 64-Bit Server VM, Java 1.8.0_181).
Type in expressions for evaluation. Or try :help.

scala> List.from(1 to 10)
res0: List[Int] = List(1, 2, 3, 4, 5, 6, 7, 8, 9, 10)

scala> res0.view
res1: scala.collection.SeqView[Int] = View(?)

scala> res1.map { x => println(x); x }
res2: scala.collection.SeqView[Int] = View(?)

scala> res2.take(3) // With `take`, no traversal happens
res3: scala.collection.SeqView[Int] = View(?)

scala> res2.takeRight(3)
1
2
3
4
1
5
2
6
3
7
4
8
5
9
6
10
7
8
9
10
res4: scala.collection.View[Int] = View(?)

takeRight returns a View whose elements are eagerly forced, and if we look at the implementation, we see that the elements are collected into an ArrayBuffer, and then a view of that ArrayBuffer is returned. I see two problems with that: (1) the fact that the returned view elements are eagerly evaluated breaks the property of transformation operations applied to views being lazy, and (2) an intermediate ArrayBuffer is used, although the purpose of using views is to not create intermediate data structures.

That being said, unlike take, we can not implement takeRight on views in a fully lazy way because we have to reach the end of the underlying collection to know that we can start emitting elements. So, the best we could do would be to delay the time we traverse the underlying collection to the time when at least one element of the view is effectively accessed. Doing that would fix the two mentioned issues.

However, that’s not the end of the story because this solution would break another (maybe informal?) property of views: accessing view elements should have a predictable complexity, and more specifically, it should have the same complexity of accessing elements of the underlying collection. Said otherwise, head should be O(1) on views, but that wouldn’t be the case here because it would require a complete traversal of the underlying collection to get the first element resulting from the takeRight operation.

So, it seems that the two properties (transformation operations of views should be lazy, and accessing view elements should have the same complexity as accessing the underlying collection elements) can not be satisfied at the same time. Which one should we pick?

My suggestion would be to keep the second property and give up the first one, because we already have some other operations that are not lazy (e.g. groupBy, permutations, …). These operations are not really transformation operations because they don’t return the same collection type but a collection of sub-collections (ie groupBy returns a Map[K, View[A]]). Another similar case is sorted, which eagerly evaluates its elements as well (and uses an intermediate ArrayBuffer).

If we agree on that, we can still at least improve the current implementation by not creating an intermediate ArrayBuffer, like LazyList does here: #11083 (comment)

The text was updated successfully, but these errors were encountered:

som-snytt · 2018-11-26T21:14:58Z

Just to point out leading-dot in REPL, in case the feature is not well-known:

scala> (1 to 10).toList
res0: List[Int] = List(1, 2, 3, 4, 5, 6, 7, 8, 9, 10)

scala> .view
res1: scala.collection.SeqView[Int] = View(?)

scala> .map { x => println(x) ; x }
res2: scala.collection.SeqView[Int] = View(?)

szeiger · 2018-12-05T12:24:10Z

My suggestion would be to keep the second property and give up the first one, because we already have some other operations that are not lazy (e.g. groupBy, permutations, …). These operations are not really transformation operations because they don’t return the same collection type but a collection of sub-collections

But that's an important difference. These methods cannot be lazy because they return strict collections. There is no indication in the types that takeRight or dropRight shouldn't be lazy. They are symmetrical to take and drop, they have the same type and they should be equally lazy.

accessing view elements should have a predictable complexity, and more specifically, it should have the same complexity of accessing elements of the underlying collection. Said otherwise, head should be O(1) on views

I don't think that's true. A View represents a reified operation on an Iterator (or on indexed access in case of an IndexedView). Traversing is done with an Iterator, values are computed based on an Iterator, so the complexity is that of accessing the unerlying collection through an Iterator. head won't be O(1) if the base collection doesn't have O(1) head, it won't be O(1) if you put it after filter(_ => false), etc.

Any kind of caching in Views is a problem because it goes against the simple model of Views as reified Iterator operations. I don't we should do it unless it's clear from the types (like computing a strict collection or building a View from an Iterator). What we need for takeRight and dropRight are View-based implementations that already exist for take and drop. StrictOptimizedIterableOps can override them with more efficient strict implementations.

julienrf · 2018-12-05T16:44:55Z

head won't be O(1) if the base collection doesn't have O(1) head, it won't be O(1) if you put it after filter(_ => false), etc.

Yeah, good point.

What we need for takeRight and dropRight are View-based implementations that already exist for take and drop. StrictOptimizedIterableOps can override them with more efficient strict implementations.

I agree.

szeiger · 2018-12-13T19:02:35Z

Small catch: Iterator doesn't even have takeRight and dropRight and these are non-trivial to implement efficiently. I'll give it a try.

NthPortal · 2018-12-13T20:46:36Z

Possibly not the best option, but you could convert it to a LazyList first, which has decent takeRight and dropRight implementations

- Add these methods to IterableOnceOps - Implement them in Iterator (with the necessary amount of caching but no more than that) - Move the existing strict implementations from IterableOps to StrictOptimizedIterableOps - Add View.(TakeRight|DropRight) based on Iterator - Move IndexedSeqView implementations of TakeRight, Drop and DropRight up to SeqView - Add new overrides in IndexedSeqView Fixes scala/bug#11275

szeiger · 2018-12-14T14:39:33Z

I took the direct route with dedicated Iterator implementations with array-based ring buffers. This should be much faster than going through LazyList.

- Move the existing strict implementations from IterableOps to StrictOptimizedIterableOps - Add View.(TakeRight|DropRight) with private Iterator-based implementations that cache as little and as late as possible - Move IndexedSeqView implementations of TakeRight, Drop and DropRight up to SeqView - Add new overrides in IndexedSeqView Fixes scala/bug#11275

julienrf added the library:collections label Nov 26, 2018

SethTisue added this to the 2.13.0-RC1 milestone Nov 26, 2018

julienrf added the scala spree label Dec 11, 2018

szeiger mentioned this issue Dec 13, 2018

Lazy implementations of takeRight and dropRight scala/scala#7524

Merged

szeiger added the has PR label Dec 13, 2018

julienrf removed the scala spree label Dec 15, 2018

adriaanm assigned szeiger Jan 15, 2019

szeiger closed this as completed in scala/scala#7524 Jan 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

takeRight and dropRight of views eagerly traverse the underlying collection elements #11275

takeRight and dropRight of views eagerly traverse the underlying collection elements #11275

julienrf commented Nov 26, 2018 •

edited

Loading

som-snytt commented Nov 26, 2018

Uh oh!

szeiger commented Dec 5, 2018

Uh oh!

julienrf commented Dec 5, 2018

Uh oh!

szeiger commented Dec 13, 2018

Uh oh!

NthPortal commented Dec 13, 2018

Uh oh!

szeiger commented Dec 14, 2018

Uh oh!

takeRight and dropRight of views eagerly traverse the underlying collection elements #11275

takeRight and dropRight of views eagerly traverse the underlying collection elements #11275

Comments

julienrf commented Nov 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

som-snytt commented Nov 26, 2018

Uh oh!

szeiger commented Dec 5, 2018

Uh oh!

julienrf commented Dec 5, 2018

Uh oh!

szeiger commented Dec 13, 2018

Uh oh!

NthPortal commented Dec 13, 2018

Uh oh!

szeiger commented Dec 14, 2018

Uh oh!

julienrf commented Nov 26, 2018 •

edited

Loading