Implement Applicative based API #522

Shimuuar · 2025-01-28T09:16:08Z

Implementation follows plan described in #477 with generateA :: Applicative f => Int -> (Int -> f a) -> f (v a) as primitive. It is not subject to stream fusion.

Writing generateA is not difficult. Problem is doing it efficiently. Current benchmarks are (suggestions for more are very welcome):

Generate vector of random numbers using state monad
Generate vector using IO action
Compute sum of vector using lens
Map vector using lens

First and naive implementation uses list as intermediate data structure. Sum benchmark performs well in this case. Using STA instead brings map benchmark on par with explicit loop and produces slight (5-10%) improvements in state and IO benchmark)

newtype STA v a = STA {  _runSTA :: forall s. Mutable v s a -> ST s (v a) }

Currently sum and map perform on par with explicit loop. State gives 7x slowdown and 8x allocations, IO benchmark 4x slowdown and 8x allocations. We obviously can add rewrite rules for IO/ST but maybe there're more general optimizations.

Fixes #477, #69, #132, #144

generateA is used as primitive and all other functions are expressed in it terms. First version goes through intermediate list. This is simplest implementation possible and would serve as baseline for further optimizations

We establish implementation which goes through list as baseline and the we can try to optimize it. Note definition of foldlOf'. It's different from definition in lens<=5.3.3 but it's absolutely necessary to get good perfomance in folds

Does wonders for traversals using Identity

konsumlamm

What about for/for_/traverse_ etc?

vector/src/Data/Vector/Generic.hs

konsumlamm · 2025-01-28T14:16:39Z

vector/src/Data/Vector/Generic.hs

+    go !i | i >= n    = pure $ STA unsafeFreeze
+          | otherwise =  (\a (STA m) -> STA $ \mv -> M.unsafeWrite mv i a >> m mv)
+                     <$> f i
+                     <*> go (i + 1)


It's probably better to use liftA2 here.

Probably. But I'm not sure that STA will survive maybe some New-like variant will perform better

Co-authored-by: konsumlamm <44230978+konsumlamm@users.noreply.github.com>

Shimuuar · 2025-01-28T14:28:50Z

What about for/for_/traverse_ etc?

Good point! I forgot about them. But they're simpler since in this case one isn't in business of constructing vector so it simply reduces to fold.

Shimuuar added 3 commits January 27, 2025 19:56

Implement functions which use Applicatives

3764a73

generateA is used as primitive and all other functions are expressed in it terms. First version goes through intermediate list. This is simplest implementation possible and would serve as baseline for further optimizations

Implement STA optimization trick as an optimization

bb38bf5

Does wonders for traversals using Identity

konsumlamm reviewed Jan 28, 2025

View reviewed changes

Update vector/src/Data/Vector/Generic.hs

6ad3458

Co-authored-by: konsumlamm <44230978+konsumlamm@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Applicative based API #522

Implement Applicative based API #522

Shimuuar commented Jan 28, 2025

konsumlamm left a comment

konsumlamm Jan 28, 2025

Shimuuar Jan 28, 2025

Shimuuar commented Jan 28, 2025

Implement Applicative based API #522

Are you sure you want to change the base?

Implement Applicative based API #522

Conversation

Shimuuar commented Jan 28, 2025

konsumlamm left a comment

Choose a reason for hiding this comment

konsumlamm Jan 28, 2025

Choose a reason for hiding this comment

Shimuuar Jan 28, 2025

Choose a reason for hiding this comment

Shimuuar commented Jan 28, 2025