jyp · September 16, 2015 09:21
diff --git a/Organ.lhs b/Organ.lhs
 ---
 title: On the Duality of Streams
 subtitle: How Can Linear Types Help to Solve the Lazy IO Problem?
 author:
 - name: Jean-Philippe Bernardy
 - name: Josef Svenningsson
 ...

 <!--

 > {-# LANGUAGE ScopedTypeVariables, TypeOperators, RankNTypes, LiberalTypeSynonyms, BangPatterns, TypeSynonymInstances, FlexibleInstances, FlexibleContexts  #-}
 > module Organ where
 > import System.IO
 > import Control.Exception
 > import Control.Concurrent (forkIO, readChan, writeChan, Chan, newChan, QSem, newQSem, waitQSem, signalQSem)
 > import Control.Applicative hiding (empty)
 > import Data.IORef
 > import Data.Monoid
 > import Prelude hiding (tail)
 > import Control.Monad (ap)

 -->

 \begin{abstract}
 In this paper, we present a novel stream-programming library for
 Haskell.  As other coroutine-based stream libraries, our library
 allows synchronous execution, which implies that effects are run in
 lockstep and no buffering occurs.

 A novelty of our implementation is that it allows to locally introduce
 buffering or re-scheduling of effects. The buffering requirements (or
 re-scheduling opportunities) are indicated by the type-system.

 Our library is based on a number of design principles, adapted from
 the theory of Girard's Linear Logic. These principles are applicable
 to the design of any Haskell structure where resource management
 (memory, IO, ...) is critical.
 \end{abstract}

 \category{D.1.1}{Applicative (Functional) Programming}{}
 \category{D.3.3}{Language Constructs and Features}{Coroutines}

 \keywords
 Streams, Continuations, Linear Types

 Introduction
 ============

 As \citet{hughes_functional_1989} famously noted, the strength of
 functional programming languages resides in the composition mechanisms
 that they provide. That is, simple components can be built and
 understood in isolation; one does not need to worry about interference
 effects when composing them. In particular, lazy evaluation affords to
 construct complex programs by pipelining simple list transformation
 functions. Indeed, while strict evaluation forces to fully reify each
 intermediate result between each computational step, lazy
 evaluation allows to run all the computations concurrently, often
 without ever allocating more than a single intermediate element at a time.

 Unfortunately, lazy evaluation suffers from two drawbacks.  First, it
 has unpredictable memory behavior. Consider the following function
 composition:

 < f :: [a] -> [b]
 < g :: [b] -> [c]
 < h = g . f

 One hopes that, at run-time, the intermediate list ($[b]$)
 will only be allocated element-wise, as outlined above. Unfortunately,
 this desired behavior does not always happen. Indeed, a
 necessary condition is that the production pattern of $f$ matches the
 consumption pattern of $g$; otherwise buffering occurs. In practice,
 this means that a seemingly innocuous change in either of the function
 definitions may drastically change the memory behavior of the
 composition, without warning. If one cares about memory behavior,
 this means that the compositionality principle touted by Hughes breaks
 down.

 Second, lazy evaluation does not extend nicely to effectful
 processing. That is, if (say) an input list is produced by reading a
 file lazily, one is exposed to losing referential transparency (as
 \citet{kiselyov_lazy_2013} has shown). For example, one may rightfully
 expect\footnote{This expectation is expressed in a
 Stack Overflow question, accessible at this URL:
 http://stackoverflow.com/questions/296792/haskell-io-and-closing-files
 } that both following programs have the same behavior:

 < main = do  inFile <- openFile "foo" ReadMode
 <            contents <- hGetContents inFile
 <            putStr contents
 <            hClose inFile
 < 
 < main = do  inFile <- openFile "foo" ReadMode
 <            contents <- hGetContents inFile
 <            hClose inFile
 <            putStr contents

 Indeed, the \var{putStr} and \var{hClose} commands act on unrelated
 resources, and thus swapping them should have no observable effect.
 However, while the first program prints the `foo` file, the second one
 prints nothing.  Indeed, because \var{hGetContents} reads the file
 lazily, the \var{hClose} operation has the effect to truncate the
 list. In the first program, printing the contents force reading the
 file. One may argue that \var{hClose} should not be called in the
 first place --- but then, closing the handle happens only when the
 \var{contents} list can be garbage collected (in full), and relying on
 garbage collection for cleaning resources is brittle; furthermore
 this effect compounds badly with the first issue discussed above.  If
 one wants to use lazy effectful computations, again, the
 compositionality principle is lost.

 In this paper, we propose to tackle both of these issues by mimicking
 the computational behavior of Girard's linear logic
 \cite{girard_linear_1987} in Haskell. In fact, one way to read this
 paper is as an advocacy for linear types support in Haskell. While
 Kiselyov's *iteratees* (\citeyear{kiselyov_iteratees_2012}) already
 solves the issues described above, our grounding in linear logic
 yields a rich structure for types for data streams, capturing
 various production and consumption patterns.

 First, the type corresponding to on-demand production of elements is called a
 source (\var{Src}). An adaptation of the first code example above to
 use sources would look as follows, and give the guarantee that the
 composition does not allocate more memory than the sum of its
 components.

 < f :: Src a -> Src b
 < g :: Src b -> Src c
 < h = g . f

 Second, the type driving the consumption of elements is called a sink
 (\var{Snk}).  For example, the standard output is naturally given a
 sink type:

 < stdoutSnk :: Snk String

 Using it, we can implement the printing of a file as follows, and
 guarantee the timely release of resources, even in the presence of
 exceptions:

 > main = fileSrc "foo" `fwd` stdoutSnk

 In the above \var{fileSrc} provides the contents of a file, and
 \var{fwd} forwards data from a source to a sink.  The types are as
 follows:

 < fileSrc :: FilePath -> Src String
 < fwd :: Src a -> Snk a -> IO ()

 Sources provide data on-demand, while sinks decide when they are ready
 to consume data. This is an instance of the push/pull duality.  In
 general, push-streams control the flow of computation, while
 pull-streams respond to it. We will see that this polarization does
 not need to match the flow of data. We support in particular data
 sources with push-flavor, called co-sources (\var{CoSrc}).
 Co-sources are useful for example when a data stream needs precise
 control over the execution of effects it embeds (sec
 Sec. \ref{async}). For example, sources cannot be demultiplexed, but
 co-sources can.

 In a program which uses both sources and co-sources, the need might
 arise to compose a function which returns a co-source with a function
 which takes a source as input: this is the situation where list-based
 programs would silently cause memory allocation. In our approach, this
 mismatch is caught by the type system, and the user must explicitly
 conjure a buffer to be able to write the composition:

 < f :: Src a -> CoSrc b
 < g :: Src b -> Src c
 < h = g . buffer . f

 The contributions of this paper are

 * The formulation of principles for compositional resource-aware
 programming in Haskell (resources include memory and files). The
 principles are linearity, duality, and polarization. While borrowed
 from linear logic, as far as we know they have not been applied to
 Haskell programming before.

 * An embodiment of the above principles, in the form of a Haskell
 library for streaming `IO`. Besides supporting compositionality as
 outlined above, our library features two concrete novel aspects:

  1. A more lightweight design than state-of-the-art co-routine based
    libraries.

  2. Support for explicit buffering and control structures, while
    still respecting compositionality (Sec. \ref{async}).

 \paragraph{Outline} The rest of the paper is structured as follows.
 In Sec. \ref{negations}, we recall the notions of continuations in presence of effects.
 In Sec. \ref{streams}, we present our design for streams, and justify it by appealing to linearity principles.
 In Sec. \ref{effect-free-streams}, we give an API to program with streams, and analyze their algebraic structure.
 In Sec. \ref{effectful-streams}, we show how to embed IO into streams.
 In Sec. \ref{async}, we discuss polarity mismatch.
 Related work and future work are discussed respectively in sections \ref{related-work} and \ref{future-work}.
 We conclude in Sec. \ref{conclusion}.



 Preliminary: negation and continuations
 =======================================
 \label{negations}

 In this section we recall the basics of continuation-based
 programming. We introduce our notation, and justify effectful
 continuations.

 We begin by assuming a type of effects \var{Eff}, which we keep
 abstract for now. We can then define negation as follows:

 > type N a = a -> Eff

 A shortcut for double negations is also convenient.

 > type NN a = N (N a)

 The basic idea (imported from classical logic) pervading this paper
 is that producing a result of type α is equivalent to consuming an
 argument of type $N α$. Dually, consuming an argument of type α is
 equivalent to producing a result of type $N α$. In this paper we call
 these equivalences the duality principle.

 In classical logic, negation is involutive; that is: $\var{NN}\,α = α$
 However, because we work within Haskell, we do not have this
 equality\footnote{Even though
 \citet{munch-maccagnoni_formulae-as-types_2014} achieves an involutive
 negation in an intuitionistic language, he does so by stack
 manipulation, which is not available in Haskell.}.  We can come close
 enough though.  First, double negations can always be introduced,
 using the \var{shift} operator:

 > shift :: a -> NN a
 > shift x k = k x

 Second, it is possible to remove double negations, but only if an
 effect can be outputted.  Equivalently, triple negations can be
 collapsed to a single one:

 > unshift :: N (NN a) -> N a
 > unshift k x = k (shift x)

 The above two functions are the \var{return} and \var{join} of the
 double negation monad\footnote{for \var{join}, substitute $N\,a$ for
 $a$}; indeed adding a double negation in the type corresponds to
 sending the return value to its consumer. However, we will not be
 using this monadic structure anywhere in the following. Indeed, single
 negations play a central role in our approach, and the monadic
 structure is a mere diversion.

 Structure of Effects
 --------------------

 When dealing with purely functional programs, continuations have no
 effects. In this case, one can let \var{Eff} remain abstract, or
 define it to be the empty type: $\var{Eff} = \bot$. This is also the
 natural choice when interpreting the original linear logic of
 \citet{girard_linear_1987}.

 The pure logic makes no requirement on effects, but interpretations
 may choose to impose a richer structure on them. Such interpretations
 would then not be complete with respect to the logic --- but they
 would remain sound.
 In our case, we first require \var{Eff} to be a monoid. Its unit
 (\var{mempty}) corresponds to program termination, while the operator
 (\var{mappend}) corresponds to sequential composition of effects.
 (This structure is standard to interpret the \smallcaps{halt} and
 \smallcaps{mix} rules in linear logic
 \citep{bernardy_composable_2015,mellis_resource_2010})

 For users of the stream library, \var{Eff} will remain an abstract
 monoid. However in this paper we will develop concrete effectful
 streams, and therefore we greatly extend the structure of effects. In
 fact, because we will provide streams interacting with files and other
 operating-system resources, and write the whole code in standard
 Haskell, we must pick $\var{Eff} = \var{IO} ()$, and ensure that
 \var{Eff} can be treated as a monoid. 

 > type Eff = IO ()

 > instance Monoid Eff where
 >   mempty = return ()
 >   mappend = (>>)

 The parts of the code which now about $\var{Eff} = \var{IO} ()$ must
 be carefully written. The type system provides no particular
 guarantees about such code. These IO-interacting functions do not
 interpret any standard fragment of linear logic: they are non-standard
 extensions of its model.

 Streams
 =======

 Our guiding design principle is duality. This principle is reflected in
 the design of the streaming library: we not only have a type for
 sources of data but also a type for sinks. For example, a simple
 stream processor reading from a single source and writing to a single
 sink will be given the following type:

 < simple :: Src a -> Snk a -> Eff

 We will make sure that \var{Snk} is the negation of a source (and vice
 versa), and thus the type of the above program may equivalently have
 been written as follows:

 < simple :: Src a -> Src a

 However, having explicit access to sinks allows us to (for example)
 dispatch a single source to multiple sinks, as in the following type signature:

 < forkSrc :: Src (a,b) -> Snk a -> Snk b -> Eff

 Familiarity with duality will be crucial in the later sections of this paper.

 We define sources and sinks by mutual recursion. Producing a
 source means to select if some more is available (\var{Cons}) or not
 (\var{Nil}). If there is data, one must then produce a data item and
 *consume* a sink.

 > data Source  a   = Nil   | Cons a  (N (Sink    a))
 > data Sink    a   = Full  | Cont    (N (Source  a))

 Producing a sink means to select if one can accept more elements
 (\var{Cont}) or not (\var{Full}). In the former case, one must then be
 able to consume a source. The \var{Full} case is useful when the sink
 bails out early, for example when it encounters an exception.

 Note that, in order to produce (or consume) the next element, the
 source (or sink) must handle the effects generated by the other side
 of the stream before proceeding. This means that each production is
 matched by a consumption, and \textit{vice versa}.

 Linearity
 ---------

 For streams to be used safely, one cannot discard nor duplicate them,
 for otherwise effects may be discarded and duplicated, which is
 dangerous.  For example, the same file could be closed twice, or not
 at all.  Indeed, the last action of a sink will typically be closing
 the file. Timely closing of the sink can only be guaranteed if the
 actions are run until reaching the end of the pipe (either \var{Full}
 or \var{Nil}). In the rest of the section we precisely define the condition
 that programs need to respect in order to safely use our streams.

 The first notion that we need to define is that of an effectful type:

 * The type \var{Eff} is effectful
 * A function type is effectful if the co-domain is effectful
 * A product type is effectful if any of its operands is effectful
 * A sum type is effectful if any of its operands is effectful
 * A type variable is not effectful

 Further, we say that a variable with an effectful type is itself effectful.

 The linearity convention is then respected iff:

 1. No effectful variable may be duplicated or shared. In particular,
 if passed as an argument to a function it may not be used again.

 2. Every effectful variable must be consumed (or passed to a function, which
 will be in charged of consuming it).

 3. A type variable α can not be instantiated to an effectful type.

 In this paper, the linearity convention is enforced by manual
 inspection. Manual inspection is unreliable, but weather the linearity
 convention is respected can be algorithmically decided. (See
 sec. \ref{future-work})

 The third restriction (instantiation of type-variables) means that
 effectful types cannot be used in standard polymorphic Haskell
 functions. This is a severe restriction, but it gives enough leeway to
 implement a full-fledged stream library, as we do below. (Yet some
 approached to lift this limitation have been proposed, e.g. by
 \citet{mazurak_lightweight_2010}.)


 One might think that the above restriction fails to take into account
 captured environments in functions. Indeed, one can write the following
 function, which may be duplicated, but runs linear effects.

 > oops :: (() -> Eff) -> IO Bool
 > oops k = do ignore <- k ()
 >             return True

 However, writing such a function requires to know that $\var{Eff} =
 \var{IO} ()$, and is therefore disallowed by the Haskell type system
 in user code, where \var{Eff} is kept abstract. (The \var{mappend}
 function may combine two effects in one, but not discard or duplicate
 them.)


 Basics
 ------

 We begin by presenting three basic function to manipulate
 \var{Source} and \var{Sink}: one to read from sources, one to write
 to sinks, and one to connect sources and sinks.

 \paragraph{Reading}
 One may want to provide the following function, waiting for data to be
 produced by a source. The second argument is the effect to run if no
 data is produced, and the third is the effect to run given the data
 and the remaining source.

 > await :: Source a -> Eff -> (a -> Source a -> Eff) -> Eff
 > await Nil eof _ = eof
 > await (Cons x cs) _ k = cs $ Cont $ \xs -> k x xs

 However, the above function breaks the linearity invariant, so we will
 refrain to use it as such. The pattern that it defines is still
 useful: it is valid when the second and third argument consume the
 same set of variables.  Indeed, this condition is often satisfied.

 \paragraph{Writing}
 One can send data to a sink. If the sink is full, the data is ignored.
 The third argument is a continuation getting the "new" sink, that
 obtained after the "old" sink has consumed the data.

 > yield :: a -> Sink a -> (Sink a -> Eff) -> Eff
 > yield x (Cont c) k = c (Cons x k)
 > yield _ Full k = k Full

 \paragraph{Forwarding}
 One can forward the data from a source to a sink, as follows. The
 effect generated by this operation is the combined effect of all
 productions and consumptions on the stream.

 > forward :: Source a -> Sink a -> Eff
 > forward s (Cont s') = s' s
 > forward Nil Full = mempty
 > forward (Cons _ xs) Full = xs Full




 Baking in negations: exercise in duality
 -------------------

 Programming with \var{Source} and \var{Sink} explicitly is
 inherently continuation-heavy: negations must be explicitly added in
 many places. This style is somewhat inconvenient; therefore, we will
 use instead pre-negated versions of sources and sink:

 > type Src   a = N  (Sink a)
 > type Snk   a = N  (Source a)

 These definitions have the added advantage to perfect the duality
 between sources and sinks, while not restricting the programs one can
 write.
 Indeed, one can access the underlying structure as follows:

 > onSource   :: (Src  a -> t) -> Source   a -> t
 > onSink     :: (Snk  a -> t) -> Sink     a -> t

 > onSource  f   s = f   (\t -> forward s t)
 > onSink    f   t = f   (\s -> forward s t)

 And, while a negated \var{Sink} cannot be converted to a
 \var{Source}, all the following conversions are implementable:

 > unshiftSnk :: N (Src a) -> Snk a
 > unshiftSrc :: N (Snk a) -> Src a
 > shiftSnk :: Snk a -> N (Src a)
 > shiftSrc :: Src a -> N (Snk a)

 > unshiftSnk = onSource
 > unshiftSrc = onSink
 > shiftSnk k kk = kk (Cont k)
 > shiftSrc k kk = k (Cont kk)

 A different reading of the type of \var{shiftSrc} reveals that it implements
 forwarding of data from \var{Src} to \var{Snk}:

 > fwd :: Src a -> Snk a -> Eff
 > fwd = shiftSrc

 In particular, one can flip sink transformers to obtain source transformers,
 and vice versa.

 > flipSnk :: (Snk a -> Snk b) -> Src b -> Src a
 > flipSnk f s = shiftSrc s . onSink f

 > flipSrc :: (Src a -> Src b) -> Snk b -> Snk a
 > flipSrc f t = shiftSnk t . onSource f


 Flipping allows to choose the most convenient direction to
 implement, and get the other one for free. Consider as an example the
 implementation of the mapping functions:

 > mapSrc  :: (a -> b) -> Src  a -> Src  b
 > mapSnk  :: (b -> a) -> Snk  a -> Snk  b

 Mapping sources is defined by flipping mapping of sinks:

 > mapSrc f = flipSnk (mapSnk f)

 Sink mapping is defined by case analysis on the concrete
 source, and the recursive case conveniently calls \var{mapSrc}.

 > mapSnk _ snk Nil = snk Nil
 > mapSnk f snk (Cons a s)
 >   = snk (Cons (f a) (mapSrc f s))


 When using double negations, it is sometimes useful to insert or
 remove them inside type constructor. For sources and sinks, one proceeds
 as follows. Introduction of double negation in sources and its elimination
 in sinks is a special case of mapping.

 > nnIntro :: Src a -> Src (NN a)
 > nnIntro = mapSrc shift

 > nnElim' :: Snk (NN a) -> Snk a
 > nnElim' = mapSnk shift

 The duals are easily implemented by case analysis, following the mutual
 recursion pattern introduced above.

 > nnElim :: Src (NN a) -> Src a
 > nnIntro' :: Snk a -> Snk (NN a)

 > nnElim = flipSnk nnIntro'
 > nnIntro' k Nil = k Nil
 > nnIntro' k (Cons x xs) = x $ \x' -> k (Cons x' $ nnElim xs)


 Effect-Free Streams
 ===================

 The functions seen so far make no use of the fact that \var{Eff} can
 embed IO actions. In fact, a large number of useful functions over
 streams can be implemented without relying on IO. We give an overview
 of effect-free streams in this section.

 List-Like API
 -------------

 To begin, we show that one can implement a list-like API for
 sources, as follows:

 > empty :: Src a
 > empty sink' = forward Nil sink'

 > cons :: a -> Src a -> Src a
 > cons a s s' = yield a s' s

 > tail :: Src a -> Src a
 > tail = flipSnk $ \t s -> case s of
 >   Nil -> t Nil
 >   Cons _ xs -> fwd xs t

 (Taking just the head is not meaningful due to the linearity
 constraint)

 Dually, the full sink is simply

 > plug :: Snk a
 > plug source' = forward source' Full


 Another useful function is the equivalent of \var{take} on lists.
 Given a source, we can create a new source which ignores all but its
 first $n$ elements. Conversely, we can prune a sink to consume only
 the first $n$ elements of a source.

 > takeSrc  :: Int -> Src  a -> Src  a
 > takeSnk  :: Int -> Snk  a -> Snk  a

 The natural implementation is again by mutual recursion. The main
 subtlety is that, when reaching the $n$th element, both ends of the
 stream must be notified of its closing. Note the use of the monoidal
 structure of \var{Eff} in this case.

 > takeSrc i = flipSnk (takeSnk i)

 > takeSnk _ s Nil = s Nil
 > takeSnk 0 s (Cons _ s') = s Nil <> s' Full
 > takeSnk i s (Cons a s') = s (Cons a (takeSrc (i-1) s'))


 Algebraic structure
 -------------------

 Source and sinks form a monoid under concatenation:

 > instance Monoid (Src a) where
 >   mappend = appendSrc
 >   mempty = empty

 > instance Monoid (Snk a) where
 >   mappend = appendSnk
 >   mempty = plug

 We have already encountered the units (\var{empty} and \var{plug});
 the appending operations are defined below.  Intuitively,
 \var{appendSrc} first gives control to the first source until it runs
 out of elements and then turns control over to the second source. This
 behavior is implemented in the helper function \var{forwardThenSnk}.

 > appendSrc :: Src a -> Src a -> Src a
 > appendSrc s1 s2 Full = s1 Full <> s2 Full
 > appendSrc s1 s2 (Cont s)
 >   = s1 (Cont (forwardThenSnk s s2))

 > forwardThenSnk :: Snk a -> Src a -> Snk a
 > forwardThenSnk snk src Nil = fwd src snk
 > forwardThenSnk snk src (Cons a s)
 >   = snk (Cons a (appendSrc s src))

 Sinks can be appended is a similar fashion.

 > appendSnk :: Snk a -> Snk a -> Snk a
 > appendSnk s1 s2 Nil = s1 Nil <> s2 Nil
 > appendSnk s1 s2 (Cons a s)
 >   = s1 (Cons a (forwardThenSrc s2 s))

 > forwardThenSrc :: Snk a -> Src a -> Src a
 > forwardThenSrc s2 = flipSnk (appendSnk s2)

 The operations \var{forwardThenSnk} and \var{forwardThenSrc} are akin
 to making the difference of sources and sinks, thus we find it
 convenient to give them the following aliases:

 > (-?) :: Snk a -> Src a -> Snk a
 > t -? s = forwardThenSnk t s

 > (-!) :: Snk a -> Src a -> Src a
 > t -! s = forwardThenSrc t s

 > infixr -!
 > infixl -?

 Appending and differences interact in the expected way: the following
 observational equalities hold:

 < t -? (s1 <> s2) == t -? s2 -? s1
 < (t1 <> t2) -! s == t1 -! t2 -! s

 <!--

 Not sure if these are true or what

 > prop_diff3 t1 t2 s = (t1 <> t2) -? s == t1 -? (t2 -! s)
 > prop_diff4 t s1 s2 = t -! (s1 <> s2) == (t -? s1) -! s2

 -->

 The proofs for the above laws can be found in Appendix \ref{proof}.

 \paragraph{Functor}
 We have already seen the mapping functions for sources and sinks:
 sources are functors and sinks are contravariant functors. (Given the
 implementation of the morphism actions it is straightforward to check
 the functor laws.)



 Table of effect-free functions
 ------------------------------

 The above gives already an extensive API for sources and sinks, many
 more useful effect-free functions can be implemented on this basis. We
 give here a menu of functions that we have implemented, and whose
 implementation is available in the appendix.

 Zip two sources, and the dual.

 > zipSrc :: Src a -> Src b -> Src (a,b)
 > forkSnk :: Snk (a,b) -> Src a -> Snk b

 <!--
 or: forkSnk :: Snk (a,b) -> Snk a ⅋ Snk b
 -->

 Zip two sinks, and the dual.

 > forkSrc :: Src (a,b) -> Snk a -> Src b
 > zipSnk :: Snk a -> Snk b -> Snk (a,b)

 Equivalent of \var{scanl'} for sources, and the dual

 > scanSrc :: (b -> a -> b) -> b -> Src a -> Src b
 > scanSnk :: (b -> a -> b) -> b -> Snk b -> Snk a

 Equivalent of \var{foldl'} for sources, and the dual.

 > foldSrc' :: (b -> a -> b) -> b -> Src a -> NN b
 > foldSnk' :: (b -> a -> b) -> b -> N b -> Snk a

 Drop some elements from a source, and the dual.

 > dropSrc :: Int -> Src a -> Src a
 > dropSnk :: Int -> Snk a -> Snk a

 Convert a list to a source, and vice versa.

 > fromList :: [a] -> Src a
 > toList :: Src a -> NN [a]

 Split a source in lines, and the dual.

 > linesSrc :: Src Char -> Src String
 > unlinesSnk :: Snk String -> Snk Char


 Consume elements until the predicate is reached; then the sink is
 closed.

 > untilSnk :: (a -> Bool) -> Snk a

 Interleave two sources, and the dual.

 > interleave :: Src a -> Src a -> Src a
 > interleaveSnk :: Snk a -> Src a -> Snk a

 Forward data coming from the input source to the result source and to
 the second argument sink.

 > tee :: Src a -> Snk a -> Src a

 Filter a source, and the dual.

 > filterSrc :: (a -> Bool) -> Src a -> Src a
 > filterSnk :: (a -> Bool) -> Snk a -> Snk a

 Turn a source of chunks of data into a single source; and the dual.

 > unchunk :: Src [a] -> Src a
 > chunkSnk :: Snk a -> Snk [a]


 App: Stream-Based Parsing
 -------------------------

 To finish with effect-free function, we give an example of a complex
 stream processor, which turns source of unstructured data into a
 source of structured data, given a parser.  This conversion is useful
 for example to turn an XML file, provided as a stream of characters
 into a stream of (opening and closing) tags.

 We begin by defining a pure parsing structure, modeled after the
 parallel parsing processes of \citet{claessen_parallel_2004}.  The
 parser is continuation based, but the effects being accumulated are
 parsing processes, defined as follows. The \var{Sym} constructor parses \var{Just}
 a symbol, or \var{Nothing} if the end of stream is reached. A process may
 also \var{Fail} or return a \var{Result}.

 > data P s res  =  Sym (Maybe s -> P s res)
 >               |  Fail
 >               |  Result res

 A parser is producing the double negation of $a$:

 > newtype Parser s a = P (forall res. (a -> P s res) -> P s res)

 The monadic interface can then be built in the standard way:

 > instance Monad (Parser s) where
 >   return x  = P $ \fut -> fut x
 >   P f >>= k = P (\fut -> f (\a -> let P g = k a in g fut))
 > instance Applicative (Parser s) where
 >   pure = return
 >   (<*>) = ap
 > instance Functor (Parser s) where
 >   fmap = (<$>)

 The essential parsing ingredient, choice, rests on the
 ability to weave processes together; picking that which
 succeeds first, and that which fails as last resort:

 > weave :: P s a -> P s a -> P s a
 > weave Fail x = x
 > weave x Fail = x
 > weave (Result res) y = Result res
 > weave x (Result res) = Result res
 > weave (Sym k1) (Sym k2)
 >     = Sym (\s -> weave (k1 s) (k2 s))

 > (<|>) :: Parser s a -> Parser s a -> Parser s a
 > P p <|> P q = P (\fut -> weave (p fut) (q fut))


 Parsing then reconciles the execution of the process with the
 traversal of the source. In particular, whenever a result is
 encountered, it is fed to the sink. If the parser fails, both ends of
 the stream are closed.

 > parse :: forall s a. Parser s a -> Src s -> Src a
 > parse q@(P p0) = flipSnk $ scan $ p0 $ \x -> Result x
 >  where
 >   scan :: P s a -> Snk a -> Snk s
 >   scan (Result res  )  ret        xs     = ret
 >        (Cons res $ parse q $ forward xs)
 >   scan Fail            ret        xs     = ret Nil <> forward xs Full
 >   scan (Sym f)         mres       xs     = case xs of
 >     Nil        -> scan (f Nothing) mres Nil
 >     Cons x cs  -> fwd cs (scan (f $ Just x) mres)




 Effectful streams
 =================

 So far, we have constructed only effect-free streams. That is, effects
 could be any monoid, and in particular the unit type.  In this
 section we bridge this gap and provide some useful sources and sinks
 performing IO effects, namely reading and writing to files.

 We first define the following helper function, which sends data to a
 handle, thereby constructing a sink.

 > hFileSnk :: Handle -> Snk String
 > hFileSnk h Nil = hClose h
 > hFileSnk h (Cons c s) = do
 >   hPutStrLn h c
 >   s (Cont (hFileSnk h))

 A file sink is then simply:

 > fileSnk :: FilePath -> Snk String
 > fileSnk file s = do
 >   h <- openFile file WriteMode
 >   hFileSnk h s

 And the sink for standard output is:

 > stdoutSnk :: Snk String
 > stdoutSnk = hFileSnk stdout

 (For ease of experimenting with our functions, the data items are
 lines of text --- but an industrial variant would provide chunks of
 raw binary data, to be further parsed.)

 Conversely, a file source reads data from a file, as follows:

 > hFileSrc :: Handle -> Src String
 > hFileSrc h Full = hClose h
 > hFileSrc h (Cont c) = do
 >   e <- hIsEOF h
 >   if e   then   do  hClose h
 >                     c Nil
 >          else   do  x <- hGetLine h
 >                     c (Cons x $ hFileSrc h)

 > fileSrc :: FilePath -> Src String
 > fileSrc file sink = do
 >   h <- openFile file ReadMode
 >   hFileSrc h sink

 Combining the above primitives, we can then implement file copy as
 follows:

 > copyFile :: FilePath -> FilePath -> Eff
 > copyFile source target = fwd  (fileSrc source)
 >                               (fileSnk target)

 It should be emphasized at this point that when running \var{copyFile} reading and writing will be
 interleaved: in order to produce the next line in the source (in this
 case by reading from the file), the current line must first be
 consumed in the sink (in this case by writing it to disk).  The stream
 behaves fully synchronously, and no intermediate data is buffered.

 Whenever a sink is full, the source connected to it should be finalized.
 The next example shows what happens when a sink closes the stream
 early. Instead of connecting the source to a bottomless sink, we
 connect it to one which stops receiving input after three lines.

 > read3Lines :: Eff
 > read3Lines = fwd  (hFileSrc stdin)
 >                   (takeSnk 3 $ fileSnk "text.txt")

 Indeed, testing the above program reveals that it properly closes
 \var{stdin} after reading three lines. This early closing of sinks
 allows modular stream programming. In particular, it is easy to
 support proper finalization in the presence of exceptions, as the next
 section shows.

 Exception Handling
 ------------------

 While the above implementations of file source and sink are fine for
 illustrative purposes, their production-strength versions should
 handle exceptions. Doing so is straightforward: as shown above, our
 sinks and sources readily support early closing of the stream.

 The following code fragment shows how to handle an exception when
 reading a line in a file source.

 > hFileSrcSafe :: Handle -> Src String
 > hFileSrcSafe h Full = hClose h
 > hFileSrcSafe h (Cont c) = do
 >   e <- hIsEOF h
 >   if e then do
 >          hClose h
 >          c Nil
 >        else do
 >          mx <- catch  (Just <$> hGetLine h)
 >                       (\(_ :: IOException) -> return Nothing)
 >          case mx of
 >            Nothing -> c Nil
 >            Just x -> c (Cons x $ hFileSrcSafe h)

 Exceptions raised in \var{hIsEOF} should be handled in the same
 way. The file sink is responsible for handling its own exceptions so
 there is no need to insert a handler around the invocation of the
 continuation \var{c}.  One would probably have a field in both the
 \var{Nil} and \var{Full} constructors indicating the nature of the
 exception encountered, if any, but we will not bother in the proof of
 concept implementation presented in this paper.


 Dealing with exceptions is done once and for all when implementing the
 library of streams. The programmer using the library does not have to
 be concerned with exceptions as they are caught and communicated
 properly under the hood.

 Using exception handlers, as in the above snippet, will secure the
 library from synchronous exceptions arising from accessing the file,
 but not from asynchronous exceptions which may come from other
 sources. Asynchronous exception-safety requires more machinery. The
 region library presented in \citet{kiselyov2008lightweight} can be
 used for this purpose, as outlined in
 \citet{kiselyov12:region_iteratees}.


 Synchronicity and Asynchronicity
 ================================
 \label{async}

 One of the main benefits of streams as defined here is that the
 programming interface is (or appears to be) asynchronous, while the
 run-time behavior is synchronous.
 That is, one can build a data source regardless of how the data is be consumed,
 or dually one can build a sink regardless of how the data is produced;
 but, despite the independence of definitions, all the code can (and
 is) executed synchronously: composing a source and a sink require no
 concurrency (nor any external control structure).

 As discussed above, a consequence of synchronicity is that the
 programmer cannot be implicitly buffering data when connecting a source
 to a sink: every production must be matched by a consumption (and vice
 versa).  In sum, synchronicity restricts the kind of operations one
 can construct, in exchange for two guarantees:

 1. Execution of connected sources and sinks is synchronous
 2. No implicit memory allocation happens

 While the guarantees have been discussed so far, it may be unclear how
 synchronicity actually restricts the programs one can write. In the
 rest of the section we show by example how the restriction plays out.

 Example: demultiplexing
 -----------------------

 One operation supported by synchronous behavior is the demultiplexing
 of a source, by connecting it to two sinks.

 > dmux' :: Src (Either a b) -> Snk a -> Snk b -> Eff

 We can implement this demultiplexing operation as follows:

 > dmux :: Source (Either a b) -> Sink a -> Sink b -> Eff
 > dmux Nil ta tb = forward Nil ta <> forward Nil tb
 > dmux (Cons ab c) ta tb = case ab of
 >   Left a -> c $ Cont $ \src' -> case ta of
 >     Full -> forward Nil tb <> plug src'
 >     Cont k -> k (Cons a $ \ta' -> dmux src' ta' tb)
 >   Right b -> c $ Cont $ \src' -> case tb of
 >     Full -> forward Nil ta <> plug src'
 >     Cont k -> k (Cons b $ \tb' -> dmux src' ta tb')

 > dmux' sab' ta' tb' =
 >   shiftSnk ta' $ \ta ->
 >   shiftSnk tb' $ \tb ->
 >   shiftSrc sab' $ \sab ->
 >   dmux sab ta tb

 The key ingredient is that demultiplexing starts by reading the next
 value available on the source. Depending on its value, we feed the
 data to either of the sinks are proceed. Besides, as soon as any of
 the three parties closes the stream, the other two are notified.

 However, multiplexing sources cannot be implemented while respecting
 synchronicity. To see why, let us attempt anyway, using the following
 type signature:

 < mux :: Src a -> Src b -> Src (Either a b)
 < mux sa sb = ?

 We can try to fill the hole by reading on a source. However, if we do
 this, the choice falls to the multiplexer to choose which source to
 run first. We may pick \var{sa}, however it may be blocking, while
 \var{sb} is ready with data. This is not really multiplexing, at best
 this approach would give us interleaving of data sources, by taking
 turns.

 In order to make any progress, we can let the choice of which source
 to pick fall on the consumer of the stream. The type that we need for
 output data in this case is a so-called additive conjunction. It is
 the dual of the \var{Either} type: there is a choice, but this choice
 falls on the consumer rather than the producer of the data. Additive
 conjunction, written &, can be encoded by sandwiching \var{Either}
 between two inversion of the control flow, thus switching the party
 who makes the choice:

 > type a & b = N (Either (N a) (N b))

 (One will recognize the similarity between this definition and the
 De Morgan's laws.)

 We can then amend the type of multiplexing:

 > mux :: Src a -> Src b -> Src (a & b)

 Unfortunately, we still cannot implement multiplexing typed as
 above. Consider the following attempt, where we begin by asking the
 consumer if it desires $a$ or $b$. If the answer is $a$,
 we can extract a value from \var{sa} and yield it; and
 symmetrically for $b$.

 > mux sa sb (Cont tab) = tab $ Cons
 >                         (\ab -> case ab of
 >                                  Left   ka -> sa $ Cont $ \(Cons a resta) -> ka a
 >                                  Right  kb -> sb $ Cont $ \(Cons b restb) -> kb b)
 >                         (error "oops")

 However, there is no way to then make a recursive call (`oops`) to
 continue processing.  Indeed the recursive call to make must depend on
 the choice made by the consumer (in one case we should be using
 \var{resta}, in the other \var{restb}). However the type of \var{Cons}
 forces us to produce its arguments independently.

 What we need to do is to reverse the control fully: we need a data
 source which is in control of the flow of execution.

 Co-Sources, Co-Sinks
 -------------------

 We call the structure that we are looking for a
 *co-source*. Co-sources are the subject of this section.  Remembering
 that producing $N a$ is equivalent to consuming $a$, thus a sink of $N
 a$ is a (different kind of) source of $a$. We define:

 > type CoSrc a = Snk (N a)
 > type CoSnk a = Src (N a)

 Implementing multiplexing on co-sources is then straightforward, by
 leveraging \var{dmux'}:

 > mux' :: CoSrc a -> CoSrc b -> CoSrc (a & b)
 > mux' sa sb = unshiftSnk $ \tab -> dmux' (nnElim tab) sa sb


 We use the rest of the section to study the property of co-sources and
 co-sinks.

 \var{CoSrc} is a functor, and \var{CoSnk} is a contravariant functor.

 > mapCoSrc :: (a -> b) -> CoSrc a -> CoSrc b
 > mapCoSrc f = mapSnk (\b' -> \a -> b' (f a))

 > mapCoSnk :: (b -> a) -> CoSnk a -> CoSnk b
 > mapCoSnk f = mapSrc (\b' -> \a -> b' (f a))

 Elements of a co-source are access only "one at a time". That is, one
 cannot extract the contents of a co-source as a list. Attempting to
 implement this extraction looks as follows.

 > coToList :: CoSrc a -> NN [a]
 > coToList k1 k2 = k1 $ Cons (\a -> k2 [a]) (error "rest")
 > coToList k1 k2 = k2 $ (error "a?") : (error "rest")

 If one tries to begin by eliminating the co-source (first equation), then there is no
 way to produce subsequent elements of the list. If one tries to begin
 by constructing the list (second equation), then no data is available.

 Yet it is possible to define useful and effectful co-sources and
 co-sinks. The first example shows how to provide a file as a co-source:

 > coFileSrc :: Handle -> CoSrc String
 > coFileSrc h Nil = hClose h
 > coFileSrc h (Cons x xs) = do
 >   e <- hIsEOF h
 >   if e then do
 >          hClose h
 >          xs Full
 >        else do
 >          x' <- hGetLine h
 >          x x'                     -- (1)
 >          xs $ Cont $ coFileSrc h  -- (2)


 Compared to \var{fileSrc}, the difference is that this function can
 decide the ordering of effects ran in a co-sink connected to it. That is,
 the lines (1) and (2) have no data dependency. Therefore they may be
 run in any order. (Blindly doing so is a bad idea though, as the
 \var{Full} action on the sink will be run before all other actions.)
 We will see in the next section how this situation generalizes.

 The second example is a infinite co-sink that sends data to a file.

 > coFileSink :: Handle -> CoSnk String
 > coFileSink h Full = hClose h
 > coFileSink h (Cont c) = c (Cons  (hPutStrLn h)
 >                                  (coFileSink h))

 Compared to \var{fileSnk}, the difference is that one does not control
 the order of execution of effects. The effect of writing the current
 line is put in a data structure, and its execution is up to the
 co-source which eventually connects to the co-sink. Thus, the
 order of writing lines in the file depends on the order of effects chosen
 in the co-source connected to this co-sink.

 In sum, using co-sources and co-sinks shifts the flow of control from
 the sink to the source. It should be stressed that, in the programs
 which use the functions defined so far (even those that use IO),
 synchronicity is preserved: no data is buffered implicitly, and
 reading and writing are interleaved.

 Asynchronicity
 --------------

 We have seen so far that synchronicity gives useful guarantees, but
 restricts the kind of programs one can write. In this section, we
 provide primitives which allow forms of asynchronous programming within
 our framework.
 The main benefit of sticking to our framework in this case is that
 asynchronous behavior is cornered to the explicit usages of these
 primitives. That is, the benefits of synchronous programming still
 hold locally.

 \paragraph{Scheduling}

 When converting a \var{Src} to a \var{CoSrc} (or dually \var{CoSnk} to
 a \var{Snk}), we have two streams which are ready to respond to
 pulling of data from them.  This means that effects must be scheduled
 explicitly, as we have seen an example above when manually converting
 the file source to a file co-source.

 In general, given a \var{Schedule}, we can implement the above two
 conversions:

 > srcToCoSrc :: Schedule a -> Src a -> CoSrc a
 > coSnkToSnk :: Schedule a -> CoSnk a -> Snk a

 We define a \var{Schedule} as the reconciliation between a source and a
 co-sink:

 > type Schedule a = Source a -> Source (N a) -> Eff

 Implementing the conversions is then straightforward:

 > srcToCoSrc strat s s0 = shiftSrc s $ \ s1 -> strat s1 s0
 > coSnkToSnk strat s s0 = shiftSrc s $ \ s1 -> strat s0 s1

 What are possible scheduling strategies? The simplest, and most
 natural one is sequential execution: looping through both sources and
 match the consumptions/productions element-wise, as follows.

 > sequentially :: Schedule a
 > sequentially Nil (Cons _ xs) = xs Full
 > sequentially (Cons _ xs) Nil = xs Full
 > sequentially (Cons x xs) (Cons x' xs') =
 >   x' x <>   (shiftSrc xs  $ \sa ->
 >              shiftSrc xs' $ \sna ->
 >              sequentially sa sna)

 When effects are arbitrary IO actions, sequential execution is the
 only sensible schedule: indeed, the sources and sinks expect their
 effects to be run in the order prescribed by the stream. Swapping the
 arguments to `<>` in the above means that \var{Full} effects will be
 run first, spelling disaster.

 However, in certain cases running effects out of order may make
 sense. If the monoid of effects is commutative (or if the programmer
 is confident that execution order does not matter), one can shuffle
 the order of execution of effects. This re-ordering can be taken
 advantage of to run effects concurrently, as follows:

 > concurrently :: Schedule a
 > concurrently Nil (Cons _ xs) = xs Full
 > concurrently (Cons _ xs) Nil = xs Full
 > concurrently (Cons x xs) (Cons x' xs') = do
 >   forkIO $ x' x
 >   (shiftSrc xs  $ \sa ->
 >    shiftSrc xs' $ \sna ->
 >    concurrently sa sna)

 The above strategy is useful if the production or consumption
 of elements is expensive and distributable over computation units.
 While the above implementation naively spawns a thread for every
 element, in reality one will most likely want to divide the stream
 into chunks before spawning threads. Because strategies are separate
 components, a bad choice is easily remedied to by swapping one
 strategy for another.

 \paragraph{Buffering}

 Consider now the situation where one needs to convert from a
 \var{CoSrc} to a \var{Src} (or from a \var{Snk} to a \var{CoSnk}).
 Here, we have two streams, both of which want to control the execution
 flow. The conversion can only be implemented by running both streams
 in concurrent threads, and have them communicate via some form of
 buffer. A form of buffer that we have seen before is the file. Using
 it yields the following buffering implementation:

 > fileBuffer :: String -> CoSrc String -> Src String
 > fileBuffer tmpFile f g = do
 >   h' <- openFile  tmpFile WriteMode
 >   forkIO $ fwd (coFileSink h') f
 >   h <- openFile tmpFile ReadMode
 >   hFileSrc h g

 If the temporary file is a regular file, the above implementation is
 likely to fail. For example the reader may be faster than the writer
 and reach an end of file prematurely. Thus the temporary file should
 be a UNIX pipe. One then faces the issue that UNIX pipes are of fixed
 maximum size, and if the writer overshoots the capacity of the pipe, a
 deadlock will occur.

 Thus, one may prefer to use Concurrent Haskell channels as a buffering
 means, as they are bounded only by the size of the memory and do not
 rely on any special feature of the operating system:

 > chanCoSnk :: Chan a -> CoSnk a
 > chanCoSnk _ Full = return ()
 > chanCoSnk h (Cont c) = c (Cons  (writeChan h)
 >                                 (chanCoSnk h))

 > chanSrc :: Chan a -> Src a
 > chanSrc _ Full = return ()
 > chanSrc h (Cont c) = do  x <- readChan h
 >                          c (Cons x $ chanSrc h)

 > chanBuffer :: CoSrc a -> Src a
 > chanBuffer f g = do
 >   c <- newChan
 >   forkIO $ fwd (chanCoSnk c) f
 >   chanSrc c g


 Note that it is easy to create a bounded buffer, by guarding the
 writes with a semaphore. In general there is no issue with blocking
 reads or writes. The implementation follows.

 > chanCoSnk' :: Chan a -> QSem -> CoSnk a
 > chanCoSnk' _ _ Full = return ()
 > chanCoSnk' h s (Cont c) = c (Cons  write
 >                                    (chanCoSnk' h s))
 >  where write x = do  waitQSem s
 >                      writeChan h x

 > chanSrc' :: Chan a -> QSem -> Src a
 > chanSrc' _ _ Full = return ()
 > chanSrc' h s (Cont c) = do  x <- readChan h
 >                             signalQSem s
 >                             c (Cons x $ chanSrc' h s)

 > boundedChanBuffer :: Int -> CoSrc a -> Src a
 > boundedChanBuffer n f g = do
 >   c <- newChan
 >   s <- newQSem n
 >   forkIO $ fwd (chanCoSnk' c s) f
 >   chanSrc' c s g


 In certain situations (for example for a stream yielding a status
 whose history does not matter, like mouse positions) one may want to
 ignore all but the latest datum. In this case a single memory cell can
 serve as buffer:

 > varCoSnk :: IORef a -> CoSnk a
 > varCoSnk _ Full      = return ()
 > varCoSnk h (Cont c)  = c (Cons  (writeIORef h)
 >                                 (varCoSnk h))

 > varSrc :: IORef a -> Src a
 > varSrc _ Full = return ()
 > varSrc h (Cont c) = do  x <- readIORef h
 >                         c (Cons x $ varSrc h)

 > varBuffer :: a -> CoSrc a -> Src a
 > varBuffer a f g = do
 >   c <- newIORef a
 >   forkIO $ fwd (varCoSnk c) f
 >   varSrc c g

 All the above buffering operations work on sources, but they can be generically
 inverted to work on sinks, as follows.

 > flipBuffer :: (forall a. CoSrc a -> Src a) -> Snk b -> CoSnk b
 > flipBuffer f s = f (nnIntro' s)


 Summary
 -------

 In sum, we can classify streams according to polarity:

 - Pull: source and co-sinks
 - Push: sinks and co-sources

 We then have three situations when composing stream processors:

 1. Matching polarities (one pull, one push). In this case behavior is
 synchronous; no concurrency appears.

 2. Two pull streams. In this case an explicit loop must process the
 streams.  If effects commute, the programmer may run effects out of
 order, potentially concurrently.

 3. Two push streams. In this case the streams must run in independent
 threads, and the programmer needs to make a choice for the communication
 buffer. One needs to be careful: if the buffer is to small a deadlock
 may occur.

 Therefore, when programming with streams, one should consume push
 types when one can, and pull ones when one must. Conversely, one
 should produce pull types when one can, and push ones when one
 must.

 App: idealized echo server
 ---------------------

 We finish exposition of asynchronous behavior with a small program
 sketching the skeleton of a client-server application. This is a small
 server with two clients, which echoes the requests of each client to
 both of them.

 The server communicates with each client via two streams, one for
 inbound messages, one for outbound ones. We want each client to be
 able to send and receive messages in the order that they like. That
 is, from their point of view, they are in control of the message
 processing order. Hence a client should have a co-sink for sending
 messages to the server, and a source for receiving them.  On the
 server side, types are dualized and thus, a client is represented by a
 pair of a co-source and a sink:

 > type Client a = (CoSrc a, Snk a)

 For simplicity we implement a chat server handling exactly two
 clients.

 The first problem is to multiplex the inputs of the clients. In the
 server, we do not actually want any client to be controlling the
 processing order. Hence we have to multiplex the messages in real time,
 using a channel (note the similarity with \var{chanBuffer}):

 > bufferedDmux :: CoSrc a -> CoSrc a -> Src a
 > bufferedDmux s1 s2 t = do
 >   c <- newChan
 >   forkIO $ fwd (chanCoSnk c) s1
 >   forkIO $ fwd (chanCoSnk c) s2
 >   chanSrc c t

 We then have to send each message to both clients. This may be done
 using the following effect-free function, which forwards everything
 sent to a sink to its two argument sinks.

 > collapseSnk :: Snk a -> Snk a -> Snk a
 > collapseSnk t1 t2 Nil = t1 Nil <> t2 Nil
 > collapseSnk t1 t2 (Cons x xs)
 >   =  t1  (Cons x $ \c1 ->
 >      t2  (Cons x $ \c2 ->
 >          shiftSrc xs (collapseSnk  (flip forward c1)
 >                                    (flip forward c2))))


 The server can then be defined by composing the above two functions.

 > server :: Client a -> Client a -> Eff
 > server (i1,o1) (i2,o2) = fwd  (bufferedDmux i1 i2)
 >                               (collapseSnk o1 o2)



 Related Work
 ============



 Polarities, data structures and control
 ---------------------------------------

 One of keys ideas formalized in this paper is to classify streams by
 polarity. The push polarity (Sinks, CoSrc) controls the execution
 thread, whereas the pull one (Sources, Co-sinks) provide
 data. This idea has recently been taken advantage of to
 bring efficient array programming facilities to functional programming
 \citep{bernardy_composable_2015,claessen2012expressive,ankner_edsl_2013}.

 This concept is central in the literature on Girard's linear logic
 \citep{laurent_etude_2002,zeilberger_logical_2009}. However, in the
 case of streams, this idea dates back at least to
 \citet{jackson_principles_1975} (\citet{kay_you_2008} gives a good
 summary of Jacksons' insight).

 Our contribution is to bring this idea to stream programming in
 Haskell. (While duality was used for Haskell array programming, it has
 not been taken advantage for stream programming.) We believe that our
 implementation brings together the practical applications that Jackson
 intended, while being faithful to the theoretical foundations in
 logic, via the double-negation embedding.


 Iteratees
 ---------

 We consider that the state of the art in Haskell stream processing is
 embodied by Kiselyov's iteratees \citeyear{kiselyov_iteratees_2012}.

 The type for iteratees can be given the following definitions:

 > data I s m a = Done a | GetC (Maybe s -> m (I s m a))

 An iteratee $I\,s\,m\,a$ roughly corresponds to a sink of $s$ which also
 returns an $a$ --- but it uses a monad $m$ rather than a monoid
 \var{Eff} for effects.

 The above type contains a continuation in the \var{GetC}
 constructor. Therefore, one must be careful about discarding or
 duplicating iteratees. Hence, such libraries typically provide
 higher-level interfaces to discourage non-linear usages.

 A first advantage of our approach is the formulation and emphasis on
 the linearity constraint, which is central to correct use of effectful
 continuations. It appears that variants of iteratees (including the
 *pipes* library) make the representation abstract, but at the cost of
 a complex interface for programming them. By stating the linearity
 requirement no complex abstract API is necessary to guarantee safety.

 A second advantage of our library is that effects are not required to
 be monads. Indeed, the use of continuations already provide the
 necessary structure to combine computations (recall in particular
 that double negation is already a monad). We believe that having a
 single way to bind intermediate results (continuations vs. both
 continuations and monads) is a simplification in design, which may make
 our library more approachable.

 The presence of source and sinks also clarifies how to build complex
 types programs from basic blocks. Indeed, iteratee-based libraries
 make heavy use of the following types:

 > type Enumerator el m a = I el m a -> m (I el m a)
 > type Enumeratee elo eli m a =
 >         I eli m a -> I elo m (I eli m a)

 It is our understanding that these types make up for the lack of explicit
 sources by putting iteratees (sinks) on the left-hand-side of an
 arrow. Enumerators are advantageously replaced by sources, and
 enumeratees by simple functions from source to source (or sink to
 sink).

 A third advantage of our approach is that the need for buffering (or
 the scheduling opportunities) are clearly indicated by the type
 system, as mismatching polarities.



 In more recent work \citet{kiselyov_lazy_2012} present a
 continuation-based pretty printer, which fosters a more stylized used
 of continuations, closer to what we advocate here. Producers and
 consumers (sources and sinks) are defined more simply, using types
 which correspond more directly to negations:

 < type GenT e m = ReaderT (e -> m ()) m
 < type Producer m e = GenT e m ()
 < type Consumer m e = e -> m ()
 < type Transducer m1 m2 e1 e2 =
 <   Producer m1 e1 -> Producer m2 e2

 Yet, in that work, linearity is only briefly mentioned; the use of a
 monad rather than monoid persists; and mismatching polarities are not
 discussed, let alone taken advantage of.

 Several production-strength libraries have been built upon the concept
 of iteratees, including *pipes* \citep{gonzalez_pipes_2015},
 *conduits* \citep{snoyman_conduit_2015} and *machines*
 \citep{kmett_machines_2015}.  While we focus our comparison with
 iteratees, most of our analysis carries to the production libraries.
 There is additionally a large body of non peer-reviewed literature
 discussing and analyzing either iteratees or its variants. The
 proliferation of libraries for IO streaming in Haskell indicates that
 a unifying foundation for them is needed, and we hope that the present
 paper provides a basis for such a foundation.


 Feldspar monadic streams
 ------------------------

 Feldspar, a DSL for digital signal processing, has a notion of streams
 built on monads \citep{axelsson_feldspar_2010,svenningsson15:monadic_streams}. In Haskell
 the stream type can be written as follows:

 < type Stream a = IO (IO a)

 Intuitively the outer monad can be understood as performing
 initialization which creates the inner monadic computation. The inner
 computation is called iteratively to produce the elements of the
 stream.

 Compared to the representation in the present paper, the monadic
 streams only has one form of stream, corresponding to a source. Also,
 there is no support for timely release of resources, such things need
 to be dealt with outside of the stream framework. Additionally, even
 conceptually effect-free streams rely on running IO effects.

 Session Types
 -------------

 In essence our pair of types for stream is an encoding of a protocol
 for data transmission. This protocol is readily expressible using
 linear types, following the ideas of \citet{wadler_propositions_2012}
 and \citet{caires_concurrent_2012}:

 < Source a = 1 ⊕ (a ⊗ N (Sink a))
 < Sink a = 1 ⊕ N (Source a)

 For the translation to Haskell, we have chosen to use a lightweight
 encoding, assuming linearity of effectful variables; arguing at the
 same time for support of linearity in future Haskell versions.  Yet,
 other encodings could be chosen. For example, we could have used the
 technique of Pucella and Tov (Haskell session types with almost no
 class), which does not require abiding to linearity.


 Future Work
 ===========

 As we see it, a natural next step for the present work is to show that
 intermediate sources and sinks can be deforested. As it stands, we
 believe that a standard approach
 \cite{gill_short_1993,svenningsson_shortcut_2002,coutts_stream_2007}
 should work: 1. encode sources (and sinks) as non-recursive data types
 2. show that standard evaluation rules remove the intermediate
 occurrences of the encoded types. However, this work has not been
 carried out yet.

 The duality principle exposed here as already been taken advantage of
 to support fusible array types \cite{bernardy_composable_2015,ankner_edsl_2013}. The
 present paper has shown how to support effectful stream
 computations. One would naturally think that the same principle can be
 applied to other lazily-evaluated data structures, such as the game
 trees discussed by \citet{hughes_functional_1989}: as far as we know
 this remains to be investigated.

 Another line of development would be to implement language support for
 the linearity convention, directly in Haskell. There has been many
 proposals to extend functional languages with linear types (see for
 example \cite[Ch. 9]{tov_practical_2012} for a survey). These
 proposals are often rather involved, because they typically support
 advanced forms of polymorphism allowing to abstract over the linearity
 of an argument. The linearity convention that we use here calls for no
 such complexity, therefore we hope it may be enough of a motivation to
 add simple linear type support in research-grade Haskell compilers.


 Conclusion
 ==========

 We have cast an new light on the current state of coroutine-based
 computation in Haskell, which we have done so by drawing inspiration
 from classical linear logic. We have further shown that the concepts
 of duality and polarity provide design principles to structure
 continuation-based code. In particular, we have shown that mismatches
 in polarity correspond to buffers and control structures, depending on
 the kind of mismatch.

 Using effectful continuations is not a new idea; in fact it was the
 standard way of writing effectful programs in Haskell 1.2. Later
 versions of Haskell switched to a monadic approach. However, given the
 issues outlined in the introduction, and especially the error-prone
 character of lazy monadic IO, many libraries have reverted to explicitly
 using co-routines.

 A possible reason for selecting monads over co-routines is that monads
 are rooted in solid theory (categories). However, we hope to have
 shown that co-routines are also rooted in solid theory, namely
 linear logic. If Haskell had support for linear types, co-routines
 could be used safely, without the quirks of lazy IO.



 \acks

 We gratefully thank Koen Claessen, Atze van der Ploeg and Nicolas
 Pouillard for feedback on drafts of this paper.  The source code for
 this paper is a literate Haskell file, whose latest version is
 available at this url:
 https://gist.github.com/jyp/fadd6e8a2a0aa98ae94d

 The paper is typeset
 using pandoc, lhs2TeX and latex.



 \bibliographystyle{abbrvnat}
 \bibliography{PaperTools/bibtex/jp,js}


 \appendix

 Table of Functions: implementations
 ===================================

 > zipSrc s1 s2 t3 = shiftSrc s2 $ \s ->
 >                   unshiftSrc (\t -> forkSnk t s1 s) t3

 > forkSnk sab ta tb =
 >   shiftSrc ta $ \ta' ->
 >   case ta' of
 >     Nil -> (forward tb) Full <> sab Nil
 >     Cons a as ->  case tb of
 >       Nil -> as Full <> sab Nil
 >       Cons b bs -> fwd (cons (a,b) $ zipSrc as bs) sab

 > forkSrc sab ta tb
 >   = shiftSnk (zipSnk ta (flip forward tb)) sab

 > zipSnk sa sb Nil = sa Nil <> sb Nil
 > zipSnk sa sb (Cons (a,b) tab) = sa $ Cons a $ \sa' ->
 >                                 sb $ Cons b $ \sb' ->
 >                                 forkSrc tab (flip forward sa') sb'

 > scanSrc f !z = flipSnk (scanSnk f z)

 > scanSnk _ _ snk Nil          = snk Nil
 > scanSnk f z snk (Cons a s)   = snk $  Cons next $
 >                                       scanSrc f next s
 >   where next = f z a

 > foldSrc' f !z s nb = s (Cont (foldSnk' f z nb))

 > foldSnk' _ z nb Nil = nb z
 > foldSnk' f z nb (Cons a s) = foldSrc' f (f z a) s nb

 Return the last element of the source, or the first argument if the
 source is empty.

 > lastSrc :: a -> Src a -> NN a
 > lastSrc x s k = shiftSrc s $ \s' -> case s' of
 >   Nil -> k x
 >   Cons x' cs -> lastSrc x' cs k

 > dropSrc i = flipSnk (dropSnk i)

 > dropSnk 0 s s' = s s'
 > dropSnk _ s Nil = s Nil
 > dropSnk i s (Cons _ s') = shiftSrc (dropSrc (i-1) s') s

 > fromList = foldr cons empty

 > enumFromToSrc :: Int -> Int -> Src Int
 > enumFromToSrc _ _ Full = mempty
 > enumFromToSrc b e (Cont s)
 >   | b > e     = s Nil
 >   | otherwise = s (Cons b (enumFromToSrc (b+1) e))

 > enumFromToSrc' :: Int -> Int -> CoSrc Int
 > enumFromToSrc' _ _ Nil = mempty
 > enumFromToSrc' from to (Cons x xs) = do
 >   x from
 >   let !from' = from+1
 >   shiftSnk (enumFromToSrc' from' to) xs

 > linesSrc = flipSnk unlinesSnk

 > unlinesSnk = unlinesSnk' []

 > unlinesSnk' :: String -> Snk String -> Snk Char
 > unlinesSnk' acc s Nil = s (Cons acc empty)
 > unlinesSnk' acc s (Cons '\n' s') = s (Cons   (reverse acc)
 >                                              (linesSrc s'))
 > unlinesSnk' acc s (Cons c s')
 >   = s' (Cont $ unlinesSnk' (c:acc) s)

 > untilSnk _ Nil = mempty
 > untilSnk p (Cons a s)
 >   | p a  = s Full
 >   | True = s (Cont (untilSnk p))

 > interleave s1 s2 Full = s1 Full <> s2 Full
 > interleave s1 s2 (Cont s) = s1 (Cont (interleaveSnk s s2))

 > interleaveSnk snk src Nil = fwd src snk
 > interleaveSnk snk src (Cons a s)
 >   = snk (Cons a (interleave s src))

 > tee s1 t1 = flipSnk (collapseSnk t1) s1

 > filterSrc p = flipSnk (filterSnk p)

 > filterSnk _ snk Nil = snk Nil
 > filterSnk p snk (Cons a s)
 >   | p a       = snk (Cons a (filterSrc p s))
 >   | otherwise = s (Cont (filterSnk p snk))

 > unchunk = flipSnk chunkSnk

 > chunkSnk s Nil = s Nil
 > chunkSnk s (Cons x xs)
 >   = fwd (fromList x `appendSrc` unchunk xs) s

 > toList s k = shiftSrc s (toListSnk k)

 > toListSnk :: N [a] -> Snk a
 > toListSnk k Nil = k []
 > toListSnk k (Cons x xs) = toList xs $ \xs' -> k (x:xs')


 Proofs
 ======

 The laws can be proved by induction, for finite streams. The following
 reasoning is only fast-and-loose in the infinite case, but morally
 correct \citet{danielsson_fast_2006}.

 Associativity
 -------------

 \var{Nil} case.

 <     ((t1 <> t2) <> t3) Nil
 < == -- by def
 <     (t1 <> t2) Nil <> t3 Nil
 < == -- by def
 <     (t1 Nil <> t2 Nil) <> t3 Nil
 < == -- by associativity of effect composition
 <     t1 Nil <> (t2 Nil <> t3 Nil)
 < == -- by def
 <     t1 Nil <> ((t2 <> t3) Nil)
 < == -- by def
 <     (t1 <> (t2 <> t3)) Nil

 \var{Cons} case.

 <     ((t1 <> t2) <> t3) (Cons a s0)
 < == -- by def
 <     (t1 <> t2) (Cons a (t3 -! s0))
 < == -- by def
 <     t1 (Cons a (t2 -! (t3 -! s0)))
 < == -- by IH
 <     t1 (Cons a ((t2 <> t3) -! s0))
 < == -- by def
 <     (t1 <> (t2 <> t3)) (Cons a s0)

 \var{Full} case.
 <   ((t1 <> t2) -! s) Full
 < == -- by def
 <   s (Cont (t1 <> t2))
 < == -- by def
 <    (t2 -! s) (Cont t1)
 < == -- by def
 <   (t1 -! (t2 -! s)) Full

 \var{Cont} case.
 <   ((t1 <> t2) -! s) (Cont t0)
 < == -- by def
 <   s (Cont (t0 <> (t1 <> t2)))
 < == -- by IH
 <   s (Cont ((t0 <> t1) <> t2))
 < == -- by def
 <   (t2 -! s) (Cont (t0 <> t1))
 < == -- by def
 <   (t1 -! (t2 -! s)) (Cont t0)

 Difference laws
 ---------------

 Let us show only the case for sources, the case for sinks being
 similar.


 The \var{Full} case relies on the monoidal structure of effects:

 <    ((s1 <> s2) <> s3) Full
 < ==  -- by def
 <    (s1 <> s2) Full <> s3 Full
 < ==  -- by def
 <    (s1 Full <> s2 Full) <> s3 Full
 < ==  -- \var{Eff} is a monoid
 <    s1 Full <> (s2 Full <> s3 Full)
 < ==  -- by def
 <    s1 Full <> (s2 <> s3) Full
 < ==  -- by def
 <    (s1 <> (s2 <> s3)) Full

 The \var{Cont} case uses mutual induction:

 <   ((s1 <> s2) <> s3) (Cont k)
 < == -- by def
 <   (s1 <> s2) (Cont (k -? s3)
 < == -- by def
 <   s1 (Cont (k -? s3) -? s2)
 < == -- mutual IH
 <   s1 (Cont (k -? (s2 <> s3)))
 < == -- by def
 <   (s1 <> (s2 <> s3)) (Cont k)

 The \var{Cons} case uses mutual induction:

 <   ((k -? s2) -? s1) (Cons a s0)
 < == -- by def
 <   (k -? s2) (Cons a (s0 <> s1))
 < == -- by def
 <   k (Cons a ((s0 <> s1) <> s2))
 < == -- mutual IH
 <   k (Cons a (s0 <> (s1 <> s2))
 < == -- def
 <   (k -? (s1 <> s2)) (Cons a s0)

 (We omit the \var{Nil} case; it is similar to the \var{Full} case)

 <!--


 --  LocalWords:  forkIO readChan writeChan newChan Applicative IORef
 --  LocalWords:  coroutine Coroutines hughes compositionality inFile
 --  LocalWords:  effectful kiselyov openFile ReadMode hGetContents NN
 --  LocalWords:  putStr hClose Girard fileSrc stdoutSnk stdout Src ap
 --  LocalWords:  compositional mempty mappend Dually involutive Snk
 --  LocalWords:  unshift monadic versa forkSrc textit iff rw eof pre
 --  LocalWords:  consumptions onSource onSink unshiftSnk unshiftSrc
 --  LocalWords:  shiftSnk shiftSrc kk flipSnk flipSrc mapSrc mapSnk
 --  LocalWords:  snk formers dnintro dndel duals takeSrc takeSnk th
 --  LocalWords:  monoidal appendSrc appendSnk forwardThenSnk src IH
 --  LocalWords:  forwardThenSrc infixr infixl equalities morphism ss
 --  LocalWords:  contravariant concatSrcSrc concatSnkSrc concatAux mx
 --  LocalWords:  TODO ssrc monads comonads comonad counit contramap
 --  LocalWords:  sinkToSnk superclasses josef subclasses zipWith Sym
 --  LocalWords:  zipSrc forkSnk zipSnk scanl scanSrc scanSnk foldl yy
 --  LocalWords:  foldSrc foldSnk dropSrc dropSnk fromList toList ret
 --  LocalWords:  linesSrc unlinesSnk untilSnk interleaveSnk filterSrc
 --  LocalWords:  filterSnk unchunk chunkSnk claessen newtype forall
 --  LocalWords:  longestResultSnk mres hFileSnk hPutStrLn fileSnk txt
 --  LocalWords:  FilePath WriteMode hFileSrc hIsEOF hGetLine copyFile
 --  LocalWords:  stdin hFileSrcSafe IOException Asynchronicity dually
 --  LocalWords:  demultiplexing dmux tb sab mux sa sb De ka resta kb
 --  LocalWords:  restb CoSrc CoSnk mapCoSrc mapCoSnk coToList strat
 --  LocalWords:  coFileSrc coFileSink srcToCoSrc coSnkToSnk sna tmp
 --  LocalWords:  distributable fileBuffer chanCoSnk chanSrc varCoSnk
 --  LocalWords:  chanBuffer writeIORef varSrc readIORef varBuffer kay
 --  LocalWords:  newIORef flipBuffer dualized bufferedDmux laurent el
 --  LocalWords:  collapseSnk zeilberger jackson Jacksons FeldSpar DSL
 --  LocalWords:  svenningsson iteratively Iteratees Kiselyov's GetC
 --  LocalWords:  iteratees citeyear iteratee Enumeratee elo eli GenT
 --  LocalWords:  enumeratees ReaderT caires Pucella Tov coutts tov js
 --  LocalWords:  acks url pandoc lhs bibliographystyle abbrvnat bs nb
 --  LocalWords:  PaperTools lastSrc foldr Kiselov's natively runtime
 --  LocalWords:  async algorithmically tmpFile gonzalez snoyman kmett
 --  LocalWords:  Atze der Ploeg enumFromToSrc ScopedTypeVariables acc
 --  LocalWords:  TypeOperators RankNTypes LiberalTypeSynonyms reify
 --  LocalWords:  BangPatterns TypeSynonymInstances FlexibleInstances
 --  LocalWords:  pipelining demultiplexed nnIntro nnElim QSem newQSem
 --  LocalWords:  ankner edsl axelsson toListSnk waitQSem signalQSem

 -->

 -- > f :: forall a. Src a -> N [a] -> ()
 -- > f x ret = do toList x ret
 -- >              return True

 -- > yy :: Src a
 -- > yy = undefined

 -- > zz :: N [a] -> IO Bool
 -- > zz ret = f yy ret

 -- > f :: forall a. Src a -> () -> IO Bool
 -- > f x () = do _ <- toList x _
 -- >             return True

 -- > yy :: Src a
 -- > yy = undefined

 -- > zz :: () -> IO Bool
 -- > zz = f yy
 --  LocalWords:  maccagnoni formulae smallcaps mellis mazurak CIH zz
 --  LocalWords:  boundedChanBuffer danielsson