Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016

Comprehending Ringads 2 1. Comprehensions • ZF axiom schema of specification: { x 2 | x ∈ Nat ∧ x < 10 ∧ x is even } • SETL set-formers: { x ∗ x : x in { 0 . . 9 } | x mod 2 = 0 } • Eindhoven Quantifier Notation: ( x : 0 � x < 10 ∧ x is even : x 2 ) • Haskell (NPL, Python, . . . ) list comprehensions: [ x ∧ 2 | x ← [ 0 . . 9 ], even x ]

Comprehending Ringads 3 2. Comprehending monads (Wadler, 1992) Structure is that of a monad with zero ( T , return , mult , ∅ ) : mult :: T ( T a ) → T a mult ( mult xss ) = mult ( fmap mult xss ) return :: a → T a mult ( return x ) = x mzero :: T a mult ( fmap return x ) = x mult ∅ = ∅ Comprehensions can then be used for any monad-with-zero: D [ e | ] = return e = e ′ > D [ e | p ← e ′ , Q ] > = λ p → D [ e | Q ] = guard e ′ > D [ e | e ′ , Q ] > = λ() → D [ e | Q ] D [ e | let d , Q ] = let d in D [ e | Q ] (where x > > = k = mult ( fmap k x ) and guard b = if b then return () else ∅ ). Hence monad comprehensions for sets , bags , (sub-)distributions , exceptions . . .

Comprehending Ringads 4 3. Collection monads Finite collection types are monads. But the operations of a monad-with-zero cannot introduce multiplicity; need also ( ⊎ ) :: T a → T a → T a such that mult ( xs ⊎ ys ) ( mult xs ) ⊎ ( mult ys ) = x ⊎ ∅ x = ∅ ⊎ y = y Sets , bags , sub-distributions are collection monads; but exceptions are not. Eg the Boom Hierarchy : trees lists ⊎ is associative bags . . . and commutative sets . . . and idempotent

Comprehending Ringads 5 4. Aggregations Well-behaved operations h over collections: count , sum , some , . . . h ( return a ) = a h ( mult xs ) = h ( fmap h xs ) —the algebras for the monad T . Define ⊕ and ε by a ⊕ b = h ( return a ⊎ return b ) ε = h ∅ Then h ( x ⊎ y ) = h x ⊕ h y Moreover, ⊕ and ε satisfy whatever laws ⊎ and ∅ do: ε is the unit of ⊕ ; ⊕ is associative if ⊎ is; etc (at least on the range of h ).

Comprehending Ringads 6 5. Comprehending queries Wadler & Trinder (1991) argued for comprehensions as a query notation: Given input tables customers :: Bag ( CID , Name , Address ) invoices :: Bag ( IID , CID , Amount , Date ) then overdueInvoices = [ ( c . name , c . address , i . amount ) | c ← customers , i ← invoices , i . due < today , c . cid == i . customer ] Works similarly in any collection monad, not just bags. An influential observation in the DBPL community: basis of languages such as Buneman’s Kleisli , Microsoft LINQ , Wadler’s Links , as well as querying for objects ( OQL ) and XML ( XQuery ).

Comprehending Ringads 7 6. The problem with joins The comprehension yields a terrible query plan! Constructs entire cartesian product, then discards most of it: fmap (λ( c , i ) → ( c . name , c . address , i . amount ) ( filter (λ( c , i ) → c . cid == i . customer ) ( filter (λ( c , i ) → i . due < today ) ( cp customers invoices ))) Better to group by customer identifier, then handle groups separately: fmap ( fmap (λ c → ( c . name , c . address )) × fmap (λ i → i . amount )) ( fmap ( id × filter (λ i → i . due < today )) ( merge ( indexBy cid customers , indexBy customer invoices ))) (where indexBy partitions, and merge pairs on common index). But this doesn’t correspond to anything expressible in comprehensions.

Comprehending Ringads 8 7. Comprehensive comprehensions Parallel (‘zip’) comprehensions (Clean 1.0, 1995): [( x , y ) | x ← [ 1 , 2 , 3 ] | y ← [ 4 , 5 , 6 ]] = [( 1 , 4 ), ( 2 , 5 ), ( 3 , 6 )] ‘Order by’ and ‘group by’ (Wadler & Peyton Jones, 2007): [ ( the dept , sum salary ) | ( name , dept , salary ) ← employees , then group by dept using groupWith , then sortWith by sum salary ] (NB group by rebinds the variables salary etc bound earlier!) Initially just for lists, but also generalizable (Giorgidze et al., 2011): mzip T :: T a → T b → T ( a , b ) mgroupWith T , U , F :: Eq b ⇒ ( a → b ) → T a → U ( F a ) (Note heterogeneous type: T , U should be monads, F a functor.)

Comprehending Ringads 9 8. Solving the problem with (equi-)joins Maps-to-bags form a monad-with-zero—roughly: type Map k v = k → v type Table k v = Map k ( Bag v ) Now define merge :: ( Table k v , Table k w ) → Table k ( v , w ) merge ( f , g ) = λ k → cp ( f k ) ( g k ) indexBy :: Eq k ⇒ ( v → k ) → Bag v → Table k v indexBy f xs k = filter (λ v → f v == k ) xs and use merge for zipping, indexBy for grouping. With care, indexBy can be evaluated in linear time.

Comprehending Ringads 10 Now represent query as: overdueInvoices :: Map Int ( Name , Address , Bag Amount ) overdueInvoices = [ ( the name , the addr , amount ) | ( cid , name , addr ) ← customers , then group by cid using indexBy | ( iid , customer , amount , due ) ← invoices , due < today , then group by customer using indexBy ] Avoids expanding the whole cartesian product.

Comprehending Ringads 11 9. Finite maps A catch: • need monads , for comprehensions • need Map s, for indexing • need finite collections , for aggregation • but finite maps don’t form a monad (no return ) Solution?

Comprehending Ringads 12 10. Graded monads (Katsumata et al, 2016) Monad ( T , return , mult ) has endofunctor T : C → C , polymorphic functions return :: a → T a mult :: T ( T a ) → T a satisfying certain laws. M-graded monad ( T , return , mult ) for monoid ( M , ε, · ) has (non-endo-)functor T : M → [ C , C ] and return :: a → T ε a mult :: T m ( T n a ) → T ( m · n ) a with same laws. (Eg for collecting effects ; think also of vectors .) We use T = Table , with monoid ( K , � � , + + ) of finite type sequences. Not an endofunctor, but there is still a story involving adjunctions.

Comprehending Ringads 13 11. Ringads (Wadler, 1990) Wadler called collection monads ringads . Notes on monads and ringads Ringads are (roughly) Philip Wadler University of Glasgow ∗ right near-semirings in the 17 September 1990 right near-semiringy category These notes are in four parts. Section 1 summarises the usual definition of monads, as found, for instance, in Mac Lane’s text. Section 2 summarises a second, equivalent definition of monads, due to Kleisli. Section 3 extends Kleisli’s definition of monad to include a zero. Section 4 further extends this definition to include an associative operator of endofunctors under which has the zero of Section 3 as a unit: this is a ringad . 1 Monads a la Mac Lane composition and product. According to Mac Lane, a monad is a triple ( M, η, µ ), where M is a functor and η x : x ! M x and µ x : M 2 x ! M x are natural transformations satisfying: (1-1) µ x · η M x = id M x , Wadler’s note cited in (1-2) µ x · M η x = id M x , (1-3) µ x · µ M x = µ x · M µ x . Roughly speaking, the first two equations correspond to left and right identity laws, while numerous papers from the the third corresponds to associativity. In addition to the three explicit laws, we can also extract four “hidden” equations. Since M is a functor: 1990s (with varying degrees (1-4) M id x = id M x , (1-5) M g · M f = M ( g · f ) , of accuracy), but long where f : x ! y and g : y ! z . And, since η and µ are natural transformations: (1-6) = M f · η x η y · M f, µ y · M 2 f, (1-7) = M f · µ x thought lost. . . where f : x ! y . In the context of the lambda calculus, comprehensions can be viewed as a syntactic sugar for monads. Let x range over variables, t, u range over terms, and p, q, r range over ∗ Author’s address: Department of Computing Science, University of Glasgow, G12 8QQ, Scotland. Electronic mail: wadler@cs.glasgow.ac.uk. 1

Comprehending Ringads 14 12. Conclusions: Comprehending Wadler • list comprehensions • monads for functional programming • monad comprehensions and do notation • comprehensions for queries • comprehensive comprehensions • graded monads for the marriage with effects • thank you, Phil!

Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016 - PowerPoint PPT Presentation

Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016 Comprehending Ringads 2 1. Comprehensions ZF axiom schema of specification: { x 2 | x Nat x < 10 x is even } SETL set-formers: { x x : x in { 0 . . 9 } | x

Predicting and Comprehending Predicting and Comprehending Asteroid Impacts Asteroid Impacts

Comprehending Monadic Queries Jeremy Gibbons (joint work with Fritz Henglein, Ralf Hinze, Nicolas

CANADIAN IMMIGRATION LAW EVENT Comprehending the Incomprehensible September

Comprehending Compensation John P. Griffin, J.D., LL.M., Principal, ASC Institute, LLC John

Multiple Alignments and Phylogenies Mark Voorhies 3/29/2012 Mark Voorhies Multiple Alignments

1 Software maintenance: Relative Cost 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% 1970s

Possible Models Computation and Revision A Practical Approach Peter Baumgartner Data61 |

Under PDPM Specifics of the PT, OT, and SLP Classifications Kim Barrows, RN, BSN President

mask misalignment due to Double Patterning Arvind NV, Ajoy Mandal Texas Instruments India 1

Minimum Caterpillar Trees and Ring-Stars: a branch-and-cut algorithm Luidi G. Simonetti Yuri A.

Local System Security via SSHD Instrumenta5on Sco6 Campbell

Vitess, k8s & sharding Sugu Sougoumarane, Co-creator CTO, PlanetScale @vitessio What is

Translating Web Data Ronald Fagin, Mauricio A. Hernandez, Lucian Popa, IBM Almaden Research

Introduction to Database Systems Datalog & Deductive Databases Textbook Reference Database

CS5460: Operating Systems Lecture 5: Processes and Threads (Chapters 3-4) CS 5460: Operating

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren

1 and customer name for customers who order it. Display product name and category for products

YI R alice bob Dha What nodes processeachpacket O to What nodes need to 1 each packet a

Identity internetworking with eduGAIN Diego R. Lopez - RedIRIS Confederations Federate

Querying Relational Data: Algebra Gerome Miklau UMass Amherst CMPSCI 645 Database Systems

ROHC: compress your VoIP traffic Didier Barvaux didier.barvaux@toulouse.viveris.com

Relational Database Design Theory Part II CPS 196.3 Introduction to Database Systems 2

Crash Testing and Coverity The Numbers Caoln McNamara, Red Hat 2015-09-25 1 Caoln McNamara

Regularity and Grbner bases of the Rees algebra of edge ideals of bipartite graphs Yairon Cid