[Git][ghc/ghc][wip/romes/25170-idea4] Rewrite Notes
Simon Peyton Jones (@simonpj)
gitlab at gitlab.haskell.org
Fri Mar 14 23:04:18 UTC 2025
Simon Peyton Jones pushed to branch wip/romes/25170-idea4 at Glasgow Haskell Compiler / GHC
Commits:
927f71d1 by Simon Peyton Jones at 2025-03-14T23:03:18+00:00
Rewrite Notes
- - - - -
3 changed files:
- compiler/GHC/Core/Opt/Simplify/Iteration.hs
- compiler/GHC/Core/Opt/Simplify/Utils.hs
- compiler/GHC/Core/Opt/Specialise.hs
Changes:
=====================================
compiler/GHC/Core/Opt/Simplify/Iteration.hs
=====================================
@@ -2008,8 +2008,8 @@ So we go to some effort to avoid repeatedly simplifying the same thing:
* We go to some efforts to avoid unnecessarily simplifying ApplyToVal,
in at least two places
- In simplCast/addCoerce, where we check for isReflCo
- - In rebuildCall we avoid simplifying arguments before we have to
- (see Note [Trying rewrite rules])
+ - We sometimes try rewrite RULES befoe simplifying arguments;
+ see Note [Plan (BEFORE)]
All that said /postInlineUnconditionally/ (called in `completeBind`) does
fire in the above (f BIG) situation. See Note [Post-inline for single-use
@@ -2325,7 +2325,7 @@ simplOutId env fun cont
-- Normal case for (f e1 .. en)
simplOutId env fun cont
- = -- Try rewrite rules
+ = -- Try rewrite rules: Plan (BEFORE) in Note [When to apply rewrite rules]
do { rule_base <- getSimplRules
; let rules_for_me = getRules rule_base fun
out_args = contOutArgs env cont :: [OutExpr]
@@ -2422,7 +2422,7 @@ rebuildCall env fun_info
rebuildCall env (ArgInfo { ai_fun = fun, ai_args = rev_args, ai_rules = rules }) cont
| null rules
= rebuild env (argInfoExpr fun rev_args) cont
- | otherwise -- Try rules again
+ | otherwise -- Try rules again: Plan (AFTER) in Note [When to apply rewrite rules]
= do { let args = reverse rev_args
; mb_match <- tryRules env rules fun (map argSpecArg args)
; case mb_match of
@@ -2462,83 +2462,90 @@ tryInlining env logger var cont
text "Cont: " <+> ppr cont])]
-{- Note [Trying rewrite rules]
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-Consider an application (f e1 e2 e3) where the e1,e2,e3 are not yet
-simplified. We want to simplify enough arguments to allow the rules
-to apply, but it's more efficient to avoid simplifying e2,e3 if e1 alone
-is sufficient. Example: class ops
- (+) dNumInt e2 e3
-If we rewrite ((+) dNumInt) to plusInt, we can take advantage of the
-latter's strictness when simplifying e2, e3. Moreover, suppose we have
- RULE f Int = \x. x True
-
-Then given (f Int e1) we rewrite to
- (\x. x True) e1
-without simplifying e1. Now we can inline x into its unique call site,
-and absorb the True into it all in the same pass. If we simplified
-e1 first, we couldn't do that; see Note [Avoiding simplifying repeatedly].
-
-So we try to apply rules if either
- (a) no_more_args: we've run out of argument that the rules can "see"
- (b) nr_wanted: none of the rules wants any more arguments
-
-
-Note [RULES apply to simplified arguments]
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-It's very desirable to try RULES once the arguments have been simplified, because
-doing so ensures that rule cascades work in one pass. Consider
- {-# RULES g (h x) = k x
- f (k x) = x #-}
- ...f (g (h x))...
-Then we want to rewrite (g (h x)) to (k x) and only then try f's rules. If
-we match f's rules against the un-simplified RHS, it won't match. This
-makes a particularly big difference when superclass selectors are involved:
- op ($p1 ($p2 (df d)))
-We want all this to unravel in one sweep.
+{- Note [When to apply rewrite rules]
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+Should we apply rewrite rules before simplifying the arguments, or after?
+Both are plausible: see
+ - Note [Plan (BEFORE): try RULES /before/ simplifying arguments]
+ - Note [Plan (AFTER): try RULES /after/ simplifying arguments]
-Note [Rewrite rules and inlining]
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-In general we try to arrange that inlining is disabled (via a pragma) if
-a rewrite rule should apply, so that the rule has a decent chance to fire
-before we inline the function.
+So we do both!
+ - Plan (BEFORE) selectively, in `simplOutId`
+ - Plan (AFTER) always, in the finishing-up case of `rebuildCall`
+
+The "selectively" in Plan (BEFORE) is a bit ad-hoc:
+
+* We want Plan (BEFORE) for class ops (see Note [Plan (BEFORE)]).
+
+* We do NOT want Plan (BEFORE) for primops, because the constant-folding rules
+ are quite complicated and expeensive, and we don't want to try them twice.
+ Moreover the beneifts of Plan (BEFORE), described in the Note, don't apply to
+ primops.
-But it turns out that (especially when type-class specialisation or
-SpecConstr is involved) it is very helpful for the the rewrite rule to
-"win" over inlining when both are active at once: see #21851, #22097.
-The simplifier arranges to do this, as follows. In effect, the ai_rewrite
-field of the ArgInfo record is the state of a little state-machine:
+Note [Plan (BEFORE): try RULES /before/ simplifying arguments]
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+It is sometimes desirable to apply RULES before simplifying the function
+arguments. Two particuar cases:
+
+* Class ops
+ (+) dNumInt e2 e3
+ If we rewrite ((+) dNumInt) to plusInt, we can take advantage of the
+ latter's strictness when simplifying e2, e3. Moreover, if
+ (+) dNumInt e2 e3 --> (\x y -> ....) e2 e3
+ Frequently `x` is used just once in the body of the (\x y -> ...).
+ If `e2` is un-simplified we can preInlineUnconditinally and that saves
+ simplifying `e2` twice. See Note [Avoiding simplifying repeatedly].
+
+* Specialisation RULES. In general we try to arrange that inlining is disabled
+ (via a pragma) if a rewrite rule should apply, so that the rule has a decent
+ chance to fire before we inline the function.
+
+ But it turns out that (especially when type-class specialisation or
+ SpecConstr is involved) it is very helpful for the the rewrite rule to
+ "win" over inlining when both are active at once: see #21851, #22097.
+
+ So we want to try RULES before we try inlining.
+
+Wrinkles:
-* mkArgInfo sets the ai_rewrite field to TryRules if there are any rewrite
- rules avaialable for that function.
+(BF1) Each un-simplified argument has its own static environment, stored
+ in its `ApplyToVal` nodes. So we can't just match on the un-simplified
+ arguments: we have to apply that static environment as a substitution
+ first! This is done lazily in `contOutArgs`, so it'll be done just enough
+ to allow the rule to match, or not.
-* rebuildCall simplifies arguments until enough are simplified to match the
- rule with greatest arity. See Note [RULES apply to simplified arguments]
- and the first field of `TryRules`.
+Note [Plan (AFTER): try RULES /after/ simplifying arguments]
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+It's very desirable to try RULES once the arguments have been simplified,
+because doing so ensures that rule cascades work in one pass. Consider
- But no more! As soon as we have simplified enough arguments to satisfy the
- maximum-arity rules, we try the rules; see Note [Trying rewrite rules].
+ {-# RULES g (h x) = k x
+ f (k x) = x #-}
+ ...f (g (h x))...
+Then we want to rewrite (g (h x)) to (k x) and only then try f's rules. If
+we match f's rules against the un-simplified RHS, it won't match. This
+makes a particularly big difference for
-* Once we have tried rules (or immediately if there are no rules) set
- ai_rewrite to TryInlining, and the Simplifier will try to inline the
- function. We want to try this immediately (before simplifying any (more)
- arguments). Why? Consider
- f BIG where f = \x{OneOcc}. ...x...
- If we inline `f` before simplifying `BIG` well use preInlineUnconditionally,
- and we'll simplify BIG once, at x's occurrence, rather than twice.
+* Superclass selectors
+ op ($p1 ($p2 (df d)))
+ We want all this to unravel in one sweep
-* GHC.Core.Opt.Simplify.Utils. mkRewriteCall: if there are no rules, and no
- unfolding, we can skip both TryRules and TryInlining, which saves work.
+* Constant folding
+ +# 3# (+# 4# 5#)
+ We want this to happen in one pass
Note [Avoid redundant simplification]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-Because RULES apply to simplified arguments, there's a danger of repeatedly
-simplifying already-simplified arguments. An important example is that of
- (>>=) d e1 e2
-Here e1, e2 are simplified before the rule is applied, but don't really
-participate in the rule firing. So we mark them as Simplified to avoid
-re-simplifying them.
+Because RULES often apply to simplified arguments (see Note [Plan (AFTER)]),
+there's a danger of simplifying already-simplified arguments. For example,
+suppose we have
+ RULE f (x,y) = $sf x y
+and the expression
+ f (p,q) e1 e2
+With Plan (AFTER) by the time the rule fires, we will have already simplified e1, e2,
+and we want to avoid doing so a second time. So ApplyToVal records if the argument
+is already Simplified.
Note [Shadowing in the Simplifier]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
=====================================
compiler/GHC/Core/Opt/Simplify/Utils.hs
=====================================
@@ -562,8 +562,11 @@ contOutArgs env cont
go (ApplyToVal { sc_dup = dup, sc_arg = arg, sc_env = env, sc_cont = cont })
| isSimplified dup = arg : go cont
| otherwise = GHC.Core.Subst.substExpr (getFullSubst in_scope env) arg : go cont
- -- NOT substExprSC: we want to get the benefit of knowing what is
- -- evaluated etc, via the in-scope set
+ -- Make sure we apply the static environment `sc_env` as a substitution
+ -- to get an OutExpr. See (BF1) in Note [Plan (BEFORE)]
+ -- in GHC.Core.Opt.Simplify.Iteration
+ -- NB: we use substExpr, not substExprSC: we want to get the benefit of
+ -- knowing what is evaluated etc, via the in-scope set
-- No more arguments
go _ = []
=====================================
compiler/GHC/Core/Opt/Specialise.hs
=====================================
@@ -1379,7 +1379,7 @@ version of `g` will contain the call `f @Int`; but in the subsequent run of
the Simplifier, there will be a competition between:
* The user-supplied SPECIALISE rule for `f`
* The inlining of the wrapper for `f`
-In fact, the latter wins -- see Note [Rewrite rules and inlining] in
+In fact, the latter wins -- see Note [Plan (BEFORE): try RULES /before/ simplifying arguments]
GHC.Core.Opt.Simplify.Iteration. However, it a bit fragile.
Moreover consider (test T21851_2):
@@ -1409,10 +1409,10 @@ we load it up just once, in `initRuleEnv`, called at the beginning of
`specProgram`.
NB: you might wonder if running rules in the specialiser (this Note)
-renders Note [Rewrite rules and inlining] in the Simplifier redundant.
-That is, if we run rules in the specialiser, does it matter if we make
-rules "win" over inlining in the Simplifier? Yes, it does! See the
-discussion in #21851.
+renders Note [Plan (BEFORE): try RULES /before/ simplifying arguments]
+in the Simplifier (partly) redundant. That is, if we run rules in the
+specialiser, does it matter if we make rules "win" over inlining in
+the Simplifier? Yes, it does! See the discussion in #21851.
Note [Floating dictionaries out of cases]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
View it on GitLab: https://gitlab.haskell.org/ghc/ghc/-/commit/927f71d11af90cc1a8b373e9fe642b602dd95260
--
View it on GitLab: https://gitlab.haskell.org/ghc/ghc/-/commit/927f71d11af90cc1a8b373e9fe642b602dd95260
You're receiving this email because of your account on gitlab.haskell.org.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/ghc-commits/attachments/20250314/f9e1155a/attachment-0001.html>
More information about the ghc-commits
mailing list