From Gergo.Erdi at sc.com Tue Apr 1 04:51:17 2025 From: Gergo.Erdi at sc.com (Erdi, Gergo) Date: Tue, 1 Apr 2025 04:51:17 +0000 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: PUBLIC This sounds extremely interesting, but I don’t understand where you are getting this number from! How do you see in the eventlog HTMLs that I’ve included that there are ~2000 ModuleGraphs? I’ve now tried using ghc-debug to find all ModuleGraph constructors at two points in the run: just before typechecking the first module (after all the extendMG calls) and just after typechecking the last module, and even in the cold case I only see 1 ModuleGraph before and 13 ModuleGraphs after. Also, what do you mean by “precisely one loaded per interface loaded into the EPS”? Since my repro has 2294 modules, wouldn’t that mean 2294 ModuleGraphs by that metric? From: Matthew Pickering Sent: Saturday, March 29, 2025 1:53 AM To: Erdi, Gergo Cc: GHC Devs ; ÉRDI Gergő ; Montelatici, Raphael Laurent ; Dijkstra, Atze Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi Gergo, I quickly tried building `Cabal` with the master branch. There is precisely 1 ModuleGraph allocated for the home session, and precisely one loaded per interface loaded into the EPS. No leaky behaviour like you can see in your eventlogs. It seems there are about 2000 live module graphs in your program, are you doing something with the API to create this many? Cheers, Matt On Fri, Mar 28, 2025 at 12:40 PM Matthew Pickering > wrote: HI Gergo, Do you have a (synthetic?) reproducer? You have probably identified some memory leak. However, without any means to reproduce it becomes very difficult to investigate. I feel like we are getting into very precise details now, where speculating is not going to be so useful. It seems like this is an important thing for you and your company. Is there any budget to pay for some investigation? If that was the case then some effort could be made to create a synthetic producer and make the situation more robust going into the future if your requirements were precisely understood. Cheers, Matt On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo > wrote: PUBLIC Just to add that I get the same "equalizing" behaviour (but in a more "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just call `hugInstancesBelow` before processing each module. So that's definitely one source of extra memory usage. I wonder if it would be possible to rebuild the ModuleGraph periodically (similar to the ModDetails dehydration), or if there are references to it stored all over the place from `HscEnv`s scattered around in closures etc. (basically the same problem the HPT had before it was made into a mutable reference). -----Original Message----- From: ghc-devs > On Behalf Of Erdi, Gergo via ghc-devs Sent: Friday, March 28, 2025 4:49 PM To: Matthew Pickering >; GHC Devs > Cc: ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi, Unfortunately, I am forced to return to this problem. Everything below is now in the context of GHC 9.12 plus the mutable HPT patch backported. My test case is typechecking a tree of 2294 modules that form the transitive closure of a single module's dependencies, all in a single process. I have done this typechecking three times, here's what `+RTS -s -RTS` reports for max residency: * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB * "cold-top": With all `ModIface`s already on disk, except for the single top-level module: 302 MB * "warm": With all `ModIface`s already on disk: 211 MB So my stupidly naive question is, why is the "cold" case also not 302 MB? In earlier discussion, `ModDetails` unfolding has come up. Dehydrating `ModDetails` in the HPT all the time is disastrous for runtime, but based on this model I would expect to see improvements from dehydrating "every now and then". So I tried a stupid simple example where after every 100th typechecked module, I run this function on the topologically sorted list of modules processed so far: ``` dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods = do let HPT{ table = hptr } = hsc_HPT hsc_env hpt <- readIORef hptr for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface _details _linkable) -> do !details <- initModDetails hsc_env iface pure () ``` Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, the profile looks exactly the same. Speaking of the profile, in the "cold" case I see a lot of steadily increasing heap usage from the `ModuleGraph`. I could see this happening if typechecking from scratch involves more `modulesUnder` calls which in turn force more and more of the `ModuleGraph`. If so, then maybe this could be worked around by repeatedly remaking the `ModuleGraph` just like I remake the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing the `ModuleGraph` at the start, with the idea being that this should "equalize" the three scenarios if this really is a substantial source of extra memory usage. This pushes up the warm case's memory usage to 381 MB, which is promising, but I still see a `Word64Map` that is steadily increasing in the "cold-force-modulegraph" case and contributes a lot to the memory usage. Unfortunately, I don't know where that `Word64Map` is (it could be any `Unique`-keyed environment...). So I am now stuck at this point. To spell out my goal explicitly, I would like to typecheck one module after another and not keep anything more in memory around than if I loaded them from `ModIface` files. Thanks, Gergo p.s.: I couldn't find a way in the EventLog output HTML to turn event markers on/off or filter them, so to avoid covering the whole graph with gray lines, I mark only every 100th module. From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM To: ÉRDI Gergő > Cc: Erdi, Gergo >; Zubin Duggal >; Montelatici, Raphael Laurent >; GHC Devs > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces You do also raise a good point about rehydration costs. In oneshot mode, you are basically rehydrating the entire transitive closure of each module when you compile it, which obviously results in a large amount of repeated work. This is why people are investigating ideas of a persistent worker to at least avoid rehydrating all external dependencies as well. On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering > wrote: Sure, you can remove them once you are sure they are not used anymore. For clients like `GHCi` that doesn't work obviously as they can be used at any point in the future but for a batch compiler it would be fine. On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő > wrote: On Mon, 10 Feb 2025, Matthew Pickering wrote: > I wonder if you have got your condition the wrong way around. > > The only "safe" time to perform rehydration is AFTER the point it can > never be used again. > > If you rehydrate it just before it is used then you will repeat work > which has already been done. If you do this, you will always have a > trade-off between space used and runtime. Oops. Yes, I have misunderstood the idea. I thought the idea was that after loading a given module into the HPT, its ModDetails would start out small (because of laziness) and then keep growing in size as more and more of it are traversed, and thus forced, during the typechecking of its dependees, so at some point we would want to reset that into the small initial representation as created by initModDetails. But if the idea is that I should rehydrate modules when they can't be used anymore, then that brings up the question why even do that, instead of straight removing the HomeModInfos from the HPT? ---------------------------------------------------------------------- ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our public website. ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com -------------- next part -------------- An HTML attachment was scrubbed... URL: From Gergo.Erdi at sc.com Tue Apr 1 06:04:46 2025 From: Gergo.Erdi at sc.com (Erdi, Gergo) Date: Tue, 1 Apr 2025 06:04:46 +0000 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: PUBLIC OK scratch that, I was looking at wrong ghc-debug output. Indeed there are 2301 ModuleGraphs in the heap at the end of typechecking :O From: Erdi, Gergo Sent: Tuesday, April 1, 2025 12:51 PM To: Matthew Pickering Cc: GHC Devs ; ÉRDI Gergő ; Montelatici, Raphael Laurent ; Dijkstra, Atze Subject: Re: GHC memory usage when typechecking from source vs. loading ModIfaces This sounds extremely interesting, but I don’t understand where you are getting this number from! How do you see in the eventlog HTMLs that I’ve included that there are ~2000 ModuleGraphs? I’ve now tried using ghc-debug to find all ModuleGraph constructors at two points in the run: just before typechecking the first module (after all the extendMG calls) and just after typechecking the last module, and even in the cold case I only see 1 ModuleGraph before and 13 ModuleGraphs after. Also, what do you mean by “precisely one loaded per interface loaded into the EPS”? Since my repro has 2294 modules, wouldn’t that mean 2294 ModuleGraphs by that metric? From: Matthew Pickering > Sent: Saturday, March 29, 2025 1:53 AM To: Erdi, Gergo > Cc: GHC Devs >; ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi Gergo, I quickly tried building `Cabal` with the master branch. There is precisely 1 ModuleGraph allocated for the home session, and precisely one loaded per interface loaded into the EPS. No leaky behaviour like you can see in your eventlogs. It seems there are about 2000 live module graphs in your program, are you doing something with the API to create this many? Cheers, Matt On Fri, Mar 28, 2025 at 12:40 PM Matthew Pickering > wrote: HI Gergo, Do you have a (synthetic?) reproducer? You have probably identified some memory leak. However, without any means to reproduce it becomes very difficult to investigate. I feel like we are getting into very precise details now, where speculating is not going to be so useful. It seems like this is an important thing for you and your company. Is there any budget to pay for some investigation? If that was the case then some effort could be made to create a synthetic producer and make the situation more robust going into the future if your requirements were precisely understood. Cheers, Matt On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo > wrote: PUBLIC Just to add that I get the same "equalizing" behaviour (but in a more "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just call `hugInstancesBelow` before processing each module. So that's definitely one source of extra memory usage. I wonder if it would be possible to rebuild the ModuleGraph periodically (similar to the ModDetails dehydration), or if there are references to it stored all over the place from `HscEnv`s scattered around in closures etc. (basically the same problem the HPT had before it was made into a mutable reference). -----Original Message----- From: ghc-devs > On Behalf Of Erdi, Gergo via ghc-devs Sent: Friday, March 28, 2025 4:49 PM To: Matthew Pickering >; GHC Devs > Cc: ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi, Unfortunately, I am forced to return to this problem. Everything below is now in the context of GHC 9.12 plus the mutable HPT patch backported. My test case is typechecking a tree of 2294 modules that form the transitive closure of a single module's dependencies, all in a single process. I have done this typechecking three times, here's what `+RTS -s -RTS` reports for max residency: * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB * "cold-top": With all `ModIface`s already on disk, except for the single top-level module: 302 MB * "warm": With all `ModIface`s already on disk: 211 MB So my stupidly naive question is, why is the "cold" case also not 302 MB? In earlier discussion, `ModDetails` unfolding has come up. Dehydrating `ModDetails` in the HPT all the time is disastrous for runtime, but based on this model I would expect to see improvements from dehydrating "every now and then". So I tried a stupid simple example where after every 100th typechecked module, I run this function on the topologically sorted list of modules processed so far: ``` dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods = do let HPT{ table = hptr } = hsc_HPT hsc_env hpt <- readIORef hptr for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface _details _linkable) -> do !details <- initModDetails hsc_env iface pure () ``` Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, the profile looks exactly the same. Speaking of the profile, in the "cold" case I see a lot of steadily increasing heap usage from the `ModuleGraph`. I could see this happening if typechecking from scratch involves more `modulesUnder` calls which in turn force more and more of the `ModuleGraph`. If so, then maybe this could be worked around by repeatedly remaking the `ModuleGraph` just like I remake the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing the `ModuleGraph` at the start, with the idea being that this should "equalize" the three scenarios if this really is a substantial source of extra memory usage. This pushes up the warm case's memory usage to 381 MB, which is promising, but I still see a `Word64Map` that is steadily increasing in the "cold-force-modulegraph" case and contributes a lot to the memory usage. Unfortunately, I don't know where that `Word64Map` is (it could be any `Unique`-keyed environment...). So I am now stuck at this point. To spell out my goal explicitly, I would like to typecheck one module after another and not keep anything more in memory around than if I loaded them from `ModIface` files. Thanks, Gergo p.s.: I couldn't find a way in the EventLog output HTML to turn event markers on/off or filter them, so to avoid covering the whole graph with gray lines, I mark only every 100th module. From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM To: ÉRDI Gergő > Cc: Erdi, Gergo >; Zubin Duggal >; Montelatici, Raphael Laurent >; GHC Devs > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces You do also raise a good point about rehydration costs. In oneshot mode, you are basically rehydrating the entire transitive closure of each module when you compile it, which obviously results in a large amount of repeated work. This is why people are investigating ideas of a persistent worker to at least avoid rehydrating all external dependencies as well. On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering > wrote: Sure, you can remove them once you are sure they are not used anymore. For clients like `GHCi` that doesn't work obviously as they can be used at any point in the future but for a batch compiler it would be fine. On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő > wrote: On Mon, 10 Feb 2025, Matthew Pickering wrote: > I wonder if you have got your condition the wrong way around. > > The only "safe" time to perform rehydration is AFTER the point it can > never be used again. > > If you rehydrate it just before it is used then you will repeat work > which has already been done. If you do this, you will always have a > trade-off between space used and runtime. Oops. Yes, I have misunderstood the idea. I thought the idea was that after loading a given module into the HPT, its ModDetails would start out small (because of laziness) and then keep growing in size as more and more of it are traversed, and thus forced, during the typechecking of its dependees, so at some point we would want to reset that into the small initial representation as created by initModDetails. But if the idea is that I should rehydrate modules when they can't be used anymore, then that brings up the question why even do that, instead of straight removing the HomeModInfos from the HPT? ---------------------------------------------------------------------- ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our public website. ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthewtpickering at gmail.com Tue Apr 1 09:44:16 2025 From: matthewtpickering at gmail.com (Matthew Pickering) Date: Tue, 1 Apr 2025 10:44:16 +0100 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: Hi Gergo, I looked in the detailed pane, searched for ModuleGraph, hovered my mouse over the "ModuleGraph` constructor, recorded the number of live bytes, divided that by 32. Matt On Tue, Apr 1, 2025 at 7:04 AM Erdi, Gergo wrote: > PUBLIC > > OK scratch that, I was looking at wrong ghc-debug output. Indeed there are > 2301 ModuleGraphs in the heap at the end of typechecking :O > > > > *From:* Erdi, Gergo > *Sent:* Tuesday, April 1, 2025 12:51 PM > *To:* Matthew Pickering > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* Re: GHC memory usage when typechecking from source vs. loading > ModIfaces > > > > This sounds extremely interesting, but I don’t understand where you are > getting this number from! How do you see in the eventlog HTMLs that I’ve > included that there are ~2000 ModuleGraphs? I’ve now tried using ghc-debug > to find all ModuleGraph constructors at two points in the run: just before > typechecking the first module (after all the extendMG calls) and just after > typechecking the last module, and even in the cold case I only see 1 > ModuleGraph before and 13 ModuleGraphs after. > > > > Also, what do you mean by “precisely one loaded per interface loaded into > the EPS”? Since my repro has 2294 modules, wouldn’t that mean 2294 > ModuleGraphs by that metric? > > > > *From:* Matthew Pickering > *Sent:* Saturday, March 29, 2025 1:53 AM > *To:* Erdi, Gergo > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* [External] Re: GHC memory usage when typechecking from source > vs. loading ModIfaces > > > > > > Hi Gergo, > > > > I quickly tried building `Cabal` with the master branch. There is > precisely 1 ModuleGraph allocated for the home session, and precisely one > loaded per interface loaded into the EPS. No leaky behaviour like you can > see in your eventlogs. > > > > It seems there are about 2000 live module graphs in your program, are you > doing something with the API to create this many? > > > > Cheers, > > > > Matt > > > > On Fri, Mar 28, 2025 at 12:40 PM Matthew Pickering < > matthewtpickering at gmail.com> wrote: > > HI Gergo, > > > > Do you have a (synthetic?) reproducer? You have probably identified some > memory leak. However, without any means to reproduce it becomes very > difficult to investigate. I feel like we are getting into very precise > details now, where speculating is not going to be so useful. > > > > It seems like this is an important thing for you and your company. Is > there any budget to pay for some investigation? If that was the case then > some effort could be made to create a synthetic producer and make the > situation more robust going into the future if your requirements were > precisely understood. > > > > Cheers, > > > > Matt > > > > On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo wrote: > > PUBLIC > > Just to add that I get the same "equalizing" behaviour (but in a more > "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just > call `hugInstancesBelow` before processing each module. So that's > definitely one source of extra memory usage. I wonder if it would be > possible to rebuild the ModuleGraph periodically (similar to the ModDetails > dehydration), or if there are references to it stored all over the place > from `HscEnv`s scattered around in closures etc. (basically the same > problem the HPT had before it was made into a mutable reference). > > -----Original Message----- > From: ghc-devs On Behalf Of Erdi, Gergo > via ghc-devs > Sent: Friday, March 28, 2025 4:49 PM > To: Matthew Pickering ; GHC Devs < > ghc-devs at haskell.org> > Cc: ÉRDI Gergő ; Montelatici, Raphael Laurent < > Raphael.Montelatici at sc.com>; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > Hi, > > Unfortunately, I am forced to return to this problem. Everything below is > now in the context of GHC 9.12 plus the mutable HPT patch backported. > > My test case is typechecking a tree of 2294 modules that form the > transitive closure of a single module's dependencies, all in a single > process. I have done this typechecking three times, here's what `+RTS -s > -RTS` reports for max residency: > > * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB > > * "cold-top": With all `ModIface`s already on disk, except for the > single top-level module: 302 MB > > * "warm": With all `ModIface`s already on disk: 211 MB > > So my stupidly naive question is, why is the "cold" case also not 302 MB? > > In earlier discussion, `ModDetails` unfolding has come up. Dehydrating > `ModDetails` in the HPT all the time is disastrous for runtime, but based > on this model I would expect to see improvements from dehydrating "every > now and then". So I tried a stupid simple example where after every 100th > typechecked module, I run this function on the topologically sorted list of > modules processed so far: > > > ``` > dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods > = do > let HPT{ table = hptr } = hsc_HPT hsc_env > hpt <- readIORef hptr > for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface > _details _linkable) -> do > !details <- initModDetails hsc_env iface > pure () > ``` > > Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, > the profile looks exactly the same. > > Speaking of the profile, in the "cold" case I see a lot of steadily > increasing heap usage from the `ModuleGraph`. I could see this happening if > typechecking from scratch involves more `modulesUnder` calls which in turn > force more and more of the `ModuleGraph`. If so, then maybe this could be > worked around by repeatedly remaking the `ModuleGraph` just like I remake > the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing > the `ModuleGraph` at the start, with the idea being that this should > "equalize" the three scenarios if this really is a substantial source of > extra memory usage. This pushes up the warm case's memory usage to 381 MB, > which is promising, but I still see a `Word64Map` that is steadily > increasing in the "cold-force-modulegraph" case and contributes a lot to > the memory usage. Unfortunately, I don't know where that `Word64Map` is (it > could be any `Unique`-keyed environment...). > > So I am now stuck at this point. To spell out my goal explicitly, I would > like to typecheck one module after another and not keep anything more in > memory around than if I loaded them from `ModIface` files. > > Thanks, > Gergo > > p.s.: I couldn't find a way in the EventLog output HTML to turn event > markers on/off or filter them, so to avoid covering the whole graph with > gray lines, I mark only every 100th module. > > > > > From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM > To: ÉRDI Gergő > Cc: Erdi, Gergo ; Zubin Duggal ; > Montelatici, Raphael Laurent ; GHC Devs < > ghc-devs at haskell.org> > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > You do also raise a good point about rehydration costs. > > In oneshot mode, you are basically rehydrating the entire transitive > closure of each module when you compile it, which obviously results in a > large amount of repeated work. This is why people are investigating ideas > of a persistent worker to at least avoid rehydrating all external > dependencies as well. > > On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering matthewtpickering at gmail.com> wrote: > Sure, you can remove them once you are sure they are not used anymore. > > For clients like `GHCi` that doesn't work obviously as they can be used at > any point in the future but for a batch compiler it would be fine. > > On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő wrote: > On Mon, 10 Feb 2025, Matthew Pickering wrote: > > > I wonder if you have got your condition the wrong way around. > > > > The only "safe" time to perform rehydration is AFTER the point it can > > never be used again. > > > > If you rehydrate it just before it is used then you will repeat work > > which has already been done. If you do this, you will always have a > > trade-off between space used and runtime. > > Oops. Yes, I have misunderstood the idea. I thought the idea was that > after loading a given module into the HPT, its ModDetails would start out > small (because of laziness) and then keep growing in size as more and more > of it are traversed, and thus forced, during the typechecking of its > dependees, so at some point we would want to reset that into the small > initial representation as created by initModDetails. > > But if the idea is that I should rehydrate modules when they can't be used > anymore, then that brings up the question why even do that, instead of > straight removing the HomeModInfos from the HPT? > > ---------------------------------------------------------------------- > > ------------------------------ > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our public website. > ------------------------------ > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our main Standard Chartered PLC (UK) website at sc. com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Gergo.Erdi at sc.com Tue Apr 1 12:24:14 2025 From: Gergo.Erdi at sc.com (Erdi, Gergo) Date: Tue, 1 Apr 2025 12:24:14 +0000 Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: PUBLIC OK so this `ModuleGraph` leak has the potential to be a huge explainer because of the large number of ModuleNodes and NodeKey_Modules in the heap profile. It turns out the reason we had one ModuleGraph per module was because I was populating the ModuleGraph by repeated calls to `extendMG`, where the new edges were created via `mgModSummaries` on the old `ModuleGraph`, and so the new edge list was a thunk pointing to the old `ModuleGraph`. I’ve now changed it to a single `mkModuleGraph` call, which gets rid of the extra ModuleGraphs (see attached profile), but unfortunately, the ModuleNodes/ edge NodeKey_Modules are still not shared. Here’s how I build the graph, trying hard with a Data.Map.Map to ensure only use one NodeKey_Module per each target module: ``` registerModulesToGraph :: GhcMonad m => [(UnitId, ModuleName, [ModuleName])] -> m () registerModulesToGraph entries = modifySessionM \hsc_env -> do mod_deps <- for entries \(unit, mod_name, deps) -> do mod <- liftIO $ registerToModuleFinder hsc_env unit mod_name let ms = dummyModSummary (hsc_dflags hsc_env) mod deps fingerprint0 pure (ms, deps) let nodes = [ModuleNode (edges deps) ms | (ms, deps) <- mod_deps] pure hsc_env{ hsc_mod_graph = mkModuleGraph nodes } where edge_map :: Map.Map ModuleName NodeKey edge_map = Map.fromList [(mod_name, NodeKey_Module $ ModNodeKeyWithUid (notBoot mod_name) uid) | (uid, mod_name, _) <- entries] edges deps = map (edge_map Map.!) deps registerToModuleFinder :: HscEnv -> UnitId -> ModuleName -> IO Module registerToModuleFinder hsc_env unitId modName = do addHomeModuleToFinder (hsc_FC hsc_env) homeUnit (notBoot modName) location where homeUnit = DefiniteHomeUnit unitId Nothing location = dummyModLocation modName ``` But, as you can see in the attached profile, I still get way more than 2k ModuleNodes and NodeKey_Modules…. From: Matthew Pickering Sent: Tuesday, April 1, 2025 5:44 PM To: Erdi, Gergo Cc: GHC Devs ; ÉRDI Gergő ; Montelatici, Raphael Laurent ; Dijkstra, Atze Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi Gergo, I looked in the detailed pane, searched for ModuleGraph, hovered my mouse over the "ModuleGraph` constructor, recorded the number of live bytes, divided that by 32. Matt On Tue, Apr 1, 2025 at 7:04 AM Erdi, Gergo > wrote: PUBLIC OK scratch that, I was looking at wrong ghc-debug output. Indeed there are 2301 ModuleGraphs in the heap at the end of typechecking :O From: Erdi, Gergo Sent: Tuesday, April 1, 2025 12:51 PM To: Matthew Pickering > Cc: GHC Devs >; ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: Re: GHC memory usage when typechecking from source vs. loading ModIfaces This sounds extremely interesting, but I don’t understand where you are getting this number from! How do you see in the eventlog HTMLs that I’ve included that there are ~2000 ModuleGraphs? I’ve now tried using ghc-debug to find all ModuleGraph constructors at two points in the run: just before typechecking the first module (after all the extendMG calls) and just after typechecking the last module, and even in the cold case I only see 1 ModuleGraph before and 13 ModuleGraphs after. Also, what do you mean by “precisely one loaded per interface loaded into the EPS”? Since my repro has 2294 modules, wouldn’t that mean 2294 ModuleGraphs by that metric? From: Matthew Pickering > Sent: Saturday, March 29, 2025 1:53 AM To: Erdi, Gergo > Cc: GHC Devs >; ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi Gergo, I quickly tried building `Cabal` with the master branch. There is precisely 1 ModuleGraph allocated for the home session, and precisely one loaded per interface loaded into the EPS. No leaky behaviour like you can see in your eventlogs. It seems there are about 2000 live module graphs in your program, are you doing something with the API to create this many? Cheers, Matt On Fri, Mar 28, 2025 at 12:40 PM Matthew Pickering > wrote: HI Gergo, Do you have a (synthetic?) reproducer? You have probably identified some memory leak. However, without any means to reproduce it becomes very difficult to investigate. I feel like we are getting into very precise details now, where speculating is not going to be so useful. It seems like this is an important thing for you and your company. Is there any budget to pay for some investigation? If that was the case then some effort could be made to create a synthetic producer and make the situation more robust going into the future if your requirements were precisely understood. Cheers, Matt On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo > wrote: PUBLIC Just to add that I get the same "equalizing" behaviour (but in a more "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just call `hugInstancesBelow` before processing each module. So that's definitely one source of extra memory usage. I wonder if it would be possible to rebuild the ModuleGraph periodically (similar to the ModDetails dehydration), or if there are references to it stored all over the place from `HscEnv`s scattered around in closures etc. (basically the same problem the HPT had before it was made into a mutable reference). -----Original Message----- From: ghc-devs > On Behalf Of Erdi, Gergo via ghc-devs Sent: Friday, March 28, 2025 4:49 PM To: Matthew Pickering >; GHC Devs > Cc: ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi, Unfortunately, I am forced to return to this problem. Everything below is now in the context of GHC 9.12 plus the mutable HPT patch backported. My test case is typechecking a tree of 2294 modules that form the transitive closure of a single module's dependencies, all in a single process. I have done this typechecking three times, here's what `+RTS -s -RTS` reports for max residency: * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB * "cold-top": With all `ModIface`s already on disk, except for the single top-level module: 302 MB * "warm": With all `ModIface`s already on disk: 211 MB So my stupidly naive question is, why is the "cold" case also not 302 MB? In earlier discussion, `ModDetails` unfolding has come up. Dehydrating `ModDetails` in the HPT all the time is disastrous for runtime, but based on this model I would expect to see improvements from dehydrating "every now and then". So I tried a stupid simple example where after every 100th typechecked module, I run this function on the topologically sorted list of modules processed so far: ``` dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods = do let HPT{ table = hptr } = hsc_HPT hsc_env hpt <- readIORef hptr for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface _details _linkable) -> do !details <- initModDetails hsc_env iface pure () ``` Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, the profile looks exactly the same. Speaking of the profile, in the "cold" case I see a lot of steadily increasing heap usage from the `ModuleGraph`. I could see this happening if typechecking from scratch involves more `modulesUnder` calls which in turn force more and more of the `ModuleGraph`. If so, then maybe this could be worked around by repeatedly remaking the `ModuleGraph` just like I remake the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing the `ModuleGraph` at the start, with the idea being that this should "equalize" the three scenarios if this really is a substantial source of extra memory usage. This pushes up the warm case's memory usage to 381 MB, which is promising, but I still see a `Word64Map` that is steadily increasing in the "cold-force-modulegraph" case and contributes a lot to the memory usage. Unfortunately, I don't know where that `Word64Map` is (it could be any `Unique`-keyed environment...). So I am now stuck at this point. To spell out my goal explicitly, I would like to typecheck one module after another and not keep anything more in memory around than if I loaded them from `ModIface` files. Thanks, Gergo p.s.: I couldn't find a way in the EventLog output HTML to turn event markers on/off or filter them, so to avoid covering the whole graph with gray lines, I mark only every 100th module. From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM To: ÉRDI Gergő > Cc: Erdi, Gergo >; Zubin Duggal >; Montelatici, Raphael Laurent >; GHC Devs > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces You do also raise a good point about rehydration costs. In oneshot mode, you are basically rehydrating the entire transitive closure of each module when you compile it, which obviously results in a large amount of repeated work. This is why people are investigating ideas of a persistent worker to at least avoid rehydrating all external dependencies as well. On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering > wrote: Sure, you can remove them once you are sure they are not used anymore. For clients like `GHCi` that doesn't work obviously as they can be used at any point in the future but for a batch compiler it would be fine. On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő > wrote: On Mon, 10 Feb 2025, Matthew Pickering wrote: > I wonder if you have got your condition the wrong way around. > > The only "safe" time to perform rehydration is AFTER the point it can > never be used again. > > If you rehydrate it just before it is used then you will repeat work > which has already been done. If you do this, you will always have a > trade-off between space used and runtime. Oops. Yes, I have misunderstood the idea. I thought the idea was that after loading a given module into the HPT, its ModDetails would start out small (because of laziness) and then keep growing in size as more and more of it are traversed, and thus forced, during the typechecking of its dependees, so at some point we would want to reset that into the small initial representation as created by initModDetails. But if the idea is that I should rehydrate modules when they can't be used anymore, then that brings up the question why even do that, instead of straight removing the HomeModInfos from the HPT? ---------------------------------------------------------------------- ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our public website. ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: cold-no-modulegraph-leak.zip Type: application/x-zip-compressed Size: 1337463 bytes Desc: cold-no-modulegraph-leak.zip URL: From george.colpitts at gmail.com Tue Apr 1 14:48:34 2025 From: george.colpitts at gmail.com (George Colpitts) Date: Tue, 1 Apr 2025 11:48:34 -0300 Subject: can't move to latest version of llvm on ghc 9.12.2 Message-ID: llvm 20 is out but unlike in earlier versions of ghc moving to it means you can no longer use llvm: compiling: ghc -fllvm hello.hs Loaded package environment from /Users/avie/.ghc/aarch64-darwin-9.12.2/environments/default [1 of 2] Compiling Main ( hello.hs, hello.o ) : error: [GHC-66599] GHC was not configured with a supported LLVM toolchain Make sure you have installed LLVM between [13 and 20) and reinstall GHC to make -fllvm work from configure: configure: We only support llvm 13 upto 20 (non-inclusive) (found 20.1.1). Can we move to llvm 20 on HEAD and can we revert to the old behavior on ghc 9.12.3? Should I file an ER? Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: From allbery.b at gmail.com Tue Apr 1 16:27:41 2025 From: allbery.b at gmail.com (Brandon Allbery) Date: Tue, 1 Apr 2025 12:27:41 -0400 Subject: can't move to latest version of llvm on ghc 9.12.2 In-Reply-To: References: Message-ID: Have you demonstrated that it works? IIRC the current behavior is because some LLVM version (16, IIRC) didn't work with GHC (threw errors from opt, I think). On Tue, Apr 1, 2025 at 10:49 AM George Colpitts wrote: > llvm 20 is out but unlike in earlier versions of ghc moving to it means > you can no longer use llvm: > > compiling: > > ghc -fllvm hello.hs > Loaded package environment from > /Users/avie/.ghc/aarch64-darwin-9.12.2/environments/default > [1 of 2] Compiling Main ( hello.hs, hello.o ) > : error: [GHC-66599] > GHC was not configured with a supported LLVM toolchain > Make sure you have installed LLVM between [13 and 20) and reinstall > GHC to make -fllvm work > > > from configure: > > configure: We only support llvm 13 upto 20 (non-inclusive) (found 20.1.1). > > > Can we move to llvm 20 on HEAD and can we revert to the old behavior on > ghc 9.12.3? > > Should I file an ER? > > Thanks > > _______________________________________________ > ghc-devs mailing list > ghc-devs at haskell.org > http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs > -- brandon s allbery kf8nh allbery.b at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From george.colpitts at gmail.com Tue Apr 1 17:05:59 2025 From: george.colpitts at gmail.com (George Colpitts) Date: Tue, 1 Apr 2025 14:05:59 -0300 Subject: can't move to latest version of llvm on ghc 9.12.2 In-Reply-To: References:

Message-ID: I was trying to find out if it works but I ran into the problem I described. Yes, there were a few years where the latest version didn't work but in many years it did. ghc used to give a warning, this is an unsupported version but we'll try it anyways. I would like to return to that. In any case I believe ghc dev should test to see if it works and if so use it in HEAD. If it doesn't work than we should file a bug and fix it. In the past we got a few versions behind llvm. I think we want to be on the latest version available for each new release if possible. The earlier we look into this the more chance we have to succeed at that. On Tue, Apr 1, 2025 at 1:27 PM Brandon Allbery wrote: > Have you demonstrated that it works? IIRC the current behavior is because > some LLVM version (16, IIRC) didn't work with GHC (threw errors from opt, I > think). > > On Tue, Apr 1, 2025 at 10:49 AM George Colpitts > wrote: > >> llvm 20 is out but unlike in earlier versions of ghc moving to it means >> you can no longer use llvm: >> >> compiling: >> >> ghc -fllvm hello.hs >> Loaded package environment from >> /Users/avie/.ghc/aarch64-darwin-9.12.2/environments/default >> [1 of 2] Compiling Main ( hello.hs, hello.o ) >> : error: [GHC-66599] >> GHC was not configured with a supported LLVM toolchain >> Make sure you have installed LLVM between [13 and 20) and reinstall >> GHC to make -fllvm work >> >> >> from configure: >> >> configure: We only support llvm 13 upto 20 (non-inclusive) (found 20.1.1). >> >> >> Can we move to llvm 20 on HEAD and can we revert to the old behavior on >> ghc 9.12.3? >> >> Should I file an ER? >> >> Thanks >> >> _______________________________________________ >> ghc-devs mailing list >> ghc-devs at haskell.org >> http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs >> > > > -- > brandon s allbery kf8nh > allbery.b at gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From klebinger.andreas at gmx.at Tue Apr 1 17:11:24 2025 From: klebinger.andreas at gmx.at (Andreas Klebinger) Date: Tue, 1 Apr 2025 19:11:24 +0200 Subject: can't move to latest version of llvm on ghc 9.12.2 In-Reply-To: References:

Message-ID: <4db7eafd-69b0-476c-bffd-e2a8827d7346@gmx.at> Looking at https://gitlab.haskell.org/ghc/ghc/-/issues/25011 it seems the desire was to fail with a proper error when LLVM was not found at all. To me failing when the llvm version is too new seems like a unintended side effect of fixing the former. I agree that simply warning and trying anway is the better default. A ticket would be appreciated! Am 01/04/2025 um 19:05 schrieb George Colpitts: > I was trying to find out if it works but I ran into the problem I > described. Yes, there were a few years where the latest version didn't > work but in many years it did. ghc used to give a warning, this is an > unsupported version but we'll try it anyways. I would like to > return to that. > > In any case I believe ghc dev should test to see if it works and if so > use it in HEAD. If it doesn't work than we should file a bug and fix > it. In the past we got a few versions behind llvm. I think we want to > be on the latest version available for each new release if possible. > The earlier we look into this the more chance we have to succeed at that. > > On Tue, Apr 1, 2025 at 1:27 PM Brandon Allbery > wrote: > > Have you demonstrated that it works? IIRC the current behavior is > because some LLVM version (16, IIRC) didn't work with GHC (threw > errors from opt, I think). > > On Tue, Apr 1, 2025 at 10:49 AM George Colpitts > wrote: > > llvm 20 is out but unlike in earlier versions of ghc moving to > it means you can no longer use llvm: > > compiling: > > ghc -fllvm hello.hs > Loaded package environment from > /Users/avie/.ghc/aarch64-darwin-9.12.2/environments/default > [1 of 2] Compiling Main ( hello.hs, hello.o ) > : error: [GHC-66599] > GHC was not configured with a supported LLVM toolchain > Make sure you have installed LLVM between [13 and 20) > and reinstall GHC to make -fllvm work > > > from configure: > > configure: We only support llvm 13 upto 20 (non-inclusive) > (found 20.1.1). > > > Can we move to llvm 20 on HEAD and can we revert to the old > behavior on ghc 9.12.3? > > Should I file an ER? > > Thanks > > _______________________________________________ > ghc-devs mailing list > ghc-devs at haskell.org > http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs > > > > -- > brandon s allbery kf8nh > allbery.b at gmail.com > > > _______________________________________________ > ghc-devs mailing list > ghc-devs at haskell.org > http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs -------------- next part -------------- An HTML attachment was scrubbed... URL: From george.colpitts at gmail.com Tue Apr 1 20:15:46 2025 From: george.colpitts at gmail.com (George Colpitts) Date: Tue, 1 Apr 2025 17:15:46 -0300 Subject: can't move to latest version of llvm on ghc 9.12.2 In-Reply-To: <4db7eafd-69b0-476c-bffd-e2a8827d7346@gmx.at> References:

<4db7eafd-69b0-476c-bffd-e2a8827d7346@gmx.at> Message-ID: Thanks, I'll file a ticket today or tomorrow. On Tue, Apr 1, 2025 at 2:11 PM Andreas Klebinger via ghc-devs < ghc-devs at haskell.org> wrote: > Looking at https://gitlab.haskell.org/ghc/ghc/-/issues/25011 it seems the > desire was to fail with a proper error when LLVM was not found at all. > > To me failing when the llvm version is too new seems like a unintended > side effect of fixing the former. > > I agree that simply warning and trying anway is the better default. A > ticket would be appreciated! > Am 01/04/2025 um 19:05 schrieb George Colpitts: > > I was trying to find out if it works but I ran into the problem I > described. Yes, there were a few years where the latest version didn't work > but in many years it did. ghc used to give a warning, this is an > unsupported version but we'll try it anyways. I would like to return to > that. > > In any case I believe ghc dev should test to see if it works and if so use > it in HEAD. If it doesn't work than we should file a bug and fix it. In the > past we got a few versions behind llvm. I think we want to be on the latest > version available for each new release if possible. The earlier we look > into this the more chance we have to succeed at that. > > On Tue, Apr 1, 2025 at 1:27 PM Brandon Allbery > wrote: > >> Have you demonstrated that it works? IIRC the current behavior is because >> some LLVM version (16, IIRC) didn't work with GHC (threw errors from opt, I >> think). >> >> On Tue, Apr 1, 2025 at 10:49 AM George Colpitts < >> george.colpitts at gmail.com> wrote: >> >>> llvm 20 is out but unlike in earlier versions of ghc moving to it means >>> you can no longer use llvm: >>> >>> compiling: >>> >>> ghc -fllvm hello.hs >>> Loaded package environment from >>> /Users/avie/.ghc/aarch64-darwin-9.12.2/environments/default >>> [1 of 2] Compiling Main ( hello.hs, hello.o ) >>> : error: [GHC-66599] >>> GHC was not configured with a supported LLVM toolchain >>> Make sure you have installed LLVM between [13 and 20) and reinstall >>> GHC to make -fllvm work >>> >>> >>> from configure: >>> >>> configure: We only support llvm 13 upto 20 (non-inclusive) (found >>> 20.1.1). >>> >>> >>> Can we move to llvm 20 on HEAD and can we revert to the old behavior on >>> ghc 9.12.3? >>> >>> Should I file an ER? >>> >>> Thanks >>> >>> _______________________________________________ >>> ghc-devs mailing list >>> ghc-devs at haskell.org >>> http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs >>> >> >> >> -- >> brandon s allbery kf8nh >> allbery.b at gmail.com >> > > _______________________________________________ > ghc-devs mailing listghc-devs at haskell.orghttp://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs > > _______________________________________________ > ghc-devs mailing list > ghc-devs at haskell.org > http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthewtpickering at gmail.com Wed Apr 2 09:14:44 2025 From: matthewtpickering at gmail.com (Matthew Pickering) Date: Wed, 2 Apr 2025 10:14:44 +0100 Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: Hi Gergo, Since this is a non-reproducible issue with your custom code, I think that we've reached the limit of free debugging via email. There are several consultancies which can be employed to investigate complicated issues like this and perform the necessary changes to GHC to make your use-case easier. If you have a reproducer on a distributed GHC release then it could be investigated. Cheers, Matt On Tue, Apr 1, 2025 at 1:24 PM Erdi, Gergo wrote: > PUBLIC > > OK so this `ModuleGraph` leak has the potential to be a huge explainer > because of the large number of ModuleNodes and NodeKey_Modules in the heap > profile. It turns out the reason we had one ModuleGraph per module was > because I was populating the ModuleGraph by repeated calls to `extendMG`, > where the new edges were created via `mgModSummaries` on the old > `ModuleGraph`, and so the new edge list was a thunk pointing to the old > `ModuleGraph`. > > > > I’ve now changed it to a single `mkModuleGraph` call, which gets rid of > the extra ModuleGraphs (see attached profile), but unfortunately, the > ModuleNodes/ edge NodeKey_Modules are still not shared. Here’s how I build > the graph, trying hard with a Data.Map.Map to ensure only use one > NodeKey_Module per each target module: > > > > ``` > > registerModulesToGraph :: GhcMonad m => [(UnitId, ModuleName, > [ModuleName])] -> m () > > registerModulesToGraph entries = modifySessionM \hsc_env -> do > > mod_deps <- for entries \(unit, mod_name, deps) -> do > > mod <- liftIO $ registerToModuleFinder hsc_env unit mod_name > > let ms = dummyModSummary (hsc_dflags hsc_env) mod deps fingerprint0 > > pure (ms, deps) > > let nodes = [ModuleNode (edges deps) ms | (ms, deps) <- mod_deps] > > pure hsc_env{ hsc_mod_graph = mkModuleGraph nodes } > > where > > edge_map :: Map.Map ModuleName NodeKey > > edge_map = Map.fromList [(mod_name, NodeKey_Module $ ModNodeKeyWithUid > (notBoot mod_name) uid) | (uid, mod_name, _) <- entries] > > > > edges deps = map (edge_map Map.!) deps > > > > registerToModuleFinder :: HscEnv -> UnitId -> ModuleName -> IO Module > > registerToModuleFinder hsc_env unitId modName = do > > addHomeModuleToFinder (hsc_FC hsc_env) homeUnit (notBoot modName) > location > > where > > homeUnit = DefiniteHomeUnit unitId Nothing > > location = dummyModLocation modName > > ``` > > > > But, as you can see in the attached profile, I still get way more than 2k > ModuleNodes and NodeKey_Modules…. > > > > *From:* Matthew Pickering > *Sent:* Tuesday, April 1, 2025 5:44 PM > *To:* Erdi, Gergo > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* [External] Re: GHC memory usage when typechecking from source > vs. loading ModIfaces > > > > Hi Gergo, > > > > I looked in the detailed pane, searched for ModuleGraph, hovered my mouse > over the "ModuleGraph` constructor, recorded the number of live bytes, > divided that by 32. > > > > Matt > > > > On Tue, Apr 1, 2025 at 7:04 AM Erdi, Gergo wrote: > > PUBLIC > > > > OK scratch that, I was looking at wrong ghc-debug output. Indeed there are > 2301 ModuleGraphs in the heap at the end of typechecking :O > > > > *From:* Erdi, Gergo > *Sent:* Tuesday, April 1, 2025 12:51 PM > *To:* Matthew Pickering > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* Re: GHC memory usage when typechecking from source vs. loading > ModIfaces > > > > This sounds extremely interesting, but I don’t understand where you are > getting this number from! How do you see in the eventlog HTMLs that I’ve > included that there are ~2000 ModuleGraphs? I’ve now tried using ghc-debug > to find all ModuleGraph constructors at two points in the run: just before > typechecking the first module (after all the extendMG calls) and just after > typechecking the last module, and even in the cold case I only see 1 > ModuleGraph before and 13 ModuleGraphs after. > > > > Also, what do you mean by “precisely one loaded per interface loaded into > the EPS”? Since my repro has 2294 modules, wouldn’t that mean 2294 > ModuleGraphs by that metric? > > > > *From:* Matthew Pickering > *Sent:* Saturday, March 29, 2025 1:53 AM > *To:* Erdi, Gergo > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* [External] Re: GHC memory usage when typechecking from source > vs. loading ModIfaces > > > > > > Hi Gergo, > > > > I quickly tried building `Cabal` with the master branch. There is > precisely 1 ModuleGraph allocated for the home session, and precisely one > loaded per interface loaded into the EPS. No leaky behaviour like you can > see in your eventlogs. > > > > It seems there are about 2000 live module graphs in your program, are you > doing something with the API to create this many? > > > > Cheers, > > > > Matt > > > > On Fri, Mar 28, 2025 at 12:40 PM Matthew Pickering < > matthewtpickering at gmail.com> wrote: > > HI Gergo, > > > > Do you have a (synthetic?) reproducer? You have probably identified some > memory leak. However, without any means to reproduce it becomes very > difficult to investigate. I feel like we are getting into very precise > details now, where speculating is not going to be so useful. > > > > It seems like this is an important thing for you and your company. Is > there any budget to pay for some investigation? If that was the case then > some effort could be made to create a synthetic producer and make the > situation more robust going into the future if your requirements were > precisely understood. > > > > Cheers, > > > > Matt > > > > On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo wrote: > > PUBLIC > > Just to add that I get the same "equalizing" behaviour (but in a more > "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just > call `hugInstancesBelow` before processing each module. So that's > definitely one source of extra memory usage. I wonder if it would be > possible to rebuild the ModuleGraph periodically (similar to the ModDetails > dehydration), or if there are references to it stored all over the place > from `HscEnv`s scattered around in closures etc. (basically the same > problem the HPT had before it was made into a mutable reference). > > -----Original Message----- > From: ghc-devs On Behalf Of Erdi, Gergo > via ghc-devs > Sent: Friday, March 28, 2025 4:49 PM > To: Matthew Pickering ; GHC Devs < > ghc-devs at haskell.org> > Cc: ÉRDI Gergő ; Montelatici, Raphael Laurent < > Raphael.Montelatici at sc.com>; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > Hi, > > Unfortunately, I am forced to return to this problem. Everything below is > now in the context of GHC 9.12 plus the mutable HPT patch backported. > > My test case is typechecking a tree of 2294 modules that form the > transitive closure of a single module's dependencies, all in a single > process. I have done this typechecking three times, here's what `+RTS -s > -RTS` reports for max residency: > > * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB > > * "cold-top": With all `ModIface`s already on disk, except for the > single top-level module: 302 MB > > * "warm": With all `ModIface`s already on disk: 211 MB > > So my stupidly naive question is, why is the "cold" case also not 302 MB? > > In earlier discussion, `ModDetails` unfolding has come up. Dehydrating > `ModDetails` in the HPT all the time is disastrous for runtime, but based > on this model I would expect to see improvements from dehydrating "every > now and then". So I tried a stupid simple example where after every 100th > typechecked module, I run this function on the topologically sorted list of > modules processed so far: > > > ``` > dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods > = do > let HPT{ table = hptr } = hsc_HPT hsc_env > hpt <- readIORef hptr > for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface > _details _linkable) -> do > !details <- initModDetails hsc_env iface > pure () > ``` > > Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, > the profile looks exactly the same. > > Speaking of the profile, in the "cold" case I see a lot of steadily > increasing heap usage from the `ModuleGraph`. I could see this happening if > typechecking from scratch involves more `modulesUnder` calls which in turn > force more and more of the `ModuleGraph`. If so, then maybe this could be > worked around by repeatedly remaking the `ModuleGraph` just like I remake > the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing > the `ModuleGraph` at the start, with the idea being that this should > "equalize" the three scenarios if this really is a substantial source of > extra memory usage. This pushes up the warm case's memory usage to 381 MB, > which is promising, but I still see a `Word64Map` that is steadily > increasing in the "cold-force-modulegraph" case and contributes a lot to > the memory usage. Unfortunately, I don't know where that `Word64Map` is (it > could be any `Unique`-keyed environment...). > > So I am now stuck at this point. To spell out my goal explicitly, I would > like to typecheck one module after another and not keep anything more in > memory around than if I loaded them from `ModIface` files. > > Thanks, > Gergo > > p.s.: I couldn't find a way in the EventLog output HTML to turn event > markers on/off or filter them, so to avoid covering the whole graph with > gray lines, I mark only every 100th module. > > > > > From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM > To: ÉRDI Gergő > Cc: Erdi, Gergo ; Zubin Duggal ; > Montelatici, Raphael Laurent ; GHC Devs < > ghc-devs at haskell.org> > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > You do also raise a good point about rehydration costs. > > In oneshot mode, you are basically rehydrating the entire transitive > closure of each module when you compile it, which obviously results in a > large amount of repeated work. This is why people are investigating ideas > of a persistent worker to at least avoid rehydrating all external > dependencies as well. > > On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering matthewtpickering at gmail.com> wrote: > Sure, you can remove them once you are sure they are not used anymore. > > For clients like `GHCi` that doesn't work obviously as they can be used at > any point in the future but for a batch compiler it would be fine. > > On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő wrote: > On Mon, 10 Feb 2025, Matthew Pickering wrote: > > > I wonder if you have got your condition the wrong way around. > > > > The only "safe" time to perform rehydration is AFTER the point it can > > never be used again. > > > > If you rehydrate it just before it is used then you will repeat work > > which has already been done. If you do this, you will always have a > > trade-off between space used and runtime. > > Oops. Yes, I have misunderstood the idea. I thought the idea was that > after loading a given module into the HPT, its ModDetails would start out > small (because of laziness) and then keep growing in size as more and more > of it are traversed, and thus forced, during the typechecking of its > dependees, so at some point we would want to reset that into the small > initial representation as created by initModDetails. > > But if the idea is that I should rehydrate modules when they can't be used > anymore, then that brings up the question why even do that, instead of > straight removing the HomeModInfos from the HPT? > > ---------------------------------------------------------------------- > > ------------------------------ > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our public website. > ------------------------------ > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our main Standard Chartered PLC (UK) website at sc. com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Gergo.Erdi at sc.com Wed Apr 2 09:25:56 2025 From: Gergo.Erdi at sc.com (Erdi, Gergo) Date: Wed, 2 Apr 2025 09:25:56 +0000 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: PUBLIC Hi Matt, I think I have something that might demonstrate that GHC (at least GHC 9.12.1) might have a similar problem! With the attached vacuous module hierarchy, I tried compiling M2294 from scratch, and then with `.hi` files for everything except the toplevel module. I did the same with our GHC-API-using compiler as well. As you can see from the attached event logs, while the details differ, the overall shape of the memory used by ModuleGraph edges (750k of GWIB and NodeKey_Module constructors for the 2321 ModuleNodes and ~60k direct dependency edges) is pretty much the same between our compiler and GHC 9.12, suggesting to me that GHC is duplicating ModuleGraph node information in the dependency edges when building the transitive closure. Based on these measurements, do you agree that this is a GHC-side problem of memory usage scaling quadratically with the number of dependency edges? Thanks, Gergo p.s.: Sorry for including the reproducer module tree in this weird format as a patch file, but I am behind a mail server that won’t let me send mails with too many individual files in attached archives… From: Matthew Pickering Sent: Friday, March 28, 2025 8:40 PM To: Erdi, Gergo Cc: GHC Devs ; ÉRDI Gergő ; Montelatici, Raphael Laurent ; Dijkstra, Atze Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces HI Gergo, Do you have a (synthetic?) reproducer? You have probably identified some memory leak. However, without any means to reproduce it becomes very difficult to investigate. I feel like we are getting into very precise details now, where speculating is not going to be so useful. It seems like this is an important thing for you and your company. Is there any budget to pay for some investigation? If that was the case then some effort could be made to create a synthetic producer and make the situation more robust going into the future if your requirements were precisely understood. Cheers, Matt On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo > wrote: PUBLIC Just to add that I get the same "equalizing" behaviour (but in a more "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just call `hugInstancesBelow` before processing each module. So that's definitely one source of extra memory usage. I wonder if it would be possible to rebuild the ModuleGraph periodically (similar to the ModDetails dehydration), or if there are references to it stored all over the place from `HscEnv`s scattered around in closures etc. (basically the same problem the HPT had before it was made into a mutable reference). -----Original Message----- From: ghc-devs > On Behalf Of Erdi, Gergo via ghc-devs Sent: Friday, March 28, 2025 4:49 PM To: Matthew Pickering >; GHC Devs > Cc: ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi, Unfortunately, I am forced to return to this problem. Everything below is now in the context of GHC 9.12 plus the mutable HPT patch backported. My test case is typechecking a tree of 2294 modules that form the transitive closure of a single module's dependencies, all in a single process. I have done this typechecking three times, here's what `+RTS -s -RTS` reports for max residency: * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB * "cold-top": With all `ModIface`s already on disk, except for the single top-level module: 302 MB * "warm": With all `ModIface`s already on disk: 211 MB So my stupidly naive question is, why is the "cold" case also not 302 MB? In earlier discussion, `ModDetails` unfolding has come up. Dehydrating `ModDetails` in the HPT all the time is disastrous for runtime, but based on this model I would expect to see improvements from dehydrating "every now and then". So I tried a stupid simple example where after every 100th typechecked module, I run this function on the topologically sorted list of modules processed so far: ``` dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods = do let HPT{ table = hptr } = hsc_HPT hsc_env hpt <- readIORef hptr for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface _details _linkable) -> do !details <- initModDetails hsc_env iface pure () ``` Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, the profile looks exactly the same. Speaking of the profile, in the "cold" case I see a lot of steadily increasing heap usage from the `ModuleGraph`. I could see this happening if typechecking from scratch involves more `modulesUnder` calls which in turn force more and more of the `ModuleGraph`. If so, then maybe this could be worked around by repeatedly remaking the `ModuleGraph` just like I remake the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing the `ModuleGraph` at the start, with the idea being that this should "equalize" the three scenarios if this really is a substantial source of extra memory usage. This pushes up the warm case's memory usage to 381 MB, which is promising, but I still see a `Word64Map` that is steadily increasing in the "cold-force-modulegraph" case and contributes a lot to the memory usage. Unfortunately, I don't know where that `Word64Map` is (it could be any `Unique`-keyed environment...). So I am now stuck at this point. To spell out my goal explicitly, I would like to typecheck one module after another and not keep anything more in memory around than if I loaded them from `ModIface` files. Thanks, Gergo p.s.: I couldn't find a way in the EventLog output HTML to turn event markers on/off or filter them, so to avoid covering the whole graph with gray lines, I mark only every 100th module. From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM To: ÉRDI Gergő > Cc: Erdi, Gergo >; Zubin Duggal >; Montelatici, Raphael Laurent >; GHC Devs > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces You do also raise a good point about rehydration costs. In oneshot mode, you are basically rehydrating the entire transitive closure of each module when you compile it, which obviously results in a large amount of repeated work. This is why people are investigating ideas of a persistent worker to at least avoid rehydrating all external dependencies as well. On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering > wrote: Sure, you can remove them once you are sure they are not used anymore. For clients like `GHCi` that doesn't work obviously as they can be used at any point in the future but for a batch compiler it would be fine. On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő > wrote: On Mon, 10 Feb 2025, Matthew Pickering wrote: > I wonder if you have got your condition the wrong way around. > > The only "safe" time to perform rehydration is AFTER the point it can > never be used again. > > If you rehydrate it just before it is used then you will repeat work > which has already been done. If you do this, you will always have a > trade-off between space used and runtime. Oops. Yes, I have misunderstood the idea. I thought the idea was that after loading a given module into the HPT, its ModDetails would start out small (because of laziness) and then keep growing in size as more and more of it are traversed, and thus forced, during the typechecking of its dependees, so at some point we would want to reset that into the small initial representation as created by initModDetails. But if the idea is that I should rehydrate modules when they can't be used anymore, then that brings up the question why even do that, instead of straight removing the HomeModInfos from the HPT? ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our public website. ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ghc-prof.zip Type: application/x-zip-compressed Size: 1722494 bytes Desc: ghc-prof.zip URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: repro-hs.patch.gz Type: application/x-gzip Size: 80437 bytes Desc: repro-hs.patch.gz URL: From matthewtpickering at gmail.com Wed Apr 2 09:38:34 2025 From: matthewtpickering at gmail.com (Matthew Pickering) Date: Wed, 2 Apr 2025 10:38:34 +0100 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: What command do I run to generate the files from this patch file? Perhaps a link to a git repo would be a suitable way to share the reproducer? On Wed, Apr 2, 2025 at 10:26 AM Erdi, Gergo wrote: > PUBLIC > > Hi Matt, > > > > I think I have something that might demonstrate that GHC (at least GHC > 9.12.1) might have a similar problem! > > > > With the attached vacuous module hierarchy, I tried compiling M2294 from > scratch, and then with `.hi` files for everything except the toplevel > module. I did the same with our GHC-API-using compiler as well. As you can > see from the attached event logs, while the details differ, the overall > shape of the memory used by ModuleGraph edges (750k of GWIB and > NodeKey_Module constructors for the 2321 ModuleNodes and ~60k direct > dependency edges) is pretty much the same between our compiler and GHC > 9.12, suggesting to me that GHC is duplicating ModuleGraph node information > in the dependency edges when building the transitive closure. > > > > Based on these measurements, do you agree that this is a GHC-side problem > of memory usage scaling quadratically with the number of dependency edges? > > > > Thanks, > > Gergo > > > > p.s.: Sorry for including the reproducer module tree in this weird format > as a patch file, but I am behind a mail server that won’t let me send mails > with too many individual files in attached archives… > > > > *From:* Matthew Pickering > *Sent:* Friday, March 28, 2025 8:40 PM > *To:* Erdi, Gergo > *Cc:* GHC Devs ; ÉRDI Gergő ; > Montelatici, Raphael Laurent ; Dijkstra, Atze > > *Subject:* [External] Re: GHC memory usage when typechecking from source > vs. loading ModIfaces > > > > HI Gergo, > > > > Do you have a (synthetic?) reproducer? You have probably identified some > memory leak. However, without any means to reproduce it becomes very > difficult to investigate. I feel like we are getting into very precise > details now, where speculating is not going to be so useful. > > > > It seems like this is an important thing for you and your company. Is > there any budget to pay for some investigation? If that was the case then > some effort could be made to create a synthetic producer and make the > situation more robust going into the future if your requirements were > precisely understood. > > > > Cheers, > > > > Matt > > > > On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo wrote: > > PUBLIC > > Just to add that I get the same "equalizing" behaviour (but in a more > "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just > call `hugInstancesBelow` before processing each module. So that's > definitely one source of extra memory usage. I wonder if it would be > possible to rebuild the ModuleGraph periodically (similar to the ModDetails > dehydration), or if there are references to it stored all over the place > from `HscEnv`s scattered around in closures etc. (basically the same > problem the HPT had before it was made into a mutable reference). > > -----Original Message----- > From: ghc-devs On Behalf Of Erdi, Gergo > via ghc-devs > Sent: Friday, March 28, 2025 4:49 PM > To: Matthew Pickering ; GHC Devs < > ghc-devs at haskell.org> > Cc: ÉRDI Gergő ; Montelatici, Raphael Laurent < > Raphael.Montelatici at sc.com>; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > Hi, > > Unfortunately, I am forced to return to this problem. Everything below is > now in the context of GHC 9.12 plus the mutable HPT patch backported. > > My test case is typechecking a tree of 2294 modules that form the > transitive closure of a single module's dependencies, all in a single > process. I have done this typechecking three times, here's what `+RTS -s > -RTS` reports for max residency: > > * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB > > * "cold-top": With all `ModIface`s already on disk, except for the > single top-level module: 302 MB > > * "warm": With all `ModIface`s already on disk: 211 MB > > So my stupidly naive question is, why is the "cold" case also not 302 MB? > > In earlier discussion, `ModDetails` unfolding has come up. Dehydrating > `ModDetails` in the HPT all the time is disastrous for runtime, but based > on this model I would expect to see improvements from dehydrating "every > now and then". So I tried a stupid simple example where after every 100th > typechecked module, I run this function on the topologically sorted list of > modules processed so far: > > > ``` > dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods > = do > let HPT{ table = hptr } = hsc_HPT hsc_env > hpt <- readIORef hptr > for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface > _details _linkable) -> do > !details <- initModDetails hsc_env iface > pure () > ``` > > Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, > the profile looks exactly the same. > > Speaking of the profile, in the "cold" case I see a lot of steadily > increasing heap usage from the `ModuleGraph`. I could see this happening if > typechecking from scratch involves more `modulesUnder` calls which in turn > force more and more of the `ModuleGraph`. If so, then maybe this could be > worked around by repeatedly remaking the `ModuleGraph` just like I remake > the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing > the `ModuleGraph` at the start, with the idea being that this should > "equalize" the three scenarios if this really is a substantial source of > extra memory usage. This pushes up the warm case's memory usage to 381 MB, > which is promising, but I still see a `Word64Map` that is steadily > increasing in the "cold-force-modulegraph" case and contributes a lot to > the memory usage. Unfortunately, I don't know where that `Word64Map` is (it > could be any `Unique`-keyed environment...). > > So I am now stuck at this point. To spell out my goal explicitly, I would > like to typecheck one module after another and not keep anything more in > memory around than if I loaded them from `ModIface` files. > > Thanks, > Gergo > > p.s.: I couldn't find a way in the EventLog output HTML to turn event > markers on/off or filter them, so to avoid covering the whole graph with > gray lines, I mark only every 100th module. > > > > > From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM > To: ÉRDI Gergő > Cc: Erdi, Gergo ; Zubin Duggal ; > Montelatici, Raphael Laurent ; GHC Devs < > ghc-devs at haskell.org> > Subject: [External] Re: GHC memory usage when typechecking from source vs. > loading ModIfaces > > You do also raise a good point about rehydration costs. > > In oneshot mode, you are basically rehydrating the entire transitive > closure of each module when you compile it, which obviously results in a > large amount of repeated work. This is why people are investigating ideas > of a persistent worker to at least avoid rehydrating all external > dependencies as well. > > On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering matthewtpickering at gmail.com> wrote: > Sure, you can remove them once you are sure they are not used anymore. > > For clients like `GHCi` that doesn't work obviously as they can be used at > any point in the future but for a batch compiler it would be fine. > > On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő wrote: > On Mon, 10 Feb 2025, Matthew Pickering wrote: > > > I wonder if you have got your condition the wrong way around. > > > > The only "safe" time to perform rehydration is AFTER the point it can > > never be used again. > > > > If you rehydrate it just before it is used then you will repeat work > > which has already been done. If you do this, you will always have a > > trade-off between space used and runtime. > > Oops. Yes, I have misunderstood the idea. I thought the idea was that > after loading a given module into the HPT, its ModDetails would start out > small (because of laziness) and then keep growing in size as more and more > of it are traversed, and thus forced, during the typechecking of its > dependees, so at some point we would want to reset that into the small > initial representation as created by initModDetails. > > But if the idea is that I should rehydrate modules when they can't be used > anymore, then that brings up the question why even do that, instead of > straight removing the HomeModInfos from the HPT? > > ---------------------------------------------------------------------- > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our public website. > > ---------------------------------------------------------------------- > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our main Standard Chartered PLC (UK) website at sc. com > > ---------------------------------------------------------------------- > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our main Standard Chartered PLC (UK) website at sc. com > > ------------------------------ > This email and any attachments are confidential and may also be > privileged. If you are not the intended recipient, please delete all copies > and notify the sender immediately. You may wish to refer to the > incorporation details of Standard Chartered PLC, Standard Chartered Bank > and their subsidiaries together with Standard Chartered Bank’s Privacy > Policy via our main Standard Chartered PLC (UK) website at sc. com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Gergo.Erdi at sc.com Wed Apr 2 09:47:30 2025 From: Gergo.Erdi at sc.com (Erdi, Gergo) Date: Wed, 2 Apr 2025 09:47:30 +0000 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu> <3a5dd981-7fdc-a814-88c9-6768395cae39@erdi.hu>

Message-ID: PUBLIC zcat ../repro-hs.patch.gz |patch -p0 From: Matthew Pickering Sent: Wednesday, April 2, 2025 5:39 PM To: Erdi, Gergo Cc: GHC Devs ; ÉRDI Gergő ; Montelatici, Raphael Laurent ; Dijkstra, Atze Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces What command do I run to generate the files from this patch file? Perhaps a link to a git repo would be a suitable way to share the reproducer? On Wed, Apr 2, 2025 at 10:26 AM Erdi, Gergo > wrote: PUBLIC Hi Matt, I think I have something that might demonstrate that GHC (at least GHC 9.12.1) might have a similar problem! With the attached vacuous module hierarchy, I tried compiling M2294 from scratch, and then with `.hi` files for everything except the toplevel module. I did the same with our GHC-API-using compiler as well. As you can see from the attached event logs, while the details differ, the overall shape of the memory used by ModuleGraph edges (750k of GWIB and NodeKey_Module constructors for the 2321 ModuleNodes and ~60k direct dependency edges) is pretty much the same between our compiler and GHC 9.12, suggesting to me that GHC is duplicating ModuleGraph node information in the dependency edges when building the transitive closure. Based on these measurements, do you agree that this is a GHC-side problem of memory usage scaling quadratically with the number of dependency edges? Thanks, Gergo p.s.: Sorry for including the reproducer module tree in this weird format as a patch file, but I am behind a mail server that won’t let me send mails with too many individual files in attached archives… From: Matthew Pickering > Sent: Friday, March 28, 2025 8:40 PM To: Erdi, Gergo > Cc: GHC Devs >; ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces HI Gergo, Do you have a (synthetic?) reproducer? You have probably identified some memory leak. However, without any means to reproduce it becomes very difficult to investigate. I feel like we are getting into very precise details now, where speculating is not going to be so useful. It seems like this is an important thing for you and your company. Is there any budget to pay for some investigation? If that was the case then some effort could be made to create a synthetic producer and make the situation more robust going into the future if your requirements were precisely understood. Cheers, Matt On Fri, Mar 28, 2025 at 10:12 AM Erdi, Gergo > wrote: PUBLIC Just to add that I get the same "equalizing" behaviour (but in a more "natural" way) if instead of deepseq-ing the ModuleGraph upfront, I just call `hugInstancesBelow` before processing each module. So that's definitely one source of extra memory usage. I wonder if it would be possible to rebuild the ModuleGraph periodically (similar to the ModDetails dehydration), or if there are references to it stored all over the place from `HscEnv`s scattered around in closures etc. (basically the same problem the HPT had before it was made into a mutable reference). -----Original Message----- From: ghc-devs > On Behalf Of Erdi, Gergo via ghc-devs Sent: Friday, March 28, 2025 4:49 PM To: Matthew Pickering >; GHC Devs > Cc: ÉRDI Gergő >; Montelatici, Raphael Laurent >; Dijkstra, Atze > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces Hi, Unfortunately, I am forced to return to this problem. Everything below is now in the context of GHC 9.12 plus the mutable HPT patch backported. My test case is typechecking a tree of 2294 modules that form the transitive closure of a single module's dependencies, all in a single process. I have done this typechecking three times, here's what `+RTS -s -RTS` reports for max residency: * "cold": With no on-disk `ModIface` files, i.e. from scratch: 537 MB * "cold-top": With all `ModIface`s already on disk, except for the single top-level module: 302 MB * "warm": With all `ModIface`s already on disk: 211 MB So my stupidly naive question is, why is the "cold" case also not 302 MB? In earlier discussion, `ModDetails` unfolding has come up. Dehydrating `ModDetails` in the HPT all the time is disastrous for runtime, but based on this model I would expect to see improvements from dehydrating "every now and then". So I tried a stupid simple example where after every 100th typechecked module, I run this function on the topologically sorted list of modules processed so far: ``` dehydrateHpt :: HscEnv -> [ModuleName] -> IO () dehydrateHpt hsc_env mods = do let HPT{ table = hptr } = hsc_HPT hsc_env hpt <- readIORef hptr for_ mods \mod -> for_ (lookupUDFM hpt mod) \(HomeModInfo iface _details _linkable) -> do !details <- initModDetails hsc_env iface pure () ``` Buuut the max residency is still 534 MB (see "cold-dehydrate"); in fact, the profile looks exactly the same. Speaking of the profile, in the "cold" case I see a lot of steadily increasing heap usage from the `ModuleGraph`. I could see this happening if typechecking from scratch involves more `modulesUnder` calls which in turn force more and more of the `ModuleGraph`. If so, then maybe this could be worked around by repeatedly remaking the `ModuleGraph` just like I remake the `ModDetails` above. I tried getting rid of this effect by `deepseq`'ing the `ModuleGraph` at the start, with the idea being that this should "equalize" the three scenarios if this really is a substantial source of extra memory usage. This pushes up the warm case's memory usage to 381 MB, which is promising, but I still see a `Word64Map` that is steadily increasing in the "cold-force-modulegraph" case and contributes a lot to the memory usage. Unfortunately, I don't know where that `Word64Map` is (it could be any `Unique`-keyed environment...). So I am now stuck at this point. To spell out my goal explicitly, I would like to typecheck one module after another and not keep anything more in memory around than if I loaded them from `ModIface` files. Thanks, Gergo p.s.: I couldn't find a way in the EventLog output HTML to turn event markers on/off or filter them, so to avoid covering the whole graph with gray lines, I mark only every 100th module. From: Matthew Pickering > Sent: Wednesday, February 12, 2025 7:08 PM To: ÉRDI Gergő > Cc: Erdi, Gergo >; Zubin Duggal >; Montelatici, Raphael Laurent >; GHC Devs > Subject: [External] Re: GHC memory usage when typechecking from source vs. loading ModIfaces You do also raise a good point about rehydration costs. In oneshot mode, you are basically rehydrating the entire transitive closure of each module when you compile it, which obviously results in a large amount of repeated work. This is why people are investigating ideas of a persistent worker to at least avoid rehydrating all external dependencies as well. On Mon, Feb 10, 2025 at 12:13 PM Matthew Pickering > wrote: Sure, you can remove them once you are sure they are not used anymore. For clients like `GHCi` that doesn't work obviously as they can be used at any point in the future but for a batch compiler it would be fine. On Mon, Feb 10, 2025 at 11:56 AM ÉRDI Gergő > wrote: On Mon, 10 Feb 2025, Matthew Pickering wrote: > I wonder if you have got your condition the wrong way around. > > The only "safe" time to perform rehydration is AFTER the point it can > never be used again. > > If you rehydrate it just before it is used then you will repeat work > which has already been done. If you do this, you will always have a > trade-off between space used and runtime. Oops. Yes, I have misunderstood the idea. I thought the idea was that after loading a given module into the HPT, its ModDetails would start out small (because of laziness) and then keep growing in size as more and more of it are traversed, and thus forced, during the typechecking of its dependees, so at some point we would want to reset that into the small initial representation as created by initModDetails. But if the idea is that I should rehydrate modules when they can't be used anymore, then that brings up the question why even do that, instead of straight removing the HomeModInfos from the HPT? ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our public website. ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com ________________________________ This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com ---------------------------------------------------------------------- This email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please delete all copies and notify the sender immediately. You may wish to refer to the incorporation details of Standard Chartered PLC, Standard Chartered Bank and their subsidiaries together with Standard Chartered Bank’s Privacy Policy via our main Standard Chartered PLC (UK) website at sc. com -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthewtpickering at gmail.com Wed Apr 2 10:02:42 2025 From: matthewtpickering at gmail.com (Matthew Pickering) Date: Wed, 2 Apr 2025 11:02:42 +0100 Subject: GHC memory usage when typechecking from source vs. loading ModIfaces In-Reply-To: References:

<4f980efd-58bc-2814-b4cf-b5742c040b97@erdi.hu>