Use @autoreleasepool to avoid memory leaks #5437

irbull · 2024-02-10T05:48:40Z

There appears to be a known memory leak when using the MLTCommandBuffer. It is suggested to use @autoreleasepool in [1,2]

[1] https://developer.apple.com/forums/thread/662721
[2] https://forums.developer.apple.com/forums/thread/120931

This change-set wraps the ggml_metal_graph_compute in a @autoreleasepool.

This commit addresses #5436

irbull · 2024-02-10T05:51:16Z

It seems that the @autoreleasepool was removed recently [1], but from my testing on an M1, it seems that it is needed. The referenced links also seem to indicate that it's needed.

[1] #5007

irbull · 2024-02-10T05:53:20Z

To test this, I setup an test system that ran the same inference over the course of a few hours (several thousand inference requests). The memory of llama.cpp grew from 250Mb to over 1.5Gb. With this change-set applied, the memory remained constant at 250Mb on my system (as measured using top).

There appears to be a known memory leak when using the `MLTCommandBuffer`. It is suggested to use `@autoreleasepool` in [1,2] [1] https://developer.apple.com/forums/thread/662721 [2] https://forums.developer.apple.com/forums/thread/120931 This change-set wraps the `ggml_metal_graph_compute` in a `@autoreleasepool`. This commit addresses ggml-org#5436

ggerganov

Thanks for looking into this

ptsochantaris · 2024-02-11T18:13:19Z

Nice catch - sorry for not noticing this when I made the original PR. I tested it for memory leaks after the change and didn't find any BUT that was using a UI app - if the app doesn't have a main runloop (i.e. CLI or non-UI app) and no global autorelease pool is set up, or you're running the code in a loop that never ends like in your test, the autoreleased command buffers will stay around waiting for a drain that will never come. That was a bit short sighted of me, sorry! Very glad you spotted and fixed that issue.

BTW - I've tried making versions of this file that use ARC so I could propose discarding all the memory management from 2010 but unfortunately ARC does not play well with the struct fields that point to created objects (Clang in theory supports __strong references in structs, but somehow I never got it to work properly :)) The local structs could be replaced with ObjC objects but that would have a bad performance impact I suspect. Ideally this all should eventually become Swift but that's a discussion that's probably for a totally different time and place :-P

There appears to be a known memory leak when using the `MLTCommandBuffer`. It is suggested to use `@autoreleasepool` in [1,2] [1] https://developer.apple.com/forums/thread/662721 [2] https://forums.developer.apple.com/forums/thread/120931 This change-set wraps the `ggml_metal_graph_compute` in a `@autoreleasepool`. This commit addresses ggml-org#5436

irbull force-pushed the mac-memory-leak branch from 1b09a02 to cdf30d6 Compare February 10, 2024 06:28

ggerganov approved these changes Feb 10, 2024

View reviewed changes

ggerganov merged commit f026f81 into ggml-org:master Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use @autoreleasepool to avoid memory leaks #5437

Use @autoreleasepool to avoid memory leaks #5437

Uh oh!

irbull commented Feb 10, 2024

Uh oh!

irbull commented Feb 10, 2024

Uh oh!

irbull commented Feb 10, 2024

Uh oh!

ggerganov left a comment

Uh oh!

ptsochantaris commented Feb 11, 2024

Uh oh!

Uh oh!

Use @autoreleasepool to avoid memory leaks #5437

Use @autoreleasepool to avoid memory leaks #5437

Uh oh!

Conversation

irbull commented Feb 10, 2024

Uh oh!

irbull commented Feb 10, 2024

Uh oh!

irbull commented Feb 10, 2024

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

ptsochantaris commented Feb 11, 2024

Uh oh!

Uh oh!