metal: bug-fix when enable ggml-alloc #2757

lshzh-ww · 2023-08-24T05:32:58Z

The first commit improves memory management when using ggml-alloc: We should only free the tensor at memory barriers. Otherwise, memory released by one tensor might be immediately reused by another tensor that runs concurrently with it.

The second commit fixes a silent return in the allocate_node() function.

This PR also removes workarounds needed by Falcon.

The ggml-alloc should only free tensors at memory barriers.

In certain cases, the allocate_node() function may silently return without performing any memory allocation.

lshzh-ww · 2023-08-24T05:45:57Z

@ggerganov
Not sure if this will also fix the "hang" behavior reported in #2678 yesterday.

slaren

The changes to ggml-alloc look fine.

colinc · 2023-08-25T01:21:32Z

This has a seg fault in ggml-alloc.c @ line 508 on 70b parameter models.

This is happening right after
exec: RESHAPE (tmpq (reshaped)) <= tmpq

lshzh-ww · 2023-08-25T01:57:41Z

@colinc Fixed with PR #2776 .

Btw, Thank you for providing that information! Save me a lot of time.

* metal: better memory alloc w/ concurrency dispatch The ggml-alloc should only free tensors at memory barriers. * ggml-alloc: avoid return silently In certain cases, the allocate_node() function may silently return without performing any memory allocation.

metal: better memory alloc w/ concurrency dispatch

ee8b2aa

The ggml-alloc should only free tensors at memory barriers.

lshzh-ww requested a review from slaren August 24, 2023 05:32

ggml-alloc: avoid return silently

0c268a8

In certain cases, the allocate_node() function may silently return without performing any memory allocation.

lshzh-ww force-pushed the metal-concur-alloc-fix branch from 8496a24 to 0c268a8 Compare August 24, 2023 05:35

slaren approved these changes Aug 24, 2023

View reviewed changes

ggerganov merged commit 38b16df into ggml-org:master Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

metal: bug-fix when enable ggml-alloc #2757

metal: bug-fix when enable ggml-alloc #2757

Uh oh!

lshzh-ww commented Aug 24, 2023

Uh oh!

lshzh-ww commented Aug 24, 2023

Uh oh!

slaren left a comment

Uh oh!

colinc commented Aug 25, 2023

Uh oh!

lshzh-ww commented Aug 25, 2023

Uh oh!

Uh oh!

metal: bug-fix when enable ggml-alloc #2757

metal: bug-fix when enable ggml-alloc #2757

Uh oh!

Conversation

lshzh-ww commented Aug 24, 2023

Uh oh!

lshzh-ww commented Aug 24, 2023

Uh oh!

slaren left a comment

Choose a reason for hiding this comment

Uh oh!

colinc commented Aug 25, 2023

Uh oh!

lshzh-ww commented Aug 25, 2023

Uh oh!

Uh oh!