Make FFI callbacks thread safe #12823

smx-smx · 2023-11-29T00:34:25Z

Makes FFI callbacks work, regardless of the thread they are invoked from
Fixes #9214

$REVIEW, before merging: is there any additional handling that needs to be done for ZTS?

… one add zend_ffi_wait_request_barrier helper function add callback_in_progress flag

bwoebi · 2023-11-29T00:49:39Z

I'd like to see a simple test for this, passing the callback to pthread_create() (called via FFI) and having code executed on that other thread, possibly with a small sleep in the callback, showing that it's indeed blocking the main thread.

smx-smx · 2023-11-29T00:50:35Z

Hmm an issue i just noticed is that, if i want to use platform-independent mutexes, i cannot use tsrm_mutex_lock (because it's available only #if ZTS).
Is there an alternative?

dstogov · 2023-11-29T08:01:07Z

Few existing FFI tests are failed.

smx-smx · 2023-11-29T18:44:39Z

I'd like to see a simple test for this, passing the callback to pthread_create() (called via FFI) and having code executed on that other thread, possibly with a small sleep in the callback, showing that it's indeed blocking the main thread.

The way I implemented this, it's the other way around.
The callback thread posts an interrupt request to the main thread and then goes to sleep, waiting for the callback to be serviced within the interrupt handler (within the main thread).
This means that putting a long sleep after the threads creation, such as sleep(1), will delay the execution of the callback (until the sleep ends).

Is it preferrable to have the callback serviced by the requesting thread? I guess we could do it by having the interrupt handler stall, signal the FFI thread, and wait until it finishes... but is there an advantage in doing this?

add sync barrier on gshutdown too

dstogov

I don't think this should be merged.

Transferring execution of all callbacks to main thread can't be safe and will definitely lead to more problems, deadlocks, Thread Local Storage mess, global context pollution, etc...

Often this just doesn't make any sense by design. E.g. callback_threads.phpt starts threads, but executes their code in context of main thread.

It's better to just disable calls to FFI callbacks in context of non-main threads.

smx-smx · 2023-11-30T12:01:22Z

The callback_threads.phpt test case is a non typical example, just to simulate the scenario.
In real world cases, you might have a callback that is triggered by numerous (native) worker threads.
This is comparable to a Windows Forms application, where a thread wants to interact with the GUI and needs to do so from the UI Thread.
Here, instead of having a UI thread, we have the PHP interpreter thread and other threads that trigger the callback.
Sometimes you just cannot avoid this due to the design of the system where PHP is used. You might get concurrent events, triggering the issue.
The idea is that, instead of crashing, we can just queue the invocation events and execute them one at a time.
This PR doesn't implement such FIFO scheduling yet. It manages concurrent invocations by stalling the other worker threads until the current callback has been executed.

The alternative would be to manage this in the native code, but it would require wrapping all PHP callbacks in a native trampoline to manage the mutex locking/unlocking. This requires an extra (native) layers for the PHP code that wants to declare a callback and would make the callers more complicated.

In essence, I am not aiming to enable multiple PHP threads.
The goal is to acquire the existing PHP thread/context from other threads and synchronize concurrent accesses.

What I could do is reverse the flow, to have the callback thread run the callback instead.

smx-smx · 2023-11-30T23:39:23Z

I modified the logic so that the interrupt handler now notifies the thread that invoked the callback, and goes to sleep.
the thread executes the callback and then unlocks the interrupt handler

~~EDIT: i forgot to mention that i had to disable ZEND_CHECK_STACK_LIMIT, due to the stack tracking code assuming it's always within the same thread. I'll look into fixing this too~~
It's now fixed

this was caused by the use of 2 separate mutexes

dstogov · 2023-12-01T12:31:06Z

ext/ffi/php_ffi.h

+#include <ffi.h>
+#include <pthread.h>


Windows build is broken. There are no <pthread.h> there.

dstogov · 2023-12-01T12:37:27Z

ext/ffi/ffi.c

+static void zend_ffi_interrupt_function(zend_execute_data *execute_data){ /* {{{ */
+	pthread_mutex_lock(&FFI_G(vm_request_lock));


You should check if this is FFI related interrupt. This may be a POSIX signal or something else...

dstogov · 2023-12-01T12:50:34Z

ext/ffi/ffi.c

+static void zend_ffi_callback_trampoline(ffi_cif* cif, void* ret, void** args, void* data) /* {{{ */
+{
+	// wait for a previously initiated request to complete
+	zend_ffi_wait_request_barrier(false);


This may a regular in-main-thread callback, that doesn't require any locks.
You should check FFI_G(callback_tid) == FFI_G(main_tid) first.

Ideally, we should make distinct between regular and "thread" callback

$libc->pthread_create( FFI::addr($tid), NULL, FFI::thread_callback($thread_func), FFI::addr($arg) );

dstogov

This looks like a cooperative scheduler implemented on top of VM interrupts + pthread synchronisation for serialization of execution of callbacks in a single thread.

Is it possible to implement these ideas separately from FFI, provide some API, and extend FFI with new functionality using that API? This way we should achieve a better designed solution. I the current state, this looks like a hack that misses many things (e.g. exceptions and errors).

May be you should wrap "serialized" callbacks with fibers and then schedule fibers?

fixes parse errors in clangd

- skip locking in main thread

jcupitt · 2024-02-15T12:29:57Z

Hello, I think we've just been bitten by this (spurious stack overflow exceptions off the main thread): libvips/php-vips#237 . As a workaround, it looks like we're going to have to ask users to disable all stack overflow checks.

How about just disabling this check for execution off the main thread? I imagine it's a rare case, so it would only cause a small drop in the usefulness of this test.

There are probably complications I'm unaware of, of course!

smx-smx added 3 commits November 29, 2023 01:23

ffi: thread safe callbacks (preliminary)

16be72b

ffi: make sure there are no in progress requests before posting a new…

1de9669

… one add zend_ffi_wait_request_barrier helper function add callback_in_progress flag

code style

56c24d4

smx-smx requested a review from dstogov as a code owner November 29, 2023 00:34

github-actions bot added the Extension: ffi label Nov 29, 2023

smx-smx added 3 commits November 29, 2023 19:15

ffi: trace the requester and main thread IDs

6423d3a

ffi: fix deadlock when the callback invocation is from the main thread

98eb079

ffi: add tests/callback_threads

d762cfe

smx-smx added 2 commits November 29, 2023 21:19

ffi: fix mutex unlock before zend_error_noreturn (fixes bug79177.phpt)

29d6550

ffi: remove wrongly placed restore of interrupt handler

ba483e6

add sync barrier on gshutdown too

dstogov requested changes Nov 30, 2023

View reviewed changes

smx-smx added 3 commits December 1, 2023 00:13

ffi: have callbacks be handled by the thread that invoked them

bce091d

ffi: fix bug79177 once again

083cc9e

ffi: cleanup

34353aa

smx-smx added 2 commits December 1, 2023 01:07

ffi: initialize stack info for the new thread

9164b16

ffi: fix vm_ack <-> vm_unlock deadlock

20dd9b1

this was caused by the use of 2 separate mutexes

dstogov reviewed Dec 1, 2023

View reviewed changes

dstogov requested changes Dec 1, 2023

View reviewed changes

smx-smx added 4 commits December 2, 2023 21:52

ffi: add missing includes for php_ffi.h

e20a564

fixes parse errors in clangd

enable TSRM mutex APIs outside of ZTS

af12a99

tsrm: add cond API (POSIX only for now)

261a1d3

zend_globals_macros.h: add missing include

2f71fa3

smx-smx added 2 commits December 3, 2023 04:21

ffi: first version using fibers for callbacks

391ca94

ffi: fix tests

e8b2b5f

- skip locking in main thread

smx-smx requested a review from iluuu1994 as a code owner December 3, 2023 04:18

github-actions bot added the Category: Engine label Dec 3, 2023

tsrm: implement win32 cond API

e2fce83

smx-smx force-pushed the ffi-ts-master branch from dc72df6 to e2fce83 Compare December 3, 2023 04:22

smx-smx marked this pull request as draft December 3, 2023 17:48

smx-smx force-pushed the ffi-ts-master branch 3 times, most recently from ca48818 to 40a6537 Compare December 3, 2023 20:19

fix build errors

dc5496f

smx-smx force-pushed the ffi-ts-master branch from 40a6537 to dc5496f Compare December 3, 2023 21:54

kleisauke mentioned this pull request Feb 14, 2024

Fix CI libvips/php-vips#237

Merged

Simon34545 mentioned this pull request Jan 24, 2025

Compatibility with WebOS 24 Simon34545/lginputhook#26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make FFI callbacks thread safe #12823

Make FFI callbacks thread safe #12823

smx-smx commented Nov 29, 2023 •

edited

Loading

bwoebi commented Nov 29, 2023 •

edited

Loading

smx-smx commented Nov 29, 2023 •

edited

Loading

dstogov commented Nov 29, 2023

smx-smx commented Nov 29, 2023 •

edited

Loading

dstogov left a comment

smx-smx commented Nov 30, 2023 •

edited

Loading

smx-smx commented Nov 30, 2023 •

edited

Loading

dstogov Dec 1, 2023

dstogov Dec 1, 2023

dstogov Dec 1, 2023

dstogov left a comment

jcupitt commented Feb 15, 2024

		static void zend_ffi_interrupt_function(zend_execute_data execute_data){ / {{{ */
		pthread_mutex_lock(&FFI_G(vm_request_lock));

		#include <ffi.h>
		#include <pthread.h>

Make FFI callbacks thread safe #12823

Are you sure you want to change the base?

Make FFI callbacks thread safe #12823

Conversation

smx-smx commented Nov 29, 2023 • edited Loading

bwoebi commented Nov 29, 2023 • edited Loading

smx-smx commented Nov 29, 2023 • edited Loading

dstogov commented Nov 29, 2023

smx-smx commented Nov 29, 2023 • edited Loading

dstogov left a comment

Choose a reason for hiding this comment

smx-smx commented Nov 30, 2023 • edited Loading

smx-smx commented Nov 30, 2023 • edited Loading

dstogov Dec 1, 2023

Choose a reason for hiding this comment

dstogov Dec 1, 2023

Choose a reason for hiding this comment

dstogov Dec 1, 2023

Choose a reason for hiding this comment

dstogov left a comment

Choose a reason for hiding this comment

jcupitt commented Feb 15, 2024

smx-smx commented Nov 29, 2023 •

edited

Loading

bwoebi commented Nov 29, 2023 •

edited

Loading

smx-smx commented Nov 29, 2023 •

edited

Loading

smx-smx commented Nov 29, 2023 •

edited

Loading

smx-smx commented Nov 30, 2023 •

edited

Loading

smx-smx commented Nov 30, 2023 •

edited

Loading