redict/tests
Oran Agra 573246f73c
if diskless repl child is killed, make sure to reap the pid (#7742)
Starting redis 6.0 and the changes we made to the diskless master to be
suitable for TLS, I made the master avoid reaping (wait3) the pid of the
child until we know all replicas are done reading their rdb.

I did that in order to avoid a state where the rdb_child_pid is -1 but
we don't yet want to start another fork (still busy serving that data to
replicas).

It turns out that the solution used so far was problematic in case the
fork child was being killed (e.g. by the kernel OOM killer), in that
case there's a chance that we currently disabled the read event on the
rdb pipe, since we're waiting for a replica to become writable again.
and in that scenario the master would have never realized the child
exited, and the replica will remain hung too.
Note that there's no mechanism to detect a hung replica while it's in
rdb transfer state.

The solution here is to add another pipe which is used by the parent to
tell the child it is safe to exit. this mean that when the child exits,
for whatever reason, it is safe to reap it.

Besides that, i'm re-introducing an adjustment to REPLCONF ACK which was
part of #6271 (Accelerate diskless master connections) but was dropped
when that PR was rebased after the TLS fork/pipe changes (5a47794).
Now that RdbPipeCleanup no longer calls checkChildrenDone, and the ACK
has chance to detect that the child exited, it should be the one to call
it so that we don't have to wait for cron (server.hz) to do that.
2020-09-06 16:43:57 +03:00
..
assets Fix test "server is up" detection after logging changes. 2016-12-19 16:49:58 +01:00
cluster Fix tests/cluster/cluster.tcl about wrong usage of lrange. (#6702) 2020-08-04 18:00:58 +03:00
helpers revert an accidental test code change done as part of the tls project 2019-12-01 16:10:09 +02:00
integration if diskless repl child is killed, make sure to reap the pid (#7742) 2020-09-06 16:43:57 +03:00
modules Module API: fix missing RM_CLIENTINFO_FLAG_SSL. (#7666) 2020-08-17 17:46:54 +03:00
sentinel TLS: Configuration options. 2019-10-07 21:07:27 +03:00
support if diskless repl child is killed, make sure to reap the pid (#7742) 2020-09-06 16:43:57 +03:00
tmp minor fixes to the new test suite, html doc updated 2010-05-14 18:48:33 +02:00
unit if diskless repl child is killed, make sure to reap the pid (#7742) 2020-09-06 16:43:57 +03:00
instances.tcl Improve valgrind support for cluster tests (#7725) 2020-09-06 11:11:49 +03:00
test_helper.tcl test infra - add durable mode to work around test suite crashing 2020-09-06 09:59:19 +03:00