redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-23 00:28:26 -05:00

Author	SHA1	Message	Date
Madelyn Olson	663fbd3459	Stabilize cluster hostnames tests (#11307 ) This PR introduces a couple of changes to improve cluster test stability: 1. Increase the cluster node timeout to 3 seconds, which is similar to the normal cluster tests, but introduce a new mechanism to increase the ping period so that the tests are still fast. This new config is a debug config. 2. Set `cluster-replica-no-failover yes` on a wider array of tests which are sensitive to failovers. This was occurring on the ARM CI.	2022-10-03 09:25:16 +03:00
Binbin	a549b78c48	Fix redis-cli cluster add-node race in cli.tcl (#11349 ) There is a race condition in the test: ``` *** [err]: redis-cli --cluster add-node with cluster-port in tests/unit/cluster/cli.tcl Expected '5' to be equal to '4' {assert_equal 5 [CI 0 cluster_known_nodes]} proc ::test) ``` When using cli to add node, there can potentially be a race condition in which all nodes presenting cluster state o.k even though the added node did not yet meet all cluster nodes. This comment and the fix were taken from #11221. Also apply it in several other similar places.	2022-10-03 09:21:41 +03:00
ranshid	c0ce97facc	fix test Migrate the last slot away from a node using redis-cli (#11221 ) When using cli to add node, there can potentially be a race condition in which all nodes presenting cluster state o.k even though the added node did not yet meet all cluster nodes. this adds another utility function to wait until all cluster nodes see the same cluster size	2022-09-06 16:54:24 -07:00
weimeng	8945067544	bugfix:del keys in slot replicate to replica, and trigger other invalidations (#11084 ) Bugfix: with the scenario if we force assigned a slot to other master, old master will lose the slot ownership, then old master will call the function delKeysInSlot() to delete all keys which in the slot. These delete operations should replicate to replicas, avoid the data divergence issue in master and replicas. Additionally, in this case, we now call: * signalModifiedKey (to invalidate WATCH) * moduleNotifyKeyspaceEvent (key space notification for modules) * dirty++ (to signal that the persistence file may be outdated) Co-authored-by: weimeng <weimeng@didiglobal.com> Co-authored-by: Madelyn Olson <madelyneolson@gmail.com>	2022-08-28 11:37:26 +03:00
Oran Agra	c789fb0aa7	Fix assertion when a key is lazy expired during cluster key migration (#11176 ) Redis 7.0 has #9890 which added an assertion when the propagation queue was not flushed and we got to beforeSleep. But it turns out that when processCommands calls getNodeByQuery and decides to reject the command, it can lead to a key that was lazy expired and is deleted without later flushing the propagation queue. This change prevents lazy expiry from deleting the key at this stage (not as part of a command being processed in `call`)	2022-08-24 19:39:15 +03:00
Binbin	3a16ad30b7	Fix CLUSTERDOWN issue in cluster reshard unblock test (#11139 ) change the cluster-node-timeout from 1 to 1000	2022-08-18 09:18:18 -07:00
Viktor Söderqvist	5032de50f2	Gossip forgotten nodes on `CLUSTER FORGET` (#10869 ) Gossip the cluster node blacklist in ping and pong messages. This means that CLUSTER FORGET doesn't need to be sent to all nodes in a cluster. It can be sent to one or more nodes and then be propagated to the rest of them. For each blacklisted node, its node id and its remaining blacklist TTL is gossiped in a cluster bus ping extension (introduced in #9530).	2022-07-26 10:28:13 +03:00
Binbin	5ce64ab010	Fix timing issue in cluster test (#11008 ) A timing issue like this was reported in freebsd daily CI: ``` *** [err]: Sanity test push cmd after resharding in tests/unit/cluster/cli.tcl Expected 'CLUSTERDOWN The cluster is down' to match 'MOVED' ``` We additionally wait for each node to reach a consensus on the cluster state in wait_for_condition to avoid the cluster down error. The fix just like #10495, quoting madolson's comment: Cluster check just verifies the the config state is self-consistent, waiting for cluster_state to be okay is an independent check that all the nodes actually believe each other are healthy. At the same time i noticed that unit/moduleapi/cluster.tcl has an exact same test, may have the same problem, also modified it.	2022-07-18 20:35:13 -07:00
Madelyn Olson	3abdec9969	Fix cluster hostnames test causing failover while running valgrind (#10991 ) In the newly added cluster hostnames test, the primary is failing over during the reboot for valgrind so we are validating the wrong node. This change just sets the replica to prevent taking over, which seems to fix the test. We could have also set the timeout higher, but it slows down the test.	2022-07-17 09:57:34 +03:00
Madelyn Olson	8a4e3bcd8d	Cluster test improvements (#10920 ) * Restructured testing to allow running cluster tests easily as part of the normal testing	2022-07-12 10:41:29 -07:00

10 Commits