redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-23 00:28:26 -05:00

Author	SHA1	Message	Date
Binbin	871fa12fec	Sentinel: fix reconnect test timing issue (#10424 ) We need to wait for `sentinelTimer` to kick in, and then trigger the reconnect. As for another change, we should better call `server_set_password` before calling SENTINEL SET auth-pass. Fixes problem introeuced in #10400	2022-03-14 11:13:14 +02:00
Moti Cohen	a6bf509810	Sentinel: fix no reconnect after auth-pass is changed (#10400 ) When updating SENTINEL with master’s new password (command: `SENTINEL SET mymaster auth-pass some-new-password`), sentinel might still keep the old connection and avoid reconnecting with the new password. This is because of wrong logic that traces the last ping (pong) time to servers. In fact it worked fine until `8631e64` changed the condition to send ping. To resolve it with minimal risk, let’s disconnect master and replicas once changing password/user. Based on earlier work of yz1509.	2022-03-13 10:13:47 +02:00
Binbin	beb94c901e	Fix INFO SENTINEL memory leak (#10268 ) * Fix INFO SENTINEL memory leak Introduced in #6891 * remove the copy-paste sentence	2022-02-09 07:33:24 +02:00
Wen Hui	2e1bc942aa	Make INFO command variadic (#6891 ) This is an enhancement for INFO command, previously INFO only support one argument for different info section , if user want to get more categories information, either perform INFO all / default or calling INFO for multiple times. Description of the feature The goal of adding this feature is to let the user retrieve multiple categories via the INFO command, and still avoid emitting the same section twice. A use case for this is like Redis Sentinel, which periodically calling INFO command to refresh info from monitored Master/Slaves, only Server and Replication part categories are used for parsing information. If the INFO command can return just enough categories that client side needs, it can save a lot of time for client side parsing it as well as network bandwidth. Implementation To share code between redis, sentinel, and other users of INFO (DEBUG and modules), we have a new `genInfoSectionDict` function that returns a dict and some boolean flags (e.g. `all`) to the caller (built from user input). Sentinel is later purging unwanted sections from that, and then it is forwarded to the info `genRedisInfoString`. Usage Examples INFO Server Replication INFO CPU Memory INFO default commandstats Co-authored-by: Oran Agra <oran@redislabs.com>	2022-02-08 13:14:42 +02:00
Binbin	d7fcb3c5a1	Fix SENTINEL SET config rewrite test (#10232 ) Change the sentinel config file to a directory in SENTINEL SET test. So it will now fail on the `rename` in `rewriteConfigOverwriteFile`. The test used to set the sentinel config file permissions to `000` to simulate failure. But it fails on centos7 / freebsd / alpine. (introduced in #10151) Other changes: 1. More error messages after the config rewrite failure. 2. Modify arg name `force_all` in `rewriteConfig` to `force_write`. (was rename in #9304) 3. Fix a typo in debug quicklist-packed-threshold, then -> than. (#9357)	2022-02-04 11:39:51 +02:00
Wen Hui	65ef543f8c	Sentinel: return an error if configuration save fails (#10151 ) When performing `SENTINEL SET`, Sentinel updates the local configuration file. Before this commit, failure to update the file would still result with an `+OK` reply. Now, a `-ERR Failed to save config file` error will be returned. Co-authored-by: Yossi Gottlieb <yossigo@gmail.com>	2022-02-03 13:20:35 +02:00
Binbin	7e5ded2ad0	Fix timing issue in sentinel CKQUORUM test (#10036 ) A test failure was reported in Daily CI (test-centos7-tls). `CKQUORUM detects failover authorization cannot be reached`. ``` CKQUORUM detects failover authorization cannot be reached: FAILED: Expected 'invalid command name "OK 4 usable Sentinels. Quorum and failover authorization can be reached"' to match 'NOQUORUM' ``` It seems that current sentinel does not confirm that the other sentinels are actually `down`, and then check the quorum. It at least take 3 seconds on my machine, and we can see there will be a timing issue with the hard code `after 5000`. In this commit, we check the response of `SENTINEL SENTINELS mymaster` to ensure that other sentinels are actually `down` in the view the current sentinel. Solve the timing issue due to sentinel monitor mechanism.	2022-01-23 13:54:50 +02:00
Binbin	23325c135f	sub-command support for ACL CAT and COMMAND LIST. redisCommand always stores fullname (#10127 ) Summary of changes: 1. Rename `redisCommand->name` to `redisCommand->declared_name`, it is a const char * for native commands and SDS for module commands. 2. Store the [sub]command fullname in `redisCommand->fullname` (sds). 3. List subcommands in `ACL CAT` 4. List subcommands in `COMMAND LIST` 5. `moduleUnregisterCommands` now will also free the module subcommands. 6. RM_GetCurrentCommandName returns full command name Other changes: 1. Add `addReplyErrorArity` and `addReplyErrorExpireTime` 2. Remove `getFullCommandName` function that now is useless. 3. Some cleanups about `fullname` since now it is SDS. 4. Delete `populateSingleCommand` function from server.h that is useless. 5. Added tests to cover this change. 6. Add some module unload tests and fix the leaks 7. Make error messages uniform, make sure they always contain the full command name and that it's quoted. 7. Fixes some typos see the history in #9504, fixes #10124 Co-authored-by: Oran Agra <oran@redislabs.com> Co-authored-by: guybe7 <guy.benoish@redislabs.com>	2022-01-23 10:05:06 +02:00
Wen Hui	68a8d0b46d	Add sentinel config set test case (#10114 )	2022-01-19 11:57:51 +02:00
Binbin	febc3f63b2	Fix recent daily CI test failures (#9966 ) Recent PRs have introduced some failures, this commit try to fix these CI failures. Here are the changes: 1. Enable debug-command in sentinel test. ``` Master reboot in very short time: ERR DEBUG command not allowed. If the enable-debug-command option is set to "local", you can run it from a local connection, otherwise you need to set this option in the configuration file, and then restart the server. ``` 2. Enable protected-config in sentinel test. ``` SDOWN is triggered by misconfigured instance replying with errors: ERR CONFIG SET failed (possibly related to argument 'dir') - can't set protected config ``` 3. Enable debug-command in cluster test. ``` Verify slaves consistency: ERR DEBUG command not allowed. If the enable-debug-command option is set to "local", you can run it from a local connection, otherwise you need to set this option in the configuration file, and then restart the server. ``` 4. quicklist fill should be signed int. The reason for the modification is to eliminate the warning. Modify `int fill: QL_FILL_BITS` to `signed int fill: QL_FILL_BITS` The first three were introduced at #9920 (same issue). And the last one was introduced at #9962.	2021-12-20 12:31:13 +02:00
Wen Hui	2afa41f628	Sentinel master reboot fix (#9438 ) Add master-reboot-down-after-period as a configurable parameter, to make it possible to trigger a failover from a master that is responding with `-LOADING` for a long time after being restarted.	2021-11-30 18:46:15 +02:00
Wen Hui	53ad5627b7	Sentinel: Fix failed daily tests, due to race condition (#9501 )	2021-09-15 13:39:50 +03:00
Wen Hui	763fd09416	Speed up sentinel tests (#9408 ) Use sentinel debug to reduce default timeouts and allow tests to execute faster.	2021-09-05 13:26:29 +03:00
Wen Hui	63e2a6d212	Add sentinel debug option command (#9291 ) This makes it possible to tune many parameters that were previously hard coded. We don't intend these to be user configurable, but only used by tests to accelerate certain conditions which would otherwise take a long time and slow down the test suite. Co-authored-by: Lucas Guang Yang <l84193800@china.huawei.com>	2021-08-05 11:12:55 +03:00
Wen Hui	db41536454	Remove duplicate zero-port sentinels (#9240 ) The issue is that when a sentinel with the same address and IP is turned on with a different runid, its port is set to 0 but it is still present in the dictionary master->sentinels which contain all the sentinels for a master. This causes a problem when we do INFO SENTINEL because it takes the size of the dictionary of sentinels. This might also cause a problem for failover if enough sentinels have their port set to 0 since the number of voters in failover is also determined by the size of the dictionary of sentinels. This commits removes the sentinels with the port set to zero from the dictionary of sentinels. Fixes #8786	2021-07-29 12:32:28 +03:00
Binbin	0bfccc55e2	Fixed some typos, add a spell check ci and others minor fix (#8890 ) This PR adds a spell checker CI action that will fail future PRs if they introduce typos and spelling mistakes. This spell checker is based on blacklist of common spelling mistakes, so it will not catch everything, but at least it is also unlikely to cause false positives. Besides that, the PR also fixes many spelling mistakes and types, not all are a result of the spell checker we use. Here's a summary of other changes: 1. Scanned the entire source code and fixes all sorts of typos and spelling mistakes (including missing or extra spaces). 2. Outdated function / variable / argument names in comments 3. Fix outdated keyspace masks error log when we check `config.notify-keyspace-events` in loadServerConfigFromString. 4. Trim the white space at the end of line in `module.c`. Check: https://github.com/redis/redis/pull/7751 5. Some outdated https link URLs. 6. Fix some outdated comment. Such as: - In README: about the rdb, we used to said create a `thread`, change to `process` - dbRandomKey function coment (about the dictGetRandomKey, change to dictGetFairRandomKey) - notifyKeyspaceEvent fucntion comment (add type arg) - Some others minor fix in comment (Most of them are incorrectly quoted by variable names) 7. Modified the error log so that users can easily distinguish between TCP and TLS in `changeBindAddr`	2021-06-10 15:39:33 +03:00
Oran Agra	370ab4c4db	Solve sentinel test issue in TLS due to recent tests change. (#8728 ) `5629dbe71` added a change that configures the tcp (plaintext) port alongside the tls port, this causes the INFO command for tcp_port to return that instead of the tls port when running in tls, and that broke the sentinel tests that query it. the fix is to add a method that gets the right port from CONFIG instead of relying on the tcp_port info field.	2021-04-01 09:44:44 +03:00
Jérôme Loyet	91f4f41665	Add replica-announced config option (#8653 ) The 'sentinel replicas <master>' command will ignore replicas with `replica-announced` set to no. The goal of disabling the config setting replica-announced is to allow ghost replicas. The replica is in the cluster, synchronize with its master, can be promoted to master and is not exposed to sentinel clients. This way, it is acting as a live backup or living ghost. In addition, to prevent the replica to be promoted as master, set replica-priority to 0.	2021-03-30 23:40:22 +03:00
Yossi Gottlieb	8c42d1257f	Fix errors with sentinel leaked fds test. (#8482 ) * Don't run test script on non-Linux. * Verify that reported fds do indeed exist also in parent, to avoid false negatives on some systems (namely CentOS). Co-authored-by: Andy Pan <panjf2000@gmail.com>	2021-02-11 15:25:01 +02:00
Yossi Gottlieb	dbcc0a85d0	Fix and cleanup Sentinel leaked fds test. (#8469 ) * For consistency, use tclsh for the script as well * Ignore leaked fds that originate from grandparent process, since we only care about fds redis-sentinel itself is responsible for * Check every test iteration to catch problems early * Some cleanups, e.g. parameterization of file name, etc.	2021-02-08 17:02:46 +02:00
Wen Hui	eacccd2acb	fix sentinel tests error (#8422 ) This commit fixes sentinel announces hostnames test error in certain linux environment Before this commit, we only check localhost is resolved into 127.0.0.1, however in ubuntu or some other linux environments "localhost" will be resolved into ::1 ipv6 address first if the network stack is capable.	2021-01-30 11:18:58 +02:00
Yossi Gottlieb	bb7cd97439	Add hostname support in Sentinel. (#8282 ) This is both a bugfix and an enhancement. Internally, Sentinel relies entirely on IP addresses to identify instances. When configured with a new master, it also requires users to specify and IP and not hostname. However, replicas may use the replica-announce-ip configuration to announce a hostname. When that happens, Sentinel fails to match the announced hostname with the expected IP and considers that a different instance, triggering reconfiguration, etc. Another use case is where TLS is used and clients are expected to match the hostname to connect to with the certificate's SAN attribute. To properly implement this configuration, it is necessary for Sentinel to redirect clients to a hostname rather than an IP address. The new 'resolve-hostnames' configuration parameter determines if Sentinel is willing to accept hostnames. It is set by default to no, which maintains backwards compatibility and avoids unexpected DNS resolution delays on systems with DNS configuration issues. Internally, Sentinel continues to identify instances by their resolved IP address and will also report the IP by default. The new 'announce-hostnames' parameter determines if Sentinel should prefer to announce a hostname, when available, rather than an IP address. This applies to addresses returned to clients, as well as their representation in the configuration file, REPLICAOF configuration commands, etc. This commit also introduces SENTINEL CONFIG GET and SENTINEL CONFIG SET which can be used to introspect or configure global Sentinel configuration that was previously was only possible by directly accessing the configuration file and possibly restarting the instance. Co-authored-by: myl1024 <myl92916@qq.com> Co-authored-by: sundb <sundbcn@gmail.com>	2021-01-28 12:09:11 +02:00
Wen Hui	1aad55b66f	Sentinel: Fix Config Dependency and Rewrite Sequence (#8271 ) This commit fixes a well known and an annoying issue in Sentinel mode. Cause of this issue: Currently, Redis rewrite process works well in server mode, however in sentinel mode, the sentinel config has variant semantics for different configurations, in example configuration https://github.com/redis/redis/blob/unstable/sentinel.conf, we put comments on these. However the rewrite process only treat the sentinel config as a single option. During rewrite process, it will mess up with the lines and comments. Approaches: In order to solve this issue, we need to differentiate different subconfig options in sentinel separately, for example, sentinel monitor <master-name> <ip> <redis-port> <quorum> we can treat it as sentinel monitor option, instead of the sentinel option. This commit also fixes the dependency issue when putting configurations in sentinel.conf. For example before this commit,we must put `sentinel monitor <master-name> <ip> <redis-port> <quorum>` before `sentinel auth-pass <master-name> <password>` for a single master, otherwise the server cannot start and will return error. This commit fixes this issue, as long as the monitoring master was configured, no matter the sequence is, the sentinel can start and run properly.	2021-01-26 09:31:54 +02:00
Andy Pan	8449a5df87	Sentinel tests, disable FD leak check, and print more details (#8376 ) * Print more details about fd leaks * temporarily prevent the leaks from failing the tests Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-22 12:11:58 +02:00
Andy Pan	6401920d70	Fix sentinel FD leak test, checking the wrong OS name (#8364 )	2021-01-20 10:17:20 +02:00
Andy Pan	fb66e2e249	Use FD_CLOEXEC in Sentinel, so that FDs don't leak to the scripts it runs (#8242 ) Sentinel uses execve to run scripts, so it needs to use FD_CLOEXEC on all file descriptors, so that they're not accessible by the script it runs. This commit includes a change to the sentinel tests, which verifies no FDs are left opened when the script is executed.	2021-01-19 22:57:30 +02:00
Mykhailo Pylyp	cb2c7c4d34	Recalculate hardcoded variables from $::instances_count in sentinel tests (#7561 ) Co-authored-by: MemuraiUser <githubuser@janeasystems.com>	2020-09-13 18:39:59 +03:00
Yossi Gottlieb	61733ded14	TLS: Configuration options. Add configuration options for TLS protocol versions, ciphers/cipher suites selection, etc.	2019-10-07 21:07:27 +03:00
Yossi Gottlieb	b087dd1db6	TLS: Connections refactoring and TLS support. * Introduce a connection abstraction layer for all socket operations and integrate it across the code base. * Provide an optional TLS connections implementation based on OpenSSL. * Pull a newer version of hiredis with TLS support. * Tests, redis-cli updates for TLS support.	2019-10-07 21:06:13 +03:00
Chris Lamb	132be8aed5	Correct "did not received" -> "did not receive" typos/grammar.	2018-08-26 14:45:39 +02:00
antirez	438317796b	Sentinel: test command renaming feature.	2018-06-26 16:08:43 +02:00
antirez	8b76d55f2e	Sentinel: new test unit 07 that tests master down conditions.	2016-07-22 16:39:26 +02:00
antirez	3e9ce38b0a	Sentinel: check Slave INFO state more often when disconnected. During the initial handshake with the master a slave will report to have a very high disconnection time from its master (since technically it was disconnected since forever, so the current UNIX time in seconds is reported). However when the slave is connected again the Sentinel may re-scan the INFO output again only after 10 seconds, which is a long time. During this time Sentinels will consider this instance unable to failover, so a useless delay is introduced. Actaully this hardly happened in the practice because when a slave's master is down, the INFO period for slaves changes to 1 second. However when a manual failover is attempted immediately after adding slaves (like in the case of the Sentinel unit test), this problem may happen. This commit changes the INFO period to 1 second even in the case the slave's master is not down, but the slave reported to be disconnected from the master (by publishing, last time we checked, a master disconnection time field in INFO). This change is required as a result of an unrelated change in the replication code that adds a small delay in the master-slave first synchronization.	2016-07-22 10:51:25 +02:00
antirez	d614f1c37e	Sentinel: CKQUORUM tests	2015-05-19 12:26:09 +02:00
antirez	65090401b7	Sentinel / Cluster test: exit with non-zero error code on failures.	2015-03-30 14:29:01 +02:00
Matt Stancliff	28343966a4	Spell software correctly	2014-09-29 06:49:07 -04:00
antirez	e21e0ba3dc	Sentinel test: more correct sentinels config reset. In the initialization test for each instance we used to unregister the old master and register it again to clear the config. However there is a race condition doing this: as soon as we unregister and re-register "mymaster", another Sentinel can update the new configuration with the old state because of gossip "hello" messages. So the correct procedure is instead, unregister "mymaster" from all the sentinel instances, and re-register it everywhere again.	2014-06-23 14:07:47 +02:00
antirez	f62dfa0f50	Sentinel test: tolerate larger delays in init tests.	2014-06-19 15:58:45 +02:00
antirez	d06d8d6ffa	Sentinel test: unit 02, avoid some time related false positives.	2014-06-19 15:56:28 +02:00
antirez	f16ad11c71	Sentinel test: add manual failover test.	2014-06-19 10:33:12 +02:00
Matt Stancliff	f7d9827330	Add correct exit value to failed tests	2014-06-18 08:10:04 -04:00
antirez	e8631a6991	Cluster / Sentinel test: instances count moved to run.tcl.	2014-04-29 16:17:15 +02:00
antirez	897adc1c8c	Sentinel test files / directories layout improved. The test now runs in a self-contained directory. The general abstractions to run the tests in an environment where mutliple instances are executed at the same time was extrapolated into instances.tcl, that will be reused to test Redis Cluster.	2014-04-24 11:08:22 +02:00

43 Commits