redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-23 00:28:26 -05:00

Author	SHA1	Message	Date
Valentino Geron	2029976dc3	Optimization: moduleLoadString should try to create embedded string if not plain (#11050 ) Before this change, if the module has an embedded string, then uses RedisModule_SaveString and RedisModule_LoadString, the result would be a raw string instead of an embedded string. Now the `RDB_LOAD_ENC` flag to `moduleLoadString` only affects integer encoding, but not embedded strings (which still hold an sds in the robj ptr, so they're actually still raw strings for anyone who reads them). Co-authored-by: Valentino Geron <valentino@redis.com>	2022-07-31 17:29:59 +03:00
Huang Zhw	61451b02cb	tracking pending invalidation message of flushdb sent by (#11068 ) trackingHandlePendingKeyInvalidations should use proto.	2022-07-31 16:14:39 +03:00
Binbin	90f35cea81	Avoid false positive out-of-bounds in writeForgottenNodePingExt (#11053 ) In clusterMsgPingExtForgottenNode, sizeof(name) is CLUSTER_NAMELEN, and sizeof(clusterMsgPingExtForgottenNode) is > CLUSTER_NAMELEN. Doing a (name + sizeof(clusterMsgPingExtForgottenNode)) sanitizer generates an out-of-bounds error which is a false positive in here	2022-07-28 15:14:18 -07:00
Binbin	e7144693e2	Fix bgsaveerr issue in psync wrong offset test (#11043 ) The kill above is sometimes successful and sometimes already too late. The PING in pysnc wrong offset test got rejected by bgsaveerr because lastbgsave_status is C_ERR. In theory, using diskless can avoid PING being affected, because when the replica is dropped, we will kill the child with SIGUSR1, and this will not affect lastbgsave_status. Anyway, this kill is not particularly needed here, dropping the kill is the best one, since we do have the waitForBgsave, so just let it take care of the bgsave. No need for fast termination.	2022-07-27 14:58:25 +03:00
guybe7	45c99d7092	Adds RM_Microseconds and RM_CachedMicroseconds (#11016 ) RM_Microseconds Return the wall-clock Unix time, in microseconds RM_CachedMicroseconds Returns a cached copy of the Unix time, in microseconds. It is updated in the server cron job and before executing a command. It is useful for complex call stacks, such as a command causing a key space notification, causing a module to execute a RedisModule_Call, causing another notification, etc. It makes sense that all these callbacks would use the same clock.	2022-07-27 14:40:05 +03:00
Binbin	00097bf4aa	Change the return value of rdbLoad function to enums (#11039 ) The reason we do this is because in #11036, we added error log message when failing to open RDB file for reading. In loadDdataFromDisk we call rdbLoad and also check errno, now the logging corrupts errno (reported in alpine daily). It is not safe to rely on errno as we do today, so we change the return value of rdbLoad function to enums, like we have when loading an AOF.	2022-07-26 15:13:13 +03:00
Huang Zhw	6f0a27e38e	When client tracking is on, invalidation message of flushdb in a (#11038 ) When FLUSHDB / FLUSHALL / SWAPDB is inside MULTI / EXEC, the client side tracking invalidation message was interleaved with transaction response.	2022-07-26 13:28:37 +03:00
Meir Shpilraien (Spielrein)	020e046b42	Fix #11030 , use lua_rawget to avoid triggering metatables and crash. (#11032 ) Fix #11030, use lua_rawget to avoid triggering metatables. #11030 shows how return `_G` from the Lua script (either function or eval), cause the Lua interpreter to Panic and the Redis processes to exit with error code 1. Though return `_G` only panic on Redis 7 and 6.2.7, the underline issue exists on older versions as well (6.0 and 6.2). The underline issue is returning a table with a metatable such that the metatable raises an error. The following example demonstrate the issue: ``` 127.0.0.1:6379> eval "local a = {}; setmetatable(a,{__index=function() foo() end}) return a" 0 Error: Server closed the connection ``` ``` PANIC: unprotected error in call to Lua API (user_script:1: Script attempted to access nonexistent global variable 'foo') ``` The Lua panic happened because when returning the result to the client, Redis needs to introspect the returning table and transform the table into a resp. In order to scan the table, Redis uses `lua_gettable` api which might trigger the metatable (if exists) and might raise an error. This code is not running inside `pcall` (Lua protected call), so raising an error causes the Lua to panic and exit. Notice that this is not a crash, its a Lua panic that exit with error code 1. Returning `_G` panics on Redis 7 and 6.2.7 because on those versions `_G` has a metatable that raises error when trying to fetch a none existing key. ### Solution Instead of using `lua_gettable` that might raise error and cause the issue, use `lua_rawget` that simply return the value from the table without triggering any metatable logic. This is promised not to raise and error. The downside of this solution is that it might be considered as breaking change, if someone rely on metatable in the returned value. An alternative solution is to wrap this entire logic with `pcall` (Lua protected call), this alternative require a much bigger refactoring. ### Back Porting The same fix will work on older versions as well (6.2, 6.0). Notice that on those version, the issue can cause Redis to crash if inside the metatable logic there is an attempt to accesses Redis (`redis.call`). On 7.0, there is not crash and the `redis.call` is executed as if it was done from inside the script itself. ### Tests Tests was added the verify the fix	2022-07-26 10:33:50 +03:00
Viktor Söderqvist	5032de50f2	Gossip forgotten nodes on `CLUSTER FORGET` (#10869 ) Gossip the cluster node blacklist in ping and pong messages. This means that CLUSTER FORGET doesn't need to be sent to all nodes in a cluster. It can be sent to one or more nodes and then be propagated to the rest of them. For each blacklisted node, its node id and its remaining blacklist TTL is gossiped in a cluster bus ping extension (introduced in #9530).	2022-07-26 10:28:13 +03:00
YaacovHazan	33bd8fb981	Add error log message when failing to open RDB file for reading (#11036 ) When failing to open the rdb file, there was no specific error printed (unlike a corrupt file), so it was not clear what failed and why.	2022-07-25 10:09:58 +03:00
Binbin	03fff10ab4	fsync the old aof file when open a new INCR AOF (#11004 ) In rewriteAppendOnlyFileBackground, after flushAppendOnlyFile(1), and before openNewIncrAofForAppend, we should call redis_fsync to fsync the aof file. Because we may open a new INCR AOF in openNewIncrAofForAppend, in the case of using everysec policy, the old AOF file may not be fsynced in time (or even at all). When using everysec, we don't want to pay the disk latency from the main thread, so we will do a background fsync. Adding a argument for bioCreateCloseJob, a `need_fsync` flag to indicate that a fsync is required before the file is closed. So we will fsync the old AOF file before we close it. A cleanup, we make union become a union, since the free_* args and the fd / fsync args are never used together. Co-authored-by: Oran Agra <oran@redislabs.com>	2022-07-25 09:16:35 +03:00
chenyang8094	39d216a326	Register abs-expire apis (#11025 ) RM_SetAbsExpire and RM_GetAbsExpire were not actually operational since they were introduced, due to omission in API registration.	2022-07-24 08:50:21 +03:00
Pavel Krush	5879e490b8	fixed complexity of bzmpop and zmpop commands (#11026 ) Swap M and N in the complexity formula of [B]ZMPOP Co-authored-by: Pavel Krush <neon@pushwoosh.com>	2022-07-24 08:38:04 +03:00
Tian	d00b8af892	Don't update node ip when peer fd is closed (#10696 )	2022-07-20 16:59:27 -07:00
guybe7	6d6e932fa6	Adds LASTID to XCLAIM docs (#11017 ) It seems it was overlooked when we first created the json files	2022-07-20 10:12:06 +03:00
Tian	cc2848132f	Make cluster config file saving atomic and fsync acl (#10924 ) As an outstanding part mentioned in #10737, we could just make the cluster config file and ACL file saving done with a more safe and atomic pattern (write to temp file, fsync, rename, fsync dir). The cluster config file uses an in-place overwrite and truncation (which was also used by the main config file before #7824). The ACL file is using the temp file and rename approach, but was missing an fsync. Co-authored-by: 朱天 <zhutian03@meituan.com>	2022-07-20 09:11:01 +03:00
Wen Hui	56828bab59	Fix EVALSHA_RO and EVAL_RO command json file (#11015 ) these are missing from the RO_ commands, present in the other ones. Co-authored-by: Ubuntu <lucas.guang.yang1@huawei.com>	2022-07-19 18:54:45 +03:00
Yossi Gottlieb	b550a55cbf	CI: Update vmaction. (#11013 )	2022-07-19 15:30:06 +03:00
Binbin	95b88f672a	Set RM_StringCompare input args as const (#11010 ) Following #10996, it forgot to modify RM_StringCompare in module.c Modified RM_StringCompare, compareStringObjectsWithFlags, compareStringObjects and collateStringObjects.	2022-07-19 08:59:39 +03:00
Binbin	5ce64ab010	Fix timing issue in cluster test (#11008 ) A timing issue like this was reported in freebsd daily CI: ``` *** [err]: Sanity test push cmd after resharding in tests/unit/cluster/cli.tcl Expected 'CLUSTERDOWN The cluster is down' to match 'MOVED' ``` We additionally wait for each node to reach a consensus on the cluster state in wait_for_condition to avoid the cluster down error. The fix just like #10495, quoting madolson's comment: Cluster check just verifies the the config state is self-consistent, waiting for cluster_state to be okay is an independent check that all the nodes actually believe each other are healthy. At the same time i noticed that unit/moduleapi/cluster.tcl has an exact same test, may have the same problem, also modified it.	2022-07-18 20:35:13 -07:00
Oran Agra	2825b6057b	Fix heap overflow corruption in XAUTOCLAIM (CVE-2022-31144) (#11002 ) The temporary array for deleted entries reply of XAUTOCLAIM was insufficient, but also in fact the COUNT argument should be used to control the size of the reply, so instead of terminating the loop by only counting the claimed entries, we'll count deleted entries as well. Fix #10968 Addresses CVE-2022-31144	2022-07-18 11:36:19 +03:00
ranshid	eacca729a5	Avoid using unsafe C functions (#10932 ) replace use of: sprintf --> snprintf strcpy/strncpy --> redis_strlcpy strcat/strncat --> redis_strlcat why are we making this change? Much of the code uses some unsafe variants or deprecated buffer handling functions. While most cases are probably not presenting any issue on the known path programming errors and unterminated strings might lead to potential buffer overflows which are not covered by tests. As part of this PR we change 1. added implementation for redis_strlcpy and redis_strlcat based on the strl implementation: https://linux.die.net/man/3/strl 2. change all occurrences of use of sprintf with use of snprintf 3. change occurrences of use of strcpy/strncpy with redis_strlcpy 4. change occurrences of use of strcat/strncat with redis_strlcat 5. change the behavior of ll2string/ull2string/ld2string so that it will always place null termination ('\0') on the output buffer in the first index. this was done in order to make the use of these functions more safe in cases were the user will not check the output returned by them (for example in rdbRemoveTempFile) 6. we added a compiler directive to issue a deprecation error in case a use of sprintf/strcpy/strcat is found during compilation which will result in error during compile time. However keep in mind that since the deprecation attribute is not supported on all compilers, this is expected to fail during push workflows. NOTE: while this is only an initial milestone. We might also consider using the *_s implementation provided by the C11 Extensions (however not yet widly supported). I would also suggest to start looking at static code analyzers to track unsafe use cases. For example LLVM clang checker supports security.insecureAPI.DeprecatedOrUnsafeBufferHandling which can help locate unsafe function usage. https://clang.llvm.org/docs/analyzer/checkers.html#security-insecureapi-deprecatedorunsafebufferhandling-c The main reason not to onboard it at this stage is that the alternative excepted by clang is to use the C11 extensions which are not always supported by stdlib.	2022-07-18 10:56:26 +03:00
Valentino Geron	82b8203555	remove boolean usage and use 0/1 instead (#10997 ) If we do not use jemalloc (mostly with valgrind) and use an old compiler that does not support C11 we will get compilation error Co-authored-by: Valentino Geron <valentino@redis.com>	2022-07-17 18:28:38 +03:00
Guy Korland	587fc9ce76	Set RedisModule_StringCompare input args as const (#10996 )	2022-07-17 13:53:48 +03:00
Madelyn Olson	3abdec9969	Fix cluster hostnames test causing failover while running valgrind (#10991 ) In the newly added cluster hostnames test, the primary is failing over during the reboot for valgrind so we are validating the wrong node. This change just sets the replica to prevent taking over, which seems to fix the test. We could have also set the timeout higher, but it slows down the test.	2022-07-17 09:57:34 +03:00
Wen Hui	2d3240f31b	Add optional for FCALL and FCALL_RO command json file (#10988 ) According to the Redis functions documentation, FCALL command format could be FCALL function_name numberOfKeys [key1, key2, key3.....] [arg1, arg2, arg3.....] So in the json file of fcall and fcall_ro, we should add optional for key and arg part. Just like EVAL... Co-authored-by: Binbin <binloveplay1314@qq.com>	2022-07-17 09:40:07 +03:00
Valentino Geron	847cdca151	Add an option to specify multiple skip files using `--skipfile` (#10975 ) `--skipfile` can be repeated. For example: ./runtests --skipfile file1.txt --skipfile file2.txt Co-authored-by: Valentino Geron <valentino@redis.com>	2022-07-17 08:47:35 +03:00
Oran Agra	599e59ebc5	Avoid valgrind fishy value warning on corrupt restore payloads (#10937 ) The corrupt dump fuzzer uncovered a valgrind warning saying: ``` ==76370== Argument 'size' of function malloc has a fishy (possibly negative) value: -3744781444216323815 ``` This allocation would have failed (returning NULL) and being handled properly by redis (even before this change), but we also want to silence the valgrind warnings (which are checking that casting to ssize_t produces a non-negative value). The solution i opted for is to explicitly fail these allocations (returning NULL), before even reaching `malloc` (which would have failed and return NULL too). The implication is that we will not be able to support a single allocation of more than 2GB on a 32bit system (which i don't think is a realistic scenario). i.e. i do think we could be facing cases were redis consumes more than 2gb on a 32bit system, but not in a single allocation. The byproduct of this, is that i dropped the overflow assertions, since these will now lead to the same OOM panic we have for failed allocations.	2022-07-13 09:14:38 +03:00
Binbin	20af95a99f	Add range check for server port in redis-cli/benchmark (#9854 ) Validating inputs ahead of time, to give the end user a slightly more useful error.	2022-07-12 20:32:14 -07:00
Madelyn Olson	8a4e3bcd8d	Cluster test improvements (#10920 ) * Restructured testing to allow running cluster tests easily as part of the normal testing	2022-07-12 10:41:29 -07:00
Wen Hui	8221d48165	Normalize the style of help information with other documentation (#10965 )	2022-07-12 09:56:02 -07:00
jimgreen2013	53e7de205f	Simplify arithmetic expression on LP_REPLACE case in lpinsert (#6327 ) Remove unnecessary variable name.	2022-07-12 09:24:59 -07:00
Binbin	693acc0114	Trying to fix cluster test (#10963 ) #10942 break the new test added in #10449 ``` Testing unit: 29-slot-migration-response.tcl Cluster Join and auto-discovery test: FAILED: Cluster failed to join into a full mesh. ``` It looks like we need to wait for the cluster in 28 to become stable.	2022-07-11 15:21:35 +03:00
Oran Agra	b4568fc78a	update help.h (#10961 )	2022-07-11 12:36:03 +03:00
Binbin	35e8ae3eb5	Add cluster-port support to redis-cli --cluster (#10344 ) In #9389, we add a new `cluster-port` config and make cluster bus port configurable, and currently redis-cli --cluster create/add-node doesn't support with a configurable `cluster-port` instance. Because redis-cli uses the old way (port + 10000) to send the `CLUSTER MEET` command. Now we add this support on redis-cli `--cluster`, note we don't need to explicitly pass in the `cluster-port` parameter, we can get the real `cluster-port` of the node in `clusterManagerNodeLoadInfo`, so the `--cluster create` and `--cluster add-node` interfaces have not changed. We will use the `cluster-port` when we are doing `CLUSTER MEET`, also note that `CLUSTER MEET` bus-port parameter was added in 4.0, so if the bus_port (the one in redis-cli) is 0, or equal (port + 10000), we just call `CLUSTER MEET` with 2 arguments, using the old form. Co-authored-by: Madelyn Olson <34459052+madolson@users.noreply.github.com>	2022-07-11 11:23:31 +03:00
Binbin	1e85b89aef	Add --check-system in redis-server usage (#10960 )	2022-07-11 08:31:39 +03:00
Madelyn Olson	e6a1b2ea95	Fix crash during handshake and cluster shards call (#10942 ) * Fix an engine crash when there are nodes in handshaking and a user calls cluster shards	2022-07-10 22:00:44 -07:00
Madelyn Olson	1209dc2277	Only print ACL syntax errors once and include command names in errors (#10922 ) * Only print ACL syntax errors once and include command names in errors	2022-07-09 21:02:22 -07:00
Binbin	8203461120	Fix some outdated comments and some typo (#10946 ) * Fix some outdated comments and some typo	2022-07-06 20:31:59 -07:00
adasarpan404	0a2e51f9ae	Renaming conduct to code of conduct and contributing files to .md (#10941 ) CONTRIBUTING to get better formatting CONDUCT also because github doesn't seem recognize the code of conduct page Co-authored-by: Oran Agra <oran@redislabs.com>	2022-07-06 11:58:44 +03:00
Binbin	0132ed7544	Add pubsubshard_channels field in INFO STATS (#10929 ) We already have `pubsub_channels` and `pubsub_patterns` in INFO stats, now add `pubsubshard_channels` (symmetry). Sharded pubsub was added in #8621	2022-07-06 09:50:08 +03:00
Yossi Gottlieb	45ae605332	TLS: Notify clients on connection shutdown. (#10931 ) Use SSL_shutdown(), in a best-effort manner, when closing a TLS connection. This change better supports OpenSSL 3.x clients that will not silently ignore the socket-level EOF.	2022-07-05 08:41:17 +03:00
Harkrishn Patro	9bcdd1537e	Avoid double multiplication of alloc_count (#10934 )	2022-07-04 10:09:27 -07:00
Wen Hui	f620e6ac73	Add tests for error messages during slot migrations (#10449 ) * Add tests for error messages during slot migrations Co-authored-by: Ubuntu <lucas.guang.yang1@huawei.com> Co-authored-by: Madelyn Olson <madelyneolson@gmail.com>	2022-07-04 10:31:12 -05:00
Harkrishn Patro	a3704d4e87	Optimize number of realloc syscall during multi/exec flow (#10921 ) ## Issue During the MULTI/EXEC flow, each command gets queued until the `EXEC` command is received and during this phase on every command queue, a `realloc` is being invoked. This could be expensive based on the realloc behavior (if copy to a new memory location). ## Solution In order to reduce the no. of syscall, couple of optimization I've used. 1. By default, reserve memory for atleast two commands. `MULTI/EXEC` for a single command doesn't have any significance. Hence, I believe customer wouldn't use it. 2. For further reservation, increase the memory allocation in exponent growth (power of 2). This reduces the no. of `realloc` call from `N` to `log(N)` times. ## Other changes: * Include multi exec queued command array in client memory consumption calculation (affects client eviction too)	2022-07-04 09:47:34 +03:00
Qu Chen	33b7ff387c	Unlock cluster config file upon server shutdown. (#10912 ) Currently in cluster mode, Redis process locks the cluster config file when starting up and holds the lock for the entire lifetime of the process. When the server shuts down, it doesn't explicitly release the lock on the cluster config file. We noticed a problem with restart testing that if you shut down a very large redis-server process (i.e. with several hundred GB of data stored), it takes the OS a while to free the resources and unlock the cluster config file. So if we immediately try to restart the redis server process, it might fail to acquire the lock on the cluster config file and fail to come up. This fix explicitly releases the lock on the cluster config file upon a shutdown rather than relying on the OS to release the lock, which is a cleaner and safer approach to free up resources acquired.	2022-07-04 09:38:19 +03:00
Harkrishn Patro	0ab885a685	Account sharded pubsub channels memory consumption (#10925 ) Account sharded pubsub channels memory consumption in client memory usage computation to accurately evict client based on the set threshold for `maxmemory-clients`.	2022-07-04 09:18:57 +03:00
Binbin	679344a2b0	Set aof rewrite status in some backgroundRewriteDoneHandler errors (#10923 ) We should also set aof_lastbgrewrite_status to C_ERR on these errors. Because aof rewrite did fail, and we did not finish the manifest update. Also maintain the stat_aofrw_consecutive_failures.	2022-07-03 15:36:42 +03:00
Yossi Gottlieb	0b645d6319	Fix TLS issues with large replies (#10909 ) This problem was introduced by `496375f` and seems to more easily reproduce on macOS since OpenSSL writes more frequently return with EAGAIN.	2022-07-03 13:35:58 +03:00
Yossi Gottlieb	69d5576832	Fix TLS tests on newer tcl-tls/OpenSSL. (#10910 ) Before this commit, TLS tests on Ubuntu 22.04 would fail as dropped connections result with an ECONNABORTED error thrown instead of an empty read.	2022-07-03 13:34:14 +03:00

1 2 3 4 5 ...

11301 Commits