redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-24 00:59:02 -05:00

Author	SHA1	Message	Date
Oran Agra	4447ddc8bb	Keep track of meaningful replication offset in replicas too Now both master and replicas keep track of the last replication offset that contains meaningful data (ignoring the tailing pings), and both trim that tail from the replication backlog, and the offset with which they try to use for psync. the implication is that if someone missed some pings, or even have excessive pings that the promoted replica has, it'll still be able to psync (avoid full sync). the downside (which was already committed) is that replicas running old code may fail to psync, since the promoted replica trims pings form it's backlog. This commit adds a test that reproduces several cases of promotions and demotions with stale and non-stale pings Background: The mearningful offset on the master was added recently to solve a problem were the master is left all alone, injecting PINGs into it's backlog when no one is listening and then gets demoted and tries to replicate from a replica that didn't have any of the PINGs (or at least not the last ones). however, consider this case: master A has two replicas (B and C) replicating directly from it. there's no traffic at all, and also no network issues, just many pings in the tail of the backlog. now B gets promoted, A becomes a replica of B, and C remains a replica of A. when A gets demoted, it trims the pings from its backlog, and successfully replicate from B. however, C is still aware of these PINGs, when it'll disconnect and re-connect to A, it'll ask for something that's not in the backlog anymore (since A trimmed the tail of it's backlog), and be forced to do a full sync (something it didn't have to do before the meaningful offset fix). Besides that, the psync2 test was always failing randomly here and there, it turns out the reason were PINGs. Investigating it shows the following scenario: cycle 1: redis #1 is master, and all the rest are direct replicas of #1 cycle 2: redis #2 is promoted to master, #1 is a replica of #2 and #3 is replica of #1 now we see that when #1 is demoted it prints: 17339:S 21 Apr 2020 11:16:38.523 * Using the meaningful offset 3929963 instead of 3929977 to exclude the final PINGs (14 bytes difference) 17339:S 21 Apr 2020 11:16:39.391 * Trying a partial resynchronization (request e2b3f8817735fdfe5fa4626766daa938b61419e5:3929964). 17339:S 21 Apr 2020 11:16:39.392 * Successful partial resynchronization with master. and when #3 connects to the demoted #2, #2 says: 17339:S 21 Apr 2020 11:16:40.084 * Partial resynchronization not accepted: Requested offset for secondary ID was 3929978, but I can reply up to 3929964 so the issue here is that the meaningful offset feature saved the day for the demoted master (since it needs to sync from a replica that didn't get the last ping), but it didn't help one of the other replicas which did get the last ping.	2020-04-27 15:52:23 +02:00
Oran Agra	fb732f7a94	optimize memory usage of deferred replies When deffered reply is added the previous reply node cannot be used so all the extra space we allocated in it is wasted. in case someone uses deffered replies in a loop, each time adding a small reply, each of these reply nodes (the small string reply) would have consumed a 16k block. now when we add anther diferred reply node, we trim the unused portion of the previous reply block. see #7123	2020-04-24 17:20:28 +03:00
antirez	c6ccc2f690	Tracking: NOLOOP further implementation and fixes.	2020-04-21 17:29:18 +02:00
zhenwei pi	5010da6ab0	Threaded IO: set thread name for redis-server Set thread name for each thread of redis-server, this helps us to monitor the utilization and optimise the performance. And suggested-by Salvatore, implement this feature for multi platforms. Currently support linux and bsd, ignore other OS. An exmaple on Linux: # top -d 5 -p `pidof redis-server ` -H PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 3682671 root 20 0 227744 8248 3836 R 99.2 0.0 0:19.53 redis-server 3682677 root 20 0 227744 8248 3836 S 26.4 0.0 0:04.15 io_thd_3 3682675 root 20 0 227744 8248 3836 S 23.6 0.0 0:03.98 io_thd_1 3682676 root 20 0 227744 8248 3836 S 23.6 0.0 0:03.97 io_thd_2 3682672 root 20 0 227744 8248 3836 S 0.2 0.0 0:00.02 bio_close_file 3682673 root 20 0 227744 8248 3836 S 0.2 0.0 0:00.02 bio_aof_fsync 3682674 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 bio_lazy_free 3682678 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd 3682682 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd 3682683 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd 3682684 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd 3682685 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd 3682687 root 20 0 227744 8248 3836 S 0.0 0.0 0:00.00 jemalloc_bg_thd Another exmaple on FreeBSD-12.1: PID USERNAME PRI NICE SIZE RES STATE C TIME WCPU COMMAND 5212 root 100 0 48M 7280K CPU2 2 0:26 99.52% redis-server{redis-server} 5212 root 38 0 48M 7280K umtxn 4 0:06 26.94% redis-server{io_thd_3} 5212 root 36 0 48M 7280K umtxn 6 0:06 26.84% redis-server{io_thd_1} 5212 root 39 0 48M 7280K umtxn 1 0:06 25.30% redis-server{io_thd_2} 5212 root 20 0 48M 7280K uwait 3 0:00 0.00% redis-server{redis-server} 5212 root 21 0 48M 7280K uwait 2 0:00 0.00% redis-server{bio_close_file} 5212 root 21 0 48M 7280K uwait 3 0:00 0.00% redis-server{bio_aof_fsync} 5212 root 21 0 48M 7280K uwait 0 0:00 0.00% redis-server{bio_lazy_free} Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>	2020-04-18 11:17:21 +08:00
antirez	c9388ecc73	RESP3: fix HELLO map len in Sentinel mode. See #6160.	2020-04-16 11:21:52 +02:00
antirez	7f794a122f	Fix HELLO reply in Sentinel mode, see #6160 .	2020-04-15 15:59:52 +02:00
antirez	f69876280c	Speedup INFO by counting client memory incrementally. Related to #5145. Design note: clients may change type when they turn into replicas or are moved into the Pub/Sub category and so forth. Moreover the recomputation of the bytes used is problematic for obvious reasons: it changes continuously, so as a conservative way to avoid accumulating errors, each client remembers the contribution it gave to the sum, and removes it when it is freed or before updating it with the new memory usage.	2020-04-07 12:07:54 +02:00
Salvatore Sanfilippo	8d6dc136e1	Merge pull request #7041 from hwware/trackingfix CLIENT TRACKING Command Fix: Add More Checking for OPTIN/OPTOUT mode	2020-04-02 11:22:30 +02:00
antirez	1d13ff0b39	Minor changes to #7037 .	2020-03-31 17:10:09 +02:00
Guy Benoish	d6eb3afd13	Modules: Test MULTI/EXEC replication of RM_Replicate Makse sure call() doesn't wrap replicated commands with a redundant MULTI/EXEC Other, unrelated changes: 1. Formatting compiler warning in INFO CLIENTS 2. Use CLIENT_ID_AOF instead of UINT64_MAX	2020-03-31 13:55:51 +03:00
hwware	86c76cad74	add check for not switching between optin optout mode directly	2020-03-29 23:20:54 -04:00
hwware	1ef44f8243	add check for not providing both optin optout flag	2020-03-29 23:06:50 -04:00
Yossi Gottlieb	2dab5015b7	Fix crashes related to failed/rejected accepts.	2020-03-22 14:47:44 +02:00
antirez	f9c56dbb09	ACL: default user off should not allow automatic authentication. This fixes issue #7011.	2020-03-20 12:45:48 +01:00
Salvatore Sanfilippo	15338ab694	Merge pull request #6991 from soloestoy/io-threads-bugfix Threaded IO: bugfix client kill may crash redis	2020-03-16 10:53:54 +01:00
zhaozhao.zz	a6a0e05a1a	Threaded IO: bugfix client kill may crash redis	2020-03-15 23:30:25 +08:00
antirez	573c4673ee	Aesthetic changes in PR #6989 .	2020-03-15 16:10:37 +01:00
zhaozhao.zz	606a01df70	Threaded IO: bugfix #6988 process events while blocked	2020-03-15 22:07:36 +08:00
antirez	b6378edcd6	Tracking: optin/out implemented.	2020-02-24 19:09:45 +01:00
Madelyn Olson	c1b5220ee7	Give an error message if you specify redirect twice	2020-02-16 05:41:39 -08:00
Madelyn Olson	d63a435362	Minor CSC fixes and fixed documentation	2020-02-16 05:28:29 -08:00
antirez	47177c9edc	Tracking: fix operators precedence error in bcast check.	2020-02-14 14:27:45 +01:00
antirez	6922ccc0b9	Tracking: fix sending messages bug + tracking off bug.	2020-02-13 16:58:07 +01:00
antirez	40194a2a68	Tracking: BCAST: basic feature now works.	2020-02-12 19:22:04 +01:00
antirez	3f7ba86255	Tracking: BCAST: registration in the prefix table.	2020-02-11 17:26:29 +01:00
antirez	dfe126f3e9	Tracking: BCAST: parsing of the options + skeleton.	2020-02-10 17:18:11 +01:00
Salvatore Sanfilippo	7cf53252ee	Merge pull request #6849 from oranagra/free_client_mutex freeClientAsync don't lock mutex if there's just one thread	2020-02-06 10:29:46 +01:00
Guy Benoish	91c41b6dde	Some refactroing using getClientType instead of CLIENT_SLAVE	2020-02-06 14:12:08 +05:30
Guy Benoish	e33fffbde1	Fix small bugs related to replica and monitor ambiguity 1. server.repl_no_slaves_since can be set when a MONITOR client disconnects 2. c->repl_ack_time can be set by a newline from a MONITOR client 3. Improved comments	2020-02-06 14:10:02 +05:30
Oran Agra	86e302f5f3	freeClientAsync don't lock mutex if there's just one thread	2020-02-06 10:07:17 +02:00
antirez	7d5fb5df3f	Setting N I/O threads should mean N-1 additional + 1 main thread.	2020-01-13 18:53:36 +01:00
antirez	d28948b143	Port PR #6110 to new connection object code.	2020-01-13 12:54:39 +01:00
antirez	a8c912ead6	A few comments about main thread serving I/O as well. Related to #6110.	2020-01-13 12:50:26 +01:00
Salvatore Sanfilippo	24896427fc	Merge pull request #6110 from soloestoy/enhance-io-threaded Enhance IO Threaded: use main thread to handle read/write work	2020-01-13 12:46:40 +01:00
antirez	6e4f70b817	Fix duplicated CLIENT SETNAME reply. Happened when we set the name to "" to cancel the name. Was introduced during the RESP3 refactoring. See #6036.	2019-12-29 15:44:59 +01:00
antirez	5521910de7	Inline protocol: handle empty strings well. This bug is from the first version of Redis. Probably the problem here is that before we used an SDS split function that created empty strings for additional spaces, like in "SET foo bar". AFAIK later we replaced it with the curretn sdssplitarg() API that has no such a problem. As a result, we introduced a bug, where it is no longer possible to do something like: SET foo "" Using the inline protocol. Now it is fixed.	2019-12-29 15:40:40 +01:00
Madelyn Olson	034dcf185c	Add module APIs for custom authentication	2019-12-17 06:59:59 +00:00
Salvatore Sanfilippo	bbdbfabf9e	Merge pull request #6531 from oranagra/rm_save_long_double Module API for loading and saving long double	2019-11-04 17:33:35 +01:00
Oran Agra	779aebc91c	Module API for loading and saving long double looks like each platform implements long double differently (different bit count) so we can't save them as binary, and we also want to avoid creating a new RDB format version, so we save these are hex strings using "%La". This commit includes a change in the arguments of ld2string to support this. as well as tests for coverage and short reads. coded by @guybe7	2019-11-03 16:42:31 +02:00
Oran Agra	51c3ff8d75	Modules hooks: complete missing hooks for the initial set of hooks * replication hooks: role change, master link status, replica online/offline * persistence hooks: saving, loading, loading progress * misc hooks: cron loop, shutdown, module loaded/unloaded * change the way hooks test work, and add tests for all of the above startLoading() now gets flag indicating what is loaded. stopLoading() now gets an indication of success or failure. adding startSaving() and stopSaving() with similar args and role.	2019-10-29 17:59:09 +02:00
antirez	8cccc6b468	Modules hooks: don't call the hook for fake clients.	2019-10-23 18:39:53 +02:00
antirez	94c50314f1	Modules hooks: fix sub event in calls from client change.	2019-10-23 18:39:53 +02:00
antirez	019ac37e9c	Modules hooks: fix define / linker issues. Implement one test event.	2019-10-23 18:39:53 +02:00
Yossi Gottlieb	0db3b0a0ff	Merge remote-tracking branch 'upstream/unstable' into tls	2019-10-16 17:08:07 +03:00
Oran Agra	6b6294807c	TLS: Implement support for write barrier.	2019-10-07 21:06:30 +03:00
Oran Agra	5a47794606	diskless replication rdb transfer uses pipe, and writes to sockets form the parent process. misc: - handle SSL_has_pending by iterating though these in beforeSleep, and setting timeout of 0 to aeProcessEvents - fix issue with epoll signaling EPOLLHUP and EPOLLERR only to the write handlers. (needed to detect the rdb pipe was closed) - add key-load-delay config for testing - trim connShutdown which is no longer needed - rioFdsetWrite -> rioFdWrite - simplified since there's no longer need to write to multiple FDs - don't detect rdb child exited (don't call wait3) until we detect the pipe is closed - Cleanup bad optimization from rio.c, add another one	2019-10-07 21:06:30 +03:00
Yossi Gottlieb	b087dd1db6	TLS: Connections refactoring and TLS support. * Introduce a connection abstraction layer for all socket operations and integrate it across the code base. * Provide an optional TLS connections implementation based on OpenSSL. * Pull a newer version of hiredis with TLS support. * Tests, redis-cli updates for TLS support.	2019-10-07 21:06:13 +03:00
antirez	cb384127d1	RESP3: Use verbatim in CLIENT LIST.	2019-09-18 18:52:13 +02:00
antirez	e57067b677	More strict checks and better comments in flushSlaveOutputBuffers(). Related to #6296.	2019-08-31 14:46:24 +02:00
antirez	2303ba1441	Merge branch 'unstable' of github.com:/antirez/redis into unstable	2019-08-31 14:41:28 +02:00
antirez	ca5ccc58c5	Improve comment in flushSlavesOutputBuffers().	2019-08-31 14:40:09 +02:00
zhaozhao.zz	d3f7eb0c87	networking: flushSlavesOutputBuffers bugfix	2019-08-02 17:17:19 +08:00
Salvatore Sanfilippo	3f1c84751a	Merge pull request #6196 from AngusP/resp3 [RESP3, Minor] in networking.c double representation for -infiinity leaves out comma	2019-07-12 13:08:46 +02:00
antirez	c7aaf8db4d	Client side caching: implement CLIENT GETREDIR. This subcommand may simplify the writing of Redis client libraries using the tracking feature and/or improve observability and debugging capabilities.	2019-07-10 18:17:10 +02:00
antirez	46edb55de9	Client side caching: implement trackingInvalidateKey().	2019-07-03 19:16:20 +02:00
antirez	db16a861a1	Client side caching: CLIENT TRACKING subcommand.	2019-07-03 11:58:20 +02:00
Angus Pearson	593f6656c1	RESP3 double representation for -infinity is `,-inf\r\n`, not `-inf\r\n`	2019-07-02 14:28:48 +01:00
antirez	45d64f229e	Client side caching: fields and flags for tracking mode.	2019-06-29 20:08:41 -04:00
zhaozhao.zz	1398fac3f1	Threaded IO: use main thread to handle read work	2019-05-21 11:42:10 +08:00
zhaozhao.zz	8b33975944	Threaded IO: use main thread to handle write work	2019-05-21 11:37:13 +08:00
antirez	9724ca4e22	Make comment in getClientOutputBufferMemoryUsage() describing the present.	2019-05-13 17:30:06 +02:00
antirez	3d053dbb6d	Threaded IO: handleClientsWithPendingReadsUsingThreads top comment.	2019-05-06 18:02:51 +02:00
antirez	1c0c436757	Threaded IO: ability to disable reads from threaded path.	2019-05-06 18:02:51 +02:00
antirez	90d720e7a5	Threaded IO: put fflush() inside tio_debug conditional.	2019-05-06 18:02:51 +02:00
antirez	6ab6a97fe6	Threaded IO: parsing WIP 2: refactoring to parse from thread.	2019-05-06 18:02:51 +02:00
antirez	647a66ebba	Threaded IO: parsing WIP 1: set current_client in a better scoped way.	2019-05-06 18:02:51 +02:00
antirez	463ccf8664	Threaded IO: logging should be safe in I/O threads. Potentially it is possible that we get interleaved writes, even if serverLog() makes sure to write into a buffer and then use printf(), so even this should be ok. However in general POSIX guarantees that writing to the same file pointer object from multiple threads is safe. Anyway currently we reopen the file at each call, but for the standard output logging. The logging functions actually also access global configuration while performing the log (for instance in order to check the log level, the log filename and so forth), however dunring the I/O threads execution we cannot alter such shared state in any way.	2019-05-06 18:02:51 +02:00
antirez	8d7d2be24f	Threaded IO: process read queue before stopping threads.	2019-05-06 18:02:51 +02:00
antirez	63a0ffd36a	Threaded IO: read side WIP 3.	2019-05-06 18:02:51 +02:00
antirez	a2245f8ff1	Threaded IO: read side WIP 2.	2019-05-06 18:02:51 +02:00
antirez	dd5b105c73	Threaded IO: read side WIP.	2019-05-06 18:02:51 +02:00
antirez	74591fb5bd	Threaded IO: hide more debugging printfs under conditional.	2019-05-06 18:02:51 +02:00
antirez	9814b2a5f3	Threaded IO: make num of I/O threads configurable.	2019-05-06 18:02:51 +02:00
antirez	30091dc29f	Threaded IO: use main thread if num of threads is 1.	2019-05-06 18:02:51 +02:00
Ubuntu	9bf7f302a7	Threaded IO: stop threads when no longer needed + C11 in Makefile. Now threads are stopped even when the connections drop immediately to zero, not allowing the networking code to detect the condition and stop the threads. serverCron() will handle that.	2019-05-06 18:02:51 +02:00
antirez	ea35a81c42	Threaded IO: 3rd version: use the mutex only to stop the thread.	2019-05-06 18:02:51 +02:00
antirez	6f4f36c0fb	Threaded IO: second attempt without signaling conditions.	2019-05-06 18:02:51 +02:00
antirez	a2dbd9bd97	Threaded IO: allow to disable debug printf.	2019-05-06 18:02:51 +02:00
antirez	f468e653b5	Threaded IO: implement handleClientsWithPendingWritesUsingThreads(). This is just an experiment for now, there are a couple of race conditions, mostly harmless for the performance gain experiment that this commit represents so far. The general idea here is to take Redis single threaded and instead fan-out on expansive kernel calls: write(2) in this case, but the same concept could be easily implemented for read(2) and protcol parsing. However just threading writes like in this commit, is enough to evaluate if the approach is sounding.	2019-05-06 18:02:51 +02:00
Salvatore Sanfilippo	843de8b786	Merge pull request #5971 from devnexen/unstable build fix	2019-04-26 17:32:21 +02:00
David Carlier	4de88828d9	build fix	2019-03-28 06:38:16 +00:00
Oran Agra	acba2fc9b4	slave corrupts replication stream when module blocked client uses large reply (or POSTPONED_ARRAY) when redis appends the blocked client reply list to the real client, it didn't bother to check if it is in fact the master client. so a slave executing that module command will send replies to the master, causing the master to send the slave error responses, which will mess up the replication offset (slave will advance it's replication offset, and the master does not)	2019-03-24 14:17:37 +02:00
Oran Agra	29b0a57695	diskless fork kept streaming RDB to a disconnected slave	2019-03-21 20:24:52 +02:00
antirez	68c75f248e	Gopher: reply in gopher mode only if argv[0] starts with slash. As documented but never implemented.	2019-02-27 22:20:31 +01:00
antirez	a7780f716e	Merge branch 'gopher' into unstable	2019-02-25 18:16:58 +01:00
antirez	21f92e9e34	RESP3: SETNAME option for HELLO.	2019-02-25 16:56:58 +01:00
antirez	d4d15315a8	RESP3: AUTH option for HELLO.	2019-02-25 16:55:16 +01:00
antirez	5748439770	RESP3: refactoring of CLIENT SETNAME to implement SETNAME in HELLO.	2019-02-25 16:51:49 +01:00
antirez	3b420034bb	RESP3: allow HELLO to be used with version = 2.	2019-02-25 16:41:00 +01:00
antirez	87594a7470	ACL: move AUTH implementation in acl.c.	2019-02-25 16:33:38 +01:00
antirez	e00b22e090	Gopher: initial request handling.	2019-02-21 23:13:08 +01:00
Madelyn Olson	9131fc56d6	Refactored manual computation of object length	2019-02-21 21:35:00 +00:00
antirez	d5e4a7f439	ACL: when client->user is NULL the client is a superuser. Related to #5832.	2019-02-12 09:44:30 +01:00
zhaozhao.zz	0f42447a0e	ACL: show client's user	2019-02-12 16:03:58 +08:00
antirez	c8391388c2	ACL: remove server.requirepass + some refactoring.	2019-01-18 11:49:30 +01:00
antirez	35fe59935e	ACL: automatically authenticate the nopass default user.	2019-01-15 17:57:49 +01:00
antirez	aced0328e3	ACL: avoid a radix tree lookup for the default user.	2019-01-11 11:32:41 +01:00
antirez	4278104acc	ACL: add a reference to the user in each client.	2019-01-10 16:34:13 +01:00
antirez	f5d918b2bb	ACL: HELLO should stop if the user is not authenticated.	2019-01-09 17:00:30 +01:00
antirez	709a6612eb	RESP3: addReplyString() -> addReplyProto(). The function naming was totally nuts. Let's fix it as we break PRs anyway with RESP3 refactoring and changes.	2019-01-09 17:00:30 +01:00
antirez	e291170385	RESP3: verbatim reply API + DEBUG PROTOCOL support.	2019-01-09 17:00:30 +01:00
antirez	8042afb246	RESP3: Fix addReplyBool() RESP2/3 output.	2019-01-09 17:00:30 +01:00
antirez	809e3a44a7	RESP3: addReplyBool() implemented.	2019-01-09 17:00:29 +01:00
antirez	4f0860cbfd	RESP3: initial implementation of the HELLO command.	2019-01-09 17:00:29 +01:00
antirez	1a17cdfadf	RESP3: addReplyNullArray() added for better RESP2 compat.	2019-01-09 17:00:29 +01:00
antirez	317f8b9d38	RESP3: most null replies converted.	2019-01-09 17:00:29 +01:00
antirez	1b7298e66a	RESP3: addReplyNull() added.	2019-01-09 17:00:29 +01:00
antirez	13966522ea	RESP3: bring RESP2 compatibility to previous changes.	2019-01-09 17:00:29 +01:00
antirez	e14aabf936	RESP3: addReply*Len() support for RESP2 backward comp.	2019-01-09 17:00:29 +01:00
antirez	1ac6926647	RESP3: put RESP version in the client structure.	2019-01-09 17:00:29 +01:00
antirez	57c5a766a2	RESP3: Aggregate deferred lengths functions.	2019-01-09 17:00:29 +01:00
antirez	914ee43108	RESP3: Double replies and aggregate lengths initial functions.	2019-01-09 17:00:29 +01:00
antirez	03e2bb0cfd	Crashing is too much in addReplyErrorLength(). See #5663.	2018-12-11 17:50:18 +01:00
zhaozhao.zz	28c4281495	networking: current_client should not be NULL when trim qb_pos	2018-12-07 19:14:33 +08:00
Madelyn Olson	e2c1f80b46	Fixed a serverPanic when sending an invalid command to a monitor client	2018-12-04 07:17:17 +00:00
antirez	0c875c7751	asyncCloseClientOnOutputBufferLimitReached(): don't free fake clients. Fake clients are used in special situations and are not linked to the normal clients list, freeing them will always result in Redis crashing in one way or the other. It's not common to send replies to fake clients, but we have one usage in the modules API. When a client is blocked, we associate to the blocked client object (that is safe to manipulate in a thread), a fake client that accumulates replies. So because of this bug there was the problem described in issue #5443. The fix was verified to work with the provided example module. To write a regression is very hard and unlikely to be triggered in the future.	2018-10-30 13:38:41 +01:00
zhaozhao.zz	35b7296ff4	Avoid recreate write handler for protected client.	2018-10-09 20:34:11 +08:00
antirez	8e2bbe9105	Free protected clients asynchronously. Related to #4840. Note that when we re-enter the event loop with aeProcessEvents() we don't process timers, nor before/after sleep callbacks, so we should never end calling freeClientsInAsyncFreeQueue() when re-entering the loop.	2018-10-09 13:28:51 +02:00
antirez	69c30965eb	Introduce protectClient() + some refactoring. The idea is to have an API for the cases like -BUSY state and DEBUG RELOAD where we have to manually deinstall the read handler. See #4804.	2018-10-09 13:15:41 +02:00
antirez	cff5f36d94	Slave removal: networking.c logs fixed.	2018-09-11 15:32:28 +02:00
antirez	6f3d357d8f	Slave removal: slave -> replica in redis.conf and output buffer option.	2018-09-11 15:32:28 +02:00
antirez	4e5e0d3719	Clarify why remaining may be zero in readQueryFromClient(). See #5304.	2018-09-04 13:29:27 +02:00
Salvatore Sanfilippo	2ef829d65c	Merge pull request #5304 from soloestoy/fix-unexpected-readlen networking: fix unexpected negative or zero readlen	2018-09-04 13:25:28 +02:00
Salvatore Sanfilippo	d60c17cbb3	Merge pull request #5315 from soloestoy/optimize-parsing-large-bulk networking: optimize parsing large bulk greater than 32k	2018-09-04 12:49:50 +02:00
antirez	6c001bfc0d	Unblocked clients API refactoring. See #4418 .	2018-09-03 18:39:18 +02:00
Salvatore Sanfilippo	2b689ad641	Merge pull request #4418 from soloestoy/fix-multiple-unblock fix multiple unblock for clientsArePaused()	2018-09-03 18:31:02 +02:00
antirez	3e7349fdaf	Make pending buffer processing safe for CLIENT_MASTER client. Related to #5305.	2018-09-03 18:17:31 +02:00
zhaozhao.zz	247d2a734b	networking: optimize parsing large bulk greater than 32k If we are going to read a large object from network try to make it likely that it will start at c->querybuf boundary so that we can optimize object creation avoiding a large copy of data. But only when the data we have not parsed is less than or equal to ll+2. If the data length is greater than ll+2, trimming querybuf is just a waste of time, because at this time the querybuf contains not only our bulk. It's easy to reproduce the that: Time1: call `client pause 10000` on slave. Time2: redis-benchmark -t set -r 10000 -d 33000 -n 10000. Then slave hung after 10 seconds.	2018-09-04 00:02:25 +08:00
zhaozhao.zz	e3dfd8c811	fix multiple unblock for clientsArePaused()	2018-09-03 14:26:14 +08:00
antirez	7fa493912e	After slave Lua script leaves busy state, re-process the master buffer. Technically speaking we don't really need to put the master client in the clients that need to be processed, since in practice the PING commands from the master will take care, however it is conceptually more sane to do so.	2018-08-31 16:45:02 +02:00
antirez	9ab91b8c6c	While the slave is busy, just accumulate master input. Processing command from the master while the slave is in busy state is not correct, however we cannot, also, just reply -BUSY to the replication stream commands from the master. The correct solution is to stop processing data from the master, but just accumulate the stream into the buffers and resume the processing later. Related to #5297.	2018-08-31 16:45:02 +02:00
zhaozhao.zz	dce7cefb7c	networking: fix unexpected negative or zero readlen To avoid copying buffers to create a large Redis Object which exceeding PROTO_IOBUF_LEN 32KB, we just read the remaining data we need, which may less than PROTO_IOBUF_LEN. But the remaining len may be zero, if the bulklen+2 equals sdslen(c->querybuf), in client pause context. For example: Time1: python >>> import os, socket >>> server="127.0.0.1" >>> port=6379 >>> data1="*3\r\n$3\r\nset\r\n$1\r\na\r\n$33000\r\n" >>> data2="".join("x" for _ in range(33000)) + "\r\n" >>> data3="\n\n" >>> s = socket.socket(socket.AF_INET, socket.SOCK_STREAM) >>> s.settimeout(10) >>> s.connect((server, port)) >>> s.send(data1) 28 Time2: redis-cli client pause 10000 Time3: >>> s.send(data2) 33002 >>> s.send(data3) 2 >>> s.send(data3) Traceback (most recent call last): File "<stdin>", line 1, in <module> socket.error: [Errno 104] Connection reset by peer To fix that, we should check if remaining is greater than zero.	2018-08-31 20:02:09 +08:00
zhaozhao.zz	f2ad89a314	networking: make setProtocolError simple and clear Function setProtocolError just records proctocol error details in server log, set client as CLIENT_CLOSE_AFTER_REPLY. It doesn't care about querybuf sdsrange, because we will do it after procotol parsing.	2018-08-23 12:21:28 +08:00
zhaozhao.zz	ef2a95c461	networking: just move qb_pos instead of sdsrange in processInlineBuffer	2018-08-14 14:50:37 +08:00
zhaozhao.zz	e623bd22ba	networking: just return C_OK if multibulk processing saw a <= 0 length.	2018-08-14 13:55:30 +08:00
zhaozhao.zz	14c4ddb5a6	pipeline: do not sdsrange querybuf unless all commands processed This is an optimization for processing pipeline, we discussed a problem in issue #5229: clients may be paused if we apply `CLIENT PAUSE` command, and then querybuf may grow too large, the cost of memmove in sdsrange after parsing a completed command will be horrible. The optimization is that parsing all commands in queyrbuf , after that we can just call sdsrange only once.	2018-08-14 00:43:42 +08:00
antirez	313b2240ae	In addReplyErrorLength() only panic when replying to slave. See #5135 for more context.	2018-07-18 17:41:16 +02:00
antirez	6183f0590d	Refine comment in addReplyErrorLength() about replying to masters/slaves. See #5135 for some context.	2018-07-18 17:40:07 +02:00
antirez	afc7e08a20	Panic when we are sending an error to our master/slave. Related to #5135, see discussion there.	2018-07-17 17:42:30 +02:00
Oran Agra	d55598988b	fix rare replication stream corruption with disk-based replication The slave sends \n keepalive messages to the master while parsing the rdb, and later sends REPLCONF ACK once a second. rarely, the master recives both a linefeed char and a REPLCONF in the same read, \n3\r\n$8\r\nREPLCONF\r\n... and it tries to trim two chars (\r\n) from the query buffer, trimming the '' from *3\r\n$8\r\nREPLCONF\r\n... then the master tries to process a command starting with '3' and replies to the slave a bunch of -ERR and one +OK. although the slave silently ignores these (prints a log message), this corrupts the replication offset at the slave since the slave increases the replication offset, and the master did not. other than the fix in processInlineBuffer, i did several other improvments while hunting this very rare bug. - when redis replies with "unknown command" it includes a portion of the arguments, not just the command name. so it would be easier to understand what was recived, in my case, on the slave side, it was -ERR, but the "arguments" were the interesting part (containing info on the error). - about a year ago i added code in addReplyErrorLength to print the error to the log in case of a reply to master (since this string isn't actually trasmitted to the master), now changed that block to print a similar log message to indicate an error being sent from the master to the slave. note that the slave is marked as CLIENT_SLAVE only after PSYNC was received, so this will not cause any harm for REPLCONF, and will only indicate problems that are gonna corrupt the replication stream anyway. - two places were c->reply was emptied, and i wanted to reset sentlen this is a precaution (i did not actually see such a problem), since a non-zero sentlen will cause corruption to be transmitted on the socket.	2018-07-17 12:51:49 +03:00
antirez	f9c84d6d39	Hopefully improve commenting of #5126 . Reading the PR gave me the opportunity to better specify what the code was doing in places where I was not immediately sure about what was going on. Moreover I documented the structure in server.h so that people reading the header file will immediately understand what the structure is useful for.	2018-07-16 17:56:54 +02:00
Oran Agra	bf680b6f8c	slave buffers were wasteful and incorrectly counted causing eviction A) slave buffers didn't count internal fragmentation and sds unused space, this caused them to induce eviction although we didn't mean for it. B) slave buffers were consuming about twice the memory of what they actually needed. - this was mainly due to sdsMakeRoomFor growing to twice as much as needed each time but networking.c not storing more than 16k (partially fixed recently in 237a38737). - besides it wasn't able to store half of the new string into one buffer and the other half into the next (so the above mentioned fix helped mainly for small items). - lastly, the sds buffers had up to 30% internal fragmentation that was wasted, consumed but not used. C) inefficient performance due to starting from a small string and reallocing many times. what i changed: - creating dedicated buffers for reply list, counting their size with zmalloc_size - when creating a new reply node from, preallocate it to at least 16k. - when appending a new reply to the buffer, first fill all the unused space of the previous node before starting a new one. other changes: - expose mem_not_counted_for_evict info field for the benefit of the test suite - add a test to make sure slave buffers are counted correctly and that they don't cause eviction	2018-07-16 16:43:42 +03:00
dejun.xdj	61f12973f7	Bugfix: PEL is incorrect when consumer is blocked using xreadgroup with NOACK option. Save NOACK option into client.blockingState structure.	2018-07-09 13:40:29 +02:00
dejun.xdj	289d8d9c2c	CLIENT UNBLOCK: fix client unblock help message.	2018-07-09 13:03:57 +02:00
WuYunlong	0a5805d7f1	fix compile warning in addReplySubcommandSyntaxError	2018-07-09 12:57:12 +02:00
antirez	2edcafb35d	addReplySubSyntaxError() renamed to addReplySubcommandSyntaxError().	2018-07-02 18:49:34 +02:00
Salvatore Sanfilippo	bc6a004588	Merge pull request #4998 from itamarhaber/module_command_help Module command help	2018-07-02 18:46:56 +02:00
antirez	d751d98b50	Change CLIENT LIST TYPE help string. Making it more similar to KILL.	2018-06-29 18:03:00 +02:00
zhaozhao.zz	b9cbd04b57	clients: add type option for client list	2018-06-28 17:43:05 +08:00
zhaozhao.zz	f5538642cc	clients: show pubsub flag in client list	2018-06-28 17:28:38 +08:00

1 2 3 4 5 ...

555 Commits