redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-23 08:38:27 -05:00

Author	SHA1	Message	Date
antirez	292fec058a	PSYNC initial offset fix. This commit attempts to fix a bug involving PSYNC and diskless replication (currently experimental) found by Yuval Inbar from Redis Labs and that was later found to have even more far reaching effects (the bug also exists when diskstore is off). The gist of the bug is that, a Redis master replies with +FULLRESYNC to a PSYNC attempt that fails and requires a full resynchronization. However, the baseline offset sent along with FULLRESYNC was always the current master replication offset. This is not ok, because there are many reasosn that may delay the RDB file creation. And... guess what, the master offset we communicate must be the one of the time the RDB was created. So for example: 1) When the BGSAVE for replication is delayed since there is one already but is not good for replication. 2) When the BGSAVE is not needed as we attach one currently ongoing. 3) When because of diskless replication the BGSAVE is delayed. In all the above cases the PSYNC reply is wrong and the slave may reconnect later claiming to need a wrong offset: this may cause data curruption later.	2015-08-04 17:06:10 +02:00
antirez	c1e94b6b9c	Force slaves to resync after unsuccessful PSYNC. Using chained replication where C is slave of B which is in turn slave of A, if B reconnects the replication link with A but discovers it is no longer possible to PSYNC, slaves of B must be disconnected and PSYNC not allowed, since the new B dataset may be completely different after the synchronization with the master. Note that there are varius semantical differences in the way this is handled now compared to the past. In the past the semantics was: 1. When a slave lost connection with its master, disconnected the chained slaves ASAP. Which is not needed since after a successful PSYNC with the master, the slaves can continue and don't need to resync in turn. 2. However after a failed PSYNC the replication backlog was not reset, so a slave was able to PSYNC successfully even if the instance did a full sync with its master, containing now an entirely different data set. Now instead chained slaves are not disconnected when the slave lose the connection with its master, but only when it is forced to full SYNC with its master. This means that if the slave having chained slaves does a successful PSYNC all its slaves can continue without troubles. See issue #2694 for more details.	2015-07-28 16:35:02 +02:00
antirez	278ea9d16b	replicationHandleMasterDisconnection() belongs to replication.c.	2015-07-28 14:36:50 +02:00
antirez	32f80e2f1b	RDMF: More consistent define names.	2015-07-27 14:37:58 +02:00
antirez	40eb548a80	RDMF: REDIS_OK REDIS_ERR -> C_OK C_ERR.	2015-07-26 23:17:55 +02:00
antirez	2d9e3eb107	RDMF: redisAssert -> serverAssert.	2015-07-26 15:29:53 +02:00
antirez	14ff572482	RDMF: OBJ_ macros for object related stuff.	2015-07-26 15:28:00 +02:00
antirez	554bd0e7bd	RDMF: use client instead of redisClient, like Disque.	2015-07-26 15:20:52 +02:00
antirez	424fe9afd9	RDMF: redisLog -> serverLog.	2015-07-26 15:17:43 +02:00
antirez	cef054e868	RDMF (Redis/Disque merge friendlyness) refactoring WIP 1.	2015-07-26 15:17:18 +02:00
antirez	8366907bed	Use best effort address binding to connect to the master We usually want to reach the master using the address of the interface Redis is bound to (via the "bind" config option). That's useful since the master will get (and publish) the slave address getting the peer name of the incoming socket connection from the slave. However, when this is not possible, for example because the slave is bound to the loopback interface but repliaces from a master accessed via an external interface, we want to still connect with the master even from a different interface: in this case it is not really important that the master will provide any other address, while it is vital to be able to replicate correctly. Related to issues #2609 and #2612.	2015-06-11 14:34:38 +02:00
antirez	6c60526db9	Net: improve prepareClientToWrite() error handling and comments. When we fail to setup the write handler it does not make sense to take the client around, it is missing writes: whatever is a client or a slave anyway the connection should terminated ASAP. Moreover what the function does exactly with its return value, and in which case the write handler is installed on the socket, was not clear, so the functions comment are improved to make the goals of the function more obvious. Also related to #2485.	2015-04-01 10:07:45 +02:00
Oran Agra	159875b5a3	fixes to diskless replication. master was closing the connection if the RDB transfer took long time. and also sent PINGs to the slave before it got the initial ACK, in which case the slave wouldn't be able to find the EOF marker.	2015-03-31 23:42:08 +03:00
antirez	c3ad70901f	Replication: disconnect blocked clients when switching to slave role. Bug as old as Redis and blocking operations. It's hard to trigger since only happens on instance role switch, but the results are quite bad since an inconsistency between master and slave is created. How to trigger the bug is a good description of the bug itself. 1. Client does "BLPOP mylist 0" in master. 2. Master is turned into slave, that replicates from New-Master. 3. Client does "LPUSH mylist foo" in New-Master. 4. New-Master propagates write to slave. 5. Slave receives the LPUSH, the blocked client get served. Now Master "mylist" key has "foo", Slave "mylist" key is empty. Highlights: * At step "2" above, the client remains attached, basically escaping any check performed during command dispatch: read only slave, in that case. * At step "5" the slave (that was the master), serves the blocked client consuming a list element, which is not consumed on the master side. This scenario is technically likely to happen during failovers, however since Redis Sentinel already disconnects clients using the CLIENT command when changing the role of the instance, the bug is avoided in Sentinel deployments. Closes #2473.	2015-03-24 16:00:09 +01:00
antirez	c5dd686ecb	Replication: put server.master client creation into separated function.	2015-02-04 11:26:20 +01:00
antirez	ce269ad3c5	AnetFormatIP(): renamed, commented, now sticks to IP:port format. A few code style changes + consistent format: not nice for humans but better for parsers.	2014-12-11 18:20:30 +01:00
Matt Stancliff	491881e13b	Cleanup all IP formatting code Instead of manually checking for strchr(n,':') everywhere, we can use our new centralized IP formatting functions.	2014-12-11 10:12:18 -05:00
antirez	1b732c09d0	Network bandwidth tracking + refactoring. Track bandwidth used by clients and replication (but diskless replication is not tracked since the actual transfer happens in the child process). This includes a refactoring that makes tracking new instantaneous metrics simpler.	2014-12-03 12:16:25 +01:00
antirez	bb7fea0d5c	Diskless SYNC: fix RDB EOF detection. RDB EOF detection was relying on the final part of the RDB transfer to be a magic 40 bytes EOF marker. However as the slave is put online immediately, and because of sockets timeouts, the replication stream is actually contiguous with the RDB file. This means that to detect the EOF correctly we should either: 1) Scan all the stream searching for the mark. Sucks CPU-wise. 2) Start to send the replication stream only after an acknowledge. 3) Implement a proper chunked encoding. For now solution "2" was picked, so the master does not start to send ASAP the stream of commands in the case of diskless replication. We wait for the first REPLCONF ACK command from the slave, that certifies us that the slave correctly loaded the RDB file and is ready to get more data.	2014-11-11 17:12:12 +01:00
antirez	f5c6ebbfe3	Disconnect timedout slave: regression introduced with diskless repl.	2014-11-11 15:10:58 +01:00
Matt Stancliff	0014966c1e	Networking: add more outbound IP binding fixes Same as the original bind fixes (we just missed these the first time around). This helps Redis not automatically send connections from the first IP on an interface if we are bound to a specific IP address (e.g. with multiple IP aliases on one interface, you want to send from _your_ IP, not from the first IP on the interface).	2014-10-29 15:09:09 -04:00
antirez	9ec22d9223	Diskless replication: missing listRewind() added. This caused BGSAVE to be triggered a second time without any need when we switch from socket to disk target via the command CONFIG SET repl-diskless-sync no and there is already a slave waiting for the BGSAVE to start. Also comments clarified about what is happening.	2014-10-29 12:48:22 +01:00
antirez	4b8f4b90b9	Log slave ip:port in more log messages.	2014-10-27 12:30:07 +01:00
antirez	8a416ca46e	Added a function to get slave name for logs.	2014-10-27 11:58:20 +01:00
antirez	a27befc495	Diskless replication: log BGSAVE delay only when it is non-zero.	2014-10-27 10:48:39 +01:00
antirez	707352439c	Diskless sync delay is now configurable.	2014-10-27 10:36:30 +01:00
antirez	c4dbc7cdec	Remove duplicated log message about starting BGSAVE.	2014-10-24 10:38:42 +02:00
antirez	456003af25	Diskless replication: less debugging printfs around.	2014-10-17 17:11:48 +02:00
antirez	525c488f63	rio fdset target: handle short writes. While the socket is set in blocking mode, we still can get short writes writing to a socket.	2014-10-17 16:45:53 +02:00
antirez	4b16263bd9	Diskless replication: don't send "\n" pings to slaves. This is useful for normal replication in order to refresh the slave when we are persisting on disk, but for diskless replication the child is already receiving data while in WAIT_BGSAVE_END state.	2014-10-17 10:23:44 +02:00
antirez	25a3d9965e	Diskless replication: remove 40 bytes EOF mark from end of RDB file.	2014-10-17 10:23:11 +02:00
antirez	0c5a06f6bb	Diskless replication: swap inverted branches to compute read len.	2014-10-17 10:22:29 +02:00
antirez	80f7f63b64	Diskless replication: don't enter the read-payload branch forever.	2014-10-17 10:21:18 +02:00
antirez	5ee2ccf48e	Diskless replication: EOF:<mark> streaming support slave side.	2014-10-16 17:09:35 +02:00
antirez	43ae606430	Diskless replication: redis.conf and CONFIG SET/GET support.	2014-10-16 10:22:02 +02:00
antirez	42951ab301	Diskless replication: trigger a BGSAVE after a config change. If we turn from diskless to disk-based replication via CONFIG SET, we need a way to start a BGSAVE if there are slaves alerady waiting for a BGSAVE to start. Normally with disk-based replication we do it as soon as the previous child exits, but when there is a configuration change via CONFIG SET, we may have slaves in WAIT_BGSAVE_START state without an RDB background process currently active.	2014-10-16 10:15:18 +02:00
antirez	5f8360eb21	Diskless replication flag renamed repl_diskless -> repl_diskless_sync.	2014-10-16 10:00:50 +02:00
antirez	e9e007555e	Diskless replication: trigger diskless RDB transfer if needed.	2014-10-16 09:03:52 +02:00
antirez	3730d118a3	Diskless replication: handle putting the slave online.	2014-10-15 15:31:19 +02:00
antirez	75f0cd6520	Diskless replication: RDB -> slaves transfer draft implementation.	2014-10-14 10:11:29 +02:00
antirez	16546f5aca	Add some comments in syncCommand() to clarify RDB target.	2014-10-10 16:25:58 +02:00
Aaron Rutkovsky	3a82b8ac64	Fix typos Closes #1513	2014-09-29 06:49:07 -04:00
Jan-Erik Rediger	9f98b29cef	Fix typo: ad -> and Closes #1537	2014-09-29 06:49:06 -04:00
antirez	95b1979c32	No more trailing spaces in Redis source code.	2014-06-26 18:48:40 +02:00
antirez	7970d53997	ROLE command: array len fixed for slave output.	2014-06-21 11:17:18 +02:00
antirez	6a13193d8f	ROLE output improved for slaves. Info about the replication state with the master added.	2014-06-07 17:38:20 +02:00
antirez	d34c2fa3bb	ROLE command added. The new ROLE command is designed in order to provide a client with informations about the replication in a fast and easy to use way compared to the INFO command where the same information is also available.	2014-06-07 17:27:49 +02:00
antirez	0bcc7cb4bf	CLIENT LIST speedup via peerid caching + smart allocation. This commit adds peer ID caching in the client structure plus an API change and the use of sdsMakeRoomFor() in order to improve the reallocation pattern to generate the CLIENT LIST output. Both the changes account for a very significant speedup.	2014-04-28 17:36:57 +02:00
antirez	970de3e9c0	Check for EAGAIN in sendBulkToSlave(). Sometime an osx master with a Linux server over a slow link caused a strange error where osx called the writable function for the socket but actually apparently there was no room in the socket buffer to accept the write: write(2) call returned an EAGAIN error, that was not checked, so we considered write(2) == 0 always as a connection reset, which was unfortunate since the bulk transfer has to start again. Also more errors are logged with the WARNING level in the same code path now.	2014-02-05 16:38:10 +01:00
antirez	6f54032080	Cluster: function clusterGetSlaveRank() added. Return the number of slaves for the same master having a better replication offset of the current slave, that is, the slave "rank" used to pick a delay before the request for election.	2014-01-29 16:39:04 +01:00

1 2 3 4

158 Commits