redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-24 09:08:26 -05:00

Author	SHA1	Message	Date
antirez	fb659cd334	Cluster: ignore empty lines in nodes.conf. Even without the user messing manually with the file, it is still possible to have blank lines (just a single "\n" per line) because of how the nodes.conf update/write process works.	2014-01-15 11:23:41 +01:00
antirez	6c63df3031	Cluster: atomic update of nodes.conf file. The way the file was generated was unsafe and leaded to nodes.conf file corruption (zero length file) on server stop/crash during the creation of the file. The previous file update method was as simple as open with O_TRUNC followed by the write call. While the write call was a single one with the full payload, ensuring no half-written files for POSIX semantics, stopping the server just after the open call resulted into a zero-length file (all the nodes information lost!).	2014-01-15 10:31:20 +01:00
antirez	28273394cb	Cluster: support to read from slave nodes. A client can enter a special cluster read-only mode using the READONLY command: if the client read from a slave instance after this command, for slots that are actually served by the instance's master, the queries will be processed without redirection, allowing clients to read from slaves (but without any kind fo read-after-write guarantee). The READWRITE command can be used in order to exit the readonly state.	2014-01-14 16:33:16 +01:00
antirez	58c8a071a5	Fix RESTORE ttl handling in 32 bit archs. long was used instead of long long in order to handle a 64 bit resolution millisecond timestamp. This fixes issue #1483.	2014-01-09 11:09:23 +01:00
antirez	f510549044	Cluster: clusterProcessPacket() was not 80 cols friendly. The function actually needs to be split into sub-functions at some point in the future.	2013-12-25 17:57:36 +01:00
antirez	66ec1412fe	Redis Cluster: add repl_ping_slave_period to slave data validity time. When the configured node timeout is very small, the data validity time (maximum data age for a slave to try a failover) is too little (ten times the configured node timeout) when the replication link with the master is mostly idle. In this case we'll receive some data from the master only every server.repl_ping_slave_period to refresh the last interaction with the master. This commit adds to the max data validity time the slave ping period to avoid this problem of slaves sensing too old data without a good reason. However this max data validity time is likely a setting that should be configurable by the Redis Cluster user in a way completely independent from the node timeout.	2013-12-22 10:05:16 +01:00
antirez	658aff9d29	Redis Cluster: move node failure reports logging from VERBOSE to NOTICE level.	2013-12-21 00:04:53 +01:00
antirez	5a404c87c1	Redis Cluster: remove no longer relevant comment.	2013-12-20 14:40:11 +01:00
antirez	fda4cba912	Redis Cluster: reconfigure replication when master changes address.	2013-12-20 12:47:22 +01:00
antirez	d7374032c0	Redis Cluster: handshake code refactoring + Gossip IP switch detection. This commit makes it simple to start an handshake with a specific node address, and uses this in order to detect a node IP change and start a new handshake in order to fix the IP if possible.	2013-12-20 12:38:03 +01:00
antirez	a2c938c834	Redis Cluster: delay state change when in the majority again. As specified in the Redis Cluster specification, when a node can reach the majority again after a period in which it was partitioend away with the minorty of masters, wait some time before accepting queries, to provide a reasonable amount of time for other nodes to upgrade its configuration. This lowers the probabilities of both a client and a master with not updated configuration to rejoin the cluster at the same time, with a stale master accepting writes.	2013-12-20 09:56:18 +01:00
antirez	7a666ac419	Cluster: set n->slaves to NULL in clusterNodeResetSlaves(). The value was otherwise undefined, so next time the node was promoted again from slave to master, adding a slave to the list of slaves would likely crash the server or result into undefined behavior.	2013-12-17 14:50:24 +01:00
antirez	fda91dbde3	Cluster: check link is valid before sending UPDATE.	2013-12-17 12:28:37 +01:00
antirez	f57bb36ce7	Cluster: initialize todo_before_sleep flags to 0.	2013-12-17 12:22:02 +01:00
antirez	c70c0c6db7	Cluster: use proper type mstime_t for ping delay var.	2013-12-17 10:27:36 +01:00
antirez	47815d38e0	Fixed clearNodeFailureIfNeeded() time type to mstime_t. This prevented 32bit cluster instances from clearing the FAIL flag when needed.	2013-12-17 09:45:52 +01:00
antirez	e88e6a6334	Cluster: use long long for timestamps in clusterGenNodesDescription(). Ping sent and pong received fields need to be casted to long long to be printed correctly into 32 bit systems.	2013-12-17 09:38:11 +01:00
antirez	11e81a1e9a	Fixed grammar: before H the article is a, not an.	2013-12-05 16:35:32 +01:00
antirez	6fa42b7507	Cluster: nodes re-addition blacklist API.	2013-12-02 11:12:23 +01:00
antirez	8f18345ef0	Cluster: basic data structures for nodes black list.	2013-11-29 17:37:06 +01:00
antirez	3db825fde4	Cluster: some code about clusterHandleSlaveFailover() marginally improved. 80 cols friendly, some minor change to the code to make it simpler.	2013-11-29 16:17:05 +01:00
antirez	a5e7358a12	Cluster: removed not needed newline at end of redisLog() msg.	2013-11-08 17:28:02 +01:00
antirez	28071caf38	Cluster: send a single UPDATE packet for now.	2013-11-08 17:25:49 +01:00
antirez	d289c628b1	Cluster: replace hardcoded 4096 for bus msg len with sizeof().	2013-11-08 17:19:19 +01:00
antirez	94a07d5901	Cluster: slots update refactored + UPDATE msg processing. Now there is a function that handles the update of the local slot configuration every time we have some new info about a node and its set of served slots and configEpoch. Moreoever the UPDATE packets are now processed when received (it was a work in progress in the previous commit).	2013-11-08 17:02:10 +01:00
antirez	dc43f66eac	Cluster: UPDATE msg data structure and sending function.	2013-11-08 16:26:50 +01:00
antirez	6c6572be95	Cluster: refactoring of slots update code and more. The commit also introduces detection of nodes publishing not updated configuration. More work in progress to send an UPDATE packet to inform of the config change.	2013-11-08 10:32:16 +01:00
antirez	1a0cea33a0	Cluster: initialize senderConfigEpoch and senderCurrentEpoch for warnings suppression.	2013-11-05 12:01:07 +01:00
antirez	0c9f60a628	Cluster: there is a lower limit for the handshake timeout.	2013-10-11 10:34:32 +02:00
antirez	1447d28c0f	Cluster: data_age conversion to milliseconds fixed.	2013-10-09 16:36:06 +02:00
antirez	573c2fea91	Cluster: clusterCron() freq is now 10h. Still ping 1 node every sec. After the change in clusterCron() frequency of call, we still want to ping just one random node every second.	2013-10-09 16:29:17 +02:00
antirez	ba42428633	Cluster: time switched from seconds to milliseconds. All the internal state of cluster involving time is now using mstime_t and mstime() in order to use milliseconds resolution. Also the clusterCron() function is called with a 10 hz frequency instead of 1 hz. The cluster node_timeout must be also configured in milliseconds by the user in redis.conf.	2013-10-09 16:19:26 +02:00
antirez	929b6a4480	Cluster: cluster stuff moved from redis.h to cluster.h.	2013-10-09 15:38:05 +02:00
antirez	ae2763f564	Cluster: masters don't vote for a slave with stale config. When a slave requests our vote, the configEpoch he claims for its master and the set of served slots must be greater or equal to the configEpoch of the nodes serving these slots in the current configuraiton of the master granting its vote. In other terms, masters don't vote for slaves having a stale configuration for the slots they want to serve.	2013-10-08 12:45:35 +02:00
antirez	f7d6ad4366	Cluster: fix slave data age computation when master is still connected.	2013-10-07 16:07:13 +02:00
antirez	2c3301b9f5	Cluster: log message improved when FAIL is cleared from a slave node.	2013-10-07 15:44:58 +02:00
antirez	72f38cd70f	Cluster: slave nodes advertise master slots bitmap and configEpoch.	2013-10-07 11:31:12 +02:00
antirez	7afc0dd59a	Cluster: new clusterDoBeforeSleep() API. The new API is able to remember operations to perform before returning to the event loop, such as checking if there is the failover quorum for a slave, save and fsync the configuraiton file, and so forth. Because this operations are performed before returning on the event loop we are sure that messages that are sent in the same event loop run will be delivered after the configuration is already saved, that is a requirement sometimes. For instance we want to publish a new epoch only when it is already stored in nodes.conf in order to avoid returning back in the logical clock when a node is restarted. This new API provides a big performance advantage compared to saving and possibly fsyncing the configuration file multiple times in the same event loop run, especially in the case of big clusters with tens or hundreds of nodes.	2013-10-03 09:58:06 +02:00
antirez	211dcbe339	Cluster: update cluster config when slave changes master.	2013-10-02 12:27:12 +02:00
antirez	6c4d904baf	Cluster: bus messages stats in CLUSTER info.	2013-10-02 10:10:08 +02:00
antirez	abe81781ae	Cluster: FAIL messages from unknown senders are handled better. Previously the event was not logged but instead the node reported an unknown packet type received.	2013-10-02 09:42:45 +02:00
antirez	7970ebd80a	Cluster: senderCurrentEpoch == node currentEpoch was too strict. We can accept a vote as long as its epoch is >= the epoch at which we started the voting process. There is no need for it to be exactly the same.	2013-10-01 17:21:28 +02:00
antirez	f1bfd8233b	Cluster: fix typo in clusterProcessPacket() comment.	2013-10-01 15:40:20 +02:00
antirez	1dedf9aa36	Cluster: time field removed from cluster messages header. The new algorithm does not check replies time as checking for the currentEpoch in the reply ensures that the reply is about the current election process.	2013-09-30 16:19:44 +02:00
antirez	2d0844ee37	Cluster: log message shortened.	2013-09-30 11:51:58 +02:00
antirez	4dc247eb31	Cluster: detect cluster reconfiguration when master slots drop to 0. The old algorithm used a PROMOTED flag and explicitly checks about slave->master convertions. Wit the new cluster meta-data propagation algorithm we just look at the configEpoch to check if we need to reconfigure slots, then: 1) If a node is a master but it reaches zero served slots becuase of reconfiguration. 2) If a node is a slave but the master reaches zero served slots because of a reconfiguration. We switch as a replica of the new slots owner.	2013-09-30 11:45:26 +02:00
antirez	62b1591439	Cluster: re-order failover operations to make it safer. We need to: 1) Increment the configEpoch. 2) Save it to disk and fsync the file. 3) Broadcast the PONG with the new configuration. If other nodes will receive the updated configuration we need to be sure to restart with this new config in the event of a crash.	2013-09-30 10:16:48 +02:00
antirez	b187517719	Cluster: when upading the configEpoch for a node, save config on disk ASAP.	2013-09-30 10:16:25 +02:00
antirez	03ca903983	Cluster: fsync data when saving the cluster config.	2013-09-30 10:13:07 +02:00
antirez	026e63392e	Cluster: update the node configEpoch when newer is detected.	2013-09-27 09:55:41 +02:00

... 4 5 6 7 8 ...

532 Commits