redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-24 00:59:02 -05:00

Author	SHA1	Message	Date
antirez	271733f4f8	Cluster: discard pong times in the future. However we allow for 500 milliseconds of tolerance, in order to avoid often discarding semantically valid info (the node is up) because of natural few milliseconds desync among servers even when NTP is used. Note that anyway we should ping the node from time to time regardless and discover if it's actually down from our point of view, since no update is accepted while we have an active ping on the node. Related to #3929.	2017-04-15 10:12:08 +02:00
antirez	02777bb252	Cluster: always add PFAIL nodes at end of gossip section. To rely on the fact that nodes in PFAIL state will be shared around by randomly adding them in the gossip section is a weak assumption, especially after changes related to sending less ping/pong packets. We want to always include gossip entries for all the nodes that are in PFAIL state, so that the PFAIL -> FAIL state promotion can happen much faster and reliably. Related to #3929.	2017-04-14 13:39:49 +02:00
antirez	8c829d9e43	Cluster: fix gossip section ping/pong times encoding. The gossip section times are 32 bit, so cannot store the milliseconds time but just the seconds approximation, which is good enough for our uses. At the same time however, when comparing the gossip section times of other nodes with our node's view, we need to convert back to milliseconds. Related to #3929. Without this change the patch to reduce the traffic in the bus message does not work.	2017-04-14 11:01:22 +02:00
antirez	6878a3fedd	Cluster: add clean-logs command to create-cluster script.	2017-04-14 10:52:00 +02:00
antirez	8f7bf2841a	Cluster: decrease ping/pong traffic by trusting other nodes reports. Cluster of bigger sizes tend to have a lot of traffic in the cluster bus just for failure detection: a node will try to get a ping reply from another node no longer than when the half the node timeout would elapsed, in order to avoid a false positive. However this means that if we have N nodes and the node timeout is set to, for instance M seconds, we'll have to ping N nodes every M/2 seconds. This NM/2 pings will receive the same number of pongs, so a total of NM packets per node. However given that we have a total of N nodes doing this, the total number of messages will be NNM. In a 100 nodes cluster with a timeout of 60 seconds, this translates to a total of 10010030 packets per second, summing all the packets exchanged by all the nodes. This is, as you can guess, a lot... So this patch changes the implementation in a very simple way in order to trust the reports of other nodes: if a node A reports a node B as alive at least up to a given time, we update our view accordingly. The problem with this approach is that it could result into a subset of nodes being able to reach a given node X, and preventing others from detecting that is actually not reachable from the majority of nodes. So the above algorithm is refined by trusting other nodes only if we do not have currently a ping pending for the node X, and if there are no failure reports for that node. Since each node, anyway, pings 10 other nodes every second (one node every 100 milliseconds), anyway eventually even trusting the other nodes reports, we will detect if a given node is down from our POV. Now to understand the number of packets that the cluster would exchange for failure detection with the patch, we can start considering the random PINGs that the cluster sent anyway as base line: Each node sends 10 packets per second, so the total traffic if no additioal packets would be sent, including PONG packets, would be: Total messages per second = N102 However by trusting other nodes gossip sections will not AWALYS prevent pinging nodes for the "half timeout reached" rule all the times. The math involved in computing the actual rate as N and M change is quite complex and depends also on another parameter, which is the number of entries in the gossip section of PING and PONG packets. However it is possible to compare what happens in cluster of different sizes experimentally. After applying this patch a very important reduction in the number of packets exchanged is trivial to observe, without apparent impacts on the failure detection performances. Actual numbers with different cluster sizes should be published in the Reids Cluster documentation in the future. Related to #3929.	2017-04-14 10:43:53 +02:00
antirez	c5d6f577f0	Cluster: collect more specific bus messages stats. First step in order to change Cluster in order to use less messages. Related to issue #3929.	2017-04-13 19:22:35 +02:00
antirez	1409c545da	Cluster: hash slots tracking using a radix tree.	2017-03-27 16:37:22 +02:00
antirez	f917e0da4c	Fix MIGRATE closing of cached socket on error. After investigating issue #3796, it was discovered that MIGRATE could call migrateCloseSocket() after the original MIGRATE c->argv was already rewritten as a DEL operation. As a result the host/port passed to migrateCloseSocket() could be anything, often a NULL pointer that gets deferenced crashing the server. Now the socket is closed at an earlier time when there is a socket error in a later stage where no retry will be performed, before we rewrite the argument vector. Moreover a check was added so that later, in the socket_err label, there is no further attempt at closing the socket if the argument was rewritten. This fix should resolve the bug reported in #3796.	2017-02-09 09:58:38 +01:00
Salvatore Sanfilippo	6cf1a325d6	Merge pull request #3643 from andyli028/unstable Modify MIN->MAX	2016-12-19 08:19:10 +01:00
antirez	b53e73e159	MIGRATE: Remove upfront ttl initialization. After the fix for #3673 the ttl var is always initialized inside the loop itself, so the early initialization is not needed. Variables declaration also moved to a more local scope.	2016-12-14 12:43:55 +01:00
Salvatore Sanfilippo	c9f0456d81	Merge pull request #3673 from badboy/reset-ttl-on-migrating Reset the ttl for additional keys	2016-12-14 12:41:00 +01:00
antirez	04542cff92	Replication: fix the infamous key leakage of writable slaves + EXPIRE. BACKGROUND AND USE CASEj Redis slaves are normally write only, however the supprot a "writable" mode which is very handy when scaling reads on slaves, that actually need write operations in order to access data. For instance imagine having slaves replicating certain Sets keys from the master. When accessing the data on the slave, we want to peform intersections between such Sets values. However we don't want to intersect each time: to cache the intersection for some time often is a good idea. To do so, it is possible to setup a slave as a writable slave, and perform the intersection on the slave side, perhaps setting a TTL on the resulting key so that it will expire after some time. THE BUG Problem: in order to have a consistent replication, expiring of keys in Redis replication is up to the master, that synthesize DEL operations to send in the replication stream. However slaves logically expire keys by hiding them from read attempts from clients so that if the master did not promptly sent a DEL, the client still see logically expired keys as non existing. Because slaves don't actively expire keys by actually evicting them but just masking from the POV of read operations, if a key is created in a writable slave, and an expire is set, the key will be leaked forever: 1. No DEL will be received from the master, which does not know about such a key at all. 2. No eviction will be performed by the slave, since it needs to disable eviction because it's up to masters, otherwise consistency of data is lost. THE FIX In order to fix the problem, the slave should be able to tag keys that were created in the slave side and have an expire set in some way. My solution involved using an unique additional dictionary created by the writable slave only if needed. The dictionary is obviously keyed by the key name that we need to track: all the keys that are set with an expire directly by a client writing to the slave are tracked. The value in the dictionary is a bitmap of all the DBs where such a key name need to be tracked, so that we can use a single dictionary to track keys in all the DBs used by the slave (actually this limits the solution to the first 64 DBs, but the default with Redis is to use 16 DBs). This solution allows to pay both a small complexity and CPU penalty, which is zero when the feature is not used, actually. The slave-side eviction is encapsulated in code which is not coupled with the rest of the Redis core, if not for the hook to track the keys. TODO I'm doing the first smoke tests to see if the feature works as expected: so far so good. Unit tests should be added before merging into the 4.0 branch.	2016-12-13 10:59:54 +01:00
Jan-Erik Rediger	2a32f0371e	Reset the ttl for additional keys Before, if a previous key had a TTL set but the current one didn't, the TTL was reused and thus resulted in wrong expirations set. This behaviour was experienced, when `MigrateDefaultPipeline` in redis-trib was set to >1 Fixes #3655	2016-12-08 14:27:21 +01:00
andyli	8abf9729f0	Modify MIN->MAX	2016-11-29 16:34:41 +08:00
antirez	cfdb3a2214	Cluster: handle zero bytes at the end of nodes.conf.	2016-11-16 14:13:18 +01:00
antirez	a3f893b800	RESTORE: accept RDB dumps with older versions. Reference issue #3218. Checking the code I can't find a reason why the original RESTORE code was so opinionated about restoring only the current version. The code in to `rdb.c` appears to be capable as always to restore data from older versions of Redis, and the only places where it is needed the current version in order to correctly restore data, is while loading the opcodes, not the values itself as it happens in the case of RESTORE. For the above reasons, this commit enables RESTORE to accept older versions of values payloads.	2016-06-16 15:53:57 +02:00
antirez	971e3c51b6	Cluster: make getNodeByQuery() responsible of -CLUSTERDOWN errors. This fixes a bug introduced by `d827dbf`, and makes the code consistent with the logic of always allowing, while the cluster is down, commands that don't target any key. As a side effect the code is also simpler now.	2016-05-05 11:33:43 +02:00
Salvatore Sanfilippo	330715afd8	Merge pull request #3039 from itamarhaber/patch-3 Fixes a typo in the comments	2016-05-05 10:15:17 +02:00
antirez	4fdde78c72	New masters with slots are now targets of migration if others are. This fixes issue #3043. Before this fix, after a complete resharding of a master slots to other nodes, the master remains empty and the slaves migrate away to other masters with non-zero nodes. However the old master now empty, is no longer considered a target for migration, because the system has no way to tell it had slaves in the past. This fix leaves the algorithm used in the past untouched, but adds a new rule. When a new or old master which is empty and without slaves, are assigend with their first slot, if other masters in the cluster have slaves, they are automatically considered to be targets for replicas migration.	2016-05-02 18:37:30 +02:00
antirez	b841f3ad1a	Cluster: store busport with different separator in CLUSTER NODES. We need to be able to correctly parse the node address in the case of IPv6 addresses.	2016-02-02 08:20:04 +01:00
antirez	92b9de2417	Cluster announce: WIP, allow building again.	2016-02-01 18:16:25 +01:00
antirez	e27b9b1cec	Merge branch 'cluster-docker' into unstable	2016-02-01 18:01:22 +01:00
antirez	c285862621	Cluster: include node IDs in SLOTS output. CLUSTER SLOTS now includes IDs in the nodes description associated with a given slot range. Certain client libraries implementations need a way to reference a node in an unique way, so they were relying on CLUSTER NODES, that is not a stable API and may change frequently depending on Redis Cluster future requirements.	2016-01-29 12:00:40 +01:00
antirez	d0a8512eda	Cluster anounce-ip/port WIP.	2016-01-29 09:06:37 +01:00
antirez	4abf486ca3	Cluster announce port: set port/bport for myself at startup.	2016-01-29 09:06:37 +01:00
antirez	1c038379f7	Cluster: persist bus port in nodes.conf.	2016-01-29 09:06:37 +01:00
antirez	dc98907e50	Cluster announce ip: take myself->ip always in sync.	2016-01-29 09:06:37 +01:00
antirez	11436b1449	Cluster announce ip / port initial implementation.	2016-01-29 09:06:37 +01:00
Itamar Haber	9e46bf22ed	Fixes a typo	2016-01-28 21:47:18 +02:00
antirez	5bbb09ed2c	Cluster: check packets length before accessing far fields.	2016-01-27 16:35:21 +01:00
antirez	fe44a7cb60	Cluster: mismatch sender ID log put back at DEBUG level.	2016-01-26 14:21:18 +01:00
antirez	d6c5922f75	Cluster: fix missing ntohs() call to access gossip section port.	2016-01-26 14:18:13 +01:00
antirez	592419b4ca	Better address udpate strategy when processing gossip sections. The change covers the case where: 1. There is a node we can't reach (in fail or pfail state). 2. We see a different address for this node, in the gossip section sent to us by a node that, instead, is able to talk with the node we cannot talk to. In this case it's a good bet to switch to the address reported by this node, since there was an address switch and it is able to talk with the node and we are not. However previosuly this was done in a dangerous way, by initiating an handshake. The handshake, using the MEET packet, forces the receiver to join our cluster, and this is not a good idea. If the node in question really just switched address, but is the same node, it already knows about us, so we just need to perform an address update and a reconnection. So with this commit instead we just update the address of the node, release the node link if any, and attempt to reconnect in the next clusterCron() cycle. The commit also improves debugging messages printed by Cluster during address or ID switches.	2016-01-26 12:32:53 +01:00
antirez	83b862a30e	Minor MIGRATE refactoring. Centralize cleanup of newargv in a single place. Add more comments to help a bit following a complex function. Related to issue #3016.	2016-01-19 09:53:04 +01:00
antirez	f5a1e608cc	More variadic MIGRATE fixes. Another leak was fixed in the case of syntax error by restructuring the allocation strategy for the two dynamic vectors. We also make sure to always close the cached socket on I/O errors so that all the I/O errors are handled the same, even if we had a previously queued error of a different kind from the destination server. Thanks to Kevin McGehee. Related to issue #3016.	2016-01-19 09:28:43 +01:00
antirez	00d3a40f82	Various fixes to MIGRATE with multiple keys. In issue #3016 Kevin McGehee identified multiple very serious issues in the new implementation of MIGRATE. This commit attempts to restructure the code in oder to avoid mistakes, an analysis of the new implementation is in progress in order to check for possible edge cases.	2016-01-18 16:49:21 +01:00
antirez	fc3ca8ff87	Cluster: fix setting nodes slaveof pointer to NULL on node release. With this commit we preserve the list of nodes that have .slaveof set to the node, even when the node is turned into a slave, and make sure to fix the .slaveof pointers to NULL when a node is freed from memory, regardless of the fact it's a slave or a master. Basically we try to remember the logical master in the current configuration even if the logical master advertised it as a slave already. However we still remember the associations, so that when a node is freed we can fix them. This should fix issue #3002.	2016-01-14 17:34:49 +01:00
antirez	02c40c9dc2	CLUSTER BUMPEPOCH initial implementation fixed.	2016-01-11 15:39:11 +01:00
antirez	b58796f520	Cluster: CLUSTER BUMPEPOCH introduced to help redis-trib fix. Sometimes during "fixes" we have to setup a new configuration and assign slots to nodes. With BUMPEPOCH we can make sure the new configuration of the node will win if there are conflicting configurations (for example another node is also claiming the same slot because the cluster is totally messed up).	2016-01-11 15:01:14 +01:00
antirez	524be1e465	Cluster: don't allow CLUSTER SETSLOT with slaves.	2016-01-11 15:00:45 +01:00
antirez	e15e518a67	Allow MIGRATE to always be called on local keys for open slots. Extend the MIGRATE extra freedom to be able to be called in the context of the local slot, anytime there is a slot open in one or the other direction (importing or migrating). This is useful for redis-trib to fix the cluster when it has in an odd state. Thix fix allows "redis-trib fix" to make its work in certain cases where previously an error was reported.	2016-01-08 15:04:16 +01:00
antirez	36704d653b	Fix typos & grammar in clusterBumpConfigEpochWithoutConsensus() comment.	2016-01-08 12:07:54 +01:00
antirez	00d637f2cc	Cluster: don't send -ASK to MIGRATE. For non existing keys, we don't want to send -ASK redirections to MIGRATE, since when moving slots from the migrating node to the importing node, we want just to ignore keys that are no longer there. They may be expired or deleted between the GETKEYSINSLOT call and the MIGRATE call. Otherwise this causes an error during migrations with redis-trib (or equivalent cluster management tools).	2016-01-06 12:14:49 +01:00
antirez	b9aeb98156	Suppress harmless warnings.	2015-12-16 12:36:32 +01:00
antirez	ac0a731057	MIGRATE: Fix new argument rewriting refcount handling.	2015-12-11 14:26:41 +01:00
antirez	d85fc1e9cf	MIGRATE: fix replies processing and argument rewriting. We need to process replies after errors in order to delete keys successfully transferred. Also argument rewriting was fixed since it was broken in several ways. Now a fresh argument vector is created and set if we are acknowledged of at least one key.	2015-12-11 14:04:47 +01:00
antirez	9ebf7a6776	Pipelined multiple keys MIGRATE.	2015-12-11 13:38:26 +01:00
antirez	adc2fe6993	Cluster: replica migration with delay. We wait a fixed amount of time (5 seconds currently) much greater than the usual Cluster node to node communication latency, before migrating. This way when a failover occurs, before detecting the new master as a target for migration, we give the time to its natural slaves (the slaves of the failed over master) to announce they switched to the new master, preventing an useless migration operation.	2015-12-11 09:19:06 +01:00
antirez	4159055f83	Remove debugging message left there for error.	2015-12-10 08:56:33 +01:00
antirez	e0f22df995	Fix replicas migration by adding a new flag. Some time ago I broken replicas migration (reported in #2924). The idea was to prevent masters without replicas from getting replicas because of replica migration, I remember it to create issues with tests, but there is no clue in the commit message about why it was so undesirable. However my patch as a side effect totally ruined the concept of replicas migration since we want it to work also for instances that, technically, never had slaves in the past: promoted slaves. So now instead the ability to be targeted by replicas migration, is a new flag "migrate-to". It only applies to masters, and is set in the following two cases: 1. When a master gets a slave, it is set. 2. When a slave turns into a master because of fail over, it is set. This way replicas migration targets are only masters that used to have slaves, and slaves of masters (that used to have slaves... obviously) and are promoted. The new flag is only internal, and is never exposed in the output nor persisted in the nodes configuration, since all the information to handle it are implicit in the cluster configuration we already have.	2015-12-09 23:03:18 +01:00
antirez	a0d41e51c2	Redis Cluster: hint about validity factor when slave can't failover.	2015-11-27 08:59:17 +01:00
antirez	c69c6c80fb	Lazyfree: ability to free whole DBs in background.	2015-10-01 13:02:26 +02:00
antirez	a7c5be18a8	Lazyfree: Sorted sets convereted to plain SDS. (several commits squashed)	2015-10-01 13:02:24 +02:00
antirez	02b1d5213d	RDMF: use representClusterNodeFlags() generic name.	2015-07-27 15:08:58 +02:00
antirez	3325a9b11f	RDMF: more names updated.	2015-07-27 15:03:10 +02:00
antirez	32f80e2f1b	RDMF: More consistent define names.	2015-07-27 14:37:58 +02:00
antirez	40eb548a80	RDMF: REDIS_OK REDIS_ERR -> C_OK C_ERR.	2015-07-26 23:17:55 +02:00
antirez	2d9e3eb107	RDMF: redisAssert -> serverAssert.	2015-07-26 15:29:53 +02:00
antirez	14ff572482	RDMF: OBJ_ macros for object related stuff.	2015-07-26 15:28:00 +02:00
antirez	554bd0e7bd	RDMF: use client instead of redisClient, like Disque.	2015-07-26 15:20:52 +02:00
antirez	424fe9afd9	RDMF: redisLog -> serverLog.	2015-07-26 15:17:43 +02:00
antirez	cef054e868	RDMF (Redis/Disque merge friendlyness) refactoring WIP 1.	2015-07-26 15:17:18 +02:00
Jan-Erik Rediger	d28c51d166	Do not attempt to lock on Solaris	2015-06-24 14:57:15 +02:00
antirez	a401a84eb2	Don't try to bind the source address for MIGRATE Related to issues #2609 and #2612.	2015-06-11 14:34:38 +02:00
antirez	9b7f8b1c9b	Cluster: redirection refactoring + handling of blocked clients. There was a bug in Redis Cluster caused by clients blocked in a blocking list pop operation, for keys no longer handled by the instance, or in a condition where the cluster became down after the client blocked. A typical situation is: 1) BLPOP <somekey> 0 2) <somekey> hash slot is resharded to another master. The client will block forever int this case. A symmentrical non-cluster-specific bug happens when an instance is turned from master to slave. In that case it is more serious since this will desynchronize data between slaves and masters. This other bug was discovered as a side effect of thinking about the bug explained and fixed in this commit, but will be fixed in a separated commit.	2015-03-24 11:56:24 +01:00
antirez	94030fa4d7	Two cluster.c comments improved.	2015-03-21 12:12:23 +01:00
antirez	2950824ab6	Cluster: TAKEOVER option for manual failover.	2015-03-21 11:54:32 +01:00
antirez	a7010ae208	Cluster: non-conditional steps of slave failover refactored into a function.	2015-03-20 17:56:21 +01:00
antirez	230d141420	Cluster: separate unknown master check from the rest. In no case we should try to attempt to failover if myself->slaveof is NULL.	2015-03-20 16:56:59 +01:00
antirez	4f2555aa17	Cluster: refactoring around configEpoch handling. This commit moves the process of generating a new config epoch without consensus out of the clusterCommand() implementation, in order to make it reusable for other reasons (current target is to have a CLUSTER FAILOVER option forcing the failover when no master majority is reachable). Moreover the commit moves other functions which are similarly related to config epochs in a new logical section of the cluster.c file, just for clarity.	2015-03-20 16:42:52 +01:00
antirez	25c0f5ac63	Cluster: better cluster state transiction handling. Before we relied on the global cluster state to make sure all the hash slots are linked to some node, when getNodeByQuery() is called. So finding the hash slot unbound was checked with an assertion. However this is fragile. The cluster state is often updated in the clusterBeforeSleep() function, and not ASAP on state change, so it may happen to process clients with a cluster state that is 'ok' but yet certain hash slots set to NULL. With this commit the condition is also checked in getNodeByQuery() and reported with a identical error code of -CLUSTERDOWN but slightly different error message so that we have more debugging clue in the future. Root cause of issue #2288.	2015-03-20 09:59:28 +01:00
antirez	438a1a84e8	Cluster: more robust slave check in CLUSTER REPLICATE. There are rare conditions where node->slaveof may be NULL even if the node is a slave. To check by flag is much more robust.	2015-03-18 12:10:14 +01:00
antirez	93b1320fac	Cluster: fix CLUSTER NODES optimization error in 'j' increment.	2015-03-13 13:16:35 +01:00
antirez	e1b6c9dd18	Cluster: CLUSTER NODES speedup.	2015-03-13 11:26:04 +01:00
Michel Martens	6201eb0c55	Add command CLUSTER MYID	2015-03-10 16:43:19 +00:00
antirez	c77081a45a	Migrate: replace conditional with pre-computed value.	2015-02-27 22:33:54 +01:00
antirez	832b0c7cce	Improvements to PR #2425 1. Remove useless "cs" initialization. 2. Add a "select" var to capture a condition checked multiple times. 3. Avoid duplication of the same if (!copy) conditional. 4. Don't increment dirty if copy is given (no deletion is performed), otherwise we propagate MIGRATE when not needed.	2015-02-26 10:27:56 +01:00
Tommy Wang	7fda935ad3	Add last_dbid to migrateCachedSocket to avoid redundant SELECT Avoid redundant SELECT calls when continuously migrating keys to the same dbid within a target Redis instance.	2015-02-26 10:18:43 +01:00
Salvatore Sanfilippo	d83c810265	Merge pull request #2301 from mattsta/fix/lengths Improve type correctness	2015-02-24 17:22:53 +01:00
antirez	233729fe7f	Cluster: some bias towwards FAIL/PFAIL nodes in gossip sections. This improves PFAIL -> FAIL switch. Too late at this point in the RC releases to add proper PFAIL/FAIL separate dictionary to do this in a less randomized way. Tested in practice with experiments that this helps. PFAIL -> FAIL average with 20 nodes and node-timeout set to 5 seconds takes 2.5 seconds without this commit, 1 second with this commit.	2015-01-30 11:55:36 +01:00
antirez	69b4f00d28	More correct wanted / maxiterations values in clusterSendPing().	2015-01-30 11:23:27 +01:00
antirez	e5a22064cc	Cluster: magical 10% of nodes explained in comments.	2015-01-29 15:43:35 +01:00
antirez	1efacfe53d	CLUSTER count-failure-reports command added.	2015-01-29 15:02:10 +01:00
antirez	3fd43062c8	Cluster: use a number of gossip sections proportional to cluster size. Otherwise it is impossible to receive the majority of failure reports in the node_timeout*2 window in larger clusters. Still with a 200 nodes cluster, 20 gossip sections are a very reasonable amount of bytes to send. A side effect of this change is also fater cluster nodes joins for large clusters, because the cluster layout makes less time to propagate.	2015-01-29 14:20:59 +01:00
antirez	9802ec3c83	Cluster: initialized not used fileds in gossip section. Otherwise we risk sending not initialized data to other nodes, that may contain anything. This was actually not possible only because the initialization of the buffer where the cluster packets header is created was larger than the 3 gossip sections we use, so the memory was already all filled with zeroes by the memset().	2015-01-24 07:52:24 +01:00
Matt Stancliff	051a43e03a	Fix cluster migrate memory leak Fixes valgrind error: 48 bytes in 1 blocks are definitely lost in loss record 196 of 373 at 0x4910D3: je_malloc (jemalloc.c:944) by 0x42807D: zmalloc (zmalloc.c:125) by 0x41FA0D: dictGetIterator (dict.c:543) by 0x41FA48: dictGetSafeIterator (dict.c:555) by 0x459B73: clusterHandleSlaveMigration (cluster.c:2776) by 0x45BF27: clusterCron (cluster.c:3123) by 0x423344: serverCron (redis.c:1239) by 0x41D6CD: aeProcessEvents (ae.c:311) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832)	2015-01-21 18:47:16 +01:00
Matt Stancliff	29049507ec	Fix potential invalid read past end of array If array has N elements, we can't read +1 if we are already at N. Also, we need to move elements by their storage size in the array, not just by individual bytes.	2015-01-21 18:01:03 +01:00
Matt Stancliff	30152554ea	Fix cluster reset memory leak [maybe] Fixes valgrind errors: 32 bytes in 4 blocks are definitely lost in loss record 107 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80A9AFC: clusterSetMaster (cluster.c:801) by 0x80AEDC9: clusterCommand (cluster.c:3994) by 0x80682A5: call (redis.c:2049) by 0x8068A20: processCommand (redis.c:2309) by 0x8076497: processInputBuffer (networking.c:1143) by 0x8073BAF: readQueryFromClient (networking.c:1208) by 0x8060E98: aeProcessEvents (ae.c:412) by 0x806123B: aeMain (ae.c:455) by 0x806C3DB: main (redis.c:3832) 64 bytes in 8 blocks are definitely lost in loss record 143 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80AAB40: clusterProcessPacket (cluster.c:801) by 0x80A847F: clusterReadHandler (cluster.c:1975) by 0x30000FF: ??? 80 bytes in 10 blocks are definitely lost in loss record 148 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80AAB40: clusterProcessPacket (cluster.c:801) by 0x80A847F: clusterReadHandler (cluster.c:1975) by 0x2FFFFFF: ???	2015-01-21 17:51:57 +01:00
Matt Stancliff	72b8574cca	Fix sending uninitialized bytes Fixes valgrind error: Syscall param write(buf) points to uninitialised byte(s) at 0x514C35D: ??? (syscall-template.S:81) by 0x456B81: clusterWriteHandler (cluster.c:1907) by 0x41D596: aeProcessEvents (ae.c:416) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832) Address 0x5f268e2 is 2,274 bytes inside a block of size 8,192 alloc'd at 0x4932D1: je_realloc (jemalloc.c:1297) by 0x428185: zrealloc (zmalloc.c:162) by 0x4269E0: sdsMakeRoomFor.part.0 (sds.c:142) by 0x426CD7: sdscatlen (sds.c:251) by 0x4579E7: clusterSendMessage (cluster.c:1995) by 0x45805A: clusterSendPing (cluster.c:2140) by 0x45BB03: clusterCron (cluster.c:2944) by 0x423344: serverCron (redis.c:1239) by 0x41D6CD: aeProcessEvents (ae.c:311) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832) Uninitialised value was created by a stack allocation at 0x457810: nodeUpdateAddressIfNeeded (cluster.c:1236)	2015-01-21 17:50:17 +01:00
antirez	2601e3e461	Cluster: node deletion cleanup / centralization.	2015-01-21 16:03:43 +01:00
antirez	59ad6ac5fe	Cluster: set the slaves->slaveof filed to NULL when master is freed. Related to issue #2289.	2015-01-21 15:55:53 +01:00
Matt Stancliff	53c082ec39	Improve networking type correctness read() and write() return ssize_t (signed long), not int. For other offsets, we can use the unsigned size_t type instead of a signed offset (since our replication offsets and buffer positions are never negative).	2015-01-19 14:10:12 -05:00
antirez	cf76af6b9f	Cluster: fetch my IP even if msg is not MEET for the first time. In order to avoid that misconfigured cluster nodes at some time may force an IP update on other nodes, it is required that nodes update their own address only on MEET messages. However it does not make sense to do this the first time a node is contacted and yet does not have an IP, we just risk that myself->ip remains not assigned if there are messages lost or cluster creation procedures that don't make sure everybody is targeted by at least one incoming MEET message. Also fix the logging of the IP switch avoiding the :-1 tail.	2015-01-13 10:50:34 +01:00
antirez	5b0f4a83ac	Cluster: clusterMsgDataGossip structure, explict padding + minor stuff. Also explicitly set version to 0, add a protocol version define, improve comments in the gossip structure. Note that the structure layout is the same after the change, we are just making the padding explicit with an additional not used 16 bits field. So this commit is still able to talk with the previous versions of cluster nodes.	2015-01-13 10:40:09 +01:00
antirez	237ab727b9	Suppress valgrind error about write sending uninitialized data. Valgrind checks that the buffers we transfer via syscalls are all composed of bytes actually initialized. This is useful, it makes we able to avoid leaking informations in non initialized parts fo messages transferred to other hosts. This commit fixes one of such issues.	2015-01-13 09:31:37 +01:00
antirez	6274a6789d	Cluster: initialize mf_end. Can't be initialized by resetManualFailover() since it's actual state the function uses, so we need to initialize it at startup time. Not really a bug in practical terms, but showed up into valgrind and is not technically correct anyway.	2015-01-12 15:55:00 +01:00
Matt Stancliff	ad41a7c404	Add addReplyBulkSds() function Refactor a common pattern into one function so we don't end up with copy/paste programming.	2014-12-23 09:31:02 -05:00
Matt Stancliff	a772747ffc	Cluster: Notify user on accept error If we woke up to accept a connection, but we can't accept it, inform the user of the error going on with their networking. (The previous message was the same for success or error!)	2014-12-17 10:49:32 -05:00
antirez	1aef29e079	Fix comment in clusterHandleSlaveFailover().	2014-12-16 15:03:12 +01:00
antirez	90c7d8cfa1	Make sure buffer is enough in clusterSendPing().	2014-12-15 10:18:22 +01:00

1 2 3 4 5 ...

574 Commits