redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-24 09:08:26 -05:00

Author	SHA1	Message	Date
Salvatore Sanfilippo	24bc807b5c	Merge pull request #576 from saj/fix-slave-ping-period Bug fix: slaves being pinged every second	2012-09-05 06:59:37 -07:00
antirez	36741b2c81	Scripting: Force SORT BY constant determinism inside SORT itself. SORT is able to return (faster than when ordering) unordered output if the "BY" clause is used with a constant value. However we try to play well with scripting requirements of determinism providing always sorted outputs when SORT (and other similar commands) are called by Lua scripts. However we used the general mechanism in place in scripting in order to reorder SORT output, that is, if the command has the "S" flag set, the Lua scripting engine will take an additional step when converting a multi bulk reply to Lua value, calling a Lua sorting function. This is suboptimal as we can do it faster inside SORT itself. This is also broken as issue #545 shows us: basically when SORT is used with a constant BY, and additionally also GET is used, the Lua scripting engine was trying to order the output as a flat array, while it was actually a list of key-value pairs. What we do know is to recognized if the caller of SORT is the Lua client (since we can check this using the REDIS_LUA_CLIENT flag). If so, and if a "don't sort" condition is triggered by the BY option with a constant string, we force the lexicographical sorting. This commit fixes this bug and improves the performance, and at the same time simplifies the implementation. This does not mean I'm smart today, it means I was stupid when I committed the original implementation ;)	2012-09-05 01:17:49 +02:00
antirez	9bd0e097aa	Sentinel: reply -IDONTKNOW to get-master-addr-by-name on lack of info. If we don't have any clue about a master since it never replied to INFO so far, reply with an -IDONTKNOW error to SENTINEL get-master-addr-by-name requests.	2012-09-04 16:06:53 +02:00
antirez	8bdde086ac	Sentinel: more easy master redirection if master is a slave. Before this commit Sentienl used to redirect master ip/addr if the current instance reported to be a slave only if this was the first INFO output received, and the role was found to be slave. Now instead also if we find that the runid is different, and the reported role is slave, we also redirect to the reported master ip/addr. This unifies the behavior of Sentinel in the case of a reboot (where it will see the first INFO output with the wrong role and will perform the redirection), with the behavior of Sentinel in the case of a change in what it sees in the INFO output of the master.	2012-09-04 15:52:04 +02:00
antirez	bb66fc3120	Send an async PING before starting replication with master. During the first synchronization step of the replication process, a Redis slave connects with the master in a non blocking way. However once the connection is established the replication continues sending the REPLCONF command, and sometimes the AUTH command if needed. Those commands are send in a partially blocking way (blocking with timeout in the order of seconds). Because it is common for a blocked master to accept connections even if it is actually not able to reply to the slave requests, it was easy for a slave to block if the master had serious issues, but was still able to accept connections in the listening socket. For this reason we now send an asynchronous PING request just after the non blocking connection ended in a successful way, and wait for the reply before to continue with the replication process. It is very unlikely that a master replying to PING can't reply to the other commands. This solution was proposed by Didier Spezia (Thanks!) so that we don't need to turn all the replication process into a non blocking affair, but still the probability of a slave blocked is minimal even in the event of a failing master. Also we now use getsockopt(SO_ERROR) in order to check errors ASAP in the event handler, instead of waiting for actual I/O to return an error. This commit fixes issue #632.	2012-09-02 12:24:38 +02:00
antirez	e323635c2d	Scripting: Reset Lua fake client reply_bytes after command execution. Lua scripting uses a fake client in order to run commands in the context of a client, accumulate the reply, and convert it into a Lua object to return to the caller. This client is reused again and again, and is referenced by the server.lua_client globally accessible pointer. However after every call to redis.call() or redis.pcall(), that is handled by the luaRedisGenericCommand() function, the reply_bytes field of the client was not set back to zero. This filed is used to estimate the amount of memory currently used in the reply. Because of the lack of reset, script after script executed, this value used to get bigger and bigger, and in the end on 32 bit systems it triggered the following assert: redisAssert(c->reply_bytes < ULONG_MAX-(1024*64)); On 64 bit systems this does not happen because it takes too much time to reach values near to 2^64 for users to see the practical effect of the bug. Now in the cleanup stage of luaRedisGenericCommand() we reset the reply_bytes counter to zero, avoiding the issue. It is not practical to add a test for this bug, but the fix was manually tested using a debugger. This commit fixes issue #656.	2012-08-31 11:15:02 +02:00
antirez	46c31a150a	Scripting: require at least one argument for redis.call(). Redis used to crash with a call like the following: EVAL "redis.call()" 0 Now the explicit check for at least one argument prevents the problem. This commit fixes issue #655.	2012-08-31 10:28:13 +02:00
antirez	6276434ad2	Sentinel: do not crash against slaves not publishing the runid. Older versions of Redis (before 2.4.17) don't publish the runid field in INFO. This commit makes Sentinel able to handle that without crashing.	2012-08-30 18:01:52 +02:00
antirez	58186b9dcf	Sentinel: INFO command implementation.	2012-08-29 12:44:24 +02:00
antirez	8246e58abe	Sentinel: add Redis execution mode to INFO output. The new "redis_mode" field in the INFO output will show if Redis is running in standalone mode, cluster, or sentinel mode.	2012-08-29 11:44:01 +02:00
antirez	3ec701e059	Sentinel: Sentinel-side support for slave priority. The slave priority that is now published by Redis in INFO output is now used by Sentinel in order to select the slave with minimum priority for promotion, and in order to consider slaves with priority set to 0 as not able to play the role of master (they will never be promoted by Sentinel). The "slave-priority" field is now one of the fileds that Sentinel publishes when describing an instance via the SENTINEL commands such as "SENTINEL slaves mastername".	2012-08-28 17:45:01 +02:00
antirez	169a44cbd6	Sentinel: Redis-side support for slave priority. A Redis slave can now be configured with a priority, that is an integer number that is shown in INFO output and can be get and set using the redis.conf file or the CONFIG GET/SET command. This field is used by Sentinel during slave election. A slave with lower priority is preferred. A slave with priority zero is never elected (and is considered to be impossible to elect even if it is the only slave available). A next commit will add support in the Sentinel side as well.	2012-08-28 17:20:26 +02:00
antirez	c14e0ecafd	Sentinel: suppress harmless warning by initializing 'table' to NULL. Note that the assertion guarantees that one of the if branches setting table is always entered.	2012-08-28 12:56:05 +02:00
antirez	784b93087c	Incrementally flush RDB on disk while loading it from a master. This fixes issue #539. Basically if there is enough free memory the OS may buffer the RDB file that the slave transfers on disk from the master. The file may actually be flused on disk at once by the operating system when it gets closed by Redis, causing the close system call to block for a long time. This patch is a modified version of one provided by yoav-steinberg of @garantiadata (the original version was posted in the issue #539 comments), and tries to flush the OS buffers incrementally (every 8 MB of loaded data).	2012-08-28 12:47:33 +02:00
antirez	1caa627e4e	Fix a forget zmalloc_oom() -> zmalloc_oom_handler() replacement.	2012-08-24 15:40:22 +02:00
antirez	6fdc635447	Better Out of Memory handling. The previous implementation of zmalloc.c was not able to handle out of memory in an application-specific way. It just logged an error on standard error, and aborted. The result was that in the case of an actual out of memory in Redis where malloc returned NULL (In Linux this actually happens under specific overcommit policy settings and/or with no or little swap configured) the error was not properly logged in the Redis log. This commit fixes this problem, fixing issue #509. Now the out of memory is properly reported in the Redis log and a stack trace is generated. The approach used is to provide a configurable out of memory handler to zmalloc (otherwise the default one logging the event on the standard output is used).	2012-08-24 12:55:37 +02:00
antirez	850789ce73	Sentinel: send SCRIPT KILL on -BUSY reply and SDOWN instance. From the point of view of Redis an instance replying -BUSY is down, since it is effectively not able to reply to user requests. However a looping script is a recoverable condition in Redis if the script still did not performed any write to the dataset. In that case performing a fail over is not optimal, so Sentinel now tries to restore the normal server condition killing the script with a SCRIPT KILL command. If the script already performed some write before entering an infinite (or long enough to timeout) loop, SCRIPT KILL will not work and the fail over will be triggered anyway.	2012-08-24 12:29:54 +02:00
antirez	01477753e6	Sentinel: fixed a crash on script execution. The call to sentinelScheduleScriptExecution() lacked the final NULL argument to signal the end of arguments. This resulted into a crash.	2012-08-24 12:10:24 +02:00
Salvatore Sanfilippo	724371d748	Merge pull request #628 from pietern/unstable-zip Fix ziplist edge case	2012-08-22 02:32:27 -07:00
antirez	227b429364	redis-benchmark: disable big buffer cleanup in hiredis context. This new hiredis features allows us to reuse a previous context reader buffer even if already very big in order to maximize performances with big payloads (Usually hiredis re-creates buffers when they are too big and unused in order to save memory).	2012-08-21 17:31:44 +02:00
Pieter Noordhuis	3cc2904e3e	Set p to its new offset before modifying it	2012-08-13 14:13:09 -07:00
Pieter Noordhuis	5117c20ab5	Add ziplist test for deleting next to last entries	2012-08-13 14:09:40 -07:00
antirez	cada7f9671	Sentinel: SENTINEL FAILOVER command implemented. This command can be used in order to force a Sentinel instance to start a failover for the specified master, as leader, forcing the failover even if the master is up. The commit also adds some minor refactoring and other improvements to functions already implemented that make them able to work when the master is not in SDOWN condition. For instance slave selection assumed that we ask INFO every second to every slave, this is true only when the master is in SDOWN condition, so slave selection did not worked when the master was not in SDOWN condition.	2012-08-03 12:41:27 +02:00
antirez	6275004ca6	Sentinel: client reconfiguration script execution. This commit adds support to optionally execute a script when one of the following events happen: * The failover starts (with a slave already promoted). * The failover ends. * The failover is aborted. The script is called with enough parameters (documented in the example sentinel.conf file) to provide information about the old and new ip:port pair of the master, the role of the sentinel (leader or observer) and the name of the master. The goal of the script is to inform clients of the configuration change in a way specific to the environment Sentinel is running, that can't be implemented in a genereal way inside Sentinel itself.	2012-08-02 18:40:30 +02:00
antirez	fd92b366b0	Sentinel: when leader in wait-start, sense another leader as race. When we are in wait start, if another leader (or any other external entity) turns a slave into a master, abort the failover, and detect it as an observer. Note that the wait-start state is mainly there for this reason but the abort was yet not implemented. This adds a new sentinel event -failover-abort-race.	2012-07-31 17:11:26 +02:00
antirez	91c15ed1b5	Sentinel: sentinelRefreshInstanceInfo() comments improved a bit.	2012-07-31 16:18:15 +02:00
Michael Parker	f1d187bb3e	Use correct variable name for value to convert. Note by @antirez: this code was never compiled because utils.c lacked the float.h include, so we never noticed this variable was mispelled in the past. This should provide a noticeable speed boost when saving certain types of databases with many sorted sets inside.	2012-07-31 11:48:00 +02:00
antirez	75084e057d	Sentinel: abort failover when in wait-start if master is back. When we are a Leader Sentinel in wait-start state, starting with this commit the failover is aborted if the master returns online. This improves the way we handle a notable case of net split, that is the split between Sentinels and Redis servers, that will be a very common case of split becase Sentinels will often be installed in the client's network and servers can be in a differnt arm of the network. When Sentinels and Redis servers are isolated the master is in ODOWN condition since the Sentinels can agree about this state, however the failover does not start since there are no good slaves to promote (in this specific case all the slaves are unreachable). However when the split is resolved, Sentinels may sense the slave back a moment before they sense the master is back, so the failover may start without a good reason (since the master is actually working too). Now this condition is reversible, so the failover will be aborted immediately after if the master is detected to be working again, that is, not in SDOWN nor in ODOWN condition.	2012-07-31 10:19:34 +02:00
antirez	7f5bdba434	Merge remote-tracking branch 'origin/unstable' into unstable	2012-07-28 20:55:17 +02:00
antirez	3f194a9d25	Sentinel: scripts execution engine improved. We no longer use a vanilla fork+execve but take a queue of jobs of scripts to execute, with retry on error, timeouts, and so forth. Currently this is used only for notifications but soon the ability to also call clients reconfiguration scripts will be added.	2012-07-28 20:54:27 +02:00
Jan-Erik Rediger	c6c19c8372	Include sys/wait.h to avoid compiler warning gcc warned about an implicit declaration of function 'wait3'. Including this header fixes this.	2012-07-28 12:33:01 +03:00
Salvatore Sanfilippo	ed7b308c1c	Merge pull request #587 from saj/truncate-short-write-from-aof Truncate short write from the AOF	2012-07-27 03:56:48 -07:00
Salvatore Sanfilippo	04950a9e4d	Merge pull request #586 from saj/aof_last_bgrewrite_status New in INFO: aof_last_bgrewrite_status	2012-07-27 03:55:20 -07:00
antirez	ce7b838fb9	Sentinel: don't start a failover as leader if there is no good slave.	2012-07-26 12:09:40 +02:00
antirez	baace5fc42	Sentinel: ability to execute notification scripts.	2012-07-25 16:33:37 +02:00
Salvatore Sanfilippo	42c571864e	Merge pull request #603 from mrb/fix_sentinel_config_warning Fix warning in redis.c for sentinel config load	2012-07-25 07:15:53 -07:00
mrb	f1c8661e74	Fix warning in redis.c for sentinel config load	2012-07-25 09:55:53 -04:00
antirez	672102c2ce	Sentinel: abort failover if no good slave is available. The previous behavior of the state machine was to wait some time and retry the slave selection, but this is not robust enough against drastic changes in the conditions of the monitored instances. What we do now when the slave selection fails is to abort the failover and return back monitoring the master. If the ODOWN condition is still present a new failover will be triggered and so forth. This commit also refactors the code we use to abort a failover.	2012-07-25 11:32:19 +02:00
antirez	9e5bef38e6	Sentinel: reset pending_commands in a more generic way.	2012-07-24 18:57:26 +02:00
antirez	a23a5b6c7d	Prevent a spurious +sdown event on switch. When we reset the master we should start with clean timestamps for ping replies otherwise we'll detect a spurious +sdown event, because on +master-switch event the previous master instance was probably in +sdown condition. Since we updated the address we should count time from scratch again. Also this commit makes sure to explicitly reset the count of pending commands, now we can do this because of the new way the hiredis link is closed.	2012-07-24 18:46:04 +02:00
antirez	d918e6f127	Sentinel: debugging message removed.	2012-07-24 18:20:05 +02:00
antirez	75fb6e5b8a	Sentinel: changes to connection handling and redirection. We disconnect the Redis instances hiredis link in a more robust way now. Also we change the way we perform the redirection for the +switch-master event, that is not just an instance reset with an address change. Using the same system we now implement the +redirect-to-master event that is triggered by an instance that is configured to be master but found to be a slave at the first INFO reply. In that case we monitor the master instead, logging the incident as an event.	2012-07-24 18:15:44 +02:00
antirez	2179c26916	Sentinel: check that instance still exists in reply callbacks. We can't be sure the instance object still exists when the reply callback is called.	2012-07-24 16:37:57 +02:00
antirez	d876d6feac	Sentinel: more robust failover detection as observer. Sentinel observers detect failover checking if a slave attached to the monitored master turns into its replication state from slave to master. However while this change may in theory only happen after a SLAVEOF NO ONE command, in practie it is very easy to reboot a slave instance with a wrong configuration that turns it into a master, especially if it was a past master before a successfull failover. This commit changes the detection policy so that if an instance goes from slave to master, but at the same time the runid has changed, we sense a reboot, and in that case we don't detect a failover at all. This commit also introduces the "reboot" sentinel event, that is logged at "warning" level (so this will trigger an admin notification). The commit also fixes a problem in the disconnect handler that assumed that the instance object always existed, that is not the case. Now we no longer assume that redisAsyncFree() will call the disconnection handler before returning.	2012-07-24 12:42:40 +02:00
antirez	6b5daa2df2	First implementation of Redis Sentinel. This commit implements the first, beta quality implementation of Redis Sentinel, a distributed monitoring system for Redis with notification and automatic failover capabilities. More info at http://redis.io/topics/sentinel	2012-07-23 13:14:44 +02:00
antirez	03f412ddef	Merge remote-tracking branch 'origin/unstable' into unstable	2012-07-22 17:18:42 +02:00
antirez	5d73073f6e	Allow Pub/Sub in contexts where other commands are blocked. Redis loading data from disk, and a Redis slave disconnected from its master with serve-stale-data disabled, are two conditions where commands are normally refused by Redis, returning an error. However there is no reason to disable Pub/Sub commands as well, given that this layer does not interact with the dataset. To allow Pub/Sub in as many contexts as possible is especially interesting now that Redis Sentinel uses Pub/Sub of a Redis master as a communication channel between Sentinels. This commit allows Pub/Sub to be used in the above two contexts where it was previously denied.	2012-07-22 17:18:16 +02:00
antirez	b62bdf1c64	Don't assume that "char" is signed. For the C standard char can be either signed or unsigned, it's up to the compiler, but Redis assumed that it was signed in a few places. The practical effect of this patch is that now Redis 2.6 will run correctly in every system where char is unsigned, notably the RaspBerry PI and other ARM systems with GCC. Thanks to Georgi Marinov (@eesn on twitter) that reported the problem and allowed me to use his RaspBerry via SSH to trace and fix the issue!	2012-07-18 12:04:58 +02:00
Saj Goonatilleke	55302e9e28	Truncate short write from the AOF If Redis only manages to write out a partial buffer, the AOF file won't load back into Redis the next time it starts up. It is better to discard the short write than waste time running redis-check-aof.	2012-07-18 10:35:17 +10:00
Saj Goonatilleke	48553a29e8	New in INFO: aof_last_bgrewrite_status Behaves like rdb_last_bgsave_status -- even down to reporting 'ok' when no rewrite has been done yet. (You might want to check that aof_last_rewrite_time_sec is not -1.)	2012-07-18 09:54:55 +10:00

... 4 5 6 7 8 ...

1622 Commits