redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-23 00:28:26 -05:00

Author	SHA1	Message	Date
Matt Stancliff	e942f3ce0f	Cluster: Restore proper trib master iteration This got removed in `2e5c394` during a new feature addition. The prior commit had "break if masters.length == masters_count" but we are guaranteed to aready have that condition met since otherwise we would haven't gotten this far. Without this break statement, it's possible some masters may be forgotten and have zero replicas while other masters have more than their requested number of replicas. Thanks to carlos for pointing out this regression at: https://groups.google.com/forum/#!topic/redis-db/_WVVqDw5B7c	2014-03-24 10:17:44 -04:00
antirez	93253c2762	Sample and cache RSS in serverCron(). Obtaining the RSS (Resident Set Size) info is slow in Linux and OSX. This slowed down the generation of the INFO 'memory' section. Since the RSS does not require to be a real-time measurement, we now sample it with server.hz frequency (10 times per second by default) and use this value both to show the INFO rss field and to compute the fragmentation ratio. Practically this does not make any difference for memory profiling of Redis but speeds up the INFO call significantly.	2014-03-24 12:00:20 +01:00
antirez	30639c8ca9	sdscatvprintf(): Try to use a static buffer. For small content the function now tries to use a static buffer to avoid a malloc/free cycle that is too costly when the function is used in the context of performance critical code path such as INFO output generation. This change was verified to have positive effects in the execution speed of the INFO command.	2014-03-24 10:20:33 +01:00
antirez	d3efe04c47	Cache uname() output across INFO calls. Uname was profiled to be a slow syscall. It produces always the same output in the context of a single execution of Redis, so calling it at every INFO output generation does not make too much sense. The uname utsname structure was modified as a static variable. At the same time a static integer was added to check if we need to call uname the first time.	2014-03-24 10:00:08 +01:00
antirez	a9caca0424	sdscatvprintf(): guess buflen using format length. sdscatvprintf() uses a loop where it tries to output the formatted string in a buffer of the initial length, if there was not enough room, a buffer of doubled size is tried and so forth. The initial guess for the buffer length was very poor, an hardcoded "16". This caused the printf to be processed multiple times without a good reason. Given that printf functions are already not fast, the overhead was significant. The new heuristic is to use a buffer 4 times the length of the format buffer, and 32 as minimal size. This appears to be a good balance for typical uses of the function inside the Redis code base. This change improved INFO command performances 3 times.	2014-03-24 09:44:11 +01:00
antirez	6972f18cbd	Add test-lru.rb to utils. This is a program useful to evaluate the Redis LRU algorithm behavior.	2014-03-21 09:52:05 +01:00
antirez	4d2e8fa189	Use getLRUClock() instead of server.lruclock to create objects. Thanks to Matt Stancliff for noticing this error. It was in the original code but somehow I managed to remove the change from the commit...	2014-03-21 09:08:20 +01:00
antirez	5fa3248bad	The default maxmemory policy is now noeviction. This is safer as by default maxmemory should just set a memory limit without any key to be deleted, unless the policy is set to something more relaxed.	2014-03-21 08:03:34 +01:00
antirez	a98369929e	Use 24 bits for the lru object field and improve resolution. There were 2 spare bits inside the Redis object structure that are now used in order to enlarge 4x the range of the LRU field. At the same time the resolution was improved from 10 to 1 second: this still provides 194 days before the LRU counter overflows (restarting from zero). This is not a problem since it only causes lack of eviction precision for objects not touched for a very long time, and the lack of precision is only temporary.	2014-03-20 17:56:27 +01:00
antirez	f4da796c53	Default LRU samples is now 5.	2014-03-20 17:05:42 +01:00
antirez	c641b670c3	Use new dictGetRandomKeys() API to get samples for eviction. The eviction quality degradates a bit in my tests, but since the API is faster, it allows to raise the number of samples, and overall is a win.	2014-03-20 16:52:12 +01:00
antirez	82b53c650c	struct dictEntry -> dictEntry.	2014-03-20 16:20:37 +01:00
antirez	5317f5e99a	Added dictGetRandomKeys() to dict.c: mass get random entries. This new function is useful to get a number of random entries from an hash table when we just need to do some sampling without particularly good distribution. It just jumps at a random place of the hash table and returns the first N items encountered by scanning linearly. The main usefulness of this function is to speedup Redis internal sampling of the key space, for example for key eviction or expiry.	2014-03-20 15:50:46 +01:00
antirez	22c9cfaf57	LRU eviction pool implementation. This is an improvement over the previous eviction algorithm where we use an eviction pool that is persistent across evictions of keys, and gets populated with the best candidates for evictions found so far. It allows to approximate LRU eviction at a given number of samples better than the previous algorithm used.	2014-03-20 11:57:29 +01:00
antirez	6d5790d682	Fix OBJECT IDLETIME return value converting to seconds. estimateObjectIdleTime() returns a value in milliseconds now, so we need to scale the output of OBJECT IDLETIME to seconds.	2014-03-20 11:55:18 +01:00
antirez	ad6b0f70b2	Obtain LRU clock in a resolution dependent way. For testing purposes it is handy to have a very high resolution of the LRU clock, so that it is possible to experiment with scripts running in just a few seconds how the eviction algorithms works. This commit allows Redis to use the cached LRU clock, or a value computed on demand, depending on the resolution. So normally we have the good performance of a precomputed value, and a clock that wraps in many days using the normal resolution, but if needed, changing a define will switch behavior to an high resolution LRU clock.	2014-03-20 11:47:12 +01:00
antirez	1faf82663f	Specify lruclock in redisServer structure via REDIS_LRU_BITS. The padding field was totally useless: removed.	2014-03-20 11:37:27 +01:00
antirez	d77e231682	Specify LRU resolution in milliseconds.	2014-03-20 11:33:25 +01:00
antirez	fe30847016	Set LRU parameters via REDIS_LRU_BITS define.	2014-03-20 11:22:47 +01:00
antirez	e150ec7d0c	Unify stats reset for CONFIG RESETSTAT / initServer(). Now CONFIG RESETSTAT makes sure to reset all the fields, and in the future it will be simpler to avoid missing new fields.	2014-03-19 12:55:49 +01:00
antirez	b9e90a70fa	Sentinel: sentinelRefreshInstanceInfo() minor refactoring. Test sentinel.tilt condition on top and return if it is true. This allows to remove the check for the tilt condition in the remaining code paths of the function.	2014-03-18 15:35:47 +01:00
antirez	258d377d8e	Sentinel test: 02 unit better coverage + refactoring.	2014-03-18 15:18:51 +01:00
antirez	58f104e2ee	Sentinel test: foreach_instance_id implements 'break'.	2014-03-18 15:06:52 +01:00
antirez	2586ea7637	Sentinel: instance_is_killed proc added to sentinel.tcl.	2014-03-18 14:58:27 +01:00
antirez	218cc5fc39	Sentinel: propagate down-after-ms changes to slaves and sentinels.	2014-03-18 14:37:44 +01:00
antirez	bb6d850160	Sentinel: down-after-milliseconds is not master-specific. addReplySentinelRedisInstance() modified so that this field is displayed for all the kind of instances: Sentinels, Masters, Slaves.	2014-03-18 11:21:17 +01:00
antirez	ae0b7680b3	Sentinel failure detection implementation improved. Failure detection in Sentinel is ping-pong based. It used to work by remembering the last time a valid PONG reply was received, and checking if the reception time was too old compared to the current current time. PINGs were sent at a fixed interval of 1 second. This works in a decent way, but does not scale well when we want to set very small values of "down-after-milliseconds" (this is the node timeout basically). This commit reiplements the failure detection making a number of changes. Some changes are inspired to Redis Cluster failure detection code: * A new last_ping_time field is added in representation of instances. If non zero, we have an active ping that was sent at the specified time. When a valid reply to ping is received, the field is zeroed again. * last_ping_time is not reset when we reconnect the link or send a new ping, so from our point of view it represents the time we started waiting for the instance to reply to our pings without receiving a reply. * last_ping_time is now used in order to check if the instance is timed out. This means that we can have a node timeout of 100 milliseconds and yet the system will work well since the new check is not bound to the period used to send pings. * Pings are now sent every second, or often if the value of down-after-milliseconds is less than one second. With a lower limit of 10 HZ ping frequency. * Link reconnection code was improved. This is used in order to try to reconnect the link when we are at 50% of the node timeout without a valid reply received yet. However the old code triggered unnecessary reconnections when the node timeout was very small. Now that should be ok. The new code passes the tests but more testing is needed and more unit tests stressing the failure detector, so currently this is merged only in the unstable branch.	2014-03-17 18:33:45 +01:00
antirez	3a2ff55617	Sentinel: use CLIENT SETNAME when connecting to Redis. This makes debugging / monitoring of Sentinels simpler since you can identify sentinels in CLIENT LIST output of Redis instances.	2014-03-15 14:59:23 +01:00
Salvatore Sanfilippo	c65b75e740	Merge pull request #1608 from mattsta/fix-sentinel-current-epoch-segfault Fix segfault from accessing array out of bounds	2014-03-14 22:56:24 +01:00
Matt Stancliff	584052ee6b	Fix segfault from accessing array out of bounds argc == 2; argv[2] == crash	2014-03-14 17:38:05 -04:00
antirez	ed813863f0	Sentinel: be safe under crash-recovery assumptions. Sentinel's main safety argument is that there are no two configurations for the same master with the same version (configuration epoch). For this to be true Sentinels require to be authorized by a majority. Additionally Sentinels require to do two important things: * Never vote again for the same epoch. * Never exchange an old vote for a fresh one. The first prerequisite, in a crash-recovery system model, requires to persist the master->leader_epoch on durable storage before to reply to messages. This was not the case. We also make sure to persist the current epoch in order to never reply to stale votes requests from other Sentinels, after a recovery. The configuration is persisted by making use of fsync(), this is considered in the context of this code a good enough guarantee that after a restart our durable state is restored, however this may not always be the case depending on the kind of hardware and operating system used.	2014-03-14 14:58:44 +01:00
antirez	365094028b	Sentinel: fake PUBLISH command to receive HELLO messages. Now the way HELLO messages are received is unified. Now it is no longer needed for Sentinels to converge to the higher configuration for a master to be able to chat via some Redis instance, the are able to directly exchanges configurations. Note that this commit does not include the (trivial) change needed to send HELLO messages to Sentinel instances as well, since for an error I committed the change in the previous commit that refactored hello messages processing into a separated function.	2014-03-14 11:07:42 +01:00
antirez	9dfe426fc8	Sentinel: HELLO processing refactored into sentinelProcessHelloMessage().	2014-03-14 11:07:42 +01:00
antirez	133fccb03f	Cluster: flag the transaction as dirty for the new redirections.	2014-03-13 15:11:53 +01:00
antirez	429aff4ef4	Linenoise updated, multiline mode enabled in redis-cli.	2014-03-13 15:11:08 +01:00
antirez	cc11d103c0	redis-trib: call MIGRATE via r.client.call as fix for redis-rb API changes. See issue #1593. Thanks to @badboy for suggesting the direct client.call fix.	2014-03-11 16:10:13 +01:00
antirez	df32eb6827	redis-trib: new subcommand 'call'. Exec command in all nodes. Example: ./redis-trib.rb call 192.168.1.11:7000 config get cluster-node-timeout	2014-03-11 14:58:55 +01:00
antirez	2e5c394fa8	redis-trib: create subcommand is now able to assign spare slaves. Example: if the user will try to configure a cluster with 9 nodes, asking for 1 slave for master, redis-trib will configure a 4 masters cluster with 1 slave each as usually, but this time will assign the spare node as a slave of one of the masters.	2014-03-11 14:17:28 +01:00
antirez	e26f4486b0	Cluster: update node configEpoch on UPDATE messages. The UPDATE message contains the configEpoch of the node configuration advertised in the packet. Update it if needed.	2014-03-11 11:53:09 +01:00
antirez	a2ff90919f	Cluster: set slot error if we receive an update for a busy slot. By manually modifying nodes configurations in random ways, it is possible to create the following scenario: A is serving keys for slot 10 B is manually configured to serve keys for slot 10 A receives an update from B (or another node) where it is informed that the slot 10 is now claimed by B with a greater configuration epoch, however A still has keys from slot 10. With this commit A will put the slot in error setting it in IMPORTING state, so that redis-trib can detect the issue.	2014-03-11 11:49:47 +01:00
antirez	1ed0ad77f0	Cluster: clarified a comment in clusterUpdateSlotsConfigWith().	2014-03-11 11:32:40 +01:00
antirez	8287945ff8	Cluster: flush importing/migrating state when master is turned into slave.	2014-03-11 11:22:06 +01:00
antirez	2e8e0ad44e	Cluster: clusterCloseAllSlots() added.	2014-03-11 11:16:18 +01:00
antirez	8eae54aa1e	DEBUG ERROR implemented. The new "error" subcommand of the DEBUG command can reply with an user selected error, specified as its sole argument: DEBUG ERROR "LOADING please wait..." The error is generated just prefixing the command argument with a "-" character, and replacing newlines with spaces (since error replies can't include newlines). The goal of the command is to help in Client libraries unit tests by making simple to simulate a command call triggering a given error.	2014-03-10 23:01:55 +01:00
antirez	2705306ba1	DEBUG CMDKEYS: provide some guarantee to getKeysFromCommand(). getKeysFromCommand() is designed to be called with the command arguments passing the basic arity checks described in the command table. DEBUG CMDKEYS must provide the same guarantees for calling getKeysFromCommand() to be safe.	2014-03-10 16:43:38 +01:00
antirez	5b864617bc	Cluster: make sortGetKeys() able to handle multiple STORE options. It does not make sense to pass multiple store options, so, better to handle it ;-)	2014-03-10 16:39:07 +01:00
antirez	c4ef1d6494	DEBUG CMDKEYS added for getKeysFromCommand() testing. Examples: redis 127.0.0.1:6379> debug cmdkeys set foo bar 1) "foo" redis 127.0.0.1:6379> debug cmdkeys mget a b c 1) "a" 2) "b" 3) "c" redis 127.0.0.1:6379> debug cmdkeys zunionstore foo 2 a b 1) "a" 2) "b" 3) "foo" redis 127.0.0.1:6379> debug cmdkeys ping (empty list or set)	2014-03-10 16:36:08 +01:00
antirez	3e1d772677	Cluster: don't allow BY option of SORT as well. There is the exception of a "constant" BY pattern that is used in order to signal to don't sort at all. In this case no lookup is needed so it is possible to support this case in Cluster mode.	2014-03-10 16:28:18 +01:00
antirez	04cf02e8dc	Cluster: SORT get keys helper implemented.	2014-03-10 16:26:08 +01:00
antirez	21765c8588	Cluster: evalGetKeys() fixed: was not setting keys count.	2014-03-10 16:23:42 +01:00

1 2 3 4 5 ...

4044 Commits