redict

mirror of https://codeberg.org/redict/redict.git synced 2025-01-24 09:08:26 -05:00

Author	SHA1	Message	Date
antirez	3d5b2d41b5	RDB: clarify a condition in rdbLoadRio().	2020-04-09 11:09:40 +02:00
antirez	30adc62232	RDB: load files faster avoiding useless free+realloc. Reloading of the RDB generated by DEBUG POPULATE 5000000 SAVE is now 25% faster. This commit also prepares the ability to have more flexibility when loading stuff from the RDB, since we no longer use dbAdd() but can control exactly how things are added in the database.	2020-04-09 10:24:46 +02:00
qetu3790	2c42f6a8b8	fix comments about RESIZE DB opcode in rdb.c fix comments about RESIZE DB opcode in rdb.c	2020-04-06 20:52:32 +08:00
Salvatore Sanfilippo	2f53ad30ee	Merge pull request #6813 from guybe7/debug_object_pass_keyname DEBUG OBJECT should pass keyname to module when loading	2020-04-02 16:20:48 +02:00
WuYunlong	f6029fb925	Fix master replica inconsistency for upgrading scenario. Before this commit, when upgrading a replica, expired keys will not be loaded, thus causing replica having less keys in db. To this point, master and replica's keys is logically consistent. However, before the keys in master and replica are physically consistent, that is, they have the same dbsize, if master got a problem and the replica got promoted and becomes new master of that partition, and master updates a key which does not exist on master, but physically exists on the old master(new replica), the old master would refuse to update the key, thus causing master and replica data inconsistent. How could this happen? That's all because of the wrong judgement of roles while starting up the server. We can not use server.masterhost to judge if the server is master or replica, since it fails in cluster mode. When we start the server, we load rdb and do want to load expired keys, and do not want to have the ability to active expire keys, if it is a replica.	2020-03-18 16:22:07 +08:00
Oran Agra	6fff2cf9b6	module api docs for aux_save and aux_load	2020-02-16 15:43:19 +02:00
Oran Agra	fe7e8dc955	Add handling of short read of module id in rdb	2020-02-05 19:47:09 +02:00
Guy Benoish	d72b7ed4fb	DEBUG OBJECT should pass keyname to module when loading	2020-01-30 19:15:12 +05:30
Salvatore Sanfilippo	e916058f0b	Merge pull request #6557 from oranagra/rm_lru_lfu_revized rename RN_SetLRUOrLFU -> RM_SetLRU and RN_SetLFU	2019-11-19 11:58:07 +01:00
Oran Agra	28c20b4ef9	rename RN_SetLRUOrLFU -> RM_SetLRU and RN_SetLFU - the API name was odd, separated to two apis one for LRU and one for LFU - the LRU idle time was in 1 second resolution, which might be ok for RDB and RESTORE, but i think modules may need higher resolution - adding tests for LFU and for handling maxmemory policy mismatch	2019-11-10 09:27:01 +02:00
antirez	0f026af185	Merge branch 'unstable' of github.com:/antirez/redis into unstable	2019-11-06 17:44:42 +01:00
antirez	824f5f0b7a	Update PR #6537 patch to for generality. After the thread in #6537 and thanks to the suggestions received, this commit updates the original patch in order to: 1. Solve the problem of updating the time in multiple places by updating it in call(). 2. Avoid introducing a new field but use our cached time. This required some minor refactoring to the function updating the time, and the introduction of a new cached time in microseconds in order to use less gettimeofday() calls.	2019-11-05 10:14:34 +01:00
Oran Agra	51c3ff8d75	Modules hooks: complete missing hooks for the initial set of hooks * replication hooks: role change, master link status, replica online/offline * persistence hooks: saving, loading, loading progress * misc hooks: cron loop, shutdown, module loaded/unloaded * change the way hooks test work, and add tests for all of the above startLoading() now gets flag indicating what is loaded. stopLoading() now gets an indication of success or failure. adding startSaving() and stopSaving() with similar args and role.	2019-10-29 17:59:09 +02:00
Salvatore Sanfilippo	c328c807e7	Merge pull request #6496 from oranagra/module_tests_valgrind Make module tests pass with valgrind, and fix a leak in diskless load	2019-10-28 10:05:52 +01:00
Wander Hillen	fb1f4f4e7e	Merge branch 'unstable' into minor-typos	2019-10-25 10:18:26 +02:00
Oran Agra	be352633a0	Make module tests pass with valgrind, and fix a leak in diskless load	2019-10-24 09:45:25 +03:00
Yossi Gottlieb	0db3b0a0ff	Merge remote-tracking branch 'upstream/unstable' into tls	2019-10-16 17:08:07 +03:00
Yossi Gottlieb	c469f6ad9e	Code review minor changes (names, comments).	2019-10-15 17:21:51 +03:00
Oran Agra	5a47794606	diskless replication rdb transfer uses pipe, and writes to sockets form the parent process. misc: - handle SSL_has_pending by iterating though these in beforeSleep, and setting timeout of 0 to aeProcessEvents - fix issue with epoll signaling EPOLLHUP and EPOLLERR only to the write handlers. (needed to detect the rdb pipe was closed) - add key-load-delay config for testing - trim connShutdown which is no longer needed - rioFdsetWrite -> rioFdWrite - simplified since there's no longer need to write to multiple FDs - don't detect rdb child exited (don't call wait3) until we detect the pipe is closed - Cleanup bad optimization from rio.c, add another one	2019-10-07 21:06:30 +03:00
Yossi Gottlieb	b087dd1db6	TLS: Connections refactoring and TLS support. * Introduce a connection abstraction layer for all socket operations and integrate it across the code base. * Provide an optional TLS connections implementation based on OpenSSL. * Pull a newer version of hiredis with TLS support. * Tests, redis-cli updates for TLS support.	2019-10-07 21:06:13 +03:00
antirez	de1f82aa33	Function renamed hasForkChild() -> hasActiveChildProcess().	2019-09-27 12:03:09 +02:00
antirez	beb1356b5b	Improve error message in BGSAVE.	2019-09-27 11:59:37 +02:00
Salvatore Sanfilippo	6129758558	Merge branch 'unstable' into modules_fork	2019-09-27 11:24:06 +02:00
suntiawnen	781f3fd8be	fix rdb function rdbLoadIntegerObject comment	2019-09-06 12:01:44 +08:00
Oran Agra	3bfcae247a	Fix to module aux data rdb format for backwards compatibility with old check-rdb When implementing the code that saves and loads these aux fields we used rdb format that was added for that in redis 5.0, but then we added the 'when' field which meant that the old redis-check-rdb won't be able to skip these. this fix adds an opcode as if that 'when' is part of the module data.	2019-09-05 14:11:37 +03:00
Oran Agra	3b6aeea44c	Implement module api for aux data in rdb Other changes: * fix memory leak in error handling of rdb loading of type OBJ_MODULE	2019-07-22 21:15:33 +03:00
antirez	bc5cb168f5	RDB: fix MODULE_AUX loading by continuing to next opcode. Thanks to @JohnSully for noticing this problem.	2019-07-19 11:12:46 +02:00
antirez	5f450e4928	RDB: make sure to abort on LZF encoding error.	2019-07-18 18:59:38 +02:00
antirez	bd0f06c18c	RDB: handle encoding errors with rdbExitReportCorruptRDB(). Without such change, the diskless replicas, when loading RDB files from the socket will not abort when a broken RDB file gets loaded. This is potentially unsafe, because right now Redis is not able to guarantee that encoding errors are safe from the POV of memory corruptions (for instance the LZF library may not be safe against untrusted data?) so better to abort when the RDB file we are going to load is corrupted. Instead I/O errors are still returned to the caller without aborting, so that in case of short read the diskless replica can try again.	2019-07-18 18:51:45 +02:00
antirez	947319cadd	RDB: update rdbLoadRio comment about EOF condition.	2019-07-18 12:37:55 +02:00
antirez	42b6305964	RDB: try to make error handling code more readable.	2019-07-17 17:30:02 +02:00
antirez	5189db3d81	RDB: modify rdbReportError() var name for clarity.	2019-07-17 16:46:22 +02:00
antirez	48d91cf4cc	Rio: remember read/write error conditions.	2019-07-17 16:46:22 +02:00
Oran Agra	c56b4ddc6f	prevent diskless replica from terminating on short read now that replica can read rdb directly from the socket, it should avoid exiting on short read and instead try to re-sync. this commit tries to have minimal effects on non-diskless rdb reading. and includes a test that tries to trigger this scenario on various read cases.	2019-07-17 16:46:22 +02:00
Oran Agra	56258c6b7d	Module API for Forking * create module API for forking child processes. * refactor duplicate code around creating and tracking forks by AOF and RDB. * child processes listen to SIGUSR1 and dies exitFromChild in order to eliminate a valgrind warning of unhandled signal. * note that BGSAVE error reply has changed. valgrind error is: Process terminating with default action of signal 10 (SIGUSR1)	2019-07-17 16:40:24 +03:00
Oran Agra	2de544cfcc	diskless replication on slave side (don't store rdb to file), plus some other related fixes The implementation of the diskless replication was currently diskless only on the master side. The slave side was still storing the received rdb file to the disk before loading it back in and parsing it. This commit adds two modes to load rdb directly from socket: 1) when-empty 2) using "swapdb" the third mode of using diskless slave by flushdb is risky and currently not included. other changes: -------------- distinguish between aof configuration and state so that we can re-enable aof only when sync eventually succeeds (and not when exiting from readSyncBulkPayload after a failed attempt) also a CONFIG GET and INFO during rdb loading would have lied When loading rdb from the network, don't kill the server on short read (that can be a network error) Fix rdb check when performed on preamble AOF tests: run replication tests for diskless slave too make replication test a bit more aggressive Add test for diskless load swapdb	2019-07-08 15:37:48 +03:00
Yossi Gottlieb	a88264d934	Add RedisModule_GetKeyNameFromIO().	2019-03-15 10:23:27 +02:00
zhaozhao.zz	fb81d1b3f8	Fix compile warning when log aux field	2019-03-04 19:43:00 +08:00
antirez	b9b140e2e2	Use the RDB info fields to provide info to users. Fix #5790 and 5878. Maybe a better option was to have such fields named with the first byte '%' as those are info fields for specification, however now to break it in a backward incompatible way is not an option, so let's use the fields actively to provide info when sensible, otherwise ignore when they are not really helpful.	2019-03-02 21:17:42 +01:00
antirez	4dc69497f5	Refactoring: always kill AOF/RDB child via helper functions.	2019-01-21 11:28:44 +01:00
antirez	edd3939bef	Abort instead of crashing when loading bad stream master key. See #5612.	2018-11-28 16:24:53 +01:00
Jack Drogon	93238575f7	Fix typo	2018-07-03 18:19:46 +02:00
Salvatore Sanfilippo	70b7fa2c52	Merge pull request #5049 from youjiali1995/fix-load-rdb Fix rdbLoadIntegerObject() to create shared objects when needed.	2018-06-22 15:47:45 +02:00
youjiali1995	df6644fe0e	Fix rdbLoadIntegerObject() to create shared objects when needed.	2018-06-21 19:23:16 +08:00
Salvatore Sanfilippo	c1e8240565	Merge pull request #5039 from oranagra/rdb_dbsize_hint 64 bit RDB_OPCODE_RESIZEDB in rdb saving	2018-06-20 11:18:31 +02:00
Guy Benoish	b5197f1fc9	Enhance RESTORE with RDBv9 new features RESTORE now supports: 1. Setting LRU/LFU 2. Absolute-time TTL Other related changes: 1. RDB loading will not override LRU bits when RDB file does not contain the LRU opcode. 2. RDB loading will not set LRU/LFU bits if the server's maxmemory-policy does not match.	2018-06-20 15:11:08 +07:00
Oran Agra	5cd3c9529d	64 bit RDB_OPCODE_RESIZEDB in rdb saving this complication in the code is from times were rdbSaveLen didn't support 64 bits.	2018-06-19 16:43:12 +03:00
Salvatore Sanfilippo	94658303e9	Merge pull request #4758 from soloestoy/rdb-save-incremental-fsync Rdb save incremental fsync	2018-06-16 10:59:37 +02:00
antirez	032ea657d7	RDB: Apply fix to rdbLoadMillisecondTime() only for new RDB versions. This way we let big endian systems to still load old RDB versions. However newver versions will be saved and loaded in a way that make RDB expires cross-endian again. Thanks to @oranagra for the reporting and the discussion about this problem, leading to this fix.	2018-06-12 18:21:39 +02:00
antirez	b38682199b	Fix rdbSaveKeyValuePair() integer overflow. Again thanks to @oranagra. The object idle time does not fit into an int sometimes: use the native type that the serialization function will get as argument, which is uint64_t.	2018-06-12 17:31:04 +02:00
antirez	f70e88c1f6	RDB: store times consistently in little endian. I'm not sure how this escaped the attention of Redis users for years, but finally @oranagra reported this issue... Thanks to Oran.	2018-06-12 17:22:03 +02:00
Salvatore Sanfilippo	9fa2e7020e	Merge pull request #4861 from soloestoy/rdb-dict-expand RDB: expand dict if needed when rdb load object	2018-06-08 12:12:34 +02:00
antirez	49147f36e9	Don't expire keys while loading RDB from AOF preamble. The AOF tail of a combined RDB+AOF is based on the premise of applying the AOF commands to the exact state that there was in the server while the RDB was persisted. By expiring keys while loading the RDB file, we change the state, so applying the AOF tail later may change the state. Test case: * Time1: SET a 10 * Time2: EXPIREAT a $time5 * Time3: INCR a * Time4: PERSIT A. Start bgrewiteaof with RDB preamble. The value of a is 11 without expire time. * Time5: Restart redis from the RDB+AOF: consistency violation. Thanks to @soloestoy for providing the patch. Thanks to @trevor211 for the original issue report and the initial fix. Check issue #4950 for more info.	2018-05-29 12:37:42 +02:00
WuYunlong	2a887bd53f	Fix rdb save by allowing dumping of expire keys, so that when we add a new slave, and do a failover, eighter by manual or not, other local slaves will delete the expired keys properly.	2018-05-29 12:35:15 +02:00
Salvatore Sanfilippo	1ab3c82375	Merge pull request #4908 from soloestoy/aof-rdb-preamble-compatible-checksum-no AOF & RDB: be compatible with rdbchecksum no	2018-05-23 17:11:00 +02:00
antirez	b85aae78df	Fix rdb.c dictionary iterator release in 2 more places.	2018-05-09 12:06:37 +02:00
antirez	cd87b3c71f	Fix rdb.c dictionary iterator release. Some times it was not released on error, sometimes it was released two times because the error path expected the "di" var to be NULL if the iterator was already released. Thanks to @oranagra for pinging me about potential problems of this kind inside rdb.c.	2018-05-09 11:03:27 +02:00
zhaozhao.zz	edb92db533	AOF & RDB: be compatible with rdbchecksum no	2018-05-08 19:22:13 +08:00
zhaozhao.zz	24036b4d32	RDB: expand dict if needed when rdb load object	2018-04-22 22:30:44 +08:00
antirez	8b0cfb1e66	RDB: Implement future-proof module AUX data loading.	2018-03-16 13:47:10 +01:00
Wander Hillen	dda8cc1821	More typos	2018-03-16 09:59:17 +01:00
zhaozhao.zz	54cae05ea7	rdb: incremental fsync when redis saves rdb	2018-03-16 00:44:50 +08:00
antirez	8176a2ee76	RDB: LRU/LFU branches missed continue.	2018-03-15 16:33:18 +01:00
antirez	1ce50a7adf	RDB: Ability to load LFU/LRU info.	2018-03-15 16:24:53 +01:00
antirez	d7a5c0eb71	RDB: Ability to save LFU/LRU info. This is a big win for caching use cases, since on reloading Redis will still have some idea about what is worth to evict and what not. However this only solves part of the problem because the information is only partially propagated to slaves (on write operations). Reads will not affect slaves LFU and LRU counters, so after a failover the eviction decisions are kinda random until keys start to collect some aging/freq info. However since new slaves are initially populated via RDB file transfer, this means that if we spin up a new slave from a master, and perform an immediate manual failover (for instance in order to upgrade the master), the slave will have eviction informations to use for some time. The LFU/LRU info is persisted only if the maxmemory policy is set to one of the relevant type, even if no actual "maxmemory" memory limit is set.	2018-03-15 13:15:55 +01:00
antirez	f3d9520ccb	CG: fix CG RDB loading not found conditional.	2018-03-15 12:54:10 +01:00
antirez	f7d4c3acdf	Streams: trap more errors in stream loading + RDB check type name.	2018-03-15 12:54:10 +01:00
antirez	13ff7bc3ef	CG: fix RDB saving when there are no consumer groups.	2018-03-15 12:54:10 +01:00
antirez	9f60a6bcee	CG: RDB loading, fix inverted conditional.	2018-03-15 12:54:10 +01:00
antirez	f4e1a4de25	CG: RDB loading first implementation.	2018-03-15 12:54:10 +01:00
antirez	db7a5f23b4	CG: RDB saving part 2, consumers.	2018-03-15 12:54:10 +01:00
antirez	8fb6048ed0	CG: RDB saving part 1, metadata and PEL.	2018-03-15 12:54:10 +01:00
charsyam	76386c48b8	refactoring-make-condition-clear-for-rdb	2018-02-27 21:55:20 +09:00
Salvatore Sanfilippo	d8830200b4	Merge pull request #3828 from oranagra/sdsnewlen_pr add SDS_NOINIT option to sdsnewlen to avoid unnecessary memsets.	2018-02-27 04:04:32 -08:00
Oran Agra	60a4f12f8b	fix processing of large bulks (above 2GB) - protocol parsing (processMultibulkBuffer) was limitted to 32big positions in the buffer readQueryFromClient potential overflow - rioWriteBulkCount used int, although rioWriteBulkString gave it size_t - several places in sds.c that used int for string length or index. - bugfix in RM_SaveAuxField (return was 1 or -1 and not length) - RM_SaveStringBuffer was limitted to 32bit length	2017-12-29 12:24:19 +02:00
antirez	60d26acfc8	Refactoring: improve luaCreateFunction() API. The function in its initial form, and after the fixes for the PSYNC2 bugs, required code duplication in multiple spots. This commit modifies it in order to always compute the script name independently, and to return the SDS of the SHA of the body: this way it can be used in all the places, including for SCRIPT LOAD, without duplicating the code to create the Lua function name. Note that this requires to re-compute the body SHA1 in the case of EVAL seeing a script for the first time, but this should not change scripting performance in any way because new scripts definition is a rare event happening the first time a script is seen, and the SHA1 computation is anyway not a very slow process against the typical Redis script and compared to the actua Lua byte compiling of the body. Note that the function used to assert() if a duplicated script was loaded, however actually now two times over three, we want the function to handle duplicated scripts just fine: this happens in SCRIPT LOAD and in RDB AUX "lua" loading. Moreover the assert was not defending against some obvious failure mode, so now the function always tests against already defined functions at start.	2017-12-04 11:25:20 +01:00
antirez	65a9740fa8	Fix loading of RDB files lua AUX fields when the script is defined. In the case of slaves loading the RDB from master, or in other similar cases, the script is already defined, and the function registering the script should not fail in the assert() call.	2017-12-01 16:01:10 +01:00
antirez	f24d3a7de0	Streams: delta encode IDs based on key. Add count + deleted fields. We used to have the master ID stored at the start of the listpack, however using the key directly makes more sense in order to create a space efficient representation: anyway the key at the radix tree is very unlikely to change because of how the stream is implemented. Moreover on nodes merging, to rewrite the merged listpacks is anyway the most sensible operation, and we can use the iterator and the append-to-stream function in order to avoid re-implementing the code needed for merging. This commit also adds two items at the start of the listpack: the number of valid items inside the listpack, and the number of items marked as deleted. This means that there is no need to scan a listpack in order to understand if it's a good candidate for garbage collection, if the ration between valid/deleted items triggers the GC.	2017-12-01 10:24:24 +01:00
antirez	98d184db12	Streams: Save stream->length in RDB.	2017-12-01 10:24:24 +01:00
antirez	edd70c1993	Streams: RDB loading. RDB saving modified. After a few attempts it looked quite saner to just add the last item ID at the end of the serialized listpacks, instead of scanning the last listpack loaded from head to tail just to fetch it. It's a disk space VS CPU-and-simplicity tradeoff basically.	2017-12-01 10:24:24 +01:00
antirez	485014cc74	Streams: RDB saving.	2017-12-01 10:24:24 +01:00
antirez	452ad2e928	PSYNC2: just store script bodies into RDB. Related to #4483. As suggested by @soloestoy, we can retrieve the SHA1 from the body. Given that in the new implementation using AUX fields we ended copying around a lot to create new objects and strings, extremize such concept and trade CPU for space inside the RDB file.	2017-11-30 18:38:26 +01:00
antirez	f11a7585a8	PSYNC2: Save Lua scripts state into RDB file. This is currently needed in order to fix #4483, but this can be useful in other contexts, so maybe later we may want to remove the conditionals and always save/load scripts. Note that we are using the "lua" AUX field here, in order to guarantee backward compatibility of the RDB file. The unknown AUX fields must be discarded by past versions of Redis.	2017-11-30 18:37:52 +01:00
antirez	4d063bb6ba	PSYNC2: reorganize comments related to recent fixes. Related to PR #4412 and issue #4407.	2017-11-24 11:08:29 +01:00
Salvatore Sanfilippo	9d86ae4597	Merge pull request #4412 from soloestoy/bugfix-psync2 PSYNC2: safe free backlog when reach the time limit and others	2017-11-24 10:56:18 +01:00
zhaozhao.zz	ea2e51c630	PSYNC2: persist cached_master's dbid inside the RDB	2017-11-22 12:11:26 +08:00
zhaozhao.zz	93037f7642	PSYNC2: make repl_stream_db never be -1 it means that after this change all the replication info in RDB is valid, and it can distinguish us from the older version.	2017-11-22 12:05:34 +08:00
antirez	a1944c3e4d	Fix saving of zero-length lists. Normally in modern Redis you can't create zero-len lists, however it's possible to load them from old RDB files generated, for instance, using Redis 2.8 (see issue #4409). The "Right Thing" would be not loading such lists at all, but this requires to hook in rdb.c random places in a not great way, for a problem that is at this point, at best, minor. Here in this commit instead I just fix the fact that zero length lists, materialized as quicklists with the first node set to NULL, were iterated in the wrong way while they are saved, leading to a crash. The other parts of the list implementation are apparently able to deal with empty lists correctly, even if they are no longer a thing.	2017-11-06 12:37:03 +01:00
zhaozhao.zz	b8579c225c	PSYNC2: clarify the scenario when repl_stream_db can be -1	2017-11-02 10:45:33 +08:00
zhaozhao.zz	885c4f856e	PSYNC2 & RDB: fix the missing rdbSaveInfo for BGSAVE	2017-11-01 17:52:43 +08:00
antirez	bb3b5ddd19	PSYNC2: More refinements related to #4316 .	2017-09-20 11:28:13 +02:00
zhaozhao.zz	b541ccef25	PSYNC2: make persisiting replication info more solid This commit is a reinforcement of commit `c1c99e9`. 1. Replication information can be stored when the RDB file is generated by a mater using server.slaveseldb when server.repl_backlog is not NULL, or set repl_stream_db be -1. That's safe, because NULL server.repl_backlog will trigger full synchronization, then master will send SELECT command to replicaiton stream. 2. Only do rdbSave* when rsiptr is not NULL, if we do rdbSave* without rdbSaveInfo, slave will miss repl-stream-db. 3. Save the replication informations also in the case of SAVE command, FLUSHALL command and DEBUG reload.	2017-09-20 11:18:10 +02:00
antirez	c1c99e9f4e	PSYNC2: Fix the way replication info is saved/loaded from RDB. This commit attempts to fix a number of bugs reported in #4316. They are related to the way replication info like replication ID, offsets, and currently selected DB in the master client, are stored and loaded by Redis. In order to avoid inconsistencies the changes in this commit try to enforce that: 1. Replication information are only stored when the RDB file is generated by a slave that has a valid 'master' client, so that we can always extract the currently selected DB. 2. When replication informations are persisted in the RDB file, all the info for a successful PSYNC or nothing is persisted. 3. The RDB replication informations are only loaded if the instance is configured as a slave, otherwise a master can start with IDs that relate to a different history of the data set, and stil retain such IDs in the future while receiving unrelated writes.	2017-09-19 23:03:39 +02:00
antirez	fc7ecd8d35	AOF check utility: ability to check files with RDB preamble.	2017-07-10 13:38:23 +02:00
antirez	2b36950e9b	Free IO context if any in RDB loading code. Thanks to @oranagra for spotting this bug.	2017-07-06 11:20:49 +02:00
antirez	365dd037dc	RDB modules values serialization format version 2. The original RDB serialization format was not parsable without the module loaded, becuase the structure was managed only by the module itself. Moreover RDB is a streaming protocol in the sense that it is both produce di an append-only fashion, and is also sometimes directly sent to the socket (in the case of diskless replication). The fact that modules values cannot be parsed without the relevant module loaded is a problem in many ways: RDB checking tools must have loaded modules even for doing things not involving the value at all, like splitting an RDB into N RDBs by key or alike, or just checking the RDB for sanity. In theory module values could be just a blob of data with a prefixed length in order for us to be able to skip it. However prefixing the values with a length would mean one of the following: 1. To be able to write some data at a previous offset. This breaks stremaing. 2. To bufferize values before outputting them. This breaks performances. 3. To have some chunked RDB output format. This breaks simplicity. Moreover, the above solution, still makes module values a totally opaque matter, with the fowllowing problems: 1. The RDB check tool can just skip the value without being able to at least check the general structure. For datasets composed mostly of modules values this means to just check the outer level of the RDB not actually doing any checko on most of the data itself. 2. It is not possible to do any recovering or processing of data for which a module no longer exists in the future, or is unknown. So this commit implements a different solution. The modules RDB serialization API is composed if well defined calls to store integers, floats, doubles or strings. After this commit, the parts generated by the module API have a one-byte prefix for each of the above emitted parts, and there is a final EOF byte as well. So even if we don't know exactly how to interpret a module value, we can always parse it at an high level, check the overall structure, understand the types used to store the information, and easily skip the whole value. The change is backward compatible: older RDB files can be still loaded since the new encoding has a new RDB type: MODULE_2 (of value 7). The commit also implements the ability to check RDB files for sanity taking advantage of the new feature.	2017-06-27 13:19:16 +02:00
antirez	e498d9ee3e	Collect fork() timing info only if fork succeeded.	2017-05-19 11:10:36 +02:00
antirez	c33493277a	Clarify why we save ziplist elements in revserse order. Also get rid of variables that are now kinda redundant, since the dictionary iterator was removed. This is related to PR #3949.	2017-04-18 11:01:47 +02:00
spinlock	23ec36909e	rdb: saving skiplist in reversed order to accelerate the deserialisation process	2017-04-17 13:22:34 +08:00
oranagra	f86df924b0	add SDS_NOINIT option to sdsnewlen to avoid unnecessary memsets. this commit also contains small bugfix in rdbLoadLzfStringObject a bug that currently has no implications.	2017-02-23 03:04:08 -08:00

1 2 3 4 5 ...

340 Commits