bitcask

mirror of https://github.com/taigrr/bitcask synced 2025-01-18 04:03:17 -08:00

Author	SHA1	Message	Date
Tai Groot	a72a4d0494	update sift to match docstring	2022-02-04 02:11:21 -08:00
James Mills	5429693cc8	Add ErrBadConfig and ErrBadMetadata as errors that consumers can check and use (#241 ) cc @taigrr This PR will _hopefully_ help to fix some critical isseus in the real world with several or more [Yarn.social](https://yarn.social) pods running [yarnd](https://git.mills.io/yarnsocial/yarn) where starting back up after a power failure or crash can sometimes result in an empty `config.json` or empty `meta.json` or both! I'm not actually sure how this can arise, and as yet I haven't been able to reproduce it (_I can only assume this has to be failures cases outside of our control_); but in any case the application and database is recoverable by simply `rm config.json` and/or `rm meta.json`. So this PR makes errors loading the config and metadata first-class errors and exported error types that consumers of the library can use to perform automated recovery without requiring human intervention. Basiclaly in this case it's no big deal we lost the database config of metadata, we can simply carry on. Co-authored-by: James Mills <prologic@shortcircuit.net.au> Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/241 Co-authored-by: James Mills <james@mills.io> Co-committed-by: James Mills <james@mills.io>	2021-10-30 21:07:42 +00:00
James Mills	9b0daa8a30	Add RangeScan() support (#160 ) Co-authored-by: James Mills <1290234+prologic@users.noreply.github.com> Co-authored-by: James Mills <prologic@shortcircuit.net.au> Co-authored-by: Tai Groot <tai@taigrr.com> Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/160 Co-authored-by: James Mills <james@mills.io> Co-committed-by: James Mills <james@mills.io>	2021-07-21 02:36:06 +00:00
Tai Groot	ef187f8315	[ADD] Sift and ScanSift (+ tests) (#232 ) Added Sift and ScanSift functions for review without tests (for now) fix docstrings Added tests for Sift and ScanSift Note this also fixes a bug in the Scan() function where the RMutex is not locked, allowing a potential race condition closes #231 Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/232 Co-authored-by: Tai Groot <tai@taigrr.com> Co-committed-by: Tai Groot <tai@taigrr.com>	2021-07-21 00:19:25 +00:00
James Mills	b094cd33d3	Fix runGC behaviour to correctly delete all expired keys (#229 ) Fixes #228 Co-authored-by: James Mills <prologic@shortcircuit.net.au> Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/229 Co-authored-by: James Mills <james@mills.io> Co-committed-by: James Mills <james@mills.io>	2021-07-20 20:42:22 +00:00
Tai Groot	92535e654b	[FIX] race condition from #216 (#227 ) [ADDED] new tests for TTL expiration race condition, see #216 [REMOVED] removes cleanup / automatic expiration from get() function to resolve #216 Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/227 Co-authored-by: Tai Groot <tai@taigrr.com> Co-committed-by: Tai Groot <tai@taigrr.com>	2021-07-18 23:41:40 +00:00
James Mills	5e4d863ab7	Use package github.com/gofrs/flock as flock implementation. (#224 ) Supercesd #219 after rebasing on master after migrating off Github. Co-authored-by: Nicolò Santamaria <nicolo.santamaria@protonmail.com> Co-authored-by: James Mills <prologic@shortcircuit.net.au> Co-authored-by: Tai Groot <taigrr@noreply@mills.io> Reviewed-on: https://git.mills.io/prologic/bitcask/pulls/224 Co-authored-by: James Mills <prologic@noreply@mills.io> Co-committed-by: James Mills <prologic@noreply@mills.io>	2021-07-15 21:33:20 +00:00
James Mills	90dd53c573	Rename all Go module paths	2021-07-10 17:47:38 +10:00
James Mills	b98b684bb4	Refactor TTL with a new API PutWithTTL() and reduce memory allocs (#220 )	2021-07-09 17:21:35 +10:00
Yash Suresh Chandra	e7c6490762	Purge api added to remove expired keys (#204 ) * purge api added * merged with master, import order fix * purge api renamed to RunGC Co-authored-by: yash <yash.chandra@grabpay.com>	2021-06-02 06:47:30 +10:00
Yash Suresh Chandra	5c6ceadac1	Add support for keys with ttl (#177 ) * ttl support first commit * imports fix * put api args correction * put options added * upgrade method added * upgrade log added * v0 to v1 migration script added * error assertion added * temp migration dir fix Co-authored-by: yash <yash.chandra@grabpay.com>	2020-12-21 17:41:43 +10:00
Yash Suresh Chandra	f397bec88f	retain lock file after merge (#201 ) * Add test case for Locking after Merge * retain lock file after merge * remove replacing lock file (not needed) Co-authored-by: James Mills <prologic@shortcircuit.net.au> Co-authored-by: yash <yash.chandra@grabpay.com>	2020-12-19 02:25:58 +10:00
James Mills	8a60b5a370	Fix a bug when MaxValueSize == 0 on Merge operations	2020-12-18 07:35:16 +10:00
Haleem Assal	29e1cf648b	Save metadata on Sync (#197 ) * Save metadata on Sync * Add test	2020-12-15 06:32:48 +10:00
Yash Suresh Chandra	3a6235ea03	exclusive lock before closing db in merge (#196 ) Co-authored-by: yash <yash.chandra@grabpay.com>	2020-12-13 21:28:54 +10:00
James Mills	0ab7d79246	Add support for unlimited key/value sizes	2020-12-12 02:16:36 +10:00
Georges Varouchas	38156e8461	Gv/issue 165 unlock race condition (#175 ) * add failing test case to highlight the race condition on bug note : the test "TestLock" is non deterministic, its outcome depends on the sequence of instructions yielded by the go scheduler on each run. There are two values, "goroutines" and "succesfulLockCount", which can be edited to see how the test performs. With the committed value, resp "20" and "50", I had a 100% failure on my local machine, running linux (Ubuntu 20.04). Sample test output : $ go test . -run TestLock --- FAIL: TestLock (0.17s) lock_test.go:91: [runner 14] lockCounter was > 1 on 5 occasions, max seen value was 2 lock_test.go:91: [runner 03] lockCounter was > 1 on 2 occasions, max seen value was 3 lock_test.go:91: [runner 02] lockCounter was > 1 on 3 occasions, max seen value was 3 lock_test.go:91: [runner 00] lockCounter was > 1 on 1 occasions, max seen value was 2 lock_test.go:91: [runner 12] lockCounter was > 1 on 7 occasions, max seen value was 3 lock_test.go:91: [runner 01] lockCounter was > 1 on 8 occasions, max seen value was 2 lock_test.go:91: [runner 04] lockCounter was > 1 on 6 occasions, max seen value was 4 lock_test.go:91: [runner 13] lockCounter was > 1 on 1 occasions, max seen value was 2 lock_test.go:91: [runner 17] lockCounter was > 1 on 4 occasions, max seen value was 2 lock_test.go:91: [runner 10] lockCounter was > 1 on 3 occasions, max seen value was 2 lock_test.go:91: [runner 08] lockCounter was > 1 on 6 occasions, max seen value was 2 lock_test.go:91: [runner 09] lockCounter was > 1 on 4 occasions, max seen value was 2 lock_test.go:91: [runner 05] lockCounter was > 1 on 1 occasions, max seen value was 2 lock_test.go:91: [runner 19] lockCounter was > 1 on 3 occasions, max seen value was 3 lock_test.go:91: [runner 07] lockCounter was > 1 on 4 occasions, max seen value was 3 lock_test.go:91: [runner 11] lockCounter was > 1 on 9 occasions, max seen value was 2 lock_test.go:91: [runner 15] lockCounter was > 1 on 1 occasions, max seen value was 3 lock_test.go:91: [runner 16] lockCounter was > 1 on 1 occasions, max seen value was 3 FAIL FAIL github.com/prologic/bitcask 0.176s FAIL * flock: create a wrapper module, local to bitcask, around gofrs.Flock the racy TestLock has been moved to bitcask/flock * flock: add test for expected regular locking behavior * flock: replace gofrs/flock with local implementation * update go.sum * Add build constraint for flock_unix.go Co-authored-by: James Mills <prologic@shortcircuit.net.au>	2020-12-11 20:56:58 +10:00
Yash Suresh Chandra	e1cdffd8f1	new merge approach (#191 ) * new merge approach * code refactor * comment added * isMerging flag added to allow 1 merge operation at a time * get api modified. merge updated (no recursive read locks) Co-authored-by: yash <yash.chandra@grabpay.com> Co-authored-by: James Mills <prologic@shortcircuit.net.au>	2020-12-11 20:48:41 +10:00
yashschandra	158f6d9888	Get space that can be reclaimed (#189 ) * get reclaimable space added * import order fix Co-authored-by: yash <yash.chandra@grabpay.com>	2020-12-01 06:07:00 +10:00
yashschandra	f4357e6f18	local live backup support (#185 ) * live backup first commit * exclude lock file in backup * create path if not exist for backup Co-authored-by: yash <yash.chandra@grabpay.com> Co-authored-by: James Mills <prologic@shortcircuit.net.au>	2020-11-30 07:49:02 +10:00
James Mills	720f03c6c2	Fix a race condition around .Close() and .Sync()	2020-11-17 19:30:44 +10:00
Bryan Stenson	295301a44c	Add configuration options for FileMode (#183 ) * Add configuration options for FileMode Add two additional configuration values, and their corresponding default values: * DirFileModeBeforeUmask - Dir FileMode is used on all directories created. DefaultDirFileModeBeforeUmask is 0700. * FileFileModeBeforeUmask - File FileMode is used on all files created, except for the "lock" file (managed by the Flock library). DefaultFileFileModeBeforeUmask is 0600. When using these bits of configuration, keep in mind these FileMode values are set BEFORE any umask rules are applied. For example, if the user's umask is 022, setting DirFileFileModeBeforeUmask to 777 will result in directories with FileMode set to 755 (this umask prevents the write bit from being applied to group and world permissions). * moving defer statements after checking for errors use os.ModePerm const instead of os.FileMode(777) * fix spelling/grammar * skip these tests for Windows as they appear to break - Windows is less POSIX-y than it claims * ignore "lock" file for default case too -- this was incorrectly passing before including this, as my local dev station has umask 022	2020-11-05 08:06:45 +10:00
Ignacio Hagopian	8dca9cd2a7	Auto recovery (#153 ) * implement autorepair Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com> * fix misspell Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com> * Update internal/data/recover.go Co-authored-by: James Mills <prologic@shortcircuit.net.au> * Update internal/utils.go Co-authored-by: James Mills <prologic@shortcircuit.net.au> * Update internal/data/recover.go Co-authored-by: James Mills <prologic@shortcircuit.net.au> * skip failing test on windows Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com> Co-authored-by: James Mills <prologic@shortcircuit.net.au>	2020-05-08 03:48:36 +10:00
Ignacio Hagopian	7b24d87695	don't allow empty keys (#151 ) * bitcask: don't allow empty keys Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com> * go mod tidy Signed-off-by: Ignacio Hagopian <jsign.uy@gmail.com>	2020-05-06 10:47:41 +10:00
Alain Gilbert	ca06e332d6	Add DeleteAll function (#116 )	2019-12-23 21:35:59 +10:00
Alain Gilbert	be3fd71ebe	Fix loadIndex to be deterministic (#115 )	2019-12-20 14:45:10 +10:00
Leonid Zharikov	4dfe42cb3b	Export method reopen (#113 )	2019-11-16 21:08:45 +10:00
Ignacio Hagopian	498ea4069c	codebeat: Code quality improvement (#103 ) * codebeat: improve & bugfix * codebeat: refactor to improve readability * bugfix * bugfix * internal/data/codec: improve code coverage	2019-09-24 07:19:07 +10:00
Ignacio Hagopian	5be114adab	Makefile setup & key/value coherent datatypes & refactoring (#98 ) * internal/data: comment exported functions * internal/data: make smaller codec exported api surface * make key and value sizes serializing bubble up to everything * Makefile setup & go mod tidy	2019-09-12 10:44:26 -03:00
James Mills	d59d5ad8c2	Improves Test Coverage by covering error cases (#95 ) * Add Unit Test for testing a corrupted config * Add Unit Test for testing errors from .Stats() * Refactor Datafile into an interface and add Unit Tests for testing Merge() errors * Refactor indexer into an interface and add Unit Tests for .Close() errors * Add Unit Tests for .Delete() errors * Add Unit Tests for testing Put/Get errors * Add Unit Test for testing Open errors (bad path for example) * Refactor out bitcask.writeConfig * Add more tests for config errors * Add unit test for options that might error * Add more test cases for close errors * Add test case for rotating datafiles * Fix a possible data race in .Stats() * Add test case for checksum errors * Add test case for Sync errors with Put and WithSync enabled * Refactor and use testify.mock for mocks and generate mocks for all interfaces * Refactor TestCloseErrors * Refactored TestDeleteErrors * Refactored TestGetErrors * Refactored TestPutErrors * Refactored TestMergeErrors and fixed a bug with .Fold() * Add test case for Scan() errors * Apparently only Scan() can return nil Node()s?	2019-09-09 07:18:38 +10:00
Ignacio Hagopian	13e35b7acc	bitcask: fix data races & use Encode() to serialize config (#94 )	2019-09-07 09:09:08 +10:00
Ignacio Hagopian	0d3a9213ed	cmd/bitcask: recovery tool (#92 ) * cmd/bitcask: recovery tool * refactor configuration & use it in recover tool	2019-09-07 07:57:30 +10:00
Ignacio Hagopian	a2b5ae2287	fix: check of persisted index values (#91 )	2019-09-04 22:42:32 +10:00
James Mills	1c7df7f9c7	Removed unused readConfig() (#87 )	2019-09-04 21:25:31 +10:00
Ignacio Hagopian	93cc1d409f	codec_index: check sizes, new tests for data corruption & refactor (#84 ) * bitcask/codec_index: check key and data sizes * codec_index: tests for key and data size overflows * codec_index: simplify internal funcs for unused returns	2019-09-04 12:26:26 +10:00
James Mills	50d3971e86	Fixed a bug with incorrect offsets populating the trie (#82 )	2019-09-02 19:44:11 +10:00
Ignacio Hagopian	877bf982b1	fix go vet (#80 )	2019-09-02 10:20:56 +10:00
James Mills	abbbeb8e1d	Replace keydir with ART trie (#75 ) * Replace keydir with ART trie * Address some review feedback * Address review feedback (consts)	2019-09-02 08:38:56 +10:00
James Mills	b3d6f734b6	Use an Adaptive Radix Tree (#71 )	2019-08-30 08:13:24 +10:00
Awn	e8bee948bc	Make optimised scan functionality optional (#68 )	2019-08-16 10:51:59 +10:00
James Mills	c5a565cd82	Adds WithSync(...) option to turn on sync after write durability (#63 ) * Added WithSync(...) option to turn on sync after write durability * Add Sync/NoSync benchmark variants for Put()	2019-08-12 06:47:46 +10:00
James Mills	7204a33512	Fix and cleanup some unnecessary internal sub-packages and duplication	2019-08-08 22:28:25 +10:00
Awn	af43cfa8f1	Remove merge function (#60 ) * tidy: clean up some leftovers Fixes #56 Fixes #57 Fixes #58 * api: remove standalone merge function Fixes #55	2019-08-08 19:51:45 +10:00
Ignacio Hagopian	fd179b4a86	custom high-performance encoder implementation (#52 )	2019-08-08 09:21:46 +10:00
James Mills	755b1879b5	Use []byte byte slices as keys directly avoiding serialing string(s) (#46 ) (#51 )	2019-08-08 08:14:48 +10:00
James Mills	d0c913ccee	Revert "Use []byte byte slices as keys directly avoiding serialing string(s) (#46 )" (#50 ) This reverts commit 3c1808cad3f19c23c6e4aacd4cfbbc4a04da1c08.	2019-08-08 08:06:38 +10:00
James Mills	3c1808cad3	Use []byte byte slices as keys directly avoiding serialing string(s) (#46 )	2019-08-08 07:59:11 +10:00
James Mills	5d1dd6657a	Fixed handling of missing config.json from cli behavior	2019-08-07 21:47:51 +10:00
James Mills	82e26449fa	Added the same functional options to the bitcask CLI and persist options to the db store (#40 )	2019-08-07 10:23:10 +10:00
Ignacio Hagopian	f2b5515e03	update trie dependency to take advantage of improvements (#45 )	2019-08-06 08:05:41 +10:00

1 2

83 Commits