nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-16 19:14:41 -07:00

Author	SHA1	Message	Date
Derek Collison	3663d595fc	Disallow moving a stream that is already being moved Signed-off-by: Derek Collison <derek@nats.io>	2022-04-07 17:09:55 -07:00
Matthias Hanel	5662141932	Adding unique_tag to ensure matching tags are not used twice (#3011 ) Allows to not place a stream in the same availability zone twice. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-07 18:11:00 -04:00
Derek Collison	7e38ebcb6e	Allow assets such as streams and their associated consumers to migrate between clusters. The system will allow an update to a stream, and subsequently all attached consumers, to be placed in another cluster either directly or via tag placement. The meta layer will scale the underlying peerset appropriately to straddle the two clusters for both the stream and consumers, taking into account the consumer type. Control will then pass to the current leaders of the assets who will monitor the catchup status of the new peers. (Note we can optimize this later to only traverse once across a GW for any given asset, but for now this is simpler) Once the original leaders have determined the assets are synched it will pass leadership to a member of the new peerset. Once the new leader has been elected, it will forward a request for the meta layer to shrink the peerset by removing the old peers. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-04 18:28:36 -07:00
Ivan Kozlovic	19783a9f11	[CHANGED] Rate limit similar warnings Some warnings, especially when dealing with JS limits that were printed on a per-message basis, are now limited to ~1 per second if the content of the warning is already found in a map. This is also for "client" warnings, but the client porting of the warning is not taken into account so that helps with reducing logging for similar content, but coming from different clients. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-01 15:24:03 -06:00
Matthias Hanel	a77f95faa8	error handling and info when moving a stream from non existing tier (#2992 ) adds unit test to test this scenario improves reporting of correct error only show info for non existing tiers where streams exist Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-01 14:21:35 -04:00
Matthias Hanel	241bf5df0d	Fixed wrong error check (#2986 ) Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-31 18:03:06 -04:00
Matthias Hanel	92f4dc986a	added max_ack_pending setting to js account limits (#2982 ) * added max_ack_penind setting to js account limits because of the addition, defaults now have to be set later (depend on these new limits now) also re-organized the code to closer track how stream create looks Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-31 14:17:16 -04:00
Derek Collison	76eaa5ba8b	Update catchup as well Signed-off-by: Derek Collison <derek@nats.io>	2022-03-30 08:58:41 -07:00
Derek Collison	bfc1462fb3	Merge pull request #2973 from nats-io/issue-2936 [IMPROVED] Consumer snapshot logic in clustered mode and disk usage.	2022-03-29 18:29:31 -07:00
Derek Collison	607858f213	Improved consumer snapshot logic in clustered mode and disk usage. Also fixed a bug that could cause memory based replicated consumers to no longer work after snapshots and server restarts. The snapshot logic would allow non-state changing updates to continously grow the raft logs. We also were too conservative on when we snapshotted and why. Also added in ability to have FileStore.Compact() reclaim space from the block file from the head of last changed block. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-29 18:02:49 -07:00
Matthias Hanel	1aeaaf0ca3	Adding server limits (max ack pending/dedupe window) to js config (#2967 ) * Adding server limits (max ack pending/dedupe window) to js config Also shifting consumer config check to jsConsumerCreate as in clustered mode this was enforced in the wrong place Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-29 13:19:36 -04:00
Matthias Hanel	0c5f3688a7	[ADDED] Tiered limits and fix limit issues on updates (#2945 ) * Adding tiered limits and fix limit issues on updates Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-28 20:47:54 -04:00
Derek Collison	6b379329d8	Fix for #2955 . When scaling up a stream with existing messages the existing messages were not being replicated. Also fixed a bug where we were incorrectly not spining up the monitoring loop for a stream when going from 3->1->3. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-26 07:26:46 -07:00
Matthias Hanel	2438c965e7	Fix update of R1 Consumer in clustered setup. missing reply caused timeout Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-25 14:48:15 -04:00
Derek Collison	ef8f543ea5	Improve memory usage through JetStream storage layer. Previously we would rely more heavily on Go's garbage collector since when we loaded a block for an underlying stream we would pass references upward to avoimd copies. Now we always copy when passing back to the upper layers which allows us to not only expire our cache blocks but pool and reuse them. The upper layers also had changes made to allow the pooling layer at that level to interoperate with the storage layer optionally. Also fixed some flappers and a bug where de-dupe might not be reformed correctly. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Derek Collison	d7e1e5ae61	Make sure that we do not become a candidate/leader too soon or if we are not caughtup. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Ivan Kozlovic	e75020e275	[FIXED] JetStream: possible panic on leadership change notices I got this panic in a test: ``` === RUN TestJetStreamClusterAccountLoadFailure panic: runtime error: invalid memory address or nil pointer dereference [signal SIGSEGV: segmentation violation code=0x1 addr=0x78 pc=0xb1501b] goroutine 47853 [running]: github.com/nats-io/nats-server/v2/server.(jetStream).processLeaderChange(0xc000b60580, 0x0) /home/travis/gopath/src/github.com/nats-io/nats-server/server/jetstream_cluster.go:3638 +0x9b github.com/nats-io/nats-server/v2/server.(jetStream).monitorCluster(0xc000b60580) /home/travis/gopath/src/github.com/nats-io/nats-server/server/jetstream_cluster.go:853 +0x60f created by github.com/nats-io/nats-server/v2/server.(*Server).startGoRoutine /home/travis/gopath/src/github.com/nats-io/nats-server/server/server.go:3017 +0x87 FAIL github.com/nats-io/nats-server/v2/server 227.888s ``` which from that branch would point to function processLeaderChange() line: ``` } else if node := js.getMetaGroup().GroupLeader(); node == _EMPTY_ { ``` which I guess meant that getMetaGroup() was returning `nil`. Refactored a bit to get the group leader in 2 steps. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-21 12:11:41 -06:00
Ivan Kozlovic	c3da392832	Changes to IPQueues Removed the warnings, instead have a sync.Map where they are registered/unregistered and can be inspected with an undocumented monitor page. Added the notion of "in progress" which is the number of messages that have beend pop()'ed. When recycle() is invoked this count goes down. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-17 17:53:06 -06:00
Derek Collison	dbfa47f9b1	Improve state preservation for consumers, specifically DeliverNew variants when no activity has been present. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-16 20:55:14 -07:00
Derek Collison	e4ebc4648e	When a stream or consumer was offline we would not properly respond to a delete. We also would hang if no stream info requests were sent during a stream list due to the asset being offline. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-15 21:11:23 -07:00
Ivan Kozlovic	b4128693ed	Ensure file path is correct during stream restore Also had to change all references from `path.` to `filepath.` when dealing with files, so that it works properly on Windows. Fixed also lots of tests to defer the shutdown of the server after the removal of the storage, and fixed some config files directories to use the single quote `'` to surround the file path, again to work on Windows. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-09 13:31:51 -07:00
Derek Collison	3216eb5ee5	When a consumer has no state we are now compacting the log, but were not snapshotting. This caused issues on leader change and losing quorum. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-09 07:21:25 -05:00
Derek Collison	58da4b917a	Made improvements to scale up and down for streams and consumers. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 16:59:02 -08:00
Derek Collison	31a19729b0	When removing a stream peer with an attached durable consumer, the consumer could become inconsistent. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 05:42:22 -08:00
Derek Collison	ad6020ae72	Fix for #2885 . When a filtered consumer who has no state, meaning no messages are being processed, it still will receive updates to properly track the delivered sequence as it relates to the entire stream. Since we did not have state we were inadvertently skipping the compaction logic for the raft store. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-04 08:53:16 -08:00
Derek Collison	30009fdd78	Merge pull request #2897 from nats-io/js-raft-logging Better startup logging to help debug RAFT to streams/consumers.	2022-03-03 11:26:09 -07:00
Derek Collison	11cad6be6b	In the process of working on #2885 with a user, I was struggling to map $SYS directories to consumer names. This change allows a bit better logging on startup to more easily map a RAFT log directory etc to the stream/consumer. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-03 09:50:00 -08:00
Ivan Kozlovic	dfe96944d2	[FIXED] JetStream stream info consumers count in clustered mode In clustering mode, the number of consumers in stream info may be wrong in presence of non durable consumers. Ephemeral are handled by specific nodes. The StreamInfo response would contain only the consumer count that the stream leader is handling. This fix overrides the stream's state consumers count with the number of consumers from the stream assignment record. Resolves #2895 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-03 09:46:35 -07:00
Derek Collison	ca1132a01d	Allow stream placement by tags. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-15 17:07:32 -08:00
Derek Collison	fb15dfd9b7	Allow replica updates during stream update. Also add in HAAssets count to Jsz. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-13 19:33:46 -08:00
R.I.Pienaar	6bb0861eb7	avoid seg fault when stream restore fails Signed-off-by: R.I.Pienaar <rip@devco.net>	2022-02-11 10:45:09 +01:00
Derek Collison	da9046b2e6	Snapshot initial consumer info when needed. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-09 15:23:53 -08:00
Derek Collison	579bf336ad	Allow NAK to take a delay parameter to delay redelivery for a certain amount of time. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 14:57:28 -08:00
Derek Collison	6fd41e5ea4	Updates based on review feedback Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 10:23:47 -08:00
Derek Collison	d962500827	Track reply subjects for pending pull requests across clustered consumers. We will only send if all peers in our group are >= 2.7.1 and we will check for updates. When a consumer follower takes over it will notify all pending requests that those requests are invalid now. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-21 16:31:59 -08:00
Ivan Kozlovic	84f6cbb760	Pooling pubMsg and jsPubMsg objects This should help with GC pressure, however, it may have an effect on performance (based on some benchmark). Calling sync.Pool.Get/Put too often has a performance impact... Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-13 13:14:25 -07:00
Ivan Kozlovic	29c40c874c	Adding logger for IPQueue Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-13 13:14:00 -07:00
Ivan Kozlovic	fc7a4047a5	Renamed variables, removing the "c" that indicated it was a channel	2022-01-13 13:11:05 -07:00
Ivan Kozlovic	62a07adeb9	Replaced catchup and stream restore channels Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-13 13:09:49 -07:00
Ivan Kozlovic	ceb06d6a13	Replaced RAFT's apply channel Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-13 13:06:10 -07:00
Ivan Kozlovic	23ebf9d2f8	Adapted jsOutQ Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-13 13:05:27 -07:00
Derek Collison	103f710479	Fixed consumer info num pending bug. Under load we could have a message committed to the underlying store when a consumer was being created and then it increase num pending again when the stream signals the consumers. This fix just remembers the last seq of the state when we calculate sgap and test before adding in the stream code. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-12 20:03:26 -08:00
Derek Collison	d02ad88297	Only report peers that we have seen a stats/usage update for Signed-off-by: Derek Collison <derek@nats.io>	2022-01-07 10:42:06 -08:00
Derek Collison	16f5c95785	Update atomics placements based on feedback Signed-off-by: Derek Collison <derek@nats.io>	2022-01-07 09:50:19 -08:00
Derek Collison	de5022ad7e	Make cluster placement log more detailed Signed-off-by: Derek Collison <derek@nats.io>	2022-01-07 07:44:30 -08:00
Derek Collison	52da55c8c6	Implement overflow placement for JetStream streams. This allows stream placement to overflow to adjacent clusters. We also do more balanced placement based on resources (store or mem). We can continue to expand this as well. We also introduce an account requirement that stream configs contain a MaxBytes value. We now track account limits and server limits more distinctly, and do not reserver server resources based on account limits themselves. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-06 19:33:08 -08:00
Derek Collison	5932fa1852	Avoid deadlock, release js lock Signed-off-by: Derek Collison <derek@nats.io>	2021-12-29 10:46:53 -08:00
Derek Collison	1a37f0963a	Avoid race condition Signed-off-by: Derek Collison <derek@nats.io>	2021-12-29 08:26:10 -08:00
Derek Collison	490acf5f29	Full stream state with interior delete details not needed by recipient of snapshot Signed-off-by: Derek Collison <derek@nats.io>	2021-12-20 17:37:07 -08:00
Matthias Hanel	3e8b66286d	Js leaf deny (#2693 ) Along a leaf node connection, unless the system account is shared AND the JetStream domain name is identical, the default JetStream traffic (without a domain set) will be denied. As a consequence, all clients that wants to access a domain that is not the one in the server they are connected to, a domain name must be specified. Affected from this change are setups where: a leaf node had no local JetStream OR the server the leaf node connected to had no local JetStream. One of the two accounts that are connected via a leaf node remote, must have no JetStream enabled. The side that does not have JetStream enabled, will loose JetStream access and it's clients must set `nats.Domain` manually. For workarounds on how to restore the old behavior, look at: https://github.com/nats-io/nats-server/pull/2693#issuecomment-996212582 New config values added: `default_js_domain` is a mapping from account to domain, settable when JetStream is not enabled in an account. `extension_hint` are hints for non clustered server to start in clustered mode (and be usable to extend) `js_domain` is a way to set the JetStream domain to use for mqtt. Signed-off-by: Matthias Hanel <mh@synadia.com>	2021-12-16 16:53:20 -05:00

1 2 3 4 5 ...

256 Commits