nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-17 03:24:40 -07:00

Author	SHA1	Message	Date
Matthias Hanel	0c5f3688a7	[ADDED] Tiered limits and fix limit issues on updates (#2945 ) * Adding tiered limits and fix limit issues on updates Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-28 20:47:54 -04:00
Derek Collison	004e5ce2c6	Merge pull request #2958 from nats-io/fix_2955 [FIXED] Scaling up an R1 stream would not replicate existing messages.	2022-03-28 12:18:20 -07:00
Derek Collison	7607d37799	Make sure to prevent flappers if possible Signed-off-by: Derek Collison <derek@nats.io>	2022-03-28 09:34:48 -07:00
Derek Collison	6b379329d8	Fix for #2955 . When scaling up a stream with existing messages the existing messages were not being replicated. Also fixed a bug where we were incorrectly not spining up the monitoring loop for a stream when going from 3->1->3. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-26 07:26:46 -07:00
Ivan Kozlovic	6ad93d9b34	Fix some flappers Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-25 18:24:17 -06:00
Ivan Kozlovic	4739eebfc4	[FIXED] JetStream: possible deadlock during consumer leadership change Would possibly show up when a consumer leader changes for a consumer that had redelivered messages and for instance messages were inbound on the stream. Resolves #2912 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-25 12:21:51 -06:00
Derek Collison	ef8f543ea5	Improve memory usage through JetStream storage layer. Previously we would rely more heavily on Go's garbage collector since when we loaded a block for an underlying stream we would pass references upward to avoimd copies. Now we always copy when passing back to the upper layers which allows us to not only expire our cache blocks but pool and reuse them. The upper layers also had changes made to allow the pooling layer at that level to interoperate with the storage layer optionally. Also fixed some flappers and a bug where de-dupe might not be reformed correctly. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Derek Collison	7fd5f4dc24	Update Go client Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Ivan Kozlovic	2253bb6f1a	JS: BackOff list caused too frequent checkPending() calls Since the "next" timer value is set to the AckWait value, which is the first element in the BackOff list if present, the check would possibly happen at this interval, even when we were past the first redelivery and the backoff interval had increased. The end-user would still see the redelivery be done at the durations indicated by the BackOff list, but internally, we would be checking at the initial BackOff's ack wait. I added a test that uses the store's interface to detect how many times the checkPending() function is invoked. For this test it should have been invoked twice, but without the fix it was invoked 15 times. Also fixed an unrelated test that could possibly deadlock causing tests to be aborted due to inactivity on Travis. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-23 12:46:17 -06:00
Ivan Kozlovic	29ff67e2ac	Tests: Replace all Ack() with AckSync() for now For reason explained in previous commit, for tests that were expecting the number of ack/pending to be of a certain value after an Ack(), they would be flapping. Replaced all references and we can go back to selectively call Ack() when AckSync() is not needed. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-17 20:25:01 -06:00
Derek Collison	e204a7961d	When detecting exact duplicates for URLs for routes, gws or leafnodes, enter a warning and ignore. If misconfigured could prevent the JetStream system from electing a leader. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-17 14:52:01 -07:00
Derek Collison	dbfa47f9b1	Improve state preservation for consumers, specifically DeliverNew variants when no activity has been present. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-16 20:55:14 -07:00
Derek Collison	e4ebc4648e	When a stream or consumer was offline we would not properly respond to a delete. We also would hang if no stream info requests were sent during a stream list due to the asset being offline. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-15 21:11:23 -07:00
Ivan Kozlovic	b4128693ed	Ensure file path is correct during stream restore Also had to change all references from `path.` to `filepath.` when dealing with files, so that it works properly on Windows. Fixed also lots of tests to defer the shutdown of the server after the removal of the storage, and fixed some config files directories to use the single quote `'` to surround the file path, again to work on Windows. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-09 13:31:51 -07:00
Ivan Kozlovic	0cb0f6d380	Merge pull request #2914 from nats-io/fix_2913 [FIXED] Consumer with no activity can lose quorum	2022-03-09 11:55:50 -07:00
Matthias Hanel	9a2da9ed8c	Adding denies $KV.>/$OBJ.> along leaf connections on differing domain (#2916 ) * Adding denies $KV.>/$OBJ.> along leaf connections on differing domain Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-09 13:17:59 -05:00
Derek Collison	3216eb5ee5	When a consumer has no state we are now compacting the log, but were not snapshotting. This caused issues on leader change and losing quorum. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-09 07:21:25 -05:00
Derek Collison	58da4b917a	Made improvements to scale up and down for streams and consumers. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 16:59:02 -08:00
Derek Collison	eb1ed5574d	Merge pull request #2904 from nats-io/peer_remove_bad_consumer_state [FIXED] Inconsistent durable consumer state after stream peer removal	2022-03-06 10:24:31 -08:00
Ivan Kozlovic	196319b106	[FIXED] JetStream: Some stream advisories missing The "deleted" advisory was missing because the stream's send loop was closed before the advisory was pushed to the queue to be sent. Added tests, both for single and clustered mode to test all stream advisories. Resolves #2886 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-06 10:52:42 -07:00
Derek Collison	31a19729b0	When removing a stream peer with an attached durable consumer, the consumer could become inconsistent. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 05:42:22 -08:00
Derek Collison	4b9bc29e53	If we had not heard from a source or mirror we would still calculate the delta since now. This would wrap and create a large number which overflowed JSON's 2^53 limit. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-05 12:46:55 -08:00
Derek Collison	ad6020ae72	Fix for #2885 . When a filtered consumer who has no state, meaning no messages are being processed, it still will receive updates to properly track the delivered sequence as it relates to the entire stream. Since we did not have state we were inadvertently skipping the compaction logic for the raft store. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-04 08:53:16 -08:00
Ivan Kozlovic	dfe96944d2	[FIXED] JetStream stream info consumers count in clustered mode In clustering mode, the number of consumers in stream info may be wrong in presence of non durable consumers. Ephemeral are handled by specific nodes. The StreamInfo response would contain only the consumer count that the stream leader is handling. This fix overrides the stream's state consumers count with the number of consumers from the stream assignment record. Resolves #2895 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-03 09:46:35 -07:00
Derek Collison	1c8f7de848	On filtered subjects when consumers were staggered we need to disqualify a filtered consumer if not applicable. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-16 18:24:27 -08:00
Derek Collison	ca1132a01d	Allow stream placement by tags. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-15 17:07:32 -08:00
Derek Collison	fb15dfd9b7	Allow replica updates during stream update. Also add in HAAssets count to Jsz. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-13 19:33:46 -08:00
Derek Collison	5a93b0e9d8	Allow pull requests to specify a heartbeat when idle to detect when a request is invalidated. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-11 09:51:51 -08:00
Derek Collison	ecfe42630a	Merge pull request #2858 from nats-io/add_consumer_with_info Make sure we snapshot initial consumer info during consumer creation.	2022-02-09 17:05:01 -08:00
Derek Collison	0cc7302be9	A stream name is tied to its identity and can not be changed on a restore. Signed-off-by: Derek Collison <derek@nats.io>	2022-02-09 12:38:45 -08:00
Ivan Kozlovic	3dcf0246c6	[FIXED] Adding a consumer could return inaccurate consumer info The issue is that the consumer info returned by the consumer create API is gathered after the consumer is added and possibly after starting to deliver pending messages. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-02-09 09:04:16 -07:00
Ivan Kozlovic	30c431a9a3	[FIXED] JetStream: BackOff redeliveries would always use first in list If the consumer's sequence was not the same than the stream's sequence, then the redelivery would always use the first duration from the BackOff list. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-01-31 17:44:08 -07:00
Derek Collison	0d158728d1	Merge pull request #2824 from nats-io/fix-nodeinfo Store JetStream Config in node info map	2022-01-31 13:58:12 -08:00
Derek Collison	a57bd96def	Updating a push consumer to be pull would succeed but cause a panic if used. This disallows that upgrade. We had a check in place for pull to push, but not the reverse. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-28 13:11:58 -08:00
Jaime Piña	ae8eedb88e	Store JetStream Config in node info map	2022-01-27 14:46:41 -08:00
Derek Collison	6486cd8fc8	Added in /healthz endpoint for health and liveness probes in environments like k8s. Currently this code returns a 200 and { "status": "ok" } iff all configured ports are open and if JetStream is configured and we have contact with the metaleader and the cluster and all streams are up to date. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 19:30:10 -08:00
Derek Collison	bd78b1a99b	Formal json version for NAK delay Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 15:01:52 -08:00
Derek Collison	d486c24199	Allow a consumer to be configured with BackOffs. This allows a consumer to have exponential backoffs vs static AckWait and MaxDeliver. When BackOff is set it will overridde AckWait to BackOff[0] and MaxDeliver will be len(BackOff)+1. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 14:57:36 -08:00
Derek Collison	579bf336ad	Allow NAK to take a delay parameter to delay redelivery for a certain amount of time. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 14:57:28 -08:00
Derek Collison	d332684322	Fixed data race and fuxed bug that we would not clear our waiting queue when a leader stepped down. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-24 13:01:25 -08:00
Derek Collison	d962500827	Track reply subjects for pending pull requests across clustered consumers. We will only send if all peers in our group are >= 2.7.1 and we will check for updates. When a consumer follower takes over it will notify all pending requests that those requests are invalid now. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-21 16:31:59 -08:00
Derek Collison	103f710479	Fixed consumer info num pending bug. Under load we could have a message committed to the underlying store when a consumer was being created and then it increase num pending again when the stream signals the consumers. This fix just remembers the last seq of the state when we calculate sgap and test before adding in the stream code. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-12 20:03:26 -08:00
Derek Collison	5592d923c4	Updated pull consumers. Cleaned up code, made more consistent, utilize loopAndGather. Allow pull consumers to have AckAll as well as AckExplicit. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-10 16:59:01 -08:00
Derek Collison	d02ad88297	Only report peers that we have seen a stats/usage update for Signed-off-by: Derek Collison <derek@nats.io>	2022-01-07 10:42:06 -08:00
Derek Collison	52da55c8c6	Implement overflow placement for JetStream streams. This allows stream placement to overflow to adjacent clusters. We also do more balanced placement based on resources (store or mem). We can continue to expand this as well. We also introduce an account requirement that stream configs contain a MaxBytes value. We now track account limits and server limits more distinctly, and do not reserver server resources based on account limits themselves. Signed-off-by: Derek Collison <derek@nats.io>	2022-01-06 19:33:08 -08:00
Ivan Kozlovic	3053039ff3	[FIXED] JetStream: interest across gateways If the interest existed prior to the initial creation of the consumer, the gateway "watcher" would not be started, which means that interest moving across the super-cluster after that would not be detected. The watcher runs every second and not sure if this is costly or not, so we may want to go a different approach of having a separate interest change channel that would be specific to gateways. But this means adding a new sublist where the interest would be registered and that sublist would need to be updated when processing GW RSub and RUnsub? Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2021-12-16 17:20:16 -07:00
Matthias Hanel	3e8b66286d	Js leaf deny (#2693 ) Along a leaf node connection, unless the system account is shared AND the JetStream domain name is identical, the default JetStream traffic (without a domain set) will be denied. As a consequence, all clients that wants to access a domain that is not the one in the server they are connected to, a domain name must be specified. Affected from this change are setups where: a leaf node had no local JetStream OR the server the leaf node connected to had no local JetStream. One of the two accounts that are connected via a leaf node remote, must have no JetStream enabled. The side that does not have JetStream enabled, will loose JetStream access and it's clients must set `nats.Domain` manually. For workarounds on how to restore the old behavior, look at: https://github.com/nats-io/nats-server/pull/2693#issuecomment-996212582 New config values added: `default_js_domain` is a mapping from account to domain, settable when JetStream is not enabled in an account. `extension_hint` are hints for non clustered server to start in clustered mode (and be usable to extend) `js_domain` is a way to set the JetStream domain to use for mqtt. Signed-off-by: Matthias Hanel <mh@synadia.com>	2021-12-16 16:53:20 -05:00
Ivan Kozlovic	40c0f03153	[FIXED] Monitoring: tls configuration not updated on reload When creating the http server, we need to provide a TLS configuration. After a config reload, the new TLS config would not be reflected. We had the same issue with Websocket and was fixed with the use of tls.Config.GetConfigForClient API, which makes the TLS handshake to ask for a TLS config. That fix for websocket was simply not applied to the HTTPs monitoring case. I have also fixed some flappers due to the use of localhost instead of 127.0.0.1 (connections possibly would resolve to some IPv6 address that the server would not accept, etc..) Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2021-11-30 10:18:46 -07:00
Derek Collison	49c5c873ca	Better handling of stream mismatch scenarios. 1. When a snapshot did not yield actionable data, we were not setting new last sequence if we have to readjust based on snapshot. This could lead to spinning on stream reset for followers. 2. When a stream has lots of failures by design, like KV abstraction, if we cleared the clfs state we would endlessly spin trying to reset the stream. Signed-off-by: Derek Collison <derek@nats.io>	2021-11-18 14:00:41 -08:00
R.I.Pienaar	270ff87beb	allow streams api to be filtered like list api Signed-off-by: R.I.Pienaar <rip@devco.net>	2021-11-18 13:59:12 +01:00

1 2 3 4 5 ...

291 Commits