nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-17 03:24:40 -07:00

Author	SHA1	Message	Date
Derek Collison	12730af5c2	Make sure mirror re-syncs after origin is moved. Speed up mirror and sources heartbeats. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-15 07:01:55 -07:00
Ivan Kozlovic	bd61d51a1c	[IMPROVED] JetStream: reduce unnecessary leader election - Wait of some sort of routing to be in place before starting the raft run loop - Remove use of lock in apiDispatch that was not necessary but could have cause a route to block, causing memory growth, etc.. Unrelated rename of some tests so that they start with TestJetStream and TestJetStreamCluster for cluster tests, fixed some flappers and ensure that tests that change RAFT timeouts put them back to default values on exit. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-14 10:47:14 -06:00
Derek Collison	9748925f13	Improvements to stream and consumer move. During elected stepdown and transfer allow the new leader to take over before we stepdown. We could receive a leader change, so make sure to also check migration state. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-14 07:27:29 -07:00
Matthias Hanel	ec3f9258af	[Adding] max_ha_assets to limit placement on server with more ha assets (#3032 ) * [Adding] max_ha_assets to limit placement on server with more ha assets server running more than max_ha_assets #raft nodes will not be used to place new streams and fail if not enough free server can be found. Durable Consumer creation on such server will fail as their peer size is bound to the same set as their stream. This also avoids updating placement where no new placement is needed. This is the case when, on update, placement tags get removed. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-14 01:53:41 -04:00
Derek Collison	3c0bced76e	Move test to no race, rename others Signed-off-by: Derek Collison <derek@nats.io>	2022-04-12 16:23:36 -07:00
Ivan Kozlovic	50c3986863	[FIXED] JetStream stream catchup issues - A stream could become leader when it should not, causing messages to be lost. - A catchup could stall because the server sending data could bail out of the runCatchup routine but still send the EOF signal. - Deadlock with monitoring of Jsz Signed-off-by: Ivan Kozlovic <ivan@synadia.com> Signed-off-by: Derek Collison <derek@nats.io> Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-12 16:05:12 -06:00
Derek Collison	b7718e2b7a	First pass support for stream alternates Signed-off-by: Derek Collison <derek@nats.io>	2022-04-11 18:47:19 -06:00
Derek Collison	04cce6df68	Merge pull request #3020 from nats-io/move-updates [IMPROVED] Raft layer for general stability and leader election.	2022-04-11 17:33:13 -07:00
Matthias Hanel	02d25cc640	[FIXED] Consumer deliver subject incorrect when imported and crossing gateway (#3025 ) followup to #3017 Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-11 20:27:25 -04:00
Derek Collison	3ed1ecc032	Remove old code Signed-off-by: Derek Collison <derek@nats.io>	2022-04-11 12:00:29 -07:00
Matthias Hanel	13e5ab10bd	fix js nex interest check where leaf node masked gw subj propagation (#3016 ) basically a gw subject propagation issue could be hidden behind a leaf node. also change error text when this was the case Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-11 14:04:09 -04:00
Derek Collison	95f3a3f919	Resolved conflicts with main Signed-off-by: Derek Collison <derek@nats.io>	2022-04-11 06:24:47 -07:00
Derek Collison	c3612b57c7	Fixes for some flapping tests Signed-off-by: Derek Collison <derek@nats.io>	2022-04-10 13:02:03 -07:00
Derek Collison	37cbac99e7	Improvements to the raft layer for general stability and support of scale up and down and asset move. Also fixed a bug that would allow a leadership transfer when catching up. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-10 08:59:39 -07:00
Derek Collison	2510d671de	Skip flapper for now, will fix in separate PR Signed-off-by: Derek Collison <derek@nats.io>	2022-04-09 11:55:04 -07:00
Derek Collison	cd7f16f28a	Tweak timing for test to prevent flapping Signed-off-by: Derek Collison <derek@nats.io>	2022-04-09 11:13:49 -07:00
Derek Collison	331c2faaa6	When using a stream import for a push consumer's messages, if the message crossed a route we dropped the delivered subject. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-09 06:42:22 -07:00
Derek Collison	3663d595fc	Disallow moving a stream that is already being moved Signed-off-by: Derek Collison <derek@nats.io>	2022-04-07 17:09:55 -07:00
Matthias Hanel	5662141932	Adding unique_tag to ensure matching tags are not used twice (#3011 ) Allows to not place a stream in the same availability zone twice. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-07 18:11:00 -04:00
Matthias Hanel	2db7d9fe2f	unit test to make sure tiered limits and stream moves work together (#3007 ) This needs testing because stream move adjusts the replication factor Because adjusting replication factor and moving is illegal, this case does not need to be tested In order to support one off configurations, added same modification callout to super cluster as is used with cluster Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-05 18:11:04 -04:00
Ivan Kozlovic	9b5797f63c	Undo sending bad request on no-interest in apiDispatch This broke cross-account functionality. Ported the test from the Go client that showed the failure after PR#2997 was merged. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-05 08:51:28 -06:00
Derek Collison	7e38ebcb6e	Allow assets such as streams and their associated consumers to migrate between clusters. The system will allow an update to a stream, and subsequently all attached consumers, to be placed in another cluster either directly or via tag placement. The meta layer will scale the underlying peerset appropriately to straddle the two clusters for both the stream and consumers, taking into account the consumer type. Control will then pass to the current leaders of the assets who will monitor the catchup status of the new peers. (Note we can optimize this later to only traverse once across a GW for any given asset, but for now this is simpler) Once the original leaders have determined the assets are synched it will pass leadership to a member of the new peerset. Once the new leader has been elected, it will forward a request for the meta layer to shrink the peerset by removing the old peers. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-04 18:28:36 -07:00
Matthias Hanel	b7bc842c8b	Add a config modification callback to createJetStreamCluster (#2998 ) * Add a config modification callback to createJetStreamCluster named createJetStreamClusterAndModHook allowing the generated config to be altered prior to server start Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-04 17:39:58 -04:00
Ivan Kozlovic	29ea280fe7	[FIXED] JetStream: send "bad request" response for malformed API requests An example was a "consumer info" request with a consumer name that had tokens, which is illegal. This results in the request being dropped in apiDispatch() because there was no interest. The server will now return a "bad request" error in such case. Resolves #2995 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-04 11:25:55 -06:00
Ivan Kozlovic	19783a9f11	[CHANGED] Rate limit similar warnings Some warnings, especially when dealing with JS limits that were printed on a per-message basis, are now limited to ~1 per second if the content of the warning is already found in a map. This is also for "client" warnings, but the client porting of the warning is not taken into account so that helps with reducing logging for similar content, but coming from different clients. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-01 15:24:03 -06:00
Ivan Kozlovic	34650e9dd5	Fixed data race and some flappers Data race that has been seen: ``` Read at 0x00c00134bec0 by goroutine 159: github.com/nats-io/nats-server/v2/server.(client).msgHeaderForRouteOrLeaf() /home/travis/gopath/src/github.com/nats-io/nats-server/server/client.go:2935 +0x254 github.com/nats-io/nats-server/v2/server.(client).processMsgResults() /home/travis/gopath/src/github.com/nats-io/nats-server/server/client.go:4364 +0x2147 (...) Previous write at 0x00c00134bec0 by goroutine 201: github.com/nats-io/nats-server/v2/server.(Server).addRoute() /home/travis/gopath/src/github.com/nats-io/nats-server/server/route.go:1475 +0xdb4 github.com/nats-io/nats-server/v2/server.(client).processRouteInfo() /home/travis/gopath/src/github.com/nats-io/nats-server/server/route.go:641 +0x1704 ``` Also fixed some flappers and removed use of `s.js.` since we have already captured `js` in Jsz monitoring. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-31 10:05:34 -06:00
Derek Collison	76b56b6b0e	Fix for a flapping test Signed-off-by: Derek Collison <derek@nats.io>	2022-03-29 19:15:40 -07:00
Derek Collison	607858f213	Improved consumer snapshot logic in clustered mode and disk usage. Also fixed a bug that could cause memory based replicated consumers to no longer work after snapshots and server restarts. The snapshot logic would allow non-state changing updates to continously grow the raft logs. We also were too conservative on when we snapshotted and why. Also added in ability to have FileStore.Compact() reclaim space from the block file from the head of last changed block. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-29 18:02:49 -07:00
Matthias Hanel	0c5f3688a7	[ADDED] Tiered limits and fix limit issues on updates (#2945 ) * Adding tiered limits and fix limit issues on updates Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-28 20:47:54 -04:00
Derek Collison	004e5ce2c6	Merge pull request #2958 from nats-io/fix_2955 [FIXED] Scaling up an R1 stream would not replicate existing messages.	2022-03-28 12:18:20 -07:00
Derek Collison	7607d37799	Make sure to prevent flappers if possible Signed-off-by: Derek Collison <derek@nats.io>	2022-03-28 09:34:48 -07:00
Derek Collison	6b379329d8	Fix for #2955 . When scaling up a stream with existing messages the existing messages were not being replicated. Also fixed a bug where we were incorrectly not spining up the monitoring loop for a stream when going from 3->1->3. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-26 07:26:46 -07:00
Ivan Kozlovic	6ad93d9b34	Fix some flappers Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-25 18:24:17 -06:00
Ivan Kozlovic	4739eebfc4	[FIXED] JetStream: possible deadlock during consumer leadership change Would possibly show up when a consumer leader changes for a consumer that had redelivered messages and for instance messages were inbound on the stream. Resolves #2912 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-25 12:21:51 -06:00
Derek Collison	ef8f543ea5	Improve memory usage through JetStream storage layer. Previously we would rely more heavily on Go's garbage collector since when we loaded a block for an underlying stream we would pass references upward to avoimd copies. Now we always copy when passing back to the upper layers which allows us to not only expire our cache blocks but pool and reuse them. The upper layers also had changes made to allow the pooling layer at that level to interoperate with the storage layer optionally. Also fixed some flappers and a bug where de-dupe might not be reformed correctly. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Derek Collison	7fd5f4dc24	Update Go client Signed-off-by: Derek Collison <derek@nats.io>	2022-03-24 17:45:15 -06:00
Ivan Kozlovic	2253bb6f1a	JS: BackOff list caused too frequent checkPending() calls Since the "next" timer value is set to the AckWait value, which is the first element in the BackOff list if present, the check would possibly happen at this interval, even when we were past the first redelivery and the backoff interval had increased. The end-user would still see the redelivery be done at the durations indicated by the BackOff list, but internally, we would be checking at the initial BackOff's ack wait. I added a test that uses the store's interface to detect how many times the checkPending() function is invoked. For this test it should have been invoked twice, but without the fix it was invoked 15 times. Also fixed an unrelated test that could possibly deadlock causing tests to be aborted due to inactivity on Travis. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-23 12:46:17 -06:00
Ivan Kozlovic	29ff67e2ac	Tests: Replace all Ack() with AckSync() for now For reason explained in previous commit, for tests that were expecting the number of ack/pending to be of a certain value after an Ack(), they would be flapping. Replaced all references and we can go back to selectively call Ack() when AckSync() is not needed. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-17 20:25:01 -06:00
Derek Collison	e204a7961d	When detecting exact duplicates for URLs for routes, gws or leafnodes, enter a warning and ignore. If misconfigured could prevent the JetStream system from electing a leader. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-17 14:52:01 -07:00
Derek Collison	dbfa47f9b1	Improve state preservation for consumers, specifically DeliverNew variants when no activity has been present. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-16 20:55:14 -07:00
Derek Collison	e4ebc4648e	When a stream or consumer was offline we would not properly respond to a delete. We also would hang if no stream info requests were sent during a stream list due to the asset being offline. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-15 21:11:23 -07:00
Ivan Kozlovic	b4128693ed	Ensure file path is correct during stream restore Also had to change all references from `path.` to `filepath.` when dealing with files, so that it works properly on Windows. Fixed also lots of tests to defer the shutdown of the server after the removal of the storage, and fixed some config files directories to use the single quote `'` to surround the file path, again to work on Windows. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-09 13:31:51 -07:00
Ivan Kozlovic	0cb0f6d380	Merge pull request #2914 from nats-io/fix_2913 [FIXED] Consumer with no activity can lose quorum	2022-03-09 11:55:50 -07:00
Matthias Hanel	9a2da9ed8c	Adding denies $KV.>/$OBJ.> along leaf connections on differing domain (#2916 ) * Adding denies $KV.>/$OBJ.> along leaf connections on differing domain Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-09 13:17:59 -05:00
Derek Collison	3216eb5ee5	When a consumer has no state we are now compacting the log, but were not snapshotting. This caused issues on leader change and losing quorum. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-09 07:21:25 -05:00
Derek Collison	58da4b917a	Made improvements to scale up and down for streams and consumers. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 16:59:02 -08:00
Derek Collison	eb1ed5574d	Merge pull request #2904 from nats-io/peer_remove_bad_consumer_state [FIXED] Inconsistent durable consumer state after stream peer removal	2022-03-06 10:24:31 -08:00
Ivan Kozlovic	196319b106	[FIXED] JetStream: Some stream advisories missing The "deleted" advisory was missing because the stream's send loop was closed before the advisory was pushed to the queue to be sent. Added tests, both for single and clustered mode to test all stream advisories. Resolves #2886 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-06 10:52:42 -07:00
Derek Collison	31a19729b0	When removing a stream peer with an attached durable consumer, the consumer could become inconsistent. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-06 05:42:22 -08:00
Derek Collison	4b9bc29e53	If we had not heard from a source or mirror we would still calculate the delta since now. This would wrap and create a large number which overflowed JSON's 2^53 limit. Signed-off-by: Derek Collison <derek@nats.io>	2022-03-05 12:46:55 -08:00

1 2 3 4 5 ...

319 Commits