nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-14 10:10:42 -07:00

Author	SHA1	Message	Date
Derek Collison	d04763eb7d	CAS operations improved, hold lock past store. Use separate lock for consumer list and storage updates. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-24 18:30:44 -07:00
Derek Collison	212adf5775	General improvements to clustered streams during server restart and KV/CAS scenarios. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-22 18:36:15 -07:00
Matthias Hanel	c02d1ad69e	fix consumer subject validation on recovery (#3389 ) This fixes an issue introduced in #3080 The consumer filter subject check was skipped on recovery. The intent was to bypass the upstream stream subjects. But it also filtered the downstream stream subject. This became a problem when the downstream was itself an upstream. Then during recover, the stream subject was not checked, which lead to delivery of filtered messages that should never have been delivered. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-22 14:30:00 -07:00
Ivan Kozlovic	7de4497815	Install consumer snapshot on clean exit and few other fixes - didRemove in applyMetaEntries() could be reset when processing multiple entries - change "no race" test names to include JetStream - separate raft nodes leader stepdown and stop in server shutdown process - in InstallSnapshot, call wal.Compact() with lastIndex+1 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-16 17:05:49 -06:00
Ivan Kozlovic	f0b098af92	[FIXED] JetStream: issue with max deliver and server/cluster restart This is a regression introduced in v2.8.3. If a message reaches the max redeliver count, it stops being delivered to the consumer. However, after a server or cluster restart, those messages would be redelivered again. Resolves #3361 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-16 17:05:47 -06:00
Byron Ruth	095cfef9eb	Add check and test to prevent updating consumer MaxWaiting Signed-off-by: Byron Ruth <b@devel.io>	2022-07-29 15:05:00 -04:00
Matthias Hanel	6212087feb	fix race by locking arround o.isLeader (#3291 ) Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-07-26 21:49:04 +02:00
Ivan Kozlovic	a3e62f000c	Merge pull request #3265 from abegaj/fix-max-deliver-update [FIXED] Maximum Deliveries update has no effect	2022-07-21 16:06:41 -06:00
Matthias Hanel	89b5e872ac	Move and cancel fixes (#3270 ) The Move/Cancel/Downscale mechanism did not take into account that the consumer's replica count can be set independently. This also alters peer selection to have the ability to skip unique tag prefix check for server that will be replaced. Say you have 3 az, and want to add another server to az:1, in order to replace a server that is the same zone. Without this change, uniqueTagPrefix check would filter the server to replace with and cause a failure. The cancel move response could not be received due to the wrong account name. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-07-18 18:42:03 +02:00
Ardit Begaj	4d81b8e4a2	Make MaxDeliver update changes take effect	2022-07-14 15:38:59 +02:00
Derek Collison	16f788adce	Consumer isFiltered could crash if the stream was a mirror with no direct subjects but it was in interest or workqueue retention mode. Signed-off-by: Derek Collison <derek@nats.io>	2022-07-07 03:35:41 +02:00
Matthias Hanel	70be4b77f9	fixes peer removal, simplifies move, more tests Make sure when processing a peer removal that the stream assignment agrees. When a new leader takes over it can resend a peer removal, and if the stream/consumer really was rescheduled we could remove by accident. Also need to make sure that when we remove a stream we remove the node as part of the stream assignment. If we didn't, if the same asset returned to this server we would not start up the monitoring loop. Simplify migration logic in monitorStream, to be driven by leader only Improved unit tests Added failure when server not in peer list Move command does not require server anymore Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-07-07 03:32:13 +02:00
Derek Collison	52baa95e2a	When stalled for MaxAckPending, expire all pull requests that are one shot, meaning had at least 1 delivered message Signed-off-by: Derek Collison <derek@nats.io>	2022-07-04 19:14:49 -07:00
Derek Collison	1b580c67f3	Make pull consumers FIFO per message, not per request. This effectively means that requests with batch > 1 will process a message and go to the end of the line. Signed-off-by: Derek Collison <derek@nats.io>	2022-07-04 13:05:57 -07:00
Ivan Kozlovic	c519df7e0d	[FIXED] Pull consumer may be incorrectly removed after InactiveThreshold When creating a pull consumer with InactiveThreshold set, if the application is doing pull requests with "no_wait" at regular interval (lower than InactiveThreshold), the JS consumer should be considered active and not deleted. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-06-29 10:15:09 -06:00
Derek Collison	abc5905aa9	Merge pull request #3221 from nats-io/direct Made direct get from a stream part of the $JS.API hierarchy vs separate.	2022-06-28 09:59:44 -07:00
Derek Collison	b8ef9b19a0	Made direct get from a stream part of the $JS.API hierarchy vs separate. Also for direct get and for pull requests, if we are not on a client connection check how long we have been away from the readloop. If need be execute in a separate go routine. Signed-off-by: Derek Collison <derek@nats.io>	2022-06-28 08:53:48 -07:00
Derek Collison	b0c4ec69ab	Merge pull request #3216 from nats-io/update_consumer_filtered_subj Allow consumer filter subjects to be updated	2022-06-28 08:42:08 -07:00
Derek Collison	e02016db26	Fix race Signed-off-by: Derek Collison <derek@nats.io>	2022-06-27 16:05:03 -07:00
Derek Collison	9154fca7f1	Allow consumer filter subjects to be updated Signed-off-by: Derek Collison <derek@nats.io>	2022-06-24 12:38:01 -07:00
Derek Collison	4c8110c3ff	Add in support for inactivity thresholds for durable consumers. Signed-off-by: Derek Collison <derek@nats.io>	2022-06-14 06:51:00 -07:00
Derek Collison	fb51162e37	Make error on make bytes exceeded on a pull request a 409 Signed-off-by: Derek Collison <derek@nats.io>	2022-06-07 06:59:08 -07:00
Derek Collison	72ed48d096	Merge pull request #3149 from nats-io/pull_perf_stable [FIXED] Spurious pull consumer 408s under load	2022-05-25 09:44:27 -07:00
Derek Collison	d69394efad	Fix spurious 408s under load and move processing of acks to their own Go routine. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-25 09:27:34 -07:00
Derek Collison	46f7f7bfc9	Consumer pending was not correct when stream had max msgs per subject set > 1 and a consumer that filtered out part of the stream was created. Also make sure to update stream's config on a stream restore in case of changes. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-24 14:44:15 -07:00
Ivan Kozlovic	53e3c53d96	[FIXED] JetStream: consumer with deliver new may miss messages This could happen when a consumer had not sent anything to the attached NATS subscription and there was a consumer leader step down or server restart. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-23 12:01:48 -06:00
Derek Collison	790d643431	Consumer's num pending can now rely on the stream's store vs trying to maintain furing runtime which could be wrong under certain conditions. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-20 08:45:43 -07:00
Derek Collison	41cca8d6c4	Allow proper mix and match of consumer stores and stream stores. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-18 12:51:48 -07:00
Derek Collison	906eb332fc	Make sure consumer store is memory based when selected Signed-off-by: Derek Collison <derek@nats.io>	2022-05-17 18:48:27 -07:00
Derek Collison	50be0a6599	Allow explicit configuration of consumer's replica count and allow a consumer to force memory storage. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-16 19:03:56 -07:00
Derek Collison	6bbc5f627c	Support for MaxBytes for pull requests. Signed-off-by: Derek Collison <derek@nats.io>	2022-05-16 08:43:33 -07:00
Ivan Kozlovic	5c3be1ee68	[FIXED] JetStream: panic processing cluster consumer create Before PR #3099, `waitQueue.isEmpty()` returned `wq.len() == 0` and `waitQueue.len()` was protecting against the pointer being nil (and then return 0). The change in #3099 caused `waitQueue.isEmpty()` to return `wq.n == 0`, which means that if `wq` was nil, then it would crash. This PR restores `waitQueue.isEmpty()` to return `wq.len() == 0` and add the protection for waitQueue being nil in `len()` similar to how it was prior to PR #3099. Resolves #3117 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-05-11 11:03:50 -06:00
Ivan Kozlovic	cadf921ed1	[FIXED] JetStream: PullConsumer MaxWaiting==1 and Canceled requests There was an issue with MaxWaiting==1 that was causing a request with expiration to actually not expire. This was because processWaiting would not pick it up because wq.rp was actually equal to wq.wp (that is, the read pointer was equal to write pointer for a slice of capacity of 1). The other issue was that when reaching the maximum of waiting pull requests, a new request would evict an old one with a "408 Request Canceled". There is no reason for that, instead the server will first try to find some existing expired requests (since some of the expiration is lazily done), but if none is expired, and the queue is full, the server will return a "409 Exceeded MaxWaiting" to the new request, and not a "408 Request Canceled" to an old one... Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-05-03 15:17:20 -06:00
Ivan Kozlovic	5050092468	[FIXED] JetStream: possible lock inversion When updating usage, there is a lock inversion in that the jetStream lock was acquired while under the stream's (mset) lock, which is not correct. Also, updateUsage was locking the jsAccount lock, which again, is not really correct since jsAccount contains streams, so it should be jsAccount->stream, not the other way around. Removed the locking of jetStream to check for clustered state since js.clustered is immutable. Replaced using jsAccount lock to update usage with a dedicated lock. Originally moved all the update/limit fields in jsAccount to new structure to make sure that I would see all code that is updating or reading those fields, and also all functions so that I could make sure that I use the new lock when calling these. Once that works was done, and to reduce code changes, I put the fields back into jsAccount (although I grouped them under the new usageMu mutex field). Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-05-02 09:50:32 -06:00
Ivan Kozlovic	d4d37e67f4	[FIXED] JetStream: file store compact and when to write index When deciding to compact a file, we need to remove from the raw bytes the empty records, otherwise, for small messages, we would end-up calling compact() too many times. When removing a message from the stream, in FIFO cases we would write the index every 2 seconds at most when doing it in place, when when dealing with out of order deletes, we would do it for every single delete, which can be costly. We are now writing only every 500ms for non FIFO cases. Also fixed some unrelated code: - Decision to install a snapshot was based on incorrect logical expression - In checkPending(), protect against the timer being nil which could happen when consumer is stopped or leadership change. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-28 12:35:19 -06:00
Matthias Hanel	d520a27c36	[fixed] step down timing, consumer stream seqno, clear redelivery (#3079 ) Step down timing for consumers or streams. Signals loss of leadership and sleeps before stepping down. This makes it less likely that messages are being processed during step down. When becoming leader, consumer stream seqno got reset, even though the consumer existed already. Proper cleanup of redelivery data structures and timer Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-27 03:32:08 -04:00
Derek Collison	f702e279ab	Fix for a consumer recovery issue. Also update healthz to check all assets that are assigned, not just running. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-26 19:22:19 -07:00
Leander Kohler	966d9d56f4	Add JSConsumerDeliveryNakAdvisory The advisory `JSAdvisoryConsumerMsgNakPre` will be triggered when a message is naked	2022-04-25 16:13:32 +02:00
Ivan Kozlovic	b9463b322f	[FIXED] JetStream: stream mirror issues in mixed mode clusters Similar to PR #3061 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-20 23:21:15 -06:00
Derek Collison	656a9534a5	Bump flow control window back up. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-18 05:33:20 -07:00
Matthias Hanel	79b4374d01	[Fixed] limits enforcement issues (#3046 ) * [Fixed] limits enforcement issues stream create had checks that stream restore did not have. Moved code into commonly used function checkStreamCfg. Also introduced (cluster/non clustered) StreamLimitsCheck functions to perform checks specific to clustered /non clustered data structures. Checking for valid stream config and limits/reservations before receiving all the data. Now fails the request right away. Added a jetstream limit "max_request_batch" to limit fetch batch size Shortened max name length from 256 to 255, more common file name limit Added check for loop in cyclic source stream configurations features related to limits Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-18 01:53:48 -04:00
Ivan Kozlovic	eb4856e4a7	Cleanup timers on consumer leader change Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-16 13:37:46 -06:00
Ivan Kozlovic	fc873c6f2f	Return limit in consumer max_ack_pending limit exceeded - Updated tests that were checking for the error to include the limit - Moved some tests above the benchmark ones Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-15 18:23:25 -06:00
Ivan Kozlovic	0e841d4acf	Tweak ordered consumer flow control and bump to beta.18 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-04-14 17:43:43 -06:00
Derek Collison	04cce6df68	Merge pull request #3020 from nats-io/move-updates [IMPROVED] Raft layer for general stability and leader election.	2022-04-11 17:33:13 -07:00
Matthias Hanel	13e5ab10bd	fix js nex interest check where leaf node masked gw subj propagation (#3016 ) basically a gw subject propagation issue could be hidden behind a leaf node. also change error text when this was the case Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-04-11 14:04:09 -04:00
Derek Collison	37cbac99e7	Improvements to the raft layer for general stability and support of scale up and down and asset move. Also fixed a bug that would allow a leadership transfer when catching up. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-10 08:59:39 -07:00
Derek Collison	7e38ebcb6e	Allow assets such as streams and their associated consumers to migrate between clusters. The system will allow an update to a stream, and subsequently all attached consumers, to be placed in another cluster either directly or via tag placement. The meta layer will scale the underlying peerset appropriately to straddle the two clusters for both the stream and consumers, taking into account the consumer type. Control will then pass to the current leaders of the assets who will monitor the catchup status of the new peers. (Note we can optimize this later to only traverse once across a GW for any given asset, but for now this is simpler) Once the original leaders have determined the assets are synched it will pass leadership to a member of the new peerset. Once the new leader has been elected, it will forward a request for the meta layer to shrink the peerset by removing the old peers. Signed-off-by: Derek Collison <derek@nats.io>	2022-04-04 18:28:36 -07:00
Matthias Hanel	92f4dc986a	added max_ack_pending setting to js account limits (#2982 ) * added max_ack_penind setting to js account limits because of the addition, defaults now have to be set later (depend on these new limits now) also re-organized the code to closer track how stream create looks Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-03-31 14:17:16 -04:00
Ivan Kozlovic	4ddbdbd74c	Rewrite trackDownAccountAndInterest() to make it easier to read Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-03-30 16:41:22 -06:00

1 2 3 4 5 ...

343 Commits