nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-02 03:38:42 -07:00

Author	SHA1	Message	Date
Derek Collison	855f790e3c	Improve performance and latency with large number of sparse consumers. When a stream had a large number of consumers on a server that were sparse, the signaling mechanism would do a linear scan to signal matching consumers. As usage patterns have continued to have more consumers that are filteres and sparse, meaning a message is destined for a single or small number of consumers. This change moves selection to a sublist that tracks only active consumer leaders for selection, which optimizes selection of consumers to signal when the number of consumers is large. Signed-off-by: Derek Collison <derek@nats.io>	2022-12-13 10:09:59 -08:00
R.I.Pienaar	e234e773ab	Rate limit frequent debug log lines With the increase use of subject based limits not being able to store due to limits exceeded happens frequently and makes running the server in debug quite noisy, so we rate limit this log line even in debug Signed-off-by: R.I.Pienaar <rip@devco.net>	2022-12-09 11:53:52 +01:00
Derek Collison	5f7c8e21a2	Fixed issues with multiple concurrent stream create requests. First issue was applications not getting any response. However, there was also a more serious issue that would create multiple raft groups for each concurrent request. The servers would only run one stream monitor loop, however they would update the state to the new raft group's name, so on server restart the stream would be using a different raft group then existing servers. Signed-off-by: Derek Collison <derek@nats.io>	2022-12-04 19:13:51 -08:00
Ivan Kozlovic	74a16b0097	Merge pull request #3640 from nats-io/fix_3639 [FIXED] JetStream: WorkQueue not preventing overlapping consumers	2022-11-16 17:22:35 -07:00
Ivan Kozlovic	49faba9e33	[FIXED] JetStream: WorkQueue not preventing overlapping consumers A stream with a WorkQueue retention policy is supposed to allow more than one consumer if they user filtered subjects, but those subjects should not overlap. There was an issue that if a new consumer had a filter subject "wider" than an existing one, the error was not detected and the new consumer was incorrectly accepted. Resolves #3639 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-11-16 17:09:30 -07:00
Derek Collison	e4b5b7ec64	Make mirror consumers use filtered version of consumer create like source consumers. Signed-off-by: Derek Collison <derek@nats.io>	2022-11-16 15:12:14 -08:00
Derek Collison	9f241f3322	Offload signaling to consumers when number is large. Signed-off-by: Derek Collison <derek@nats.io>	2022-11-15 11:25:07 -08:00
Derek Collison	b6ef2c8910	Auto cleanup dangling messages from interest policy streams on server start. Signed-off-by: Derek Collison <derek@nats.io>	2022-11-14 15:02:22 -08:00
Ivan Kozlovic	c16ccd34c3	[FIXED] JetStream: Sources with OptStartTime gets redelivered If start by time is before what we remember during recovery use that instead Resolves #3559 Signed-off-by: Derek Collison <derek@nats.io> Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-11-03 16:09:06 -06:00
Ivan Kozlovic	3c7aa554f7	[FIXED] JetStream: return error on negative replicas count If a stream is created or updated with a negative replicas count, and error is now returned. Same for consumers. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-10-10 12:32:41 -06:00
Derek Collison	91edd1a8d0	With snapshots both streams are present on restart so sources or mirrors that have a subject change from the origin would not recover. We now suppress that if we know we are recovering an existing stream. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-29 09:17:15 -06:00
Derek Collison	61a3cff274	Also require MaxMsgsPerSubject to be set per peer review feedback. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-22 06:56:32 -07:00
Derek Collison	2d737edba6	Allow discard new per subject for certain KV type scenarios. Requires general DiscardNewPolicy. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-22 06:38:29 -07:00
Derek Collison	8b2315eadd	When filtering a source stream use new consumer create API subject. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-16 14:45:33 -07:00
Ivan Kozlovic	170ff49837	[ADDED] JetStream: peer (the hash of server name) in statsz/jsz A request to `$SYS.REQ.SERVER.PING.JSZ` would now return something like this: ``` ... "meta_cluster": { "name": "local", "leader": "A", "peer": "NUmM6cRx", "replicas": [ { "name": "B", "current": true, "active": 690369000, "peer": "b2oh2L6w" }, { "name": "Server name unknown at this time (peerID: jZ6RvVRH)", "current": false, "offline": true, "active": 0, "peer": "jZ6RvVRH" } ], "cluster_size": 3 } ``` Note the "peer" field following the "leader" field that contains the server name. The new field is the node ID, which is a hash of the server name. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-09-16 15:31:37 -06:00
Ivan Kozlovic	dc2e4b714a	Merge pull request #3473 from nats-io/js_raft_remove_by_peer_id [ADDED] JetStream: ability to remove a server by peer ID instead of name	2022-09-15 13:52:20 -06:00
Ivan Kozlovic	f113163b9f	Change ByID boolean to Peer string and add Peer id in replicas output The CLI will now be able to display the peer IDs in MetaGroupInfo if it choses to do so, and possibly help user select the peer ID from a list with a new command to remove by peer ID instead of by server name. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-09-15 10:39:23 -06:00
Derek Collison	2aaf22b0de	For ackMsg, make sure sequence is still relevant as well. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-14 16:47:35 -07:00
Derek Collison	6c97733bb8	Optimize needAck. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-14 16:25:50 -07:00
Ivan Kozlovic	b69ffe244e	Fixed some tests Code change: - Do not start the processMirrorMsgs and processSourceMsgs go routine if the server has been detected to be shutdown. This would otherwise leave some go routine running at the end of some tests. - Pass the fch and qch to the consumerFileStore's flushLoop otherwise in some tests this routine could be left running. Tests changes: - Added missing defer NATS connection close - Added missing defer server shutdown Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-09-08 11:28:23 -06:00
Derek Collison	b850a95d4c	Remove auto-promotion of direct get. Force stream config to set AllowDirect to true. Signed-off-by: Derek Collison <derek@nats.io>	2022-09-06 13:33:39 -07:00
Derek Collison	452e3c3955	Make sure update covered Signed-off-by: Derek Collison <derek@nats.io>	2022-08-31 16:05:32 -07:00
Derek Collison	b649ae8bc3	We detected overlap between different streams but not same stream with multiple subjects. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-31 15:51:39 -07:00
Derek Collison	56e177c329	Allow stream msgs to be compressed within the raft layer and during upper layer catchups. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-30 16:10:57 -07:00
Derek Collison	d04763eb7d	CAS operations improved, hold lock past store. Use separate lock for consumer list and storage updates. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-24 18:30:44 -07:00
Derek Collison	c48bd6ea54	Inline turning on direct subs for non-clustered streams Signed-off-by: Derek Collison <derek@nats.io>	2022-08-23 11:20:18 -07:00
Derek Collison	212adf5775	General improvements to clustered streams during server restart and KV/CAS scenarios. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-22 18:36:15 -07:00
Matthias Hanel	c02d1ad69e	fix consumer subject validation on recovery (#3389 ) This fixes an issue introduced in #3080 The consumer filter subject check was skipped on recovery. The intent was to bypass the upstream stream subjects. But it also filtered the downstream stream subject. This became a problem when the downstream was itself an upstream. Then during recover, the stream subject was not checked, which lead to delivery of filtered messages that should never have been delivered. Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-22 14:30:00 -07:00
Derek Collison	1228a32bc5	Merge pull request #3380 from nats-io/direct-get-delay Don't immediately listen on the direct get subjects.	2022-08-17 17:26:52 -07:00
Matthias Hanel	904b7aeefc	fixed consumer source update receiving upstream msgs (#3364 ) if an origin stream contains: 1M msgs with subject foo and 1M msgs with subject bar IF the source consumer changes their filter from foo to bar Then it would have received messages for subject bar. This happens because this tail was filtered and their respective seqno was not communicated to the consumer. This is somewhat unexpected. It is also coincidental. Had the last message in the stream had subject foo then this wouldn't happen. Therefore, when completely changing the subject say, from foo to bar, we only receive messages received after the time the change was made. However, if the old and new subject overlap in any way, we go by sequence number. Meaning in these cases the outlined behavior remains in order to not induce artificial message loss for the part of the subject space that is covered by old and new filter. Signed-off-by: Matthias Hanel <mh@synadia.com> Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-17 17:16:03 -07:00
Derek Collison	ce2d5fa173	Don't immediately listen on the direct get subjects. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-17 16:39:34 -07:00
Derek Collison	827b34a77a	Add support for AES cipher encryption for filestore. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-15 14:21:37 -07:00
Ivan Kozlovic	00345cac64	[FIXED] JetStream: subject overlap error should be returned In standalone mode, when attempting to create a stream which has subjects that overlap with an existing stream, the generic stream create error "10049" was returned instead of the more accurate "10065" error code corresponding to subject overlap, as it was the case in clustered mode. Resolves #3362 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-11 13:32:29 -06:00
Ivan Kozlovic	1f428310b0	Fixed message timestamp formatting for direct message get feature In normal message get, the returned format is RFC3339Nano, which is what is being used when using JSON marshaling. However, for the direct get we had to pass a string to construct the header and we were using time.Time.String() which was using a different layout. So use time.Time.MarshalJSON() to be consistent with the non-direct get message. Libraries that already parsed the non RFC3339Nano time format can be updated since none should have been released yet (since the feature in the server is not released yet) Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-10 12:53:08 -06:00
Matthias Hanel	c6e37cf7af	Fix race between stream stop and monitorStream (#3350 ) * Fix race between stream stop and monitorStream monitorCluster stops the stream, when doing so, monitorStream needs to be stopped to avoid miscounting of store size. In a test stop and reset of store size happened first and then was followed by storing more messages via monitorStream Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-10 19:01:21 +02:00
Matthias Hanel	2cf2868406	fixed consumer restart on source filter update (#3355 ) * fixed consumer restart on source filter update When a stream source filter subject was updated, the internal consumer was not re created If the upstream stream contains a tail of previously filtered messages, these will now be delivered Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-10 18:47:19 +02:00
Matthias Hanel	5588c3d0de	Added check for source/mirror filter subjects (#3356 ) * Added check for source/mirror filter subjects When the origin stream exists, the sourec/mirror filter subject will be checked against the stream subjects. If there is no overlap, an error will be returned Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-10 18:46:52 +02:00
Derek Collison	3407112292	Write lock not needed Signed-off-by: Derek Collison <derek@nats.io>	2022-08-08 11:06:10 -06:00
Ivan Kozlovic	d90854a45f	Merge pull request #3341 from nats-io/go_1_19 Move to Go 1.19, remote io/util, fix data race and a flapper	2022-08-05 12:49:06 -06:00
Matthias Hanel	c56f3b9fbd	Adding account purge operation (#3319 ) * Adding account purge operation The new request is available for the system account. The subject to send the request to is $JS.API.ACCOUNT.PURGE.* With the name of the account to purge instead of the wildcard. Also added directory cleanup code such that server do not end up with empty streams directories and account dirs that only contain streams Also adding ACCOUNT to leaf node domain rewrite table Addresses #3186 and #3306 by providing a way to get rid of the streams for existing and non existing accounts Signed-off-by: Matthias Hanel <mh@synadia.com>	2022-08-05 18:24:19 +02:00
Ivan Kozlovic	3c9a7cc6e5	Move to Go 1.19, remote io/util, fix data race and a flapper Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-05 09:55:37 -06:00
Todd Beets	9f8b4461f3	change set test to > 0	2022-08-04 17:37:40 -07:00
Ivan Kozlovic	b73afbdcb1	[FIXED] JetStream: reject stream update with changes to RePublish The update was not rejected, yet the republish update was not taking place. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-08-03 16:37:39 -06:00
Derek Collison	748890adb1	Auto-set and upgrade AllowDirect when MaxMsgsPerSubject is set. Also allow mirrors to inherit properly. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-03 12:36:52 -07:00
Derek Collison	c82c49451c	Allow direct get by subject to be all subject based. This avoids marshalling or unmarshalling but also allows subject based permissioning. Signed-off-by: Derek Collison <derek@nats.io>	2022-08-02 18:19:33 -07:00
Ivan Kozlovic	1a6c5f1c90	[FIXED] JetStream: Some scaling up issues - Send snapshot only if leader - When processing snapshot, start with a smaller inactivity interval that will double up to 10sec or use 10sec directly once we get a message. Reason for that is that it is possible that the request for snapshot is sent while the leader has not yet setup the subscription that receives the requests (or subscription has not fully reached the cluster). - Don't remember snapfile on err. - Do not consider current if we have not had any activity. - Stabilize stream scale up under active heavy publishing. - Due to the publish pressure move the check for followers direct subs spinning up til after we stop publishing. Signed-off-by: Derek Collison <derek@nats.io> Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-07-25 18:44:18 -06:00
Ivan Kozlovic	1da5ecfb96	[IMPROVED] JetStream: stream already exists error description The `JSStreamNameExistErr` will now include in the description that the stream exists with a different configuration, because that is the error clients would get when trying to add a stream with a different configuration (otherwise this is a no-op and client don't get an error). Since that error was used in case of restore, a new error is added but uses the same description prefix "stream name already in use" but adds ", cannot restore" to indicate that this is a restore failure because the stream already exists. Resolves #3273 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-07-21 10:20:07 -06:00
Todd Beets	302f85dcc2	Merge pull request #3271 from nats-io/repub-fix-srcignored Fix and test for RePub src ignored at runtime.	2022-07-18 15:02:09 -07:00
Todd Beets	b145952e78	Fix and test for RePub src ignored at runtime.	2022-07-15 13:50:35 -07:00
Ivan Kozlovic	0887f3d66a	Check for mirror configuration With the previous commit, this should no longer be possible, but otherwise we got report of a panic when accessing the mirror configuration. The following test would be able to produce the panic: ``` s := RunBasicJetStreamServer() if config := s.JetStreamConfig(); config != nil { defer removeDir(t, config.StoreDir) } defer s.Shutdown() nc, js := jsClientConnect(t, s) defer nc.Close() _, err := js.AddStream(&nats.StreamConfig{Name: "SOURCE"}) require_NoError(t, err) cfg := &nats.StreamConfig{ Name: "M", Mirror: &nats.StreamSource{Name: "SOURCE"}, } _, err = js.AddStream(cfg) require_NoError(t, err) err = js.DeleteStream("SOURCE") require_NoError(t, err) cfg.Mirror = nil _, err = js.UpdateStream(cfg) require_NoError(t, err) time.Sleep(5 * time.Second) ``` Again, now that we reject the mirror config update, the panic should no longer happen, but adding preventive code in case we allow in the future. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2022-07-15 11:18:23 -06:00

... 2 3 4 5 6 ...

516 Commits