nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-16 19:14:41 -07:00

Author	SHA1	Message	Date
Ivan Kozlovic	d99d0eb069	[FIXED] Added defensive code for handling of leafnode connection This is a port of PR #1652/#1660. The code in the 2.1.x branch is not sensitive to the issue fixed in these PRs because marking the connection as closed (for instance due to a TCP error in sendProtoNow) will not set `c.nc` to nil, so there won't be the nil dereference issue that was found in the main branch. However, porting the code for extra safety. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-10-21 17:21:53 -06:00
Ivan Kozlovic	9ff8bcde2e	[FIXED] Possible panic if server receives a maliciously crafted JWT This addresses [CVE-2020-26521](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-26521) This is mainly a port of #1624 with some other updates related to tests. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-10-21 10:22:57 -06:00
Waldemar Quevedo	7a88eee090	[FIXED] Better support for distinguishedNameMatch in TLS Auth Signed-off-by: Waldemar Quevedo <wally@synadia.com>	2020-09-03 10:12:06 -06:00
Ivan Kozlovic	073789b860	Fix flappers Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 17:07:04 -06:00
Ivan Kozlovic	ff89a5f277	[FIXED] Handling or real duplicate subscription That is, if the server receives "SUB foo 1" more than once from the same client, we would register in the client map this subscription only once, and add to the account's sublist only once, however we would have updated shadow subscriptions and route/gateway maps for each SUB protocol, which would result in inability to send unsubscribe to routes when the client goes away or unsubscribes. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 15:10:22 -06:00
Ivan Kozlovic	2f877e7755	Fixed flapper Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 15:05:12 -06:00
Ivan Kozlovic	7e22004c3a	[FIXED] Unsubscribe may not be propagated through a leaf node There is a race between the time the processing of a subscription and the init/send of subscriptions when accepting a leaf node connection that may cause internally a subscription's subject to be counted many times, which would then prevent the send of an LS- when the subscription's interest goes away. Imagine this sequence of events, each side represents a "thread" of execution: ``` client readLoop leaf node readLoop ---------------------------------------------------------- recv SUB foo 1 sub added to account's sublist recv CONNECT auth, added to acc. updateSmap smap["foo"]++ -> 1 no LS+ because !allSubsSent init smap finds sub in acc sl smap["foo"]++ -> 2 sends LS+ foo allSubsSent == true recv UNSUB 1 updateSmap smap["foo"]-- -> 1 no LS- because count != 0 ---------------------------------------------------------- ``` Equivalent result but with slightly diffent execution: ``` client readLoop leaf node readLoop ---------------------------------------------------------- recv SUB foo 1 sub added to account's sublist recv CONNECT auth, added to acc. init smap finds sub in acc sl smap["foo"]++ -> 1 sends LS+ foo allSubsSent == true updateSmap smap["foo"]++ -> 2 no LS+ because count != 1 recv UNSUB 1 updateSmap smap["foo"]-- -> 1 no LS- because count != 0 ---------------------------------------------------------- ``` The approach for the fix is delay the creation of the smap until we actually initialize the map and send the subs on processing of the CONNECT. In the meantime, as soon as the LN connection is registered and available in updateSmap, we check that smap is nil or not. If nil, we do nothing. In "init smap" we keep track of the subscriptions that have been added to smap. This map will be short lived, just enough to protect against races above. In updateSmap, when smap is not nil, we need to checki, if we are adding, that the subscription has not already been handled. The tempory subscription map will be ultimately emptied/set to nil with the use of a timer (if not emptied in place when processing smap updates). Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 14:48:42 -06:00
Ivan Kozlovic	fe31ab3796	[FIXED] Possible removal of interest on queue subs with leaf nodes Server was incorrectly processing a queue subscription removal as both a plain sub and queue sub, which may have resulted in drop of interest even when some queue subs remained. Resolves #1421 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 14:39:19 -06:00
Ivan Kozlovic	df676b7c63	[FIXED] Race condition during implicit Gateway reconnection Say server in cluster A accepts a connection from a server in cluster B. The gateway is implicit, in that A does not have a configured remote gateway to B. Then the server in B is shutdown, which A detects and initiate a single reconnect attempt (since it is implicit and if the reconnect retries is not set). While this happens, a new server in B is restarted and connects to A. If that happens before the initial reconnect attempt failed, A will register that new inbound and do not attempt to solicit because it has already a remote entry for gateway B. At this point when the reconnect to old server B fails, then the remote GW entry is removed, and A will not create an outbound connection to the new B server. We fix that by checking if there is a registered inbound when we get to the point of removing the remote on a failed implicit reconnect. If there is one, we try the reconnection. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 14:36:54 -06:00
Ivan Kozlovic	6e3401eb81	[FIXED] Allow response permissions to work across accounts This is a port of https://github.com/nats-io/nats-server/pull/1487 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-08-27 14:33:53 -06:00
Waldemar Quevedo	9a2d095885	Add support to match domainComponent (DC) in RDNSequence with TLS Auth Currently when using TLS based authentication, any domain components that could be present in the cert will be omitted since Go's ToRDNSequence is not including them: `202c43b2ad/src/crypto/x509/pkix/pkix.go (L226-L245)` This commit adds support to include the domain components in case present, also roughly following the order suggested at: https://tools.ietf.org/html/rfc2253 Signed-off-by: Waldemar Quevedo <wally@synadia.com>	2020-05-11 17:41:11 -07:00
Ivan Kozlovic	1cf21fc4ee	Fix some leafnode test flappers Make use of some existing helpers and add checkFor in some places since accounting updates may not be instantaneous. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-04-15 15:15:26 -06:00
Derek Collison	a301d6731b	Re-order client close Signed-off-by: Derek Collison <derek@nats.io>	2020-04-14 09:54:57 -07:00
Derek Collison	aff10aa16b	Fix for #1344 Signed-off-by: Derek Collison <derek@nats.io>	2020-04-14 09:26:35 -07:00
Derek Collison	ef85a1b836	Fix for #1336 Signed-off-by: Derek Collison <derek@nats.io>	2020-04-10 17:30:03 -07:00
Matthias Hanel	e8ce738808	Test of service across accounts and leaf node. Tests #1336 Signed-off-by: Matthias Hanel <mh@synadia.com>	2020-04-10 15:55:10 -04:00
Derek Collison	f9d9ac193a	Use prefix to make sure we use right subject Signed-off-by: Derek Collison <derek@nats.io>	2020-04-10 10:49:05 -07:00
Derek Collison	090abc939d	Fix for stream imports and leafnodes, #1332 Signed-off-by: Derek Collison <derek@nats.io>	2020-04-10 10:36:20 -07:00
Derek Collison	e843a27bba	When a responder was on a leaf node and the requestor was connected to the same server as the leafnode we did not propagate the service reply wildcard properly. This fixes that. Signed-off-by: Derek Collison <derek@nats.io>	2020-04-10 08:35:09 -07:00
Derek Collison	699502de8f	Detection for loops with leafnodes. We need to send the unique LDS subject to all leafnodes to properly detect setups like triangles. This will have the server who completes the loop be the one that detects the error soley based on its own loop detection subject. Otehr changes are just to fix tests that were not waiting for the new LDS sub. Signed-off-by: Derek Collison <derek@nats.io>	2020-04-08 20:00:40 -07:00
Derek Collison	82f585d83a	Updated to also resend leafnode connect on GW connect via first INFO Signed-off-by: Derek Collison <derek@nats.io>	2020-04-08 09:55:19 -07:00
Derek Collison	43fbe0ffed	This commit allows new servers ina supercluster to be informed of accounts with active leafnode connections. This is needed to put those accounts into interest only mode for inbound gateway connections. Also added code to make sure we were doing proper account tracking and would track the global account as well, which used to be excluded. Fixes #977 Signed-off-by: Derek Collison <derek@nats.io>	2020-04-07 16:22:15 -07:00
Ivan Kozlovic	a19dcbee2d	Release v2.1.6 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-03-31 10:29:06 -06:00
Matthias Hanel	6f77a54118	[FIXED] loop detection by checking for duplicate lds subscriptions This is in addition to checking if the own subscription comes back. The duplicated lds subscription must come from a different client. Added unit tests. Also prefixed lds with '$' to mark it as system subject going forward. This moves the loop detection check past other checks. These checks should not trigger in cases where a loop is initially detected. Fixes #1305 Signed-off-by: Matthias Hanel <mh@synadia.com>	2020-03-17 19:06:35 -04:00
Ivan Kozlovic	8a5b9269d7	Merge pull request #1301 from nats-io/ulimit-unittest Modifying unit test error message to hint at ulimit -n possibly being too low	2020-03-04 12:52:15 -07:00
Matthias Hanel	68efc95a60	Modifying unit test error message to hint at ulimit -n possibly being too low Signed-off-by: Matthias Hanel <mh@synadia.com>	2020-03-04 14:30:35 -05:00
Matthias Hanel	a8e6af30a3	On client connect, send first ping after ping interval. On connect message resend reset timer with setFirstPingTimer, so RTT can be obtained quicker. Disable short first ping in default server options for client_test. In log_test prevent immediate scheduling by setting ping interval. Signed-off-by: Matthias Hanel <mh@synadia.com>	2020-03-02 20:10:15 -05:00
Ivan Kozlovic	7ab9c76f2b	Fixed benchmark tests to be able to run with Go 1.13+ Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-02-26 12:12:18 -07:00
Ivan Kozlovic	8e4b449119	Fixed flappers Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-02-19 13:19:08 -07:00
Ivan Kozlovic	47b08335a4	[FIXED] Reset of tlsName only for x509.HostnameError For issue #1256, we cleared the possibly saved tlsName on Hanshake failure. However, this meant that for normal use cases, if a reconnect failed for any reason we would not be able to reconnect if it is an IP until we get back to the URL that contained the hostname. We now clear only if the handshake error is of x509.HostnameError type, which include errors such as: ``` "x509: Common Name is not a valid hostname: <x>" "x509: cannot validate certificate for <x> because it doesn't contain any IP SANs" "x509: certificate is not valid for any names, but wanted to match <x>" "x509: certificate is valid for <x>, not <y>" ``` Applied the same logic to solicited gateway connections, and fixed the fact that the tlsConfig should be cloned (since we set the ServerName). I have also made a change for leafnode connections similar to what we are doing for gateway connections, which is to use the saved tlsName only if tlsConfig.ServerName is empty, which may not be the case for users that embed NATS Server and pass directly tls configuration. In other words, if the option TLSConfig.ServerName is not empty, always use this value. Relates to #1256 Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-01-28 13:16:38 -07:00
Derek Collison	643e73c0c5	Fix for #1256 , mixed IP and DNS for cluster and TLS with leafnodes Signed-off-by: Derek Collison <derek@nats.io>	2020-01-22 11:25:09 -08:00
Ivan Kozlovic	bdd7fa86e9	Update flapping test We need to wait for the route close to be processed before attempting to recreate it. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-01-10 12:01:59 -07:00
Ivan Kozlovic	c097357b52	[FIXED] More than expected switch to Interest-Only mode for account When an account is switched to interest-only mode due to no interest, it was not possible to switch that account more than once. But the function switchAccountToInterestMode() that triggers a switch could possibly doing it more than once. This should not cause problems but increased the number of traces in a big super cluster. Also fixed some flappers and a data race. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-01-09 13:35:08 -07:00
Ivan Kozlovic	b42856afa2	Set expectConnect flag for CLIENT only if auth required Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-01-07 10:48:11 -07:00
Ivan Kozlovic	c73be88ac0	Updated based on comments Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2020-01-06 16:57:48 -07:00
Ivan Kozlovic	947798231b	[UPDATED] TCP Write and SlowConsumer handling - All writes will now be done by the writeLoop, unless when the writeLoop has not been started yet (likely in connection init). - Slow consumers for non CLIENT connections will be reported but not failed. The idea is that routes, gateway, etc.. connections should stay connected as much as possible. However if a flush operation times out and no data at all has been written, the connection will be closed (regardless of type). - Slow consumers due to max pending is only for CLIENT connections. This allows sending of SUBs through routes, etc.. to not have to be chunked. - The backpressure to CLIENT connections is increased (up to 1sec) based on the sub's connection pending bytes level. - Connection is flushed on close from the writeLoop as to not block the "fast path". Some tests have been fixed and adapted since now closeConnection() is not flushing/closing/removing connection in place. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-12-31 15:06:27 -07:00
Ivan Kozlovic	1b2754475b	Refactor async client tests Updated all tests that use "async" clients. - start the writeLoop (this is in preparation for changes in the server that will not do send-in-place for some protocols, such as PING, etc..) - Added missing defers in several tests - fixed an issue in client.go where test was wrong possibly causing a panic. - Had to skip a test for now since it would fail without server code change. The next step will be ensure that all protocols are sent through the writeLoop and that the data is properly flushed on close (important for -ERR for instance). Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-12-12 11:58:24 -07:00
Derek Collison	ffc3c0da70	Fixed #1144 , qsub performance improvements Signed-off-by: Derek Collison <derek@nats.io>	2019-12-09 22:08:59 +01:00
Ivan Kozlovic	ae99fc3a2a	Fixed issues reported by staticcheck Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-12-04 17:04:58 -07:00
Ivan Kozlovic	63138509f7	Tune some code/test for Windows Running test suite on a Windows VM, I notice several failures. Updated the compute of the RTT to be at least 1ns. I think that this is just an issue with the VM I am running, but that change will have no impact for normal situations (since setting the rtt to the very minimum duration (1ns) instead of 0) and will prevent some tests from failing. Because of those same timer granularity issues, I had to add some delays between some actions in order for time.Sub()/Since() to actually report something more than 0. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-11-21 14:32:46 -07:00
Ivan Kozlovic	977c290bf2	[FIXED] Handling of split buffer for LEAF messages Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-11-18 11:55:18 -07:00
Derek Collison	07253c0517	Merge pull request #1196 from nats-io/daisy Allow interest propagation with daisy-chained leafnodes	2019-11-17 17:46:23 -08:00
Derek Collison	07da68ce56	Allow interest propagation with daisy chained leafnodes Signed-off-by: Derek Collison <derek@nats.io>	2019-11-17 17:35:20 -08:00
Ivan Kozlovic	e0bc81d0ed	Make the Leafnode internal sub on _GR_.> This is needed for mapped gateway replies. We had used an extra token when implementing the new prefix, but it was then removed, but the leafnode subscription on _GR_...*.> was not updated. We now subscribe on _GR_.> There was a test that was passing because we were using inboxes that caused the pattern to match. Replaced with single token reply so that it would have caught this bug. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-11-17 17:37:09 -07:00
Derek Collison	7b1bea61e2	Merge pull request #1192 from nats-io/load_account Do not fetch accounts on system events.	2019-11-16 18:33:23 -08:00
Derek Collison	f60266bc2e	Merge pull request #1190 from nats-io/import_reply Introduced wildcard handling of _R_ mapped replies.	2019-11-16 18:07:18 -08:00
Derek Collison	093b57ed40	Do not fetch accounts on system events. Noticed we would lookup accounts, but would also fetch them when tracking remote connections, etc. Signed-off-by: Derek Collison <derek@nats.io>	2019-11-16 18:05:42 -08:00
Ivan Kozlovic	3e1728d623	[FIXED] Some accounts locking issues - Risk of deadlock when checking if issuer claim are trusted. There was a RLock() in one thread, then a request for Lock() in another that was waiting for RLock() to return, but the first thread was then doing RLock() which was not acquired because this was blocked by the Lock() request (see `e2160cc571`) - Use proper account/locking mode when checking if stream/service exports/signer have changed. - Account registration race (regression from https://github.com/nats-io/nats-server/pull/890) - Move test from #890 to "no race" test since only then could it detect the double registration. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-11-16 16:59:38 -07:00
Derek Collison	6ad8287bbe	Introduced wildcard handling of _R_ mapped replies. We had too much special processing, so reduced to a single wildcard which will propagate across routes and gateways and is consistent with gateway handling of globally routed subjects and timeouts. Signed-off-by: Derek Collison <derek@nats.io>	2019-11-16 12:50:53 -08:00
Ivan Kozlovic	bdf5cf63b3	Shutdown on Ctrl+C Changed code on Windows to not use svc code if running in interactive mode. The original code was running svc.debug.Run() which uses service code (Execute()) but from the command line. We don't need that. Also reduced salt on bcrypt password for a config file that started to cause failures due to test taking too long to finish. Signed-off-by: Ivan Kozlovic <ivan@synadia.com>	2019-11-14 20:05:32 -07:00

1 2 3 4 5 ...

447 Commits