nats-server

mirror of https://github.com/gogrlx/nats-server.git synced 2026-04-17 03:24:40 -07:00

Author	SHA1	Message	Date
Ivan Kozlovic	461ab96a58	Additional fix to #631 This is the result of flapping tests in go-nats that were caused by a defect (see PR https://github.com/nats-io/go-nats/pull/348). However, during debugging, I realize that there were also things that were not quite right in the server side. This change should make it the notification of cluster topology changes to clients more robust.	2018-03-05 20:03:46 -07:00
Ivan Kozlovic	aeca31ce51	[FIXED] Cluster toplogy change possibly not sent to clients When a server accepts a route, it will keep track of that server `connectURLs` array. However, if the server was creating a route to that other server at the same time, it will promote the route as a solicited one. The content of that array was not transfered, which means that on a disconnect, it was possible that the cluster topology change was not properly sent to clients.	2018-03-04 14:38:03 -07:00
Ivan Kozlovic	1acf330e07	[ADDED] Notification to clients when servers leave the cluster Until now, a server would only notify clients of servers that join the cluster. More than that, a server would send ot its clients only information if new servers were added. This PR changes this by sending to clients that support async INFO the list of URLs for all servers in the cluster any time that there is a change (joining or leaving the cluster). As of now, clients will not be affected by the change (and will not take benefit of this: removing servers from their server pool). This will be addressed in each supported client once this is merged.	2018-02-27 14:22:13 -07:00
Ivan Kozlovic	163ba3f6a7	Merge pull request #608 from nats-io/fix_issue_447 [ADDED] Client and Cluster Advertise. Issue #447	2018-02-26 09:12:58 -07:00
Ivan Kozlovic	acf4a31e4b	Major updates + support for config reload of client/cluster advertise	2018-02-05 20:15:36 -07:00
Tyler Treat	379701758c	Use correct log level for route errors Also move route created log message to when the route has actually been created successfully.	2017-12-11 08:10:42 -06:00
Peter Miron	306a3f9507	Resolving Ivan's feedback.	2017-11-29 15:35:05 -05:00
Peter Miron	4829592107	removed support for array of Advertise addresses. Added support for Route advertise address.	2017-11-29 11:41:08 -05:00
Ivan Kozlovic	fa86ff93eb	[CHANGED] Server notifies clients when server rejoins cluster When the option Cluster.NoAdvertise is false, a server will send an INFO protocol message to its client when a server has joined the cluster. Previously, the protocol would be sent only if the joining server's "client URLs" (the addresses where clients connect to) were new. It will now be sent regardless if the server joins (for the first time) or rejoins the cluster. Clients are still by default invoking the DiscoveredServersCB callback only if they themselves detect that new URLs were added. A separate PR may be filled to client libraries repo to be able to invoke the callback anytime an async INFO protocol is received. Based on @madgrenadier PR #597.	2017-11-17 12:21:34 -07:00
Ivan Kozlovic	20926a6176	Added megacheck This tool combines staticcheck, gosimple and unused. Fixed reports from unused.	2017-08-11 17:28:18 -06:00
Ivan Kozlovic	2befd973cc	Fixed DATA RACE and ensure route is not created/accepted on shutdown - Created a setter for the closed flag. - Check if route is closed under lock and set a boolean if so, so we don't check c.route outside of c's mutex. - Ensure that we do not create a route on shutdown, which would leave a connection hanging (was seen in some config reload tests).	2017-07-19 10:42:18 -06:00
Ivan Kozlovic	0e2882d741	[FIXED] Handling of duplicate routes When A connects to B and B connects to A (either based on static configuration - explicit routes, or because of auto-discovery - implicit routes), it is possible that each server initially registers the route from the opposite TCP connection. It will then result in each server dropping the connection. We were previously setting a retry flag in the first accepted route based on the name of servers, which means that regardless of duplicate detection, the server with the "smaller" server name would try to reconnect when the route connection was closed. For instance, suppose that server B connects to server A, when B disconnects, A would try to reconnect once to B. This became problematic in the case of configuration reload, because removing the route from B to A would still result in a route created from A to B. Also, when a route attempts a reconnect, a random delay is added to avoid repeated failure cycles that may occur in case where A connects to B and B to A.	2017-07-18 18:25:56 -06:00
Tyler Treat	56ab619498	First pass at implementing cluster reload	2017-06-16 15:53:07 -05:00
Peter Miron	d1f38f38a2	changes to support random ports for clusters and profiler.	2017-06-10 10:35:01 -04:00
Tyler Treat	cc30af8ede	Address code review feedback	2017-06-05 17:43:42 -05:00
Tyler Treat	c468abd15f	Merge branch 'master' of github.com:nats-io/gnatsd into config_reload	2017-06-05 13:41:04 -05:00
Tyler Treat	28160f1de2	Remove global logger gnatsd currently uses a global logger. This can cause some problems (especially around the config-reload work), but global variables are also just an anti-pattern in general. The current behavior is particularly surprising because the global logger is configured through calls to the Server. This addresses issue #500 by removing the global logger and making it a field on Server.	2017-05-31 16:06:31 -05:00
Tyler Treat	9902c3da84	First pass at implementing config reload	2017-05-30 16:18:36 -05:00
miraclesu	26ef3f8a70	Revise queue msg action We think it marks a queue subscription via QRSID prefix.	2017-05-09 16:16:15 +08:00
miraclesu	b570f8de9b	Add test for invalid queue sid	2017-05-09 16:16:15 +08:00
miraclesu	29d1573124	[IMPROVED] Route sid parse performance	2017-05-09 16:16:15 +08:00
Derek Collison	f7ba3d175e	Correct invocation of misspell with fixes	2017-04-21 09:21:33 -07:00
Ivan Kozlovic	a804516540	Fix gosimple report	2017-03-22 22:52:33 -06:00
Ivan Kozlovic	d3555053d0	Change option/parameter name	2016-12-22 14:59:27 -07:00
Ivan Kozlovic	a8dfaeae3d	[ADDED] Ability to configure number of connect retries for implicit routes When a server is told to connect to a server (with auto-discovery), it tries to connect once. There have been a report where that connection fails, but would probably succeed if tried again (#408). This new parameter allows to configure the number of times a failed implicit connect should be tried. Resolves #408	2016-12-20 18:37:23 -07:00
Ivan Kozlovic	5f471b6e7f	Replace GetListenEndpoint() with ReadyForConnections() The RunServer() function (and the various variants) call Server.Start() in a go-routine, but do not return until it has verified that the server is ready to accept connections. To do so, it use GetListenEndpoint() to get a suitable connect address (replacing "0.0.0.0" or "::" with localhost - important on Windows). It then creates a raw TCP connection to ensure the server is started, repeating the process in case of failure up to 10 seconds. This PR replaces this with a function that checks that client listener, and route listener if configured, are set. This removes the need to get a connect address and create test tcp connections. The reason for this change is that NATS Streaming when starting the NATS Server (unless configured to connect to a remote one) calls RunServerWithAuth(), which when getting "localhost" from GetListenEndpoint(), would fail trying to resolve it. This happened for the NATS Streaming Docker image built with Go 1.7+.	2016-12-09 14:03:45 -07:00
Derek Collison	8fbacaaea1	Cleanup for cluster opts	2016-12-02 14:29:22 -08:00
Ivan Kozlovic	4997637270	[FIXED] assignment copies lock value for crypto/tls.Config Running `go vet ./...` with `go 1.7.3` would report the following: ``` server/route.go:342: assignment copies lock value to tlsConfig: crypto/tls.Config contains sync.Once contains sync.Mutex server/server.go:479: assignment copies lock value to config: crypto/tls.Config contains sync.Once contains sync.Mutex ``` Add a “clone” function while waiting for this to be addressed by the language itself (https://go-review.googlesource.com/#/c/28075/)	2016-10-20 14:59:29 -06:00
Ivan Kozlovic	811e0868ed	[FIXED] Data RACE on Unsubscribe when client connection is closed Resolves #331	2016-08-17 16:46:34 -06:00
Ivan Kozlovic	82dbb3a5ab	[ADDED] Option to not advertise to clients cluster's IPs By default, a server is now sending to its clients the client URLs of all servers in the cluster. This allows clients to be able to reconnect to any server in the cluster even if those clients were not configured with the list of servers in the cluster. However, there may be cases where it would make sense to disable this feature. This now can be done with this option/command line parameter. Resolves #322	2016-08-12 19:24:12 -06:00
Ivan Kozlovic	3b8412049e	[FIXED] Cluster's listener with IPv6 Trying to use IPv6 address for the cluster host would fail. Also, there were some unclosed channels in case of accept loop setup failures. Resolves #323	2016-08-12 15:54:15 -06:00
Ivan Kozlovic	fda5bd7ac7	[ADDED] Server sends INFO with cluster URLs to clients with support Clients that will be at the ClientProtoInfo protocol level (or above) will now receive an asynchronous INFO protocol when the server they connect to adds a new route. This means that when the cluster adds a new server, all clients in the cluster should now be notified of this new addition.	2016-07-26 10:55:55 -06:00
Ivan Kozlovic	188f7bf84c	Fix possible blocking on socket write or connection close (when using TLS) Ensure that all socket writes are protected with deadlines. For connection Close(), also use deadlines since in case of TLS, the Close() will send an alert (do a write) if the handshake was completed. If the peer is not reading, this would cause the Close() to hang.	2016-05-23 19:57:54 -06:00
Antonin Amand	1eb12a1501	fix concurrent map access	2016-05-19 17:14:48 +02:00
Ivan Kozlovic	3a999c1299	Add tracking of most go routines started by the server Refactor the way client is initialized. We need to ensure that clients are not added to the clients map and readLoop started if the server is in the process of being shutdown otherwise there is a chance that the server already gathered the list of connections to close and this one would not be included, leaving a readLoop running. Same occurs for routes, with the complexity that the readLoop is started well before the route connection is added to the server routes' list. We need a temporary map that contains those connections to be able to close them on server Shutdown. Fixed some flapping tests.	2016-04-21 11:48:39 -06:00
Ivan Kozlovic	3aa09ecc01	Ensure Shutdown() waits for outstanding routes go routines We need to make sure that when Shutdown() returns, routes go routines that try to connect or reconnect have returned. Otherwise, this may affect tests running one after the other (a server from one test may connect to a server in the next test).	2016-04-21 11:48:39 -06:00
Derek Collison	b3388db53f	Enable dynamic write buffers for client connections	2016-04-15 18:16:13 -07:00
Derek Collison	4f333416bb	Revert race on interest graph since it could cause dropped interest propogation, fix test instead	2016-04-15 15:46:29 -07:00
Derek Collison	3e2c3714bc	Fix race in interest propogation to new routes	2016-04-15 13:16:13 -07:00
Derek Collison	df02bc0bcf	Removed sublist, hash and hashmap, no longer needed.	2016-04-02 12:52:48 -07:00
Colin Sullivan	2baac47820	Address issues found by golint. * No functional changes * Did not address the ALL_CAPS issues * Did not modify public APIs and field names.	2016-03-15 15:21:13 -06:00
Ivan Kozlovic	6263c66a40	Fixed code and tests to run on Windows Mainly tests, but also a fix in route.go to reject a route when the server is being shutdown.	2016-03-07 18:47:20 -07:00
Ivan Kozlovic	3ea412798a	Optimizations -No need to store ip url string in c.route and resolve remote IP when forwarding the INFO to known servers. -When checking if a route is explicit, use strings.ToLower() once for the url being checked.	2016-02-25 20:00:21 -07:00
Ivan Kozlovic	7c0a3b49a6	Fix cluster formation when servers connect quickly Both seed and chained cases are now handled properly when servers connect quickly and concurrently to one another. When accepting a route, the server will forward the new route INFO protocol to its known routes. In turn those routes will connect to the new server (if not already connected). A retry for implicit route was introduced to mitigate the issue with two servers connecting to each other and electing the opposite connection as the winner, resulting in both connections being dropped. The server with smaller ID will try once to reconnect. Some tests were fixed to handle possible extra INFO protocol. New tests added. Fix issue: https://github.com/nats-io/gnatsd/issues/206	2016-02-24 19:44:25 -07:00
Ivan Kozlovic	ce79f524be	Fix scheme for routes returned When server returns routes through INFO, use "nats-route://" scheme. A test was checking that. Add test to check that hostname is replaced with IP.	2016-02-11 17:20:33 -07:00
Ivan Kozlovic	fc38a8336f	Fix wrong check for error	2016-02-11 13:29:34 -07:00
Ivan Kozlovic	cef87212b9	Fix route url to use remote IP instead of configured url when sending in INFO protocol.	2016-02-11 11:23:41 -07:00
Ivan Kozlovic	112413a466	Fix infinite server attempt to connect route to itself Attempt to address issue #175. Instead of trying to detect if route URL will point to route listen address, detects that the route remoteID is server's ID. If so, closes the connection and stop trying.	2016-02-10 14:19:54 -07:00
Derek Collison	8393c3c994	Basic INFO for cluster auto-discovery, Addresses #126	2015-12-16 12:36:24 -08:00
Ivan Kozlovic	5036bbbf36	Fix TLS issue where server started to receive TLS data on non TLS connection. Without the server fix, tls_test.go would likely report an error. The server would show a parser error with protocol snippet containing "random" bytes, likely encrypted data.	2015-12-07 19:44:12 -07:00

1 2

85 Commits