summaryrefslogtreecommitdiffstats
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* [KERNEL] Fix build on 5.15 kernels >= 5.15.132HEADmasterSimon Rettberg2024-03-151-0/+1
|
* [KERNEL] Set fake RTT to 500ms on manual server switchSimon Rettberg2024-01-181-2/+2
| | | | | 5ms was apparently way too low and lead to frequent switch-aways from the local dnbd3-server on heavy load during bootup.
* [SERVER] Log errno if opening image file failsSimon Rettberg2023-06-301-3/+5
|
* [SERVER] Threadpool: Add active thread count to error messageSimon Rettberg2023-06-141-1/+1
|
* [KERNEL] Add Rocky 9 supportSimon Rettberg2023-06-072-8/+20
| | | | | Co-authored-by: Jonathan Bauer <jonathan.bauer@rz.uni-freiburg.de> Co-authored-by: Manuel Messner <mm@skelett.io>
* [FUSE] Fix termination condition for reading host listSimon Rettberg2023-02-141-1/+1
|
* [FUSE] Make initial connect entirely parallel wrt serversSimon Rettberg2023-02-121-111/+232
| | | | | | | | | | | | Previously, we only did the actual socket connect in a concurrent matter. Once a connection was successfully established, we did a blocking handshake on the protocol level. If the server war particularly slow, this was bad as we would not try other servers until after this. Throw out the previous non-blocking async connect logic, switching to a multi-threaded approach, that spawns one thread per host/ip, offset by 200ms, until one of the attempts succeeds.
* [SHARED] sock_printable: Always null-terminateSimon Rettberg2023-02-121-1/+4
|
* [FUSE] Fix AIOOB accessSimon Rettberg2023-02-121-2/+1Star
|
* [KERNEL] Rename logigng macros, change debug level of a few messagesSimon Rettberg2023-02-061-21/+25
|
* [KERNEL] Actually fix build on 6.0+Simon Rettberg2023-01-111-5/+5
|
* [KERNEL] Update for Linux 6.1Simon Rettberg2023-01-092-7/+25
|
* Fix linter issues to satisfy Linux kernel code styleManuel Bentele2022-10-052-8/+11
|
* [SERVER] Don't establish uplink when loading image on startupSimon Rettberg2022-09-131-1/+1
| | | | | | | Bringing up a proxy that has been offline for some time will trigger lots of loads and replication on other proxies when booting up again. Just wait until a client actually needs an image before establishing an uplink connection.
* [KERNEL] Adapt to Linux 5.18Simon Rettberg2022-06-142-1/+14
|
* [FUSE] Adapt to changed macro namesSimon Rettberg2022-05-201-4/+4
|
* [KERNEL] IOCTL_SWITCH: Always boost/fake RTT valuesSimon Rettberg2022-03-241-17/+17
| | | | Even if we didn't switch because we already use the requested server.
* [KERNEL] Fix possible stall when switching serverSimon Rettberg2022-03-041-1/+9
| | | | | | | If we switch to a different server when we only have something in the send list but nothing in the recv list, the send worker would not have gotten invoked. Now we unconditionally trigger the send worker when asked to re-queue any pending requests.
* [KERNEL] Fix copy&paste error (passing wrong sock)Simon Rettberg2022-02-231-2/+2
|
* [KERNEL] Refactor to use workqueues and blk-mq onlySimon Rettberg2022-02-1810-1162/+892Star
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Using workqueues frees us from having to manage the lifecycle of three dedicated threads. Discovery (alt server checks) and sending keepalive packets is now done using work on the power efficient system queue. Sending and receiving happens via dedicated work queues with higher priority. blk-mq has also been around for quite a while in the kernel, so switching to it doesn't hurt backwards compatibility. As the code is now refactored to work more as blk-mq is designed, backwards compatibility even improved while at the same time freeing us from an arsenal of macros that were required to make the blk-mq port look and feel like the old implementation. For example, the code now compiles on CentOS 7 with kernel 3.10 without requiring special macros to detect the heavily modified RedHat kernel with all its backported features. A few other design limitations have been rectified along the way, e.g. switching to another server now doesn't internally disconnect from the current one first, which theoretically could lead to a non-working setup, if the new server isn't reachable and then - because of some transient network error - switching back also fails. As the discover-thread was torn down from the disconnect call, the connection would also not repair itself eventually. we now establish the new connection in parallel to the old one, and only if that succeeds do we replace the old one with it, similar to how the automatic alt-server switch already does it.
* [KERNEL] Add missing include to fix compile on 4.14.xSimon Rettberg2022-02-111-0/+1
|
* [KERNEL] Add support for Linux kernel 5.15.x LTSManuel Bentele2021-11-302-4/+40
|
* [KERNEL] Fix wurstfingered missing ;Simon Rettberg2021-11-081-1/+1
|
* [KERNEL] Explicitly pass proper addrlen on connect; improve debug logSimon Rettberg2021-11-081-3/+7
|
* [KERNEL] Don't log connect failures as errors for RTT checksSimon Rettberg2021-10-191-16/+24
| | | | | | | This spams scary red errors to dmesg when really an unreachable alt server isn't that much of a deal during normal operation. Change the log level to debug instead. Might even consider not printing anything at all.
* [BUILD] Add CMake option to enable build of dnbd3-benchManuel Bentele2021-06-241-1/+3
|
* [BUILD] Add check for stdatomic.h supportManuel Bentele2021-06-163-0/+9
|
* [KERNEL] Add support for Linux kernels without blk-mq (e.g. CentOS 7)Manuel Bentele2021-06-164-20/+205
|
* [SERVER] Add minRequestSize: Enlarge relayed requestsSimon Rettberg2021-05-113-16/+48
| | | | | | | | | | Any request from a client being relayed to an uplink server will have its size extended to this value. It will also be applied to background replication requests, if the BGR mode is FULL. As request coalescing is currently very primitive, this setting should usually be left diabled, and bgrWindowSize used instead, if appropriate. If you enable this, set it to something large (1M+), or it might have adverse effects.
* [SERVER] Fix UBSimon Rettberg2021-05-111-1/+1
|
* [SERVER] Honor uplinkTimeout directly when connecting to alt-serverSimon Rettberg2021-05-102-5/+3Star
|
* [KERNEL] Improve debug output in net.cSimon Rettberg2021-04-203-7/+13
|
* [KERNEL] Even more RTT fakery on manual server switchSimon Rettberg2021-04-201-5/+12
|
* [KERNEL] Clean alt-server list first when connectingSimon Rettberg2021-04-161-11/+8Star
| | | | | | | When establishing a new connection on a disconnected device, the old list of alt-servers was retained. This would lead to us connecting to the wrong server, as the number of newly passed servers was used when looping over the list of alt-servers to actually connect.
* [KERNEL] Fix Linter errorsManuel Bentele2021-04-162-3/+5
|
* [KERNEL] Removes duplicate word 'of' in license headersManuel Bentele2021-04-1616-16/+16
|
* [SERVER] Mark uplink requests with BGR/prefetch flags and handle themSimon Rettberg2021-04-141-5/+20
| | | | | | | | | Incoming requests from clients might actually be prefetch jobs from another downstream proxy. Don't do prefetching for those, as this would cascade upwars in the proxy chain (prefetch for a prefetch of a prefetch) Incoming requests might also be background replication. Don't relay those if we're not configured for background replication as well.
* [SERVER] Set TCP_NODELAY on outgoing connectionsSimon Rettberg2021-04-141-1/+4
| | | | | | | This will send all (block) requests immediately at sometimes more overhead, but slighly less delays. Since the outgoing connection on a client is only used very lightly, this tradeoff should always make sense.
* [SERVER] Make prefetching synchronousSimon Rettberg2021-04-143-168/+226
| | | | | | | | | There is a race condition where we process the next request from the same client faster than the OS will schedule the async prefetch job, rendering it a NOOP in the best case (request ranges match) or fetching redundant data from the upstream server (prefetch range is larger than actual request by client). Make prefetching synchronous to prevent this race condition.
* [CLIENT] Use SO_GETPEERCRED instead of braindead setuid crapSimon Rettberg2021-04-141-56/+69
| | | | | | If you need daemon mode, run as root with --daemon, normal users can then request devices to be connected using the same binary WITHOUT havind the suid bit set on it.
* [KERNEL] Deduplicate code, clean up, split into functionsSimon Rettberg2021-04-142-402/+339Star
|
* [KERNEL] Fix CMD name in debug messagesSimon Rettberg2021-04-141-3/+3
|
* [KERNEL] Improve socket connectSimon Rettberg2021-03-291-34/+59
| | | | | | | | - Remove the ugly timeout hack that apparently isn't required after all. - Set a few socket options that appear to make sense in out use case (no linger, only one SYN retry, NODELAY). - Adapt socket timeout in panic mode, in case we're on a very bad connection.
* [KERNEL] Overhaul sysfs filesSimon Rettberg2021-03-262-40/+18Star
| | | | | | Remove superflous, reduntant or otherwise useless information. Use space as separator instead of comma for better readability and easier parsing in shell etc.
* [KERNEL] Implement best_count logic for load balancingSimon Rettberg2021-03-264-22/+48
| | | | | | | Similar logic already exists in the fuse client: Count how many times in a row a server was fastest when measuring RTTs, and lower the switching threshold more the higher the count gets.
* [KERNEL] Use sockaddr instead of dnbd3_host_t where possibleSimon Rettberg2021-03-246-241/+216Star
| | | | | | | | Convert dnbd3_host_t to struct sockaddr immediately when adding alt servers, so we don't have to convert it every time we establish a connection. Additionally we can now use %pISpc in printf-like functions instead of having if/else constructs whenever we want to print an address.
* [KERNEL] Set fake low RTT after manual server switchSimon Rettberg2021-03-232-0/+8
| | | | | This avoids automatically switching back right after adding and switching to a server.
* [KERNEL] Synchronous add/remove of alt-servers via IOCTLSimon Rettberg2021-03-234-165/+229
|
* [SERVER] Fix compiler warningSimon Rettberg2021-03-221-1/+1
|
* [KERNEL] Enable assertions if CONFIG_DEBUG_DRIVER is setManuel Bentele2021-03-161-0/+6
|