dnbd3.git - Distributed Network Block Device 3 --rewrite for Kernel 3.x

	Commit message (Collapse)	Author	Age	Files	Lines
*	[KERNEL] Adapt to Linux 5.18	Simon Rettberg	2022-06-14	2	-1/+14
\|
*	[FUSE] Adapt to changed macro names	Simon Rettberg	2022-05-20	1	-4/+4
\|
*	[KERNEL] IOCTL_SWITCH: Always boost/fake RTT values	Simon Rettberg	2022-03-24	1	-17/+17
\| \| \| \|	Even if we didn't switch because we already use the requested server.
*	[KERNEL] Fix possible stall when switching server	Simon Rettberg	2022-03-04	1	-1/+9
\| \| \| \| \| \| \|	If we switch to a different server when we only have something in the send list but nothing in the recv list, the send worker would not have gotten invoked. Now we unconditionally trigger the send worker when asked to re-queue any pending requests.
*	[KERNEL] Fix copy&paste error (passing wrong sock)	Simon Rettberg	2022-02-23	1	-2/+2
\|
*	[KERNEL] Refactor to use workqueues and blk-mq only	Simon Rettberg	2022-02-18	10	-1162/+892
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Using workqueues frees us from having to manage the lifecycle of three dedicated threads. Discovery (alt server checks) and sending keepalive packets is now done using work on the power efficient system queue. Sending and receiving happens via dedicated work queues with higher priority. blk-mq has also been around for quite a while in the kernel, so switching to it doesn't hurt backwards compatibility. As the code is now refactored to work more as blk-mq is designed, backwards compatibility even improved while at the same time freeing us from an arsenal of macros that were required to make the blk-mq port look and feel like the old implementation. For example, the code now compiles on CentOS 7 with kernel 3.10 without requiring special macros to detect the heavily modified RedHat kernel with all its backported features. A few other design limitations have been rectified along the way, e.g. switching to another server now doesn't internally disconnect from the current one first, which theoretically could lead to a non-working setup, if the new server isn't reachable and then - because of some transient network error - switching back also fails. As the discover-thread was torn down from the disconnect call, the connection would also not repair itself eventually. we now establish the new connection in parallel to the old one, and only if that succeeds do we replace the old one with it, similar to how the automatic alt-server switch already does it.
*	[KERNEL] Add missing include to fix compile on 4.14.x	Simon Rettberg	2022-02-11	1	-0/+1
\|
*	[KERNEL] Add support for Linux kernel 5.15.x LTS	Manuel Bentele	2021-11-30	2	-4/+40
\|
*	[KERNEL] Fix wurstfingered missing ;	Simon Rettberg	2021-11-08	1	-1/+1
\|
*	[KERNEL] Explicitly pass proper addrlen on connect; improve debug log	Simon Rettberg	2021-11-08	1	-3/+7
\|
*	[KERNEL] Don't log connect failures as errors for RTT checks	Simon Rettberg	2021-10-19	1	-16/+24
\| \| \| \| \| \| \|	This spams scary red errors to dmesg when really an unreachable alt server isn't that much of a deal during normal operation. Change the log level to debug instead. Might even consider not printing anything at all.
*	[BUILD] Add CMake option to enable build of dnbd3-bench	Manuel Bentele	2021-06-24	1	-1/+3
\|
*	[BUILD] Add check for stdatomic.h support	Manuel Bentele	2021-06-16	3	-0/+9
\|
*	[KERNEL] Add support for Linux kernels without blk-mq (e.g. CentOS 7)	Manuel Bentele	2021-06-16	4	-20/+205
\|
*	[SERVER] Add minRequestSize: Enlarge relayed requests	Simon Rettberg	2021-05-11	3	-16/+48
\| \| \| \| \| \| \| \| \| \|	Any request from a client being relayed to an uplink server will have its size extended to this value. It will also be applied to background replication requests, if the BGR mode is FULL. As request coalescing is currently very primitive, this setting should usually be left diabled, and bgrWindowSize used instead, if appropriate. If you enable this, set it to something large (1M+), or it might have adverse effects.
*	[SERVER] Fix UB	Simon Rettberg	2021-05-11	1	-1/+1
\|
*	[SERVER] Honor uplinkTimeout directly when connecting to alt-server	Simon Rettberg	2021-05-10	2	-5/+3
\|
*	[KERNEL] Improve debug output in net.c	Simon Rettberg	2021-04-20	3	-7/+13
\|
*	[KERNEL] Even more RTT fakery on manual server switch	Simon Rettberg	2021-04-20	1	-5/+12
\|
*	[KERNEL] Clean alt-server list first when connecting	Simon Rettberg	2021-04-16	1	-11/+8
\| \| \| \| \| \| \|	When establishing a new connection on a disconnected device, the old list of alt-servers was retained. This would lead to us connecting to the wrong server, as the number of newly passed servers was used when looping over the list of alt-servers to actually connect.
*	[KERNEL] Fix Linter errors	Manuel Bentele	2021-04-16	2	-3/+5
\|
*	[KERNEL] Removes duplicate word 'of' in license headers	Manuel Bentele	2021-04-16	16	-16/+16
\|
*	[SERVER] Mark uplink requests with BGR/prefetch flags and handle them	Simon Rettberg	2021-04-14	1	-5/+20
\| \| \| \| \| \| \| \| \|	Incoming requests from clients might actually be prefetch jobs from another downstream proxy. Don't do prefetching for those, as this would cascade upwars in the proxy chain (prefetch for a prefetch of a prefetch) Incoming requests might also be background replication. Don't relay those if we're not configured for background replication as well.
*	[SERVER] Set TCP_NODELAY on outgoing connections	Simon Rettberg	2021-04-14	1	-1/+4
\| \| \| \| \| \| \|	This will send all (block) requests immediately at sometimes more overhead, but slighly less delays. Since the outgoing connection on a client is only used very lightly, this tradeoff should always make sense.
*	[SERVER] Make prefetching synchronous	Simon Rettberg	2021-04-14	3	-168/+226
\| \| \| \| \| \| \| \| \|	There is a race condition where we process the next request from the same client faster than the OS will schedule the async prefetch job, rendering it a NOOP in the best case (request ranges match) or fetching redundant data from the upstream server (prefetch range is larger than actual request by client). Make prefetching synchronous to prevent this race condition.
*	[CLIENT] Use SO_GETPEERCRED instead of braindead setuid crap	Simon Rettberg	2021-04-14	1	-56/+69
\| \| \| \| \| \|	If you need daemon mode, run as root with --daemon, normal users can then request devices to be connected using the same binary WITHOUT havind the suid bit set on it.
*	[KERNEL] Deduplicate code, clean up, split into functions	Simon Rettberg	2021-04-14	2	-402/+339
\|
*	[KERNEL] Fix CMD name in debug messages	Simon Rettberg	2021-04-14	1	-3/+3
\|
*	[KERNEL] Improve socket connect	Simon Rettberg	2021-03-29	1	-34/+59
\| \| \| \| \| \| \| \|	- Remove the ugly timeout hack that apparently isn't required after all. - Set a few socket options that appear to make sense in out use case (no linger, only one SYN retry, NODELAY). - Adapt socket timeout in panic mode, in case we're on a very bad connection.
*	[KERNEL] Overhaul sysfs files	Simon Rettberg	2021-03-26	2	-40/+18
\| \| \| \| \| \|	Remove superflous, reduntant or otherwise useless information. Use space as separator instead of comma for better readability and easier parsing in shell etc.
*	[KERNEL] Implement best_count logic for load balancing	Simon Rettberg	2021-03-26	4	-22/+48
\| \| \| \| \| \| \|	Similar logic already exists in the fuse client: Count how many times in a row a server was fastest when measuring RTTs, and lower the switching threshold more the higher the count gets.
*	[KERNEL] Use sockaddr instead of dnbd3_host_t where possible	Simon Rettberg	2021-03-24	6	-241/+216
\| \| \| \| \| \| \| \|	Convert dnbd3_host_t to struct sockaddr immediately when adding alt servers, so we don't have to convert it every time we establish a connection. Additionally we can now use %pISpc in printf-like functions instead of having if/else constructs whenever we want to print an address.
*	[KERNEL] Set fake low RTT after manual server switch	Simon Rettberg	2021-03-23	2	-0/+8
\| \| \| \| \|	This avoids automatically switching back right after adding and switching to a server.
*	[KERNEL] Synchronous add/remove of alt-servers via IOCTL	Simon Rettberg	2021-03-23	4	-165/+229
\|
*	[SERVER] Fix compiler warning	Simon Rettberg	2021-03-22	1	-1/+1
\|
*	[KERNEL] Enable assertions if CONFIG_DEBUG_DRIVER is set	Manuel Bentele	2021-03-16	1	-0/+6
\|
*	[KERNEL] Refactor code to satisfy Linux kernel code style	Manuel Bentele	2021-03-12	11	-621/+535
\|
*	[BUILD] Build picohttpparser as independent library	Manuel Bentele	2021-03-11	3	-15/+15
\|
*	[BUILD] Enable lint targets if lint programs are found	Manuel Bentele	2021-03-11	2	-2/+566
\|
*	[BUILD] Disable lint/formatting for non-kernel for now	Simon Rettberg	2021-03-05	1	-0/+12
\|
*	[BUILD] Add support in CMake to validate (lint) the source code	Manuel Bentele	2021-03-04	6	-30/+88
\|
*	[FUSE] Fix build: Add dnbd3-build to dependencies	Simon Rettberg	2020-12-08	1	-1/+1
\|
*	[BUILD] Fix dnbd3-client build, Fix source-only build	Simon Rettberg	2020-12-02	1	-1/+1
\|
*	[BUILD] Include branch and build timestamp in binaries	Simon Rettberg	2020-12-02	5	-7/+13
\|
*	[CLIENT] print help and version number correctly	Manuel Bentele	2020-12-02	1	-8/+5
\| \| \| \| \| \|	This change prevents the dnbd3-client to print the help text twice if the help parameter is submitted. In addition to that, correct exit codes are set after the help text is printed and the program terminates.
*	[SERVER] replaced non-existent FUSE define to match CMake's build defines	Manuel Bentele	2020-11-27	1	-3/+3
\|
*	[SERVER] Fix warnings	Simon Rettberg	2020-11-23	4	-7/+9
\|
*	[BUILD] update search paths for 'libatomic' to support build on FreeBSD	Manuel Bentele	2020-11-23	1	-1/+1
\|
*	[BUILD] add CMake find package search to find 'libatomic' automatically	Manuel Bentele	2020-11-23	1	-0/+3
\|
*	[KERNEL] Fix race condition for request_queuereceive in receive thread	Simon Rettberg	2020-11-20	1	-8/+7
\| \| \| \| \| \| \| \| \| \|	Formerly, the request that was about to be received was looked up in the receive queue without removing it, then the request payload was received from the socket while the lock was not being held, and finally, the lock was required again and the request removed from the queue. This is dangrous as another thread can concurrently take the request from the queue while the receive thread reads the payload from the socket, leading to a double-free by calling blk_mq_end_request twice.