dnbd3.git - Distributed Network Block Device 3 --rewrite for Kernel 3.x

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SERVER] Fix rid 0 proxy lookup if local version is newer	Simon Rettberg	2020-08-14	1	-5/+15
\| \| \| \| \|	There was a logic bug that would favor a remotely looked up image rid, even if we already found a higher revision locally.
*	[SERVER] Add FUSE mode	Simon Rettberg	2020-07-28	1	-1/+1
\| \| \| \| \|	Still needs some cleanup and optimizations, variable naming sucks, comments, etc.
*	[SERVER] Fix: NULL pointer access in saveLoadAllCacheMaps()	Simon Rettberg	2020-07-21	1	-0/+2
\| \| \| \|	Entries in _images array might ne NULL
*	[SERVER] Fix: No replication if autoFreeDiskSpaceDelay is disabled	Simon Rettberg	2020-06-30	1	-4/+12
\|
*	[SERVER] Check local and remote for updates on rid == 0	Simon Rettberg	2020-03-31	1	-4/+8
\|
*	[SERVER] image_ensureDiskSpace should only deletes proxied images	Simon Rettberg	2020-03-20	1	-18/+19
\|
*	[SERVER] Remember atime in .meta file	Simon Rettberg	2020-03-20	1	-62/+136
\|
*	[SERVER] Forbid hidden files when scanning image dir	Simon Rettberg	2020-03-20	1	-1/+2
\|
*	[SERVER] Fix warnings, add assertions	Simon Rettberg	2020-03-20	1	-2/+5
\|
*	[SERVER] Add name param to threadpool_run	Simon Rettberg	2020-03-19	1	-0/+2
\|
*	[SERVER] Rewrite uplink queue handling	Simon Rettberg	2020-03-13	1	-2/+1
\| \| \| \| \| \|	- Now uses linked lists instead of huge array - Does prefetch data on client requests - Can have multiple replication requests in-flight
*	[SERVER] Fix data type	Simon Rettberg	2020-03-09	1	-2/+2
\|
*	[SERVER] Add printf macro for image (name:rid as %s:%d)	Simon Rettberg	2020-03-06	1	-18/+14
\|
*	[SERVER] Handle "warn unused result" cases	Simon Rettberg	2020-03-06	1	-2/+6
\|
*	[SERVER] Reload cache maps periodically for local images	Simon Rettberg	2020-03-06	1	-46/+83
\| \| \| \| \| \|	If an image is incomplete, but has no upstream server that can be used for replication, reload the cache map from disk periodically, in case some other server instance is writing to the image.
*	[SERVER] Add timer task for saving cache maps	Simon Rettberg	2020-03-04	1	-1/+135
\| \| \| \| \| \| \| \| \|	Cache maps will now be saved periodically, but only if either they have a "dirty" bit set, which happens if any bits in the map get cleared again (due to corruption), or if new data has been replicated from an uplink server. This either means at least one byte received and 5 minutes have passed, or at least 500MB have been downloaded. The timer currently runs every 20 seconds.
*	[SERVER] Likewise, get rid of same loops in client handler	Simon Rettberg	2020-03-04	1	-16/+16
\|
*	[SERVER] Get rid of two loops in image_updateCacheMap	Simon Rettberg	2020-03-03	1	-22/+18
\|
*	[SERVER] Expose image->problem bools as bitmask in RPC json data	Simon Rettberg	2020-03-03	1	-2/+11
\|
*	[SERVER] Remove "working" flag, introduce fine-grained flags	Simon Rettberg	2020-03-03	1	-93/+100
\| \| \| \| \| \| \| \|	Tracking the "working" state of images using one boolean is insufficient regarding the different ways in which providing an image can fail. Introduce separate flags for different conditions, like "file not readable", "file not writable", "no uplink server available", "file content has changed".
*	[SERVER] Introduce ignoreAllocErrors	Simon Rettberg	2020-02-24	1	-2/+7
\| \| \| \| \|	If enabled, a failed fallocate will not abort image replication, but retry with sparse mode.
*	[SERVER] Lookup image on storage even in proxy mode	Simon Rettberg	2020-01-28	1	-8/+11
\| \| \| \| \| \| \|	In proxy mode, when rid 0 is requested, we now first query our uplink servers for the latest revision and if this fails, like in non-proxy mode, we'll see what the latest version on disk is.
*	[SERVER] Fix checking images without cache map	Simon Rettberg	2019-10-29	1	-7/+11
\|
*	[SERVER] Make buffer when reading for crc check larger	Simon Rettberg	2019-09-11	1	-1/+1
\|
*	[SERVER] Make integrity checks on startup async	Simon Rettberg	2019-09-10	1	-25/+24
\|
*	[SERVER] rpc: Add cachemap feature	Simon Rettberg	2019-09-06	1	-0/+16
\|
*	[SERVER] Introduce autoFreeDiskSpaceDelay	Simon Rettberg	2019-09-05	1	-6/+8
\| \| \| \| \| \| \| \|	This setting allows you to control the formerly hard-coded timeout of 10 hours before a proxy would start deleting old images in order to free up space for new images. Setting it to -1 entirely disables automatic deletion, in case you have an external process for freeing up disk space.
*	[SERVER] Support limiting alt-servers to specific namespace	Simon Rettberg	2019-09-04	1	-1/+1
\| \| \| \| \| \|	Not really namespace but simple string matching for the image path. Path is matched from start with no support for glob or regex, so usually you want to have a trailing '/' to limit to certain directories.
*	[SERVER] Fix indentation	Simon Rettberg	2019-09-03	1	-4/+4
\|
*	[SERVER] Fix image_updateCachemap()	Simon Rettberg	2019-09-03	1	-4/+8
\|
*	[SERVER] No uplink_init when checking working state; improve logging	Simon Rettberg	2019-08-30	1	-8/+10
\|
*	[SERVER] Use weakref for cache maps	Simon Rettberg	2019-08-29	1	-76/+132
\| \| \| \| \| \|	Gets rid of a bunch of locking, especially the hot path in net.c where clients are requesting data. Many clients unsing the same incomplete image previously created a bottleneck here.
*	[SERVER] Reintroduce check whether readFd is actually != -1	Simon Rettberg	2019-08-28	1	-1/+3
\|
*	[SERVER] Make signal handling more POSIX	Simon Rettberg	2019-08-28	1	-8/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	According to POSIX, a signal sent to a PID can be delivered to an arbitrary thread of that process that hasn't the signal blocked. This seens to never happen on Linux, but would mess things up since the code expected the main signal handler to only be executed by the main thread. This should now be fixed by examining the destination PID of the signal as well as the ID of the thread currently running the signal handler. If we notice the signal wasn't sent by our own PID and the handler is not currently run by the main thread, we re-send the signal to the main thread. Otherwise, if the signal was sent by our own PID but the handler is not run in the main thread, do nothing. This way we can use pthread_kill() to wake up threads that might be stuck in a blocking syscall when it's time to shut down.
*	[SERVER] Remove old comments	Simon Rettberg	2019-08-28	1	-30/+0
\|
*	[SERVER] Handle closeUnusedFd via timer	Simon Rettberg	2019-08-28	1	-17/+19
\|
*	[SERVER] Use reference counting for uplink	Simon Rettberg	2019-08-27	1	-22/+17
\| \| \| \|	First step towards less locking for proxy mode
*	[SERVER] Get rid of alt-servers thread, per-uplink rtt history	Simon Rettberg	2019-08-22	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	Alt-Server checks are now run using the threadpool, so we don't need a queue and dedicated thread anymore. The rtt history is now kept per uplink, so many uplinks won't overwhelm the history, making its time window very short. Also the fail counter is now split up; a global one for when the server actually isn't reachable, a local (per-uplink) one for when the server is reachable but doesn't serve the requested image.
*	[SERVER] Add struct representing active connection to uplink server	Simon Rettberg	2019-08-18	1	-1/+1
\|
*	[SERVER] Better lock debugging: Always check lock order	Simon Rettberg	2019-08-07	1	-5/+5
\| \| \| \| \| \|	Lock order is predefined in locks.h. Immediately bail out if a lock with lower priority is obtained while the same thread already holds one with higher priority.
*	[SERVER] Make image->users atomic and get rid of some locking	Simon Rettberg	2019-08-02	1	-52/+39
\| \| \| \| \| \| \| \|	With this change it should be safe to read the users count of an image without locking first, assuming you already have a reference on the image or are otherwise sure it cannot be freed, i.e. in an active uplink. Updating users, or checking whether it's 0 in order to free the image should only be done while holding the imageListLock.
*	[SERVER] Turn all spinlocks into mutexes	Simon Rettberg	2019-07-26	1	-97/+99
\| \| \| \| \| \| \| \|	Just assume sane platforms offer smart mutexes that have a fast-path with spinlocks internally for locks that have little to no congestion. In all other cases, mutexes should perform better anyways.
*	[SERVER] Export image idle time in json rpc	Simon Rettberg	2019-01-31	1	-3/+6
\| \| \| \|	Counter in seconds for how long this image hasn't been used.
*	[SERVER] Use O_DIRECT for integrity checks	Simon Rettberg	2018-07-04	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The idea is that for full image checks, we don't want to pollute the fs cache with gigabytes of data that won't be needed again soon. This would certainly hurt performance on servers that dont have hundreds of GBs of RAM. For single block checks during replication this has the advantage that we don't check the block in memory before it hit the disk once, but actually flush the data to disk, then remove it from the page cache, and only then read it again, from disk. TODO: Might be worth making this a config option
*	[SERVER] Refactor uplink/cache handling, improve crc checking	Simon Rettberg	2018-07-04	1	-216/+73
\| \| \| \| \| \| \| \| \| \| \| \| \|	The cacheFd is now moved to the uplink data structure and will only be handled by the uplink thread. The integrity checker now supports checking all blocks of an image. This will be triggered automatically whenever a check for a single block failed. Also, if a crc check on startup fails, the image won't be discarded anymore, but rather a full check will be initiated. Furthermore, when calling image_updateCacheMap() on an image that was previously complete, the cache map will now be re-initialized, and a new uplink connection created.
*	[SERVER] Try to re-open cacheFd if writing fails	Simon Rettberg	2018-06-25	1	-1/+44
\| \| \| \| \| \| \|	In scenarios where the proxy is using an NFS server as storage (for whatever crazy reason) or when the cacheFd goes bad through e.g. a switchroot, try to re-open it instead of just disabling caching forever.
*	[SERVER] Make sure image has read fd before reading	Simon Rettberg	2018-06-13	1	-29/+52
\|
*	[SERVER] Don't spam log in vmdkLegacyMode for unknown images	Simon Rettberg	2018-05-02	1	-3/+7
\|
*	[SERVER] Fix deadlock on shutdown (via image_tryFreeAll)	Simon Rettberg	2018-04-24	1	-4/+8
\| \| \| \| \|	imageListLock was locked on twice in the call stack, which is bad if you're using non-recursive locks.
*	[SERVER] Mark spammy replication messages as DEBUG2 instead of 1	Simon Rettberg	2018-04-11	1	-3/+3
\|