git-annex

Author	SHA1	Message	Date
Joey Hess	61b48b69ba	fix build on windows	2021-12-09 13:39:16 -04:00
Joey Hess	ef3ab0769e	close pid lock only once no threads use it This fixes a FD leak when annex.pidlock is set and -J is used. Also, it fixes bugs where the pid lock file got deleted because one thread was done with it, while another thread was still holding it open. The LockPool now has two distinct types of resources, one is per-LockHandle and is used for file Handles, which get closed when the associated LockHandle is closed. The other one is per lock file, and gets closed when no more LockHandles use that lock file, including other shared locks of the same file. That latter kind is used for the pid lock file, so it's opened by the first thread to use a lock, and closed when the last thread closes a lock. In practice, this means that eg git-annex get of several files opens and closes the pidlock file a few times per file. While with -J5 it will open the pidlock file, process a number of files, until all the threads happen to finish together, at which point the pidlock file gets closed, and then that repeats. So in either case, another process still gets a chance to take the pidlock. registerPostRelease has a rather intricate dance, there are fine-grained STM locks, a STM lock of the pidfile itself, and the actual pidlock file on disk that are all resolved in stages by it. Sponsored-by: Dartmouth College's Datalad project	2021-12-06 15:01:39 -04:00
Joey Hess	774c7dab2f	Merge branch 'master' into pidlockfinegrained	2021-12-06 13:00:40 -04:00
Joey Hess	ae4c56b28a	Revert "fix too early close of shared lock file" This reverts commit `66b2536ea0`. I misunderstood commit `ac56a5c2a0` and caused a FD leak when pid locking is not used. A LockHandle contains an action that will close the underlying lock file, and that action is run when it is closed. In the case of a shared lock, the lock file is opened once for each LockHandle, and only the one for the LockHandle that is being closed will be closed.	2021-12-06 12:51:28 -04:00
Joey Hess	e5ca67ea1c	fine-grained locking when annex.pidlock is enabled This locking has been missing from the beginning of annex.pidlock. It used to be possble, when two threads are doing conflicting things, for both to run at the same time despite using locking. Seems likely that nothing actually had a problem, but it was possible, and this eliminates that possible source of failure. Sponsored-by: Dartmouth College's Datalad project	2021-12-03 17:20:21 -04:00
Joey Hess	6988c2e740	fix build on windows broken by `ed0afbc36b` Sponsored-by: Dartmouth College's Datalad project	2021-12-03 14:08:12 -04:00
Joey Hess	ed0afbc36b	avoid concurrent threads trying to take pid lock at same time Seem there are several races that happen when 2 threads run PidLock.tryLock at the same time. One involves checkSaneLock of the side lock file, which may be deleted by another process that is dropping the lock, causing checkSaneLock to fail. And even with the deletion disabled, it can still fail, Probably due to linkToLock failing when a second thread overwrites the lock file. The same can happen when 2 processes do, but then one process just fails to take the lock, which is fine. But with 2 threads, some actions where failing even though the process as a whole had the pid lock held. Utility.LockPool.PidLock already maintains a STM lock, and since it uses LockShared, 2 threads can hold the pidlock at the same time, and when the first thread drops the lock, it will remain held by the second thread, and so the pid lock file should not get deleted until the last thread to hold it drops the lock. Which is the right behavior, and why a LockShared STM lock is used in the first place. The problem is that each time it takes the STM lock, it then also calls PidLock.tryLock. So that was getting called repeatedly and concurrently. Fixed by noticing when the shared lock is already held, and stop calling PidLock.tryLock again, just use the pid lock that already exists then. Also, LockFile.PidLock.tryLock was deleting the pid lock when it failed to take the lock, which was entirely wrong. It should only drop the side lock. Sponsored-by: Dartmouth College's Datalad project	2021-12-01 17:14:39 -04:00
Joey Hess	66b2536ea0	fix too early close of shared lock file This fixes a reversion introduced in commit `ac56a5c2a0`. I didn't notice there that it was handling the case of a shared lock file that was still open elsewhere by not running the close action. This was especially deadly when annex.pidlock is set, as it caused early deletion of the pid lock file. Sponsored-by: Dartmouth College's Datalad project	2021-12-01 17:06:28 -04:00
Joey Hess	e505c03bcc	more RawFilePath conversion nukeFile replaced with removeWhenExistsWith removeLink, which allows using RawFilePath. Utility.Directory cannot use RawFilePath since setup does not depend on posix. This commit was sponsored by Graham Spencer on Patreon.	2020-10-29 10:50:29 -04:00
Joey Hess	b68f214312	Display a message when git-annex has to wait for a pid lock file held by another process	2020-08-26 13:05:34 -04:00
Joey Hess	ac56a5c2a0	Fix a lock file descriptor leak that could occur when running commands like git-annex add with -J Bug was introduced as part of a different FD leak fix in version 6.20160318.	2020-07-21 15:30:47 -04:00
Joey Hess	9127fe4821	add DebugLocks build flag Using the method described in https://www.fpcomplete.com/blog/2018/05/pinpointing-deadlocks-in-haskell but my own code to implement it, and with callstacks added. This work is supported by the NIH-funded NICEMAN (ReproNim TR&D3) project.	2018-11-19 15:02:43 -04:00
Joey Hess	7db37ddde0	Fix transfer log file locking problem when running concurrent transfers. orElse is great, but was not the right thing to use here because waitTakeLock could retry for other reasons than the lock being held, which made tryTakeLock fail when it shouldn't. Instead, move the code to tryTakeLock and implement waitTakeLock using tryTakeLock and retry. (Also, in runTransfer, when checkSaneLock fails, dropLock to avoid leaking a lock handle.) This commit was supported by the NSF-funded DataLad project.	2017-05-25 17:40:23 -04:00
Joey Hess	35aad6ee7f	fix build with old ghc	2016-03-05 15:18:52 -04:00
Joey Hess	ca18baecdb	fix windows build more	2016-03-05 12:32:06 -04:00
Joey Hess	6237bffae5	fix windows build	2016-03-03 13:08:47 -04:00
Joey Hess	c1e439f8cc	fix windows build	2016-03-03 12:11:12 -04:00
Joey Hess	3334130368	Fix shared lock file FD leak. This fixes behavior in this situation: l1 <- lockShared Nothing "lck" l2 <- lockShared Nothing "lck" dropLock l1 dropLock l2 Before, the lock was dropped upon the second dropLock call, but the fd remained open, and would never be closed while the program was running. Fixed by a rather round-about method, but it should work well enough. It would have been simpler to open open the shared lock once, and not open it again in the second call to lockShared. But, that's difficult to do atomically. This also affects Windows and PID locks, not just posix locks. In the case of pid locks, multiple calls to waitLock within the same process are allowed because the side lock is locked using a posix lock, and so multiple exclusive locks can be taken in the same process. So, this change fixes a similar problem with pid locks. l1 <- waitLock (Seconds 1) "lck" l2 <- waitLock (Seconds 1) "lck" dropLock l1 dropLock l2 Here the l2 side lock fd remained open but not locked, although the pid lock file was removed. After this change, the second dropLock will close both fds to the side lock, and delete the pidlock.	2016-03-01 15:31:39 -04:00
Joey Hess	26c499492f	comment	2016-03-01 13:47:49 -04:00
Joey Hess	ad888a6b76	Fix bug preventing moving files to/from a repository with annex.pidlock set.	2016-03-01 12:51:54 -04:00
Joey Hess	da5d25a844	clean build warning on windows	2015-12-28 13:06:36 -04:00
Joey Hess	77b490bfba	add timeout for pid lock waiting	2015-11-12 17:12:54 -04:00
Joey Hess	7bd9e33b84	refactor	2015-11-12 16:35:15 -04:00
Joey Hess	0f25a7365a	module for PidLocks in LockPool	2015-11-12 16:31:34 -04:00
Joey Hess	e7552e4cee	make LockPool's LockHandle be able to support multiple different types of file locks	2015-11-12 16:28:11 -04:00
Joey Hess	710d1eeeac	module for pid lock files with atomic stale lock file takeover when possible	2015-11-12 15:39:49 -04:00
Joey Hess	4d50958ed7	add lockContentShared Also, rename lockContent to lockContentExclusive inAnnexSafe should perhaps be eliminated, and instead use `lockContentShared inAnnex`. However, I'm waiting on that, as there are only 2 call sites for inAnnexSafe and it's fiddly.	2015-10-08 14:29:35 -04:00
Joey Hess	c8fad345f2	add tryLockShared	2015-10-08 13:40:23 -04:00
Joey Hess	88e4fe6093	remove unused imports	2015-08-03 15:58:12 -04:00
Joey Hess	6ca08f02a4	remove unused imports	2015-08-03 15:49:35 -04:00
Joey Hess	ade6ed2d71	AMP hack	2015-05-31 16:54:07 -04:00
Joey Hess	6c3cea7699	need more polymorphism	2015-05-22 13:50:37 -04:00
Joey Hess	696ed9b3f9	fix windows build	2015-05-22 13:44:24 -04:00
Joey Hess	9de5cd2966	fix crash in stale transfer lockfile cleanup code Need to differentiate between the lockfile not being locked, and it not existing.	2015-05-19 23:35:24 -04:00
Joey Hess	846e7db245	unused imports	2015-05-19 15:05:30 -04:00
Joey Hess	ecb0d5c087	use lock pools throughout git-annex The one exception is in Utility.Daemon. As long as a process only daemonizes once, which seems reasonable, and as long as it avoids calling checkDaemon once it's already running as a daemon, the fcntl locking gotchas won't be a problem there. Annex.LockFile has it's own separate lock pool layer, which has been renamed to LockCache. This is a persistent cache of locks that persist until closed. This is not quite done; lockContent stil needs to be converted.	2015-05-19 14:09:52 -04:00
Joey Hess	6915b71c57	lock pools to work around non-concurrency/composition safety of POSIX fcntl	2015-05-18 15:57:17 -04:00

37 commits