git-annex

Author	SHA1	Message	Date
Joey Hess	2aa3e73de1	remove now unused toCygPath	2016-01-13 12:36:53 -04:00
Joey Hess	d90895ac94	Merge remote-tracking branch 'pkitslaar/master'	2016-01-13 12:36:30 -04:00
Joey Hess	1f6f9a8d34	When annex.http-headers is used to set the User-Agent header, avoid sending User-Agent: git-annex	2016-01-11 12:10:38 -04:00
Pieter Kitslaar	6cd134ade1	Added new toMSYS2Path function for use with rsync on Windows.	2016-01-11 11:18:58 +01:00
Joey Hess	aa4f353e5d	clarify absPathFrom The repo path is typically relative, not absolute, so providing it to absPathFrom doesn't yield an absolute path. This is not a bug, just unclear documentation. Indeed, there seem to be no reason to simplifyPath here, which absPathFrom does, so instead just combine the repo path and the TopFilePath. Also, removed an export of the TopFilePath constructor; asTopFilePath is provided to construct one as-is.	2016-01-05 17:33:48 -04:00
Joey Hess	b6b34f4916	automatic conflict resolution for v6 unlocked files Several tricky parts: * When the conflict is just between the same key being locked and unlocked, the unlocked version wins, and the file is not renamed in this case. * Need to update associated file map when conflict resolution renames an unlocked file. * git merge runs the smudge filter on the conflicting file, and actually overwrites the file with the same content it had before, and so invalidates its inode cache. This makes it difficult to know when it's safe to remove such files as conflict cruft, without going so far as to compare their entire contents. Dealt with this by preventing the smudge filter from populating the file when a merge is run. However, that also prevents the smudge filter being run for non-conflicting files, so eg moving a file won't put its new content into place. * Ideally, if a merge or a merge conflict resolution renames an unlocked file, the file in the work tree can just be moved, rather than copying the content to a new worktree file. This is attempted to be done in merge conflict resolution, but due to git merge's behavior of running smudge filters, what actually seems to happen is the old worktree file with the content is deleted and rewritten as a pointer file, so doesn't get reused. So, this is probably not as efficient as it optimally could be. If that becomes a problem, could look into running the merge in a separate worktree and updating the real worktree more efficiently, similarly to the direct mode merge. However, the direct mode merge had a lot of bugs, and I'd rather not use that more error-prone method unless really needed.	2015-12-29 15:41:09 -04:00
Joey Hess	da5d25a844	clean build warning on windows	2015-12-28 13:06:36 -04:00
Joey Hess	4224fae71f	optimise read and write for Keys database (untested) Writes are optimised by queueing up multiple writes when possible. The queue is flushed after the Annex monad action finishes. That makes it happen on program termination, and also whenever a nested Annex monad action finishes. Reads are optimised by checking once (per AnnexState) if the database exists. If the database doesn't exist yet, all reads return mempty. Reads also cause queued writes to be flushed, so reads will always be consistent with writes (as long as they're made inside the same Annex monad). A future optimisation path would be to determine when that's not necessary, which is probably most of the time, and avoid flushing unncessarily. Design notes for this commit: - separate reads from writes - reuse a handle which is left open until program exit or until the MVar goes out of scope (and autoclosed then) - writes are queued - queue is flushed periodically - immediate queue flush before any read - auto-flush queue when database handle is garbage collected - flush queue on exit from Annex monad (Note that this may happen repeatedly for a single database connection; or a connection may be reused for multiple Annex monad actions, possibly even concurrent ones.) - if database does not exist (or is empty) the handle is not opened by reads; reads instead return empty results - writes open the handle if it was not open previously	2015-12-23 19:18:52 -04:00
Joey Hess	d82b110da8	Merge branch 'master' into smudge	2015-12-21 17:12:46 -04:00
Joey Hess	b6ac443b60	fix build warnings under ghc 7.10 Caused by AMP.. Since I've finally upgraded my dev laptop to 7.10, I may start missing imports that are not needed with it but are with older versions..	2015-12-19 17:42:45 -04:00
Joey Hess	029111b89a	Merge branch 'master' into smudge	2015-12-16 13:07:46 -04:00
Joey Hess	25bc6ea6d8	bring back some deleted functions that git-repair uses	2015-12-15 20:42:35 -04:00
Joey Hess	96dd0f4ebe	improve temp dir security http://bugs.debian.org/807341 * Fix insecure temporary permissions when git-annex repair is used in in a corrupted git repository. Other calls to withTmpDir didn't leak any potentially private data, but repair clones the git repository to a temp directory which is made using the user's umask. Thus, it might expose a git repo that is otherwise locked down. * Fix potential denial of service attack when creating temp dirs. Since withTmpDir used easily predictable temporary directory names, an attacker could create foo.0, foo.1, etc and as long as it managed to keep ahead of it, could prevent it from ever returning. I'd rate this as a low utility DOS attack. Most attackers in a position to do this could just fill up the disk /tmp is on to prevent anything from writing temp files. And few parts of git-annex use withTmpDir anyway, so DOS potential is quite low. Examined all callers of withTmpDir and satisfied myself that switching to mkdtmp and so getting a mode 700 temp dir wouldn't break any of them. Note that withTmpDirIn continues to not force temp dir to 700. But it's only used for temp directories inside .git/annex/wherever/ so that is not a problem. Also re-audited all other uses of temp files and dirs in git-annex.	2015-12-15 20:21:48 -04:00
Joey Hess	ce73a96e4e	use InodeCache when dropping a key to see if a pointer file can be safely reset The Keys database can hold multiple inode caches for a given key. One for the annex object, and one for each pointer file, which may not be hard linked to it. Inode caches for a key are recorded when its content is added to the annex, but only if it has known pointer files. This is to avoid the overhead of maintaining the database when not needed. When the smudge filter outputs a file's content, the inode cache is not updated, because git's smudge interface doesn't let us write the file. So, dropping will fall back to doing an expensive verification then. Ideally, git's interface would be improved, and then the inode cache could be updated then too.	2015-12-09 17:54:54 -04:00
Joey Hess	969d54f914	cleanup	2015-12-06 16:36:35 -04:00
Joey Hess	4591569607	avoid looping trying to make temp dir when the name is too long Only loop when directory creation fails due to the directory existing already.	2015-12-06 16:29:36 -04:00
Joey Hess	a0fcb8ec93	generalize catchHardwareFault to catchIOErrorType	2015-12-06 16:26:38 -04:00
Joey Hess	394b66be13	import Data.Time.Format to ensure its Read instance for LocalTime is available Seems that Utility.SafeCommand loaded something that indirectly got that instance loaded on unix, but not on Windows recently.	2015-11-21 13:36:30 -04:00
Joey Hess	04e150abb3	use intercalate instead of MissingH's join The two functions are identical.	2015-11-17 17:27:24 -04:00
Joey Hess	689bdae03a	reorg quickcheck to a separate module	2015-11-17 15:49:22 -04:00
Joey Hess	1244eb3770	refactor	2015-11-16 20:27:01 -04:00
Joey Hess	7943442dff	Display progress meter in -J mode when copying from a local git repo, to a local git repo, and from a remote git repo. Had everything available, just didn't combine the progress meter with the other places progress is sent to update it. (And to a remote repo already did show progress.) Most special remotes should already display progress meters with -J, same as without it. One exception to this is the web, since it relies on wget/curl progress display without -J. Still todo..	2015-11-16 19:32:30 -04:00
Joey Hess	c670a0642c	fix warning	2015-11-16 15:37:27 -04:00
Joey Hess	e2b4861bff	store abspath to the lock file Avoids problems if the program chdirs	2015-11-16 15:25:04 -04:00
Joey Hess	b0626230b7	fix use of hifalutin terminology	2015-11-16 14:37:31 -04:00
Joey Hess	be86081ff4	avoid crashing in checkDaemon when fcntl locking is not supported Instead, just assume the daemon isn't running. Since the pid file locking fails on such a filesystem, we know it's not running.	2015-11-16 14:34:30 -04:00
Joey Hess	2e44da5c46	clean up side lock files when we're done with them There's a potential race, but it's detected and just results in the other process failing to take the side lock, so possibly retrying one second later on. The race window is quite narrow so the extra delay is minor. Left the side lock files mode 666 because an interruption can leave a side lock file created by another user for a shared repository. When this happens, the non-owning user can't delete it (+t) but can still lock it, and so the code falls back to acting as it did before this commit.	2015-11-16 11:36:11 -04:00
Joey Hess	8efd3d71c8	starting to get a handle on how to detect that mad gleam in lustre's eye	2015-11-13 16:18:44 -04:00
Joey Hess	70bfe218f5	one more try to get sane behavior our of lustre	2015-11-13 15:51:45 -04:00
Joey Hess	389c6c7d37	fixed a fd double-close	2015-11-13 15:43:09 -04:00
Joey Hess	b0155d9093	also compare lock file contents to double-check link worked And it closes the tmp file before this. I don't know if this will help avoid lustre's craziness, but it can't hurt..	2015-11-13 15:20:52 -04:00
Joey Hess	1aba23ab4e	use /tmp for sidelock file when no /dev/shm	2015-11-13 14:49:30 -04:00
Joey Hess	60a9c7f5c6	require the side lock be held to take pidlock This is less portable, since currently sidelocks rely on /dev/shm. But, I've seen crazy lustre inconsistencies that make me not trust the link() method at all, so what can you do.	2015-11-13 14:44:53 -04:00
Joey Hess	85345abe8b	avoid over-long filenames for side lock files	2015-11-13 14:04:29 -04:00
Joey Hess	c2cbe5619b	add stat check I have a strace taken on a lustre filesystem on which link() returned 0, but didn't actually succeed, since the file already existed. One of the linux man pages recommended using link followed by checking like this. I was reading it yesterday, but cannot find it now.	2015-11-13 13:22:45 -04:00
Joey Hess	88d94e674c	clean up temp file	2015-11-13 12:52:24 -04:00
Joey Hess	e31a51c5bb	better lock dropping order	2015-11-13 12:36:37 -04:00
Joey Hess	cd22340c99	generalize to MonadIO	2015-11-12 18:03:49 -04:00
Joey Hess	aa4192aea6	pid locking configuration and abstraction layer for git-annex (not actually used anywhere yet)	2015-11-12 17:50:34 -04:00
Joey Hess	77b490bfba	add timeout for pid lock waiting	2015-11-12 17:12:54 -04:00
Joey Hess	7bd9e33b84	refactor	2015-11-12 16:35:15 -04:00
Joey Hess	0f25a7365a	module for PidLocks in LockPool	2015-11-12 16:31:34 -04:00
Joey Hess	e7552e4cee	make LockPool's LockHandle be able to support multiple different types of file locks	2015-11-12 16:28:11 -04:00
Joey Hess	710d1eeeac	module for pid lock files with atomic stale lock file takeover when possible	2015-11-12 15:39:49 -04:00
Joey Hess	08bb3b1b1d	quvi may output utf-8 encoded data when the conifigured locale doesn't support that; avoid crashing on such invalid encoding.	2015-11-09 12:19:23 -04:00
Joey Hess	8b09e9306a	merge from propellor	2015-10-28 00:18:01 -04:00
Joey Hess	268800d590	Symlink timestamp preservation code uses functions from unix-2.7.0 when available, which should be more portable.	2015-10-21 02:22:18 -04:00
Joey Hess	b9c6a56b0e	Use statvfs on OSX. Fixes a recent-ish build warning on about 64 bit vs non. This is the method used by the disk-free-space library, and I tested it to yield the same results on even 10 tb drives on OSX -- so it's getting 64 bit values.	2015-10-19 17:09:06 -04:00
Joey Hess	45c9440cf9	refactor	2015-10-15 10:34:19 -04:00
Joey Hess	18c7b993bd	comment typo	2015-10-12 16:32:52 -04:00

1 2 3 4 5 ...

982 commits