git-annex

Author	SHA1	Message	Date
Joey Hess	24800b1bf1	Only look at reflogs for relevant branches, not for git-annex branches This speeds it up quite a bit.. May still be too slow in large repos.	2015-07-07 17:36:30 -04:00
Joey Hess	b11d2f5a8a	unused: --used-refspec can now be configured to look at refs in the reflog. This provides a way to not consider old versions of files to be unused after they have reached a specified age, when the old refs in the reflog expire. May be slow.	2015-07-07 17:13:50 -04:00
Joey Hess	f7dc20595e	refactor ls-tree params All in one place to avoid bugs like `174da80ddc`	2015-07-06 14:21:43 -04:00
Joey Hess	174da80ddc	bugfix: Pass --full-tree when using git ls-files to get a list of files on the git-annex branch, so it works when run in a subdirectory. This bug affected git-annex unused, and potentially also transitions running code and other things.	2015-07-06 14:09:54 -04:00
Joey Hess	adba0595bd	use bloom filter in second pass of sync --all --content This is needed because when preferred content matches on files, the second pass would otherwise want to drop all keys. Using a bloom filter avoids this, and in the case of a false positive, a key will be left undropped that preferred content would allow dropping. Chances of that happening are a mere 1 in 1 million.	2015-06-16 18:50:13 -04:00
Joey Hess	a0a8127956	instance Hashable Key for bloomfilter	2015-06-16 18:37:41 -04:00
Joey Hess	8b74aec3ea	Increased the default annex.bloomaccuracy from 1000 to 10000000 This makes git annex unused use around 48 mb more memory than it did before, but the massive increase in accuracy makes this worthwhile for all but the smallest systems. Also, I want to use the bloom filter for sync --all --content, to avoid dropping files that the preferred content doesn't want, and 1/1000 false positives would be far too many in that use case, even if it were acceptable for unused. Actual memory use numbers: 1000: 21.06user 3.42system 0:26.40elapsed 92%CPU (0avgtext+0avgdata 501552maxresident)k 1000000: 21.41user 3.55system 0:26.84elapsed 93%CPU (0avgtext+0avgdata 549496maxresident)k 10000000: 21.84user 3.52system 0:27.89elapsed 90%CPU (0avgtext+0avgdata 549920maxresident)k Based on these numbers, 10 million seemed a better pick than 1 million.	2015-06-16 18:12:00 -04:00
Joey Hess	8c46ea22c2	Added new "anything" preferred content expression, which matches all versions of all files.	2015-06-16 17:03:34 -04:00
Joey Hess	0a998032ed	Fix bug that prevented enumerating locally present objects in repos tuned with annex.tune.objecthash1=true Need to walk 1 level of subdirs less in this case. The git-annex branch traversal code didn't have a similar bug.	2015-06-11 15:15:05 -04:00
Joey Hess	de3bd11a2c	import --clean-duplicates: Fix bug that didn't count local or trusted repo's copy of a file as one of the necessary copies to allow removing it from the import location.	2015-06-03 13:15:38 -04:00
Joey Hess	d28e8fbfd5	get --incomplete: New option to resume any interrupted downloads.	2015-06-02 14:20:38 -04:00
Joey Hess	eb33569f9d	remove Params constructor from Utility.SafeCommand This removes a bit of complexity, and should make things faster (avoids tokenizing Params string), and probably involve less garbage collection. In a few places, it was useful to use Params to avoid needing a list, but that is easily avoided. Problems noticed while doing this conversion: * Some uses of Params "oneword" which was entirely unnecessary overhead. * A few places that built up a list of parameters with ++ and then used Params to split it! Test suite passes.	2015-06-01 13:52:23 -04:00
Joey Hess	a6d54e49a0	sync, remotedaemon: Pass configured ssh-options even when annex.sshcaching is disabled.	2015-05-30 22:01:52 -04:00
Joey Hess	83b262f1b6	fix windows build	2015-05-22 13:54:54 -04:00
Joey Hess	167539a354	better memoize core.sharedrepository handling It was memoized, but that was not used consistently. Move it to Types.GitConfig so it will auto-memoize.	2015-05-19 15:04:24 -04:00
Joey Hess	b47c9fd587	honor core.sharedRepository settings in lockContent The content file may not be owned by the user running git-annex, in which case, setting the owner write bit was not enough to let lockContent act on the file. However, with some core.sharedRepository configs, the file should be writable by the user's group. So, the thing to do is to call thawContent on it.	2015-05-19 14:53:19 -04:00
Joey Hess	f4e2093760	fix inAnnexSafe result for direct file that is being dropped It was returning Just False in this situation, which differed from indirect mode behavior. I don't think this led to any actual problems; things that checked if the file being dropped was present just failed to fail, and instead reported it wasn't present, possibly incorrectly. Hmm, it's possible that this could have made git annex fsck --from remote update the location log wrongly, if a remote was in direct mode, and was in the middle of trying to drop a key, and the drop later failed.	2015-05-19 14:26:07 -04:00
Joey Hess	1312e721ed	convert lockContent to use new LockPools Also cleaned up the code, avoiding creating a lock file if we're going to open it for create later anyway. And, if there's an exception while preparing to lock the file, but not at the point of actually taking the lock, throw an exception, instead of silently not locking and pretending to succeed. And, on Windows, always use lock file, even if the repo somehow got into indirect mode (maybe with cygwin git..)	2015-05-19 14:12:23 -04:00
Joey Hess	ecb0d5c087	use lock pools throughout git-annex The one exception is in Utility.Daemon. As long as a process only daemonizes once, which seems reasonable, and as long as it avoids calling checkDaemon once it's already running as a daemon, the fcntl locking gotchas won't be a problem there. Annex.LockFile has it's own separate lock pool layer, which has been renamed to LockCache. This is a persistent cache of locks that persist until closed. This is not quite done; lockContent stil needs to be converted.	2015-05-19 14:09:52 -04:00
Joey Hess	7ebf234616	Stale transfer lock and info files will be cleaned up automatically when get/unused/info commands are run. Deleting lock files is tricky, tricky stuff. I think I got it right!	2015-05-12 20:11:23 -04:00
Joey Hess	7299bbb639	don't clean up transfer lock file when retrying transfer This affected callers that used forwardRetry; if the 1st attempt failed it would clean up the transfer lock before retrying.	2015-05-12 19:43:24 -04:00
Joey Hess	8c2dd7d8ee	Fix an unlikely race that could result in two transfers of the same key running at once. As discussed in bug report.	2015-05-12 19:39:28 -04:00
Joey Hess	e25ecab7dd	convert to using Utility.Lockfile for transfer lock files Should be no behavior changes, just simplified code. The only actual difference is it doesn't truncate the lock file. I think that was a holdover from when transfer info was written to the lock file.	2015-05-12 19:36:16 -04:00
Joey Hess	61ccf95004	Avoid accumulating transfer failure log files unless the assistant is being used. Only the assistant uses these, and only the assistant cleans them up, so make only git annex transferkeys write them, There is one behavior change from this. If glacier is being used, and a manual git annex get --from glacier fails because the file isn't available yet, the assistant will no longer later see that failed transfer file and retry the get. Hope no-one depended on that old behavior.	2015-05-12 15:53:38 -04:00
Joey Hess	a812d598ef	Take space that will be used by running downloads into account when checking annex.diskreserve.	2015-05-12 15:20:22 -04:00
Joey Hess	e27b97d364	Merge branch 'master' into concurrentprogress Conflicts: Command/Fsck.hs Messages.hs Remote/Directory.hs Remote/Git.hs Remote/Helper/Special.hs Types/Remote.hs debian/changelog git-annex.cabal	2015-05-12 13:23:22 -04:00
Joey Hess	64a4553e0b	rename traverse to walk since Data.Traversable is imported by default in ghc 7.10	2015-05-10 16:43:09 -04:00
Joey Hess	08308dc9b3	fix build warning with ghc 7.10	2015-05-10 15:28:13 -04:00
Joey Hess	9f3e51dd51	move nubbing into function whose algo needs a nubbed list	2015-04-30 14:11:59 -04:00
Joey Hess	38c458b407	refactor	2015-04-30 14:02:56 -04:00
Joey Hess	5948c148fb	Make repo init more robust. The setDifferences that got added to initialize turns out to make a git commit, and before ensureCommit has been used. Thus, repo init can fail when the system has a broken hostname etc. Move the ensureCommit to the very first thing to avoid this kind of breakage.	2015-04-20 14:01:41 -04:00
Joey Hess	3a078ab357	When a key's size is unknown, still check the annex.diskreserve, and avoid getting content if the disk is too full. We can't check if there's enough disk space to download the content, but we can check if there's certainly not enough!	2015-04-17 21:29:15 -04:00
Joey Hess	86a2f9dc4d	Merge branch 'master' into concurrentprogress Conflicts: debian/changelog	2015-04-14 15:35:15 -04:00
Joey Hess	2b79e6fe08	a few hlints	2015-04-11 00:10:34 -04:00
Joey Hess	9971c82ead	refactor	2015-04-10 17:53:58 -04:00
Joey Hess	8077ccbd54	get, move, copy, mirror: Concurrent downloads and uploads are now supported! This works, and seems fairly robust. Clean get of 20 files at -J3. At -J10, there are some messages about ssh multiplexing, probably due to a race spinning up the ssh connection cacher. But, it manages to get all the files ok regardless. The progress bars are a scrambled mess though, due to bugs in ascii-progress, which I've already filed. Particularly this one: https://github.com/yamadapc/haskell-ascii-progress/issues/8	2015-04-10 17:08:07 -04:00
Joey Hess	0880c8319e	simplify and make more atomic	2015-04-10 15:16:17 -04:00
Joey Hess	ce0a82f493	contentlocationn: New plumbing command.	2015-04-09 15:34:47 -04:00
Joey Hess	b99b8d5d4c	followup to bug I cannot reproduce, and analysis based presumptive fix	2015-04-09 14:03:44 -04:00
Joey Hess	42e46a8701	avoid using --literal-pathspecs with git older than 1.8.1 which added it Windows is still building with an older git.	2015-04-06 13:46:11 -04:00
Joey Hess	1d57f142f1	Merge branch 'concurrentprogress'	2015-04-04 15:01:00 -04:00
Joey Hess	2343f99c85	well along the way to fully quiet --quiet Came up with a generic way to filter out progress messages while keeping errors, for commands that use stderr for both. --json mode will disable command outputs too.	2015-04-04 14:34:03 -04:00
Joey Hess	ff2eeaf054	avoid progress bar for url download with --quiet	2015-04-03 20:38:56 -04:00
Joey Hess	bd110516c0	init: Improve fifo test to detect NFS systems that support fifos but not well enough for sshcaching. ssh tries to hard link a fifo, and if not, complains: muxserver_listen: link mux listener .git/annex/ssh/SHARD1@iabak.archiveteam.org.QK8zOCbtNebI7q54 => .git/annex/ssh/SHARD1@iabak.archiveteam.org: Operation not permitted	2015-04-03 14:57:10 -04:00
Joey Hess	0a6933771d	cleanup	2015-03-30 19:55:35 -04:00
Joey Hess	15d45186cc	use --literal-pathspecs globally, as a better way to avoid globbing This might be overkill; I only know I need it in ls-files, but other git commands can also do their own globbing, it turns out, and I am pretty sure I never want them too when git-annex is using them as plumbing. Test suite still passes and it looks ok.	2015-03-30 19:44:13 -04:00
Joey Hess	5be536e523	Fix bug introduced in the last release that broke git-annex sync when git-annex was installed from the standalone tarball. This was introduced by commit `450ee53ab6` However, the same problem could affect other calls to programPath, specifically some on the assistant. So, I fixed it at a deeper level.	2015-03-27 12:55:18 -04:00
Joey Hess	3af4691978	Improve error message when --in @date is used and there is no reflog for the git-annex branch.	2015-03-26 11:15:15 -04:00
Joey Hess	798da6cf2e	Added a post-update-annex hook, which is run after the git-annex branch is updated. Needed for git update-server-info. See https://github.com/datalad/datalad/issues/1#issuecomment-84094406	2015-03-20 14:52:58 -04:00
Joey Hess	cf903d5a3c	fixup annex link target calculation when submodules are used in filesystems not supporting symlinks	2015-03-04 16:08:41 -04:00

1 2 3 4 5 ...

635 commits