git-annex

Author	SHA1	Message	Date
Joey Hess	1d5b70db9c	addurl: Support --json, particularly useful in --batch mode.	2016-01-13 14:25:30 -04:00
Joey Hess	423fffcd41	change keys database to use IKey type with more efficient serialization This breaks any existing keys database! IKey serializes more efficiently than SKey, although this limits the use of its Read/Show instances. This makes the keys database use less disk space, and so should be a win. Updated benchmark: benchmarking keys database/getAssociatedFiles from 1000 (hit) time 64.04 μs (63.95 μs .. 64.13 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.02 μs (63.96 μs .. 64.08 μs) std dev 218.2 ns (172.5 ns .. 299.3 ns) benchmarking keys database/getAssociatedFiles from 1000 (miss) time 52.53 μs (52.18 μs .. 53.21 μs) 0.999 R² (0.998 R² .. 1.000 R²) mean 52.31 μs (52.18 μs .. 52.91 μs) std dev 734.6 ns (206.2 ns .. 1.623 μs) benchmarking keys database/getAssociatedKey from 1000 (hit) time 64.60 μs (64.46 μs .. 64.77 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.74 μs (64.57 μs .. 65.20 μs) std dev 900.2 ns (389.7 ns .. 1.733 μs) benchmarking keys database/getAssociatedKey from 1000 (miss) time 52.46 μs (52.29 μs .. 52.68 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 52.63 μs (52.35 μs .. 53.37 μs) std dev 1.362 μs (562.7 ns .. 2.608 μs) variance introduced by outliers: 24% (moderately inflated) benchmarking keys database/addAssociatedFile to 1000 (old) time 487.3 μs (484.7 μs .. 490.1 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 490.9 μs (487.8 μs .. 496.5 μs) std dev 13.95 μs (6.841 μs .. 22.03 μs) variance introduced by outliers: 20% (moderately inflated) benchmarking keys database/addAssociatedFile to 1000 (new) time 6.633 ms (5.741 ms .. 7.751 ms) 0.905 R² (0.850 R² .. 0.965 R²) mean 8.252 ms (7.803 ms .. 8.602 ms) std dev 1.126 ms (900.3 μs .. 1.430 ms) variance introduced by outliers: 72% (severely inflated) benchmarking keys database/getAssociatedFiles from 10000 (hit) time 65.36 μs (64.71 μs .. 66.37 μs) 0.998 R² (0.995 R² .. 1.000 R²) mean 65.28 μs (64.72 μs .. 66.45 μs) std dev 2.576 μs (920.8 ns .. 4.122 μs) variance introduced by outliers: 42% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (miss) time 52.34 μs (52.25 μs .. 52.45 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 52.49 μs (52.42 μs .. 52.59 μs) std dev 255.4 ns (205.8 ns .. 312.9 ns) benchmarking keys database/getAssociatedKey from 10000 (hit) time 64.76 μs (64.67 μs .. 64.84 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.67 μs (64.62 μs .. 64.72 μs) std dev 177.3 ns (148.1 ns .. 217.1 ns) benchmarking keys database/getAssociatedKey from 10000 (miss) time 52.75 μs (52.66 μs .. 52.82 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 52.69 μs (52.63 μs .. 52.75 μs) std dev 210.6 ns (173.7 ns .. 265.9 ns) benchmarking keys database/addAssociatedFile to 10000 (old) time 489.7 μs (488.7 μs .. 490.7 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 490.4 μs (489.6 μs .. 492.2 μs) std dev 3.990 μs (2.435 μs .. 7.604 μs) benchmarking keys database/addAssociatedFile to 10000 (new) time 9.994 ms (9.186 ms .. 10.74 ms) 0.959 R² (0.928 R² .. 0.979 R²) mean 9.906 ms (9.343 ms .. 10.40 ms) std dev 1.384 ms (1.051 ms .. 2.100 ms) variance introduced by outliers: 69% (severely inflated)	2016-01-12 14:01:50 -04:00
Joey Hess	015a5e485f	add benchmarks of adding an associated file benchmarking keys database/addAssociatedFile to 1000 (old) time 516.1 μs (514.7 μs .. 517.4 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 514.0 μs (512.1 μs .. 515.2 μs) std dev 4.740 μs (2.972 μs .. 7.068 μs) benchmarking keys database/addAssociatedFile to 1000 (new) time 5.750 ms (4.857 ms .. 6.885 ms) 0.815 R² (0.698 R² .. 0.904 R²) mean 7.858 ms (7.311 ms .. 8.421 ms) std dev 1.684 ms (1.383 ms .. 2.027 ms) variance introduced by outliers: 88% (severely inflated) benchmarking keys database/addAssociatedFile to 10000 (old) time 515.7 μs (514.8 μs .. 516.5 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 515.4 μs (513.7 μs .. 516.6 μs) std dev 4.824 μs (2.957 μs .. 7.167 μs) benchmarking keys database/addAssociatedFile to 10000 (new) time 8.934 ms (7.779 ms .. 10.05 ms) 0.868 R² (0.751 R² .. 0.934 R²) mean 11.51 ms (10.66 ms .. 12.26 ms) std dev 2.174 ms (1.816 ms .. 2.747 ms) variance introduced by outliers: 82% (severely inflated)	2016-01-12 13:22:31 -04:00
Joey Hess	647b2f33af	refactor	2016-01-12 13:15:15 -04:00
Joey Hess	f9c5aa84e0	add database benchmark The benchmark shows that the database access is quite fast indeed! And, it scales linearly to the number of keys, with one exception, getAssociatedKey. Based on this benchmark, I don't think I need worry about optimising for cases where all files are locked and the database is mostly empty. In those cases, database access will be misses, and according to this benchmark, should add only 50 milliseconds to runtime. (NB: There may be some overhead to getting the database opened and locking the handle that this benchmark doesn't see.) joey@darkstar:~/src/git-annex>./git-annex benchmark setting up database with 1000 setting up database with 10000 benchmarking keys database/getAssociatedFiles from 1000 (hit) time 62.77 μs (62.70 μs .. 62.85 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 62.81 μs (62.76 μs .. 62.88 μs) std dev 201.6 ns (157.5 ns .. 259.5 ns) benchmarking keys database/getAssociatedFiles from 1000 (miss) time 50.02 μs (49.97 μs .. 50.07 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.09 μs (50.04 μs .. 50.17 μs) std dev 206.7 ns (133.8 ns .. 295.3 ns) benchmarking keys database/getAssociatedKey from 1000 (hit) time 211.2 μs (210.5 μs .. 212.3 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 211.0 μs (210.7 μs .. 212.0 μs) std dev 1.685 μs (334.4 ns .. 3.517 μs) benchmarking keys database/getAssociatedKey from 1000 (miss) time 173.5 μs (172.7 μs .. 174.2 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 173.7 μs (173.0 μs .. 175.5 μs) std dev 3.833 μs (1.858 μs .. 6.617 μs) variance introduced by outliers: 16% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (hit) time 64.01 μs (63.84 μs .. 64.18 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.85 μs (64.34 μs .. 66.02 μs) std dev 2.433 μs (547.6 ns .. 4.652 μs) variance introduced by outliers: 40% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (miss) time 50.33 μs (50.28 μs .. 50.39 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.32 μs (50.26 μs .. 50.38 μs) std dev 202.7 ns (167.6 ns .. 252.0 ns) benchmarking keys database/getAssociatedKey from 10000 (hit) time 1.142 ms (1.139 ms .. 1.146 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.142 ms (1.140 ms .. 1.144 ms) std dev 7.142 μs (4.994 μs .. 10.98 μs) benchmarking keys database/getAssociatedKey from 10000 (miss) time 1.094 ms (1.092 ms .. 1.096 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.095 ms (1.095 ms .. 1.097 ms) std dev 4.277 μs (2.591 μs .. 7.228 μs)	2016-01-12 13:07:03 -04:00
Joey Hess	d6fe7fdd7d	rekey: No longer copies over urls from the old to the new key. It makes sense for migrate to do that, but not for this low-level (and little used) plumbing command to.	2016-01-07 18:06:20 -04:00
Joey Hess	4b819bee2b	avoid confusing git with a modified ctime in clean filter Linking the file to the tmp dir was not necessary in the clean filter, and it caused the ctime to change, which caused git to think the file was changed. This caused git status to get slow as it kept re-cleaning unchanged files.	2016-01-07 17:48:04 -04:00
Joey Hess	3b960d1422	migrate and rekey v6 unlocked file support	2016-01-07 15:14:15 -04:00
Joey Hess	0b59fb423e	migrate: Copy over metadata to new key.	2016-01-07 14:21:12 -04:00
Joey Hess	66f3fb1ce2	unused: deal with v6 unlocked file that is implicitly ingested by git diff etc	2016-01-06 22:11:21 -04:00
Joey Hess	2e071a09b7	cleanup	2016-01-06 20:41:25 -04:00
Joey Hess	3320870bad	optimise `03cb2c8ece` put a cat-file into the fast bloomfilter generation path. Instead, add another bloom filter which diffs from the work tree to the index. Also, pull the sha of the changed object out of the diffs, and cat that object directly, rather than indirecting through the filename. Finally, removed some hacks that are unncessary thanks to the worktree to index diff.	2016-01-06 20:38:02 -04:00
Joey Hess	b26776d92f	fix parsing of v6 unlocked file The newline broke this ad-hoc parser; use the normal one.	2016-01-06 17:46:46 -04:00
Joey Hess	03cb2c8ece	unused: Bug fix when a new file was added to the annex, and then removed (but not git rmed). git still has the add staged in this case, so the content should not be unused and was wrongly treated as such. So, we need to look at both the file on disk to see if it's a annex link, and the file in the index too. lookupFile doesn't look in the index if the file is not present on disk.	2016-01-06 16:49:41 -04:00
Joey Hess	0c1cc7789f	fix test failure locking an unlocked not present file In v5, that was not possible, but it is in v6, and so the test was failing. Investigating, it turns out that locking was copying the pointer file content to the annex object despite the content not being present. So, add a check to prevent that.	2016-01-06 16:01:52 -04:00
Joey Hess	b96cfdc094	whereis --json: Make url list be included in machine-parseable form.	2016-01-06 12:33:32 -04:00
Joey Hess	b3d60ca285	use TopFilePath for associated files Fixes several bugs with updates of pointer files. When eg, running git annex drop --from localremote it was updating the pointer file in the local repository, not the remote. Also, fixes drop ../foo when run in a subdir, and probably lots of other problems. Test suite drops from ~30 to 11 failures now. TopFilePath is used to force thinking about what the filepath is relative to. The data stored in the sqlite db is still just a plain string, and TopFilePath is a newtype, so there's no overhead involved in using it in DataBase.Keys.	2016-01-05 17:22:19 -04:00
Joey Hess	121659576b	info --json: Improve json for "backend usage", using a nested object with fields for each backend instead of the previous weird nested lists. This may break existing parsers of this json output, if there were any.	2016-01-01 16:33:05 -04:00
Joey Hess	09a2fcb643	info: Fix "backend usage" numbers, which were counting present keys twice. Let's just count the referenced keys for that, and not present keys at all.	2016-01-01 16:13:16 -04:00
Joey Hess	43b1333216	switch to using main ingest code Fixes at least one bug, in populating existing worktree files that use the same key that's ingested.	2016-01-01 14:16:40 -04:00
Joey Hess	a2c056df65	convert isPointerFile from Annex to IO	2016-01-01 13:22:38 -04:00
Joey Hess	996ae9b172	don't disable smudge filter while merging The smudge filter does need to be run, because if the key is in the local annex already (due to renaming, or a copy of a file added, or a new file added and its content has already arrived), git merge smudges the file and this should provide its content. This does probably mean that in merge conflict resolution, git smudges the existing file, re-copying all its content to it, and then the file is deleted. So, not efficient.	2015-12-29 16:36:21 -04:00
Joey Hess	b6b34f4916	automatic conflict resolution for v6 unlocked files Several tricky parts: * When the conflict is just between the same key being locked and unlocked, the unlocked version wins, and the file is not renamed in this case. * Need to update associated file map when conflict resolution renames an unlocked file. * git merge runs the smudge filter on the conflicting file, and actually overwrites the file with the same content it had before, and so invalidates its inode cache. This makes it difficult to know when it's safe to remove such files as conflict cruft, without going so far as to compare their entire contents. Dealt with this by preventing the smudge filter from populating the file when a merge is run. However, that also prevents the smudge filter being run for non-conflicting files, so eg moving a file won't put its new content into place. * Ideally, if a merge or a merge conflict resolution renames an unlocked file, the file in the work tree can just be moved, rather than copying the content to a new worktree file. This is attempted to be done in merge conflict resolution, but due to git merge's behavior of running smudge filters, what actually seems to happen is the old worktree file with the content is deleted and rewritten as a pointer file, so doesn't get reused. So, this is probably not as efficient as it optimally could be. If that becomes a problem, could look into running the merge in a separate worktree and updating the real worktree more efficiently, similarly to the direct mode merge. However, the direct mode merge had a lot of bugs, and I'd rather not use that more error-prone method unless really needed.	2015-12-29 15:41:09 -04:00
Joey Hess	1499b9b79d	fix file perms after breaking hard link	2015-12-27 16:12:48 -04:00
Joey Hess	121f5d5b0c	annex.thin Decided it's too scary to make v6 unlocked files have 1 copy by default, but that should be available to those who need it. This is consistent with git-annex not dropping unused content without --force, etc. * Added annex.thin setting, which makes unlocked files in v6 repositories be hard linked to their content, instead of a copy. This saves disk space but means any modification of an unlocked file will lose the local (and possibly only) copy of the old version. * Enable annex.thin by default on upgrade from direct mode to v6, since direct mode made the same tradeoff. * fix: Adjusts unlocked files as configured by annex.thin.	2015-12-27 15:59:59 -04:00
Joey Hess	f776ac0a11	add unlocked flag for git-annex-shell recvkey The direct flag is also set when sending unlocked content, to support old versions of git-annex-shell. At some point, the direct flag will be removed, and only the unlocked flag will be used.	2015-12-26 13:59:27 -04:00
Joey Hess	87f0708f88	persistent-sqlite is now a hard build dependency, since v6 repository mode needs it.	2015-12-26 13:00:52 -04:00
Joey Hess	7c02f070b1	lost some bookkeeping info I forgot to convert this to use Annex.Ingest, todo later.	2015-12-24 13:15:26 -04:00
Joey Hess	39048e4568	Merge branch 'master' into smudge	2015-12-22 18:10:40 -04:00
Joey Hess	d8a8c77a8f	move cleanOldKey into ingest	2015-12-22 16:55:49 -04:00
Joey Hess	4f60234690	finish v6 support for assistant Seems to basically work now!	2015-12-22 15:23:27 -04:00
Joey Hess	4392140946	make linkAnnex detect when the file changes as it's being copied/linked in This fixes a race where the modified file ended up in annex/objects, and the InodeCache stored in the database was for the modified version, so git-annex didn't know it had gotten modified. The race could occur when the smudge filter was running; now it gets the InodeCache before generating the Key, which avoids the race.	2015-12-22 15:20:03 -04:00
Joey Hess	8e9608d7f0	refactoring no behavior changes	2015-12-22 13:42:58 -04:00
Joey Hess	2dce8081a6	addurl: Added --with-files option.	2015-12-22 12:20:39 -04:00
Joey Hess	03f2ae0423	refactor	2015-12-22 11:58:59 -04:00
Joey Hess	4cf9efb51a	remove (v6) associated file in unannex	2015-12-21 18:00:48 -04:00
Joey Hess	d82b110da8	Merge branch 'master' into smudge	2015-12-21 17:12:46 -04:00
Joey Hess	a8b398c1fa	addurl: Added --batch option.	2015-12-21 12:57:13 -04:00
Joey Hess	35827e2705	status: On crippled filesystems, was displaying M for all annexed files that were present. Probably caused by a change to what git status displays in this situation. Fixed by treating files git thinks are modified the same as typechanged files.	2015-12-19 13:36:40 -04:00
Joey Hess	6b717032c5	v6: fix locking modified file when the content is not present	2015-12-16 15:35:42 -04:00
Joey Hess	2d343224dc	fix add of file that was locked but has been replaced by a new, unlocked file (v6)	2015-12-16 14:53:41 -04:00
Joey Hess	7d0e79b9e1	Use git-annex init --version=6 to get v6 for now Not ready to make it default because of the direct mode upgrade needing to all happen at once.	2015-12-15 17:17:13 -04:00
Joey Hess	b9588fe69e	in v6 mode, unannex does not interact badly with pre-commit hook So can be used in a tree with staged changes, no problems. Much nicer.	2015-12-15 16:18:39 -04:00
Joey Hess	99f1d7991d	recent fsck changes caused ugly message when object was not present	2015-12-15 16:10:48 -04:00
Joey Hess	cdd27b8920	reorg	2015-12-15 15:34:28 -04:00
Joey Hess	0ddcaae9c1	changes for v6 broke fsck in direct mode	2015-12-15 14:27:20 -04:00
Joey Hess	8a660a7b14	add: In v6 mode, acts on modified files. Same as was done in direct mode, except in v6 mode add always adds files locked, so	2015-12-15 14:17:00 -04:00
Joey Hess	d245a80518	avoid pre-commit check having to do with v5 unlocked files when in v6 mode	2015-12-15 14:09:36 -04:00
Joey Hess	a983a3a7a2	rename stuff for v5 unlocked files to indicate it's old	2015-12-15 14:08:07 -04:00
Joey Hess	a4a813fb07	add: no need to make pass for old unlocked files in v6	2015-12-15 14:03:25 -04:00
Joey Hess	71e2050f8f	have clean filter check if the filename was already in use by an old key The annex object for it may have been modified due to hard link, and that should be cleaned up when the new version is added. If another associated file has the old key's content, that's linked into the annex object. Otherwise, update location log to reflect that content has been lost.	2015-12-15 13:06:52 -04:00
Joey Hess	42caf42857	avoid smudge filter returning invalid content 1. git add file 2. git commit 3. modify file 4. git commit 5. git reset HEAD^ Before this fix, that resulted in git saying the file was modified. And indeed, it didn't have the content it should in the just checked out ref, because step 3 modified the object file for the old key.	2015-12-11 18:01:50 -04:00
Joey Hess	e7183d83d3	fsck for v6 unlocked files This only adds 1 stat to each file fscked for locked files, so added overhead is minimal. For unlocked files it has to access the database to see if a file is modified.	2015-12-11 16:07:54 -04:00
Joey Hess	7790e059b2	finish v6 git-annex lock This was a doozy!	2015-12-11 15:28:34 -04:00
Joey Hess	50e83b606c	only make 1 hardlink max between pointer file and annex object If multiple files point to the same annex object, the user may want to modify them independently, so don't use a hard link. Also, check diskreserve when copying.	2015-12-11 14:00:21 -04:00
Joey Hess	c608a752a5	Merge branch 'master' into smudge	2015-12-11 13:50:31 -04:00
Joey Hess	abd66c7089	fsck: Failed to honor annex.diskreserve when checking a remote.	2015-12-11 13:50:27 -04:00
Joey Hess	c910b4e255	wip	2015-12-11 10:42:18 -04:00
Joey Hess	e2c8dc6778	v6 git-annex unlock Note that the implementation uses replaceFile, so that the actual replacement of the work tree file is atomic. This seems a good property to have! It would be possible for unlock in v6 mode to be run on files that do not have their content present. However, that would be a behavior change from before, and I don't see any immediate need to support it, so I didn't implement it.	2015-12-10 16:12:48 -04:00
Joey Hess	06a8256bf6	always format pointer file with a trailing newline Before the smudge filter added a trailing newline, but other things that wrote formatPointer to a file did not. also some new pointer staging code to use later	2015-12-10 16:06:58 -04:00
Joey Hess	ce73a96e4e	use InodeCache when dropping a key to see if a pointer file can be safely reset The Keys database can hold multiple inode caches for a given key. One for the annex object, and one for each pointer file, which may not be hard linked to it. Inode caches for a key are recorded when its content is added to the annex, but only if it has known pointer files. This is to avoid the overhead of maintaining the database when not needed. When the smudge filter outputs a file's content, the inode cache is not updated, because git's smudge interface doesn't let us write the file. So, dropping will fall back to doing an expensive verification then. Ideally, git's interface would be improved, and then the inode cache could be updated then too.	2015-12-09 17:54:54 -04:00
Joey Hess	5e8c628d2e	add inode cache to the db Renamed the db to keys, since it is various info about a Keys. Dropping a key will update its pointer files, as long as their content can be verified to be unmodified. This falls back to checksum verification, but I want it to use an InodeCache of the key, for speed. But, I have not made anything populate that cache yet.	2015-12-09 17:00:37 -04:00
Joey Hess	3311c48631	move InodeSentinal from direct mode code to its own module Will be used outside of direct mode for v6 unlocked files, and is already used outside of direct mode when adding files to annex.	2015-12-09 15:52:11 -04:00
Joey Hess	ba39f993f5	avoid clean filter trying to annex a pointer file	2015-12-09 15:24:32 -04:00
Joey Hess	751120c171	avoid pre-commit hook messing up new-style unlocked files in v6 repo	2015-12-09 15:18:54 -04:00
Joey Hess	05b598a057	stash DbHandle in Annex state	2015-12-09 14:55:47 -04:00
Joey Hess	78a6b8ce05	refactor and improve pointer file handling code	2015-12-09 14:27:43 -04:00
Joey Hess	712c9fc590	require "annex/objects/" before key in pointer files This removes ambiguity, because while someone might have "WORM--foo" in a file that's not intended to be a git-annex pointer file, "annex/objects/WORM--foo" is less likely. Also, `664cc987e8` had a caveat about symlink targets being parsed as pointer files, and now the same parser is used for both. I did not include any hash directories before the key in the pointer file, as they're not needed. However, if they were included, the parser would still work ok.	2015-12-07 15:45:08 -04:00
Joey Hess	664cc987e8	support pointer files Backend.lookupFile is changed to always fall back to catKey when operating on a file that's not a symlink. catKey is changed to understand pointer files, as well as annex symlinks. Before, catKey needed a file mode witness, to be sure it was looking at a symlink. That was complicated stuff. Now, it doesn't actually care if a file in git is a symlink or not; in either case asking git for the content of the file will get the pointer to the key. This does mean that git-annex will treat a link foo -> WORM--bar as a git-annex file, and also treats a regular file containing annex/objects/WORM--bar as a git-annex file. Calling catKey could make git-annex commands need to do more work than before. This would especially be the case if a repo contained many regular files, and only a few annexed files, as now git-annex will need to ask git about the contents of the regular files.	2015-12-07 15:35:36 -04:00
Joey Hess	2cbcb4f1a8	update associated files database on smudge and clean	2015-12-07 14:41:22 -04:00
Joey Hess	fb6ebdaae7	refactor	2015-12-04 17:18:26 -04:00
Joey Hess	e8ca01cbc0	comments	2015-12-04 16:46:00 -04:00
Joey Hess	e7f75b079d	don't let git-annex direct be run in a v6 repo	2015-12-04 16:33:09 -04:00
Joey Hess	ccc49861ca	add v6; keep v5 working for now and manual upgrade Since all places where a repo is used in direct mode need to have git-annex upgraded before the repo can safely be converted to v6, the upgrade needs to be manual for now. I suppose that at some point I'll want to drop all the direct mode support code. At that point, will stop supporting v5, and will need to auto-upgrade any remaining v5 repos. If possible, I'd like to carry the direct mode support for say, a year or so, to give people plenty of time to upgrade and avoid disruption.	2015-12-04 16:14:48 -04:00
Joey Hess	723e4e31a1	merge clean into smudge command The git filter config can be used to map the single git-annex command to the 2 actions, and this avoids "git annex clean" being used for this thing, it might have a better use for that name later.	2015-12-04 15:32:47 -04:00
Joey Hess	99b2a524a0	clean filter should update location log when adding new content to annex	2015-12-04 14:20:32 -04:00
Joey Hess	ad06f8ceed	avoid commit and messages for smudge filter	2015-12-04 14:20:22 -04:00
Joey Hess	fdfda7b7bb	annex.largefiles support for clean filter	2015-12-04 14:10:18 -04:00
Joey Hess	d349693269	smudge filter working	2015-12-04 14:03:10 -04:00
Joey Hess	2c6454a2e2	basic clean filter working	2015-12-04 13:39:14 -04:00
Joey Hess	20ca89dfa3	skeleton smudge/clean filters	2015-12-04 13:03:39 -04:00
Joey Hess	37a5e2d419	dropunused: Make more robust when trying to drop an object that has already been dropped. Before it crashed trying to lock the not-present content and prevented dropping anything else. Instead, succeed.	2015-12-03 15:58:00 -04:00
Joey Hess	f16e235983	addurl, importfeed: Changed to honor annex.largefiles settings, when the content of the url is downloaded. (Not when using --fast or --relaxed.) importfeed just calls addurl functions, so inherits this from it. Note that addurl still generates a temp file, and uses that key to download the file. It just adds it to the work tree at the end when the file is small.	2015-12-02 15:12:33 -04:00
Joey Hess	dc8099872a	import: Changed to honor annex.largefiles settings.	2015-12-02 14:49:03 -04:00
Joey Hess	c2674308c0	map: Improve display of git remotes with non-ssh urls, including http and gcrypt.	2015-11-18 15:08:55 -04:00
Joey Hess	cecf3894ff	note where map is left in --fast mode	2015-11-18 14:17:52 -04:00
Joey Hess	e97fce35a6	Display progress meter in -J mode when downloading from the web. Including in addurl, and get --from web, but also in S3 and External special remotes when a web url is known for content in those remotes.	2015-11-16 21:00:54 -04:00
Joey Hess	4b02af57b6	display a message in the unlikely scenario of fsking a dead repository	2015-11-10 14:44:58 -04:00
Joey Hess	cd7929034a	fsck: When fscking a dead repo, avoid incorrect "fixing location log" message. keyLocations doesn't return locations in dead repos, but if we're fscking a dead repo, we want to look at what locations are actually logged for it.	2015-11-10 13:59:04 -04:00
Joey Hess	53db9d0b5c	work around git check-ignore --batch bad exit status bug, and bring back import -J	2015-11-06 15:39:51 -04:00
Joey Hess	7938b87864	add: Fix error recovery rollback to not move the injested file content out of the annex back to the file, because other files may point to that same content. Instead, copy the injected file content out to recover. That was not a data loss, but it came close!	2015-11-06 15:28:20 -04:00
Joey Hess	8ea594f565	missed adding allowConcurrentOutput here	2015-11-06 13:41:26 -04:00
Joey Hess	362ab39aad	import -J fails at the end, disable util it can be fixed	2015-11-05 18:48:46 -04:00
Joey Hess	7dc90f2225	import: Avoid very ugly error messages when the directory files are imported to is not a directort, but perhaps an annexed file.	2015-11-05 18:46:05 -04:00
Joey Hess	5db7d435e7	-J for add/addurl/import	2015-11-05 18:24:15 -04:00
Joey Hess	c4d45ef83d	drop -Jn	2015-11-04 17:13:20 -04:00
Joey Hess	3d0f41518d	parallel fsck (yes, these changes are all it takes now!)	2015-11-04 16:28:14 -04:00
Joey Hess	c0c595345c	arrange for regional output manager to run when -J is enabled Commands that want to use it have to run their seek action inside allowConcurrentOutput. Which seems reasonable; perhaps some future command will want to support the -J flag but not use regions. The region state moved from Annex to MessageState. This makes sense organizationally, and note that some uses of onLocal use a different Annex state, but pass the MessageState into it, which is what is needed.	2015-11-04 16:22:43 -04:00
Joey Hess	640dba43b6	enableremote: List uuids and descriptions of remotes that can be enabled, and accept either the uuid or the description in leu if the name.	2015-10-26 14:55:40 -04:00
Joey Hess	1f65de4085	improve layout and comment	2015-10-15 15:10:14 -04:00
Joey Hess	fa9333e99f	use action, not sideAction sideAction is for things not generally related to the current action being performed. And, it adds a newline after the side action. This was not the right thing to use for stuff like "checksum", where doing a checksum is part of the git annex get process, and indeed we want it to display "(checksum...) ok"	2015-10-11 13:29:44 -04:00
Joey Hess	3b89d5a20c	implement lockContent for ssh remotes	2015-10-09 16:55:41 -04:00
Joey Hess	e392ec112f	also generate a drop safety proof for move --from remote	2015-10-09 16:16:03 -04:00
Joey Hess	b944da832b	tests and verified that the bug is fixed, in all the cases I identified	2015-10-09 15:59:42 -04:00
Joey Hess	6a72045707	fix local dropping to not require extra locking of copies, but only that the local copy be locked for removal	2015-10-09 15:48:02 -04:00
Joey Hess	b021321aae	rename constructor	2015-10-09 15:01:33 -04:00
Joey Hess	45e1a7c361	verify local copy of content with locking	2015-10-09 14:57:32 -04:00
Joey Hess	a5e74e9e64	display drop safety proofs in debug mode	2015-10-09 13:47:19 -04:00
Joey Hess	cf79dffa4c	improve drop proof code	2015-10-09 11:09:46 -04:00
Joey Hess	c75c79864d	support invalidating existing VerifiedCopys	2015-10-08 17:58:32 -04:00
Joey Hess	90f7c4b6a2	add VerifiedCopy data type There should be no behavior changes in this commit, it just adds a more expressive data type and adjusts code that had been passing around a [UUID] or sometimes a Maybe Remote to instead use [VerifiedCopy]. Although, since some functions were taking two different [UUID] lists, there's some potential for me to have gotten it horribly wrong.	2015-10-08 16:55:11 -04:00
Joey Hess	b1abe59193	add removeKey action to Remote Not implemented for any remotes yet; probably the git remote is the only one that will ever implement it.	2015-10-08 15:01:38 -04:00
Joey Hess	5240a9f315	git-annex-shell: Added lockcontent command, to prevent dropping of key's content.	2015-10-08 14:47:46 -04:00
Joey Hess	4d50958ed7	add lockContentShared Also, rename lockContent to lockContentExclusive inAnnexSafe should perhaps be eliminated, and instead use `lockContentShared inAnnex`. However, I'm waiting on that, as there are only 2 call sites for inAnnexSafe and it's fiddly.	2015-10-08 14:29:35 -04:00
Joey Hess	1ac79e6fe5	copy --auto was checking the wrong repo's preferred content. (--from was checking what --to should, and vice-versa.) Fixed this bug, which was introduced in version 5.20150727.	2015-10-06 17:29:44 -04:00
Joey Hess	60d382a840	avoid using print action, which is reserved for debugging	2015-10-06 15:26:42 -04:00
Joey Hess	2def1d0a23	other 80% of avoding verification when hard linking to objects in shared repo In `c6632ee5c8`, it actually only handled uploading objects to a shared repository. To avoid verification when downloading objects from a shared repository, was a lot harder. On the plus side, if the process of downloading a file from a remote is able to verify its content on the side, the remote can indicate this now, and avoid the extra post-download verification. As of yet, I don't have any remotes (except Git) using this ability. Some more work would be needed to support it in special remotes. It would make sense for tahoe to implicitly verify things downloaded from it; as long as you trust your tahoe server (which typically runs locally), there's cryptographic integrity. OTOH, despite bup being based on shas, a bup repo under an attacker's control could have the git ref used for an object changed, and so a bup repo shouldn't implicitly verify. Indeed, tahoe seems unique in being trustworthy enough to implicitly verify.	2015-10-02 14:35:12 -04:00
Joey Hess	2fb3722ce9	Do verification of checksums of annex objects downloaded from remotes. * When annex objects are received into git repositories, their checksums are verified then too. * To get the old, faster, behavior of not verifying checksums, set annex.verify=false, or remote.<name>.annex-verify=false. * setkey, rekey: These commands also now verify that the provided file matches the key, unless annex.verify=false. * reinject: Already verified content; this can now be disabled by setting annex.verify=false. recvkey and reinject already did verification, so removed now duplicate code from them. fsck still does its own verification, which is ok since it does not use getViaTmp, so verification doesn't happen twice when using fsck --from.	2015-10-01 15:56:39 -04:00
Joey Hess	b72d3fbeba	rename function	2015-10-01 14:18:57 -04:00
Joey Hess	cad3349001	rename fsckKey to verifyKeyContent No behavior changes.	2015-10-01 13:29:17 -04:00
Joey Hess	f2b6ebd502	status: Show added but not yet committed files. Seems easy, but git ls-files can't list the right subset of files. So, I wrote a whole new parser for git status output, and converted the status command to use that. There are a few other small behavior changes. The order changed. Unlocked files show as T. In indirect mode, deleted files were not shown before, and that's fixed. Regular files checked directly into git and modified were not shown before, and are now.	2015-09-22 17:32:28 -04:00
Joey Hess	178826c4cb	cleanup	2015-09-22 15:55:31 -04:00
Joey Hess	9e48c04d15	info: Don't allow use in a non-git-annex repository, since it uses the git-annex branch and would create it if it were missing. I made the change to allow in 2014 without any rationalle or associated request that I can find.	2015-09-16 12:25:43 -04:00
Joey Hess	9cfb96c53d	Special remotes configured with autoenable=true will be automatically enabled when git-annex init is run.	2015-09-14 14:49:48 -04:00
Joey Hess	3f47d1b351	Improve bash completion, so it completes names of remotes and backends in appropriate places. Not necessarily everywhere, but a lot of the most often used places. Re the use of .Internal, see https://github.com/pcapriotti/optparse-applicative/issues/155	2015-09-14 13:19:04 -04:00
Joey Hess	1cc1f9f4e5	sync: Add --no-commit, --no-pull, --no-push options to turn off parts of the sync process, as well as supporting --commit, --pull, --push, and --no-content options to specify the (current) default behavior.	2015-09-13 13:15:35 -04:00
Øyvind A. Holm	188f1b3dc6	Command/Unused.hs: Change --unused-refspec back to --used-refspec Fix typo in commit `160d4b9` ("convert Unused, and remove some dead code for old style option parsing", 2015-07-10), the "git-annex unused --used-refspec" option was incorrectly changed to --unused-refspec.	2015-09-09 14:31:21 -04:00
Joey Hess	2b326594a9	info: Support querying info of individual files in direct mode.	2015-09-09 14:15:56 -04:00
Joey Hess	0b7a8b72bb	Fix building without database. Ben Boeckel had a patch, but.. Actually, that was not the only place that used ScheduleIncremental when built w/o database. Since the data type doesn't need database stuff, I've instead fixed this build problem by exposing the ScheduleIncremental constructor to database-less builds.	2015-08-23 15:39:29 -07:00
Ben Boeckel	a4edec9b40	ImportFeed: fix signature of mkquvi without libquvi	2015-08-23 15:32:10 -07:00
Joey Hess	7be58b5e11	make sync --no-content be accepted It's the default, but this is a step toward changing that default later..	2015-08-20 17:21:14 -04:00
Joey Hess	0f5d6c09ac	importfeed --relaxed: Avoid hitting the urls of items in the feed.	2015-08-19 12:24:55 -04:00
Joey Hess	858104078a	make whereis show urls when web remote does not have content This is needed when external special remotes register an url for a key.	2015-08-17 11:35:34 -04:00
Joey Hess	87b4229b23	sync: Support --jobs * sync: Support --jobs * sync --content: Avoid unnecessary second pull from remotes when no file transfers are made.	2015-08-14 13:49:55 -04:00
Joey Hess	127c3db162	add some debugs to get timings Note that I had one in Annex.Action.startup too, but it resulted in a weird message printed by ssh, "channel 2: bad ext data". I don't know why, but it only happened when transferinfo was run, so I wonder if `983a95f021` introduced a fragility somehow.	2015-08-13 16:13:16 -04:00
Joey Hess	0f66f766b0	metadata: Fix reversion introduced in 5.20150727 that caused display of metadata to not work.	2015-08-11 13:19:01 -04:00
Joey Hess	367d1352da	git-annex-shell: Don't let configlist auto-init repository when in readonly mode. This was potentially a hole in the readonly mode armor even before my last commit. If the user could push a git-annex branch to a repo, they could get git-annex-shell to initialize the repo. After my last commit, the user didn't even need to be allowed to push a branch to init the repo, so this hole certianly needs to be closed now.	2015-08-05 14:09:25 -04:00
Joey Hess	c5b8484c2e	Simplify setup process for a ssh remote. Now it suffices to run git remote add, followed by git-annex sync. Now the remote is automatically initialized for use by git-annex, where before the git-annex branch had to manually be pushed before using git-annex sync. Note that this involved changes to git-annex-shell, so if the remote is using an old version, the manual push is still needed. Implementation required git-annex-shell be changed, so configlist can autoinit a repository even when no git-annex branch has been pushed yet. Unfortunate because we'll have to wait for it to get deployed to servers before being able to rely on this change in the documentation. Did consider making git-annex sync push the git-annex branch to repos that didn't have a uuid, but this seemed difficult to do without complicating it in messy ways. It would be cleaner to split a command out from configlist to handle the initialization. But this is difficult without sacrificing backwards compatability, for users of old git-annex versions which would not use the new command.	2015-08-05 13:49:58 -04:00
Joey Hess	b1b757bffa	unused import	2015-08-05 13:25:03 -04:00
Joey Hess	6e82584305	reorg	2015-08-04 14:45:16 -04:00
Joey Hess	c812018cc5	proxy: Fix removal of files deleted by the proxied command. Git.Ref.headSha doesn't really work in direct mode as there's not a head, so it was actually diffing against the empty tree and so not removing any deleted files. Get the sha of the current branch instead, which is the same thing Command.Sync does.	2015-08-04 14:43:42 -04:00
Joey Hess	6c15cdfcb8	proxy: Fix proxy git commit of non-annexed files in direct mode. * proxy: Fix proxy git commit of non-annexed files in direct mode. * proxy: If a non-proxied git command, such as git revert would normally fail because of unstaged files in the work tree, make the proxied command fail the same way.	2015-08-04 14:01:59 -04:00
Joey Hess	9dfe03dbcd	Improve shutdown due to --time-limit, especially for fsck * Perform a clean shutdown when --time-limit is reached. This includes running queued git commands, and cleanup actions normally run when a command is finished. * fsck: Commit incremental fsck database when --time-limit is reached. Previously, some of the last files fscked did not make it into the database when using --time-limit. Note that this changes Annex.addCleanup hooks, to run after --time-limit expires. Fsck was using such a hook to clean up after a --incremental-schedule, and that shouldn't run when --time-limit exipires it. So, instead, moved that cleanup code to be run by cleanupIncremental. Resulted in some data type juggling.	2015-07-31 16:01:54 -04:00
Joey Hess	b7a5d9c3e1	The last release accidentially removed a number of options from the copy command. (-J, file matching options, etc). These have been added back.	2015-07-30 13:33:35 -04:00
Joey Hess	1fb9ab342b	Support building without persistent database on for systems that lack TH. This removes support for incremental fsck.	2015-07-25 17:37:09 -04:00
Joey Hess	49d102f98b	addurl now accepts --prefix and --suffix options to adjust the filenames used	2015-07-21 12:50:05 -04:00
Joey Hess	f95a8c8672	importfeed: Look at not only permalinks, but now also guids to identify previously downloaded files. I've seen rss feeds that have no permalinks, only guids (which are sometimes in the form of permalinks, argh/sigh). I had previously avoided trusting guids to be globally unique, because my survey of rss feeds that I subscribe to shows a lot of pretty bad "guids" like "2 at http://serialpodcast.org" or even worse "oth20150401-hq". Worry was that two podcasts that are generating guids so badly, that there's no guarantee they're actually globally unique. But, I'm seeing too many url changes that result in redundant files, so let's try this. If feeds are so broken that guids overlap, they could just as well incorrectly call them permalinks too.	2015-07-20 14:56:57 -04:00
Joey Hess	3c134ee21a	sync --content: Fix bug that caused files to be uploaded to eg, more archive remotes than wanted copies, only to later be dropped to satisfy the preferred content settings.	2015-07-20 14:24:13 -04:00
Joey Hess	730cc3feb5	wire tasty's option parser into the main program option parser This makes bash completion work for git-annex test, and is generally cleaner.	2015-07-13 13:20:10 -04:00
Joey Hess	fd086c5752	convert WebApp; avoid duplicate arg parsing for no repo mode	2015-07-13 11:53:42 -04:00
Joey Hess	b95a48fe45	converted Assistant and Watch; avoid duplicate arg parsing for no repo mode	2015-07-13 11:42:42 -04:00
Joey Hess	084f8d9ac7	convert Import	2015-07-13 11:15:21 -04:00
Joey Hess	9c1a42873c	converted ImportFeed	2015-07-13 11:06:41 -04:00
Joey Hess	aff4b9a3dc	converted addurl	2015-07-13 10:57:49 -04:00
Joey Hess	3639b4f8ea	converted Log	2015-07-13 10:44:51 -04:00
Joey Hess	3d8a8e9792	convert Dead, and allow multiple --key options	2015-07-13 10:26:54 -04:00
Joey Hess	adec382bc2	converted MetaData, eliminating a global value from Annex state .. beautiful	2015-07-12 09:25:43 -04:00
Joey Hess	fdcb54d4f2	converted ContentLocation, ExampleKey, LookupKey	2015-07-11 20:43:45 -04:00
Joey Hess	c6375a9158	converted Status	2015-07-11 10:42:46 -04:00
Joey Hess	215f636bb3	converted Info	2015-07-11 10:41:52 -04:00
Joey Hess	9ad20c2869	converted Forget and TestRemote	2015-07-11 00:42:32 -04:00
Joey Hess	c70c841d30	converted Mirror	2015-07-10 21:05:34 -04:00
Joey Hess	e4b3701dfe	convert List	2015-07-10 20:46:48 -04:00
Joey Hess	a0b2fcc663	converted ViCfg (trivially)	2015-07-10 20:40:30 -04:00
Joey Hess	02f6d9b33a	convert FindRef	2015-07-10 20:38:11 -04:00
Joey Hess	c94e219fea	convert Whereis	2015-07-10 16:32:33 -04:00
Joey Hess	8eb0a440b9	convert Expire	2015-07-10 16:26:23 -04:00
Joey Hess	decfababe9	convert DropUnused	2015-07-10 16:15:31 -04:00
Joey Hess	160d4b9fe0	convert Unused, and remove some dead code for old style option parsing	2015-07-10 16:05:56 -04:00
Joey Hess	4064dd4c82	convert version command, and make --raw work when not in a git repo	2015-07-10 13:49:46 -04:00
Joey Hess	6a4f2087be	finished converting all the main options	2015-07-10 13:23:06 -04:00
Joey Hess	5cc882a35e	implement withGlobalOptions, and convert Find	2015-07-10 12:47:35 -04:00
Joey Hess	820b92abab	wip Current status: * building again, but several commands are commented out * still need to implement global options, file matching options, etc	2015-07-09 19:03:24 -04:00
Joey Hess	a7f58634b8	wip	2015-07-09 16:05:45 -04:00
Joey Hess	8ad927dbc6	converted copy and move Got a little tricky..	2015-07-09 15:23:14 -04:00
Joey Hess	032e6485fa	use Alternative for parsing KeyOptions	2015-07-09 12:44:03 -04:00
Joey Hess	94e703e8b8	use Alternative when parsing mutually exclusive fsck options	2015-07-09 12:26:25 -04:00
Joey Hess	8a9d2a6e9d	let optparse-applicative handle the usage display when run w/o command or bad command Still generating the list of commands myself, to get it sorted into sections and with short synopses.	2015-07-09 11:52:47 -04:00
Joey Hess	c1c64ec76c	formatting	2015-07-09 10:42:28 -04:00
Joey Hess	d8d1499229	finalOpt is the same as optional	2015-07-09 01:02:27 -04:00
Joey Hess	60806dd191	wip	2015-07-08 17:59:06 -04:00
Joey Hess	6a88c7c101	converted fsck's options to optparse-applicative Global options and seeking and key options are still to be done.	2015-07-08 16:58:54 -04:00
Joey Hess	7d9072d1db	remove git-annex help options display Common options will be displayed in the --help for a command by optparse-applicative.	2015-07-08 15:41:54 -04:00
Joey Hess	92d8f80bff	support cmdnorepo actions, also using getopt-applicative there	2015-07-08 15:39:05 -04:00
Joey Hess	6e5c1f8db3	convert all commands to work with optparse-applicative Still no options though.	2015-07-08 15:08:02 -04:00
Joey Hess	3125da54f6	display cmdparamdesc in optparse-applicative usage messages Since optparse-applicative display "FOO" as "[FOO]", the paramOptional modifier which wrapped it in square brackets was removed from most places.	2015-07-08 13:39:11 -04:00
Joey Hess	a2ba701056	started converting to use optparse-applicative This is a work in progress. It compiles and is able to do basic command dispatch, including git autocorrection, while using optparse-applicative for the core commandline parsing. * Many commands are temporarily disabled before conversion. * Options are not wired in yet. * cmdnorepo actions don't work yet. Also, removed the [Command] list, which was only used in one place.	2015-07-08 13:36:25 -04:00
Joey Hess	24800b1bf1	Only look at reflogs for relevant branches, not for git-annex branches This speeds it up quite a bit.. May still be too slow in large repos.	2015-07-07 17:36:30 -04:00
Joey Hess	600717417f	update comment	2015-07-07 17:23:02 -04:00
Joey Hess	b11d2f5a8a	unused: --used-refspec can now be configured to look at refs in the reflog. This provides a way to not consider old versions of files to be unused after they have reached a specified age, when the old refs in the reflog expire. May be slow.	2015-07-07 17:13:50 -04:00
Joey Hess	a51b98cdd5	sync: When annex.autocommit=false, avoid making any commit of local changes, while still merging with remote to the extent possible.	2015-07-07 16:36:11 -04:00
Joey Hess	81ad277a85	add: Stage symlinks the same as git add would, even if they are not a link to annexed content.	2015-07-07 16:15:30 -04:00
Joey Hess	60fc25e9d3	merge: Avoid creating the synced/master branch. The branch needs to be created when merging from the remote in sync, since we diff between it and the remote's sync branch. But git annex merge should not be creating sync branches.	2015-07-07 15:53:27 -04:00
Joey Hess	625303226d	import: Fix failure of cross-device import on Windows. As well as import, 2 other places ran "mv" manually, so changed them to use moveFile as well.	2015-07-07 14:48:23 -04:00
Joey Hess	1529add61a	Brought back the setkey plumbing command that was removed in 2011, since we found a use case for it. Note that the command's syntax was changed for consistency.	2015-07-02 17:44:25 -04:00
Joey Hess	1f35141aff	assistant --autostart: First any daemons that are already running, which might be left over from a previous login session and so unable to use the ssh agent of a new login session.	2015-07-02 14:16:53 -04:00
Joey Hess	833cf5fff9	log: Fix reversion introduced in version 5.20150528 that broke this command.	2015-07-02 12:53:38 -04:00
Joey Hess	f77d485915	move comment	2015-06-16 19:07:14 -04:00
Joey Hess	be9d9cb5ad	avoid building unused bloomfilter when run without --all	2015-06-16 19:04:20 -04:00
Joey Hess	5b801fcad9	on second thought, sync --content --unused is probably not useful, remove	2015-06-16 19:01:06 -04:00
Joey Hess	adba0595bd	use bloom filter in second pass of sync --all --content This is needed because when preferred content matches on files, the second pass would otherwise want to drop all keys. Using a bloom filter avoids this, and in the case of a false positive, a key will be left undropped that preferred content would allow dropping. Chances of that happening are a mere 1 in 1 million.	2015-06-16 18:50:13 -04:00
Joey Hess	a0a8127956	instance Hashable Key for bloomfilter	2015-06-16 18:37:41 -04:00
Joey Hess	8b74aec3ea	Increased the default annex.bloomaccuracy from 1000 to 10000000 This makes git annex unused use around 48 mb more memory than it did before, but the massive increase in accuracy makes this worthwhile for all but the smallest systems. Also, I want to use the bloom filter for sync --all --content, to avoid dropping files that the preferred content doesn't want, and 1/1000 false positives would be far too many in that use case, even if it were acceptable for unused. Actual memory use numbers: 1000: 21.06user 3.42system 0:26.40elapsed 92%CPU (0avgtext+0avgdata 501552maxresident)k 1000000: 21.41user 3.55system 0:26.84elapsed 93%CPU (0avgtext+0avgdata 549496maxresident)k 10000000: 21.84user 3.52system 0:27.89elapsed 90%CPU (0avgtext+0avgdata 549920maxresident)k Based on these numbers, 10 million seemed a better pick than 1 million.	2015-06-16 18:12:00 -04:00
Joey Hess	29c03145e6	sync: Add support for --all and --unused.	2015-06-16 16:50:03 -04:00
Joey Hess	99a1113461	switch code to using associated files	2015-06-16 15:07:03 -04:00
Joey Hess	67f7f1b1cb	info: Added json output for "backend usage", "numcopies stats", "repositories containing these files", and "transfers in progress".	2015-06-16 13:50:28 -04:00
Joey Hess	829007d629	Improve url parsing to handle some urls containing illegal [] characters in their paths. Ie, "https://archive.org/download/zoom-2/Zoom - Release 2 (1996)(Active Software)[!].iso"	2015-06-14 13:39:44 -04:00
Joey Hess	5123a512d6	add a hint about marking a key dead	2015-06-09 15:12:40 -04:00
Joey Hess	f8ab3bc449	dead --key: Can be used to mark a key as dead.	2015-06-09 14:52:05 -04:00
Joey Hess	6eefc5db65	fsck: Ignore keys that are known to be dead when running in --all mode or a in a bare repo. Otherwise, still reports files with lost contents, even if the content is dead.	2015-06-09 14:08:57 -04:00
Joey Hess	de3bd11a2c	import --clean-duplicates: Fix bug that didn't count local or trusted repo's copy of a file as one of the necessary copies to allow removing it from the import location.	2015-06-03 13:15:38 -04:00
Joey Hess	d28e8fbfd5	get --incomplete: New option to resume any interrupted downloads.	2015-06-02 14:20:38 -04:00
Joey Hess	eb33569f9d	remove Params constructor from Utility.SafeCommand This removes a bit of complexity, and should make things faster (avoids tokenizing Params string), and probably involve less garbage collection. In a few places, it was useful to use Params to avoid needing a list, but that is easily avoided. Problems noticed while doing this conversion: * Some uses of Params "oneword" which was entirely unnecessary overhead. * A few places that built up a list of parameters with ++ and then used Params to split it! Test suite passes.	2015-06-01 13:52:23 -04:00
Joey Hess	36b9c9ca5f	fromkey, registerurl: Improve handling of urls that happen to also be parsable as strange keys.	2015-05-30 02:08:49 -04:00
Joey Hess	77c43a388e	fromkey, registerurl: Allow urls to be specified instead of keys, and generate URL keys. This is especially useful because the caller doesn't need to generate valid url keys, which involves some escaping of characters, and may involve taking a md5sum of the url if it's too long.	2015-05-22 22:41:36 -04:00
Joey Hess	823bb8031b	add annex.used-refspec	2015-05-14 15:44:08 -04:00
Joey Hess	86699ff861	unused: Add --used option, which can specify a set of refs to consider used, rather than the default of considering all refs used.	2015-05-14 15:31:38 -04:00
Joey Hess	61ccf95004	Avoid accumulating transfer failure log files unless the assistant is being used. Only the assistant uses these, and only the assistant cleans them up, so make only git annex transferkeys write them, There is one behavior change from this. If glacier is being used, and a manual git annex get --from glacier fails because the file isn't available yet, the assistant will no longer later see that failed transfer file and retry the get. Hope no-one depended on that old behavior.	2015-05-12 15:53:38 -04:00
Joey Hess	a812d598ef	Take space that will be used by running downloads into account when checking annex.diskreserve.	2015-05-12 15:20:22 -04:00
Joey Hess	e27b97d364	Merge branch 'master' into concurrentprogress Conflicts: Command/Fsck.hs Messages.hs Remote/Directory.hs Remote/Git.hs Remote/Helper/Special.hs Types/Remote.hs debian/changelog git-annex.cabal	2015-05-12 13:23:22 -04:00
Joey Hess	4b2e3210ea	drop: Now supports --all, --unused, and --key.	2015-05-12 13:00:06 -04:00
Joey Hess	db5d831d07	import: Refuse to import files that are within the work tree, as that does not make sense and could cause data loss.	2015-05-11 12:57:47 -04:00
Joey Hess	26ac0753c1	more FlexibleContexts	2015-05-10 15:54:58 -04:00
Joey Hess	6c2d5b5e41	more time-1.5 fixes	2015-05-10 15:36:58 -04:00
Joey Hess	6cf62a9bde	support time-1.5.0 This no longer uses old-locale's defaultTimeLocale, but provides one of its own. Factored out a Logs.TimeStamp.	2015-05-10 15:21:35 -04:00
Joey Hess	f395ca7a41	Improve quvi 0.4 output parsing to handle cases wher there is no known filename extension. This is currently the case when using quvi with youtube. In this case, the extension ".m" will be used.	2015-05-08 13:39:00 -04:00
Joey Hess	469242ac4d	fsck: Ignore error recording the fsck in the activity log, which can happen when running fsck in a read-only repository. Closes: #698559 (fsck can still need to write to the repository if it find problems, but a successful fsck can be done read-only)	2015-05-06 14:45:20 -04:00
Joey Hess	eeb0359a2e	contentlocation, examinekey, lookupkey: Added --batch mode option.	2015-05-06 13:44:53 -04:00
Joey Hess	437d9db41d	assistant: Added --autostop to complement --autostart.	2015-05-01 13:53:45 -04:00
Joey Hess	efb37e7c78	Improve behavior when a git-annex command is told to operate on a file that doesn't exist. It will now continue to other files specified after that on the command line, and only error out at the end.	2015-04-30 15:28:17 -04:00
Joey Hess	9f3e51dd51	move nubbing into function whose algo needs a nubbed list	2015-04-30 14:11:59 -04:00
Joey Hess	607eed0de2	improve messages	2015-04-30 14:10:28 -04:00
Joey Hess	ac6b492711	import: Before removing a duplicate file in --deduplicate or --clean-duplicates mode, verify that enough copies of its content still exist.	2015-04-30 14:04:36 -04:00
Joey Hess	38c458b407	refactor	2015-04-30 14:02:56 -04:00
Joey Hess	9e10b5ca9c	nub list after adding ocal uuid to it This potentially fixes a numcopies counting bug when dropping --from a remote, and the local repository is trusted. The local repo would end up in the list twice, so it would verify one less copy than it was supposed to.	2015-04-30 13:13:28 -04:00
Joey Hess	d8ad1d5503	import: Don't stop entire import when one file fails due to being gitignored or conflicting with something in the work tree.	2015-04-29 13:56:41 -04:00
Joey Hess	2e54251c18	import: Check for gitignored files before moving them into the tree. (Needs git 1.8.4 or newer.)	2015-04-29 13:46:12 -04:00
Joey Hess	cfbeb1e7b7	Fix bogus failure of fsck --fast.	2015-04-27 17:40:21 -04:00
Joey Hess	ca2cf63e34	clean up imports	2015-04-19 10:47:07 -04:00
Joey Hess	addc82dab7	removed all uses of undefined from code base It's a code smell, can lead to hard to diagnose error messages.	2015-04-19 00:38:29 -04:00
Joey Hess	978906c616	forgot to add these	2015-04-18 16:09:09 -04:00
Joey Hess	ef2202fd94	required: New command, like wanted, but for required content. Also refactored some code to reduce duplication.	2015-04-18 16:04:35 -04:00
Joey Hess	8d685768d3	fsck --from remote: Avoid downloading a key if it would go over the annex.diskreserve limit.	2015-04-18 14:23:42 -04:00
Joey Hess	8489057e8d	fsck --from remote: When bad content is found in the remote, and the local repo does not have a copy of the content, preserve the bad content in .git/annex/bad/ to avoid further data loss.	2015-04-18 14:13:07 -04:00
Joey Hess	a2902cdaaf	add filename to progress bar, and display ok/failed at end This needed plumbing an AssociatedFile through retrieveKeyFileCheap.	2015-04-14 16:35:10 -04:00
Joey Hess	86a2f9dc4d	Merge branch 'master' into concurrentprogress Conflicts: debian/changelog	2015-04-14 15:35:15 -04:00
Joey Hess	2be4834822	info: Added --bytes option.	2015-04-12 14:08:40 -04:00
Joey Hess	b14ec45aa8	info dir: Added information about repositories that contain files in the specified directory. This is a nearly free feature; it piggybacks on the location log lookups done for the numcopies stats. So, the only extra overhead is updating the map of repository sizes. However, I had to switch to Data.Map.Strict, which needs containers 0.5. If backporting to wheezy, will probably need to revert this commit.	2015-04-12 12:54:41 -04:00
Joey Hess	8077ccbd54	get, move, copy, mirror: Concurrent downloads and uploads are now supported! This works, and seems fairly robust. Clean get of 20 files at -J3. At -J10, there are some messages about ssh multiplexing, probably due to a race spinning up the ssh connection cacher. But, it manages to get all the files ok regardless. The progress bars are a scrambled mess though, due to bugs in ascii-progress, which I've already filed. Particularly this one: https://github.com/yamadapc/haskell-ascii-progress/issues/8	2015-04-10 17:08:07 -04:00
Joey Hess	40aae2dd11	unused import	2015-04-10 13:16:04 -04:00

... 3 4 5 6 7 ...

1789 commits