git-annex

Author	SHA1	Message	Date
Joey Hess	d3ba9fe5c8	matchexpression: New plumbing command to check if a preferred content expression matches some data.	2016-01-25 16:16:18 -04:00
Joey Hess	f9c5aa84e0	add database benchmark The benchmark shows that the database access is quite fast indeed! And, it scales linearly to the number of keys, with one exception, getAssociatedKey. Based on this benchmark, I don't think I need worry about optimising for cases where all files are locked and the database is mostly empty. In those cases, database access will be misses, and according to this benchmark, should add only 50 milliseconds to runtime. (NB: There may be some overhead to getting the database opened and locking the handle that this benchmark doesn't see.) joey@darkstar:~/src/git-annex>./git-annex benchmark setting up database with 1000 setting up database with 10000 benchmarking keys database/getAssociatedFiles from 1000 (hit) time 62.77 μs (62.70 μs .. 62.85 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 62.81 μs (62.76 μs .. 62.88 μs) std dev 201.6 ns (157.5 ns .. 259.5 ns) benchmarking keys database/getAssociatedFiles from 1000 (miss) time 50.02 μs (49.97 μs .. 50.07 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.09 μs (50.04 μs .. 50.17 μs) std dev 206.7 ns (133.8 ns .. 295.3 ns) benchmarking keys database/getAssociatedKey from 1000 (hit) time 211.2 μs (210.5 μs .. 212.3 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 211.0 μs (210.7 μs .. 212.0 μs) std dev 1.685 μs (334.4 ns .. 3.517 μs) benchmarking keys database/getAssociatedKey from 1000 (miss) time 173.5 μs (172.7 μs .. 174.2 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 173.7 μs (173.0 μs .. 175.5 μs) std dev 3.833 μs (1.858 μs .. 6.617 μs) variance introduced by outliers: 16% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (hit) time 64.01 μs (63.84 μs .. 64.18 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.85 μs (64.34 μs .. 66.02 μs) std dev 2.433 μs (547.6 ns .. 4.652 μs) variance introduced by outliers: 40% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (miss) time 50.33 μs (50.28 μs .. 50.39 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.32 μs (50.26 μs .. 50.38 μs) std dev 202.7 ns (167.6 ns .. 252.0 ns) benchmarking keys database/getAssociatedKey from 10000 (hit) time 1.142 ms (1.139 ms .. 1.146 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.142 ms (1.140 ms .. 1.144 ms) std dev 7.142 μs (4.994 μs .. 10.98 μs) benchmarking keys database/getAssociatedKey from 10000 (miss) time 1.094 ms (1.092 ms .. 1.096 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.095 ms (1.095 ms .. 1.097 ms) std dev 4.277 μs (2.591 μs .. 7.228 μs)	2016-01-12 13:07:03 -04:00
Joey Hess	121f5d5b0c	annex.thin Decided it's too scary to make v6 unlocked files have 1 copy by default, but that should be available to those who need it. This is consistent with git-annex not dropping unused content without --force, etc. * Added annex.thin setting, which makes unlocked files in v6 repositories be hard linked to their content, instead of a copy. This saves disk space but means any modification of an unlocked file will lose the local (and possibly only) copy of the old version. * Enable annex.thin by default on upgrade from direct mode to v6, since direct mode made the same tradeoff. * fix: Adjusts unlocked files as configured by annex.thin.	2015-12-27 15:59:59 -04:00
Joey Hess	723e4e31a1	merge clean into smudge command The git filter config can be used to map the single git-annex command to the 2 actions, and this avoids "git annex clean" being used for this thing, it might have a better use for that name later.	2015-12-04 15:32:47 -04:00
Joey Hess	20ca89dfa3	skeleton smudge/clean filters	2015-12-04 13:03:39 -04:00
Joey Hess	f16e235983	addurl, importfeed: Changed to honor annex.largefiles settings, when the content of the url is downloaded. (Not when using --fast or --relaxed.) importfeed just calls addurl functions, so inherits this from it. Note that addurl still generates a temp file, and uses that key to download the file. It just adds it to the work tree at the end when the file is small.	2015-12-02 15:12:33 -04:00
Joey Hess	dc8099872a	import: Changed to honor annex.largefiles settings.	2015-12-02 14:49:03 -04:00
Joey Hess	5ec67335f4	improve annex.largefiles documentation	2015-12-02 14:26:49 -04:00
Joey Hess	7fce3a0f81	more warnings about networked filesystems	2015-11-13 15:55:16 -04:00
Joey Hess	aa4192aea6	pid locking configuration and abstraction layer for git-annex (not actually used anywhere yet)	2015-11-12 17:50:34 -04:00
Joey Hess	2fb3722ce9	Do verification of checksums of annex objects downloaded from remotes. * When annex objects are received into git repositories, their checksums are verified then too. * To get the old, faster, behavior of not verifying checksums, set annex.verify=false, or remote.<name>.annex-verify=false. * setkey, rekey: These commands also now verify that the provided file matches the key, unless annex.verify=false. * reinject: Already verified content; this can now be disabled by setting annex.verify=false. recvkey and reinject already did verification, so removed now duplicate code from them. fsck still does its own verification, which is ok since it does not use getViaTmp, so verification doesn't happen twice when using fsck --from.	2015-10-01 15:56:39 -04:00
Joey Hess	ffa8221517	annex.hardlink extended to also try to use hard links when copying from the repository to a remote. Also, it used to only check that one of the repos was not in direct mode; now when either repo is direct mode, annex.hardlink won't have an effect.	2015-09-14 12:13:38 -04:00
Yaroslav Halchenko	72129503a9	DOC: refer to corresponding manpage not to non-existing PREFERRED CONTENT section	2015-09-02 12:05:08 -07:00
Øyvind A. Holm	67f7de5986	doc/*.mdwn: Minor fixes (typos, letter case)	2015-07-26 04:21:06 +02:00
Joey Hess	386b8c394e	got bash completion working for "git annex" not just "git-annex" This needs a patch to git to cause the git-annex completion to be auto-loaded when completing "git annex <tab>". Otherwise, it will only load when "git-annex" is tab completed. Once loaded, it works for both uses. I've submitted the git patch to the git mailing list.	2015-07-16 13:32:23 -04:00
Joey Hess	42948e960f	typo	2015-07-13 13:25:49 -04:00
Joey Hess	b4d22e6d49	doc updates	2015-07-10 13:49:37 -04:00
Joey Hess	a51b98cdd5	sync: When annex.autocommit=false, avoid making any commit of local changes, while still merging with remote to the extent possible.	2015-07-07 16:36:11 -04:00
Joey Hess	1529add61a	Brought back the setkey plumbing command that was removed in 2011, since we found a use case for it. Note that the command's syntax was changed for consistency.	2015-07-02 17:44:25 -04:00
Joey Hess	a099dc3f6a	comment and warning	2015-07-02 15:21:25 -04:00
anarcat	0d2151beb7	explicitely describe exit status in the standard section	2015-06-23 16:56:03 +00:00
Joey Hess	8b74aec3ea	Increased the default annex.bloomaccuracy from 1000 to 10000000 This makes git annex unused use around 48 mb more memory than it did before, but the massive increase in accuracy makes this worthwhile for all but the smallest systems. Also, I want to use the bloom filter for sync --all --content, to avoid dropping files that the preferred content doesn't want, and 1/1000 false positives would be far too many in that use case, even if it were acceptable for unused. Actual memory use numbers: 1000: 21.06user 3.42system 0:26.40elapsed 92%CPU (0avgtext+0avgdata 501552maxresident)k 1000000: 21.41user 3.55system 0:26.84elapsed 93%CPU (0avgtext+0avgdata 549496maxresident)k 10000000: 21.84user 3.52system 0:27.89elapsed 90%CPU (0avgtext+0avgdata 549920maxresident)k Based on these numbers, 10 million seemed a better pick than 1 million.	2015-06-16 18:12:00 -04:00
Joey Hess	f8ab3bc449	dead --key: Can be used to mark a key as dead.	2015-06-09 14:52:05 -04:00
Antoine Beaupré	1393797373	add and fix refs in man mainpage	2015-05-29 12:12:11 -04:00
Joey Hess	823bb8031b	add annex.used-refspec	2015-05-14 15:44:08 -04:00
Joey Hess	ef2202fd94	required: New command, like wanted, but for required content. Also refactored some code to reduce duplication.	2015-04-18 16:04:35 -04:00
Joey Hess	ce0a82f493	contentlocationn: New plumbing command.	2015-04-09 15:34:47 -04:00
Joey Hess	9445556c97	rethought distributed fsck; instead add activity.log and expire command This is much more space efficient!	2015-04-05 12:50:02 -04:00
Joey Hess	20fb91a7ad	WIP on making --quiet silence progress, and infra for concurrent progress bars	2015-04-03 16:48:30 -04:00
Øyvind A. Holm	490e97ec10	Various typo fixes in doc/*.mdwn	2015-04-02 01:50:17 +02:00
Joey Hess	9e25cbde20	importfeed: Avoid downloading a redundant item from a feed whose guid has been downloaded before, even when the url has changed. To support this, always store itemid in metadata; before this was only done when annex.genmetadata was set.	2015-03-31 13:30:13 -04:00
Joey Hess	cd6b62f35e	--auto is no longer a global option; only get, drop, and copy accept it. Not a behavior change unless you were passing it to a command that ignored it.	2015-03-25 17:06:14 -04:00
Joey Hess	0b029570a7	finished splitting out man pages for all commands	2015-03-25 12:09:49 -04:00
Joey Hess	0850e8eaf9	separated man pages for all the maintenance commands	2015-03-24 15:23:59 -04:00
Joey Hess	f10282807e	separated man pages for all the setup commands while at the gate in ATL	2015-03-23 18:20:42 -04:00
Joey Hess	3cc7c03721	Man pages for individual commands now available, and can be opened using "git annex help <command>"	2015-03-23 17:50:03 -04:00
Joey Hess	daec4b007a	splitting up the man page Common command man pages all split out and often expanded. A few sections split out into their own pages. Still need to do all the other commands..	2015-03-23 15:36:10 -04:00
Joey Hess	c233f98564	migrate: --force will force migration of keys already using the destination backend. Useful in rare cases.	2015-03-23 12:11:16 -04:00
Joey Hess	798da6cf2e	Added a post-update-annex hook, which is run after the git-annex branch is updated. Needed for git update-server-info. See https://github.com/datalad/datalad/issues/1#issuecomment-84094406	2015-03-20 14:52:58 -04:00
Joey Hess	e6158130c6	checkpresentkey: New plumbing command to check if a key can be verified to be present on a remote.	2015-03-20 11:44:46 -04:00
Joey Hess	50ef4105e3	readpresentkey: New plumbing command for checking location log.	2015-03-20 11:22:27 -04:00
Joey Hess	abfe3c09b2	registerurl: New plumbing command for mass-adding urls to keys.	2015-03-15 14:37:33 -04:00
Joey Hess	b24bb6b435	fromkey: Add stdin mode.	2015-03-15 14:07:43 -04:00
Joey Hess	fa180c1ba1	fromkey --force: Skip test that the key has its content in the annex.	2015-03-15 13:51:58 -04:00
Joey Hess	504dda82a4	addurl: Added --raw option, which bypasses special handling of quvi, bittorrent etc urls.	2015-03-05 14:46:08 -04:00
Joey Hess	022461d773	add a link	2015-02-25 15:49:18 -04:00
Joey Hess	68725d27e5	wording	2015-02-25 14:31:17 -04:00
Joey Hess	8066a1c3cc	The file matching options are now only accepted by commands that can actually use them.	2015-02-06 17:16:41 -04:00
Joey Hess	dfab5e6ff4	import: Support file matching options such as --exclude, --include, --smallerthan, --largerthan	2015-02-06 15:58:06 -04:00
Joey Hess	febb1c2082	groupwanted: New command to set the groupwanted preferred content expression.	2015-02-06 15:12:42 -04:00

1 2 3 4 5 ...

484 commits