git-annex

Author	SHA1	Message	Date
Joey Hess	7c1df36d63	annex.addsmallfiles: New option controlling what is done when adding files not matching annex.largefiles.	2016-01-28 14:04:32 -04:00
Joey Hess	d3ba9fe5c8	matchexpression: New plumbing command to check if a preferred content expression matches some data.	2016-01-25 16:16:18 -04:00
Joey Hess	f9c5aa84e0	add database benchmark The benchmark shows that the database access is quite fast indeed! And, it scales linearly to the number of keys, with one exception, getAssociatedKey. Based on this benchmark, I don't think I need worry about optimising for cases where all files are locked and the database is mostly empty. In those cases, database access will be misses, and according to this benchmark, should add only 50 milliseconds to runtime. (NB: There may be some overhead to getting the database opened and locking the handle that this benchmark doesn't see.) joey@darkstar:~/src/git-annex>./git-annex benchmark setting up database with 1000 setting up database with 10000 benchmarking keys database/getAssociatedFiles from 1000 (hit) time 62.77 μs (62.70 μs .. 62.85 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 62.81 μs (62.76 μs .. 62.88 μs) std dev 201.6 ns (157.5 ns .. 259.5 ns) benchmarking keys database/getAssociatedFiles from 1000 (miss) time 50.02 μs (49.97 μs .. 50.07 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.09 μs (50.04 μs .. 50.17 μs) std dev 206.7 ns (133.8 ns .. 295.3 ns) benchmarking keys database/getAssociatedKey from 1000 (hit) time 211.2 μs (210.5 μs .. 212.3 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 211.0 μs (210.7 μs .. 212.0 μs) std dev 1.685 μs (334.4 ns .. 3.517 μs) benchmarking keys database/getAssociatedKey from 1000 (miss) time 173.5 μs (172.7 μs .. 174.2 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 173.7 μs (173.0 μs .. 175.5 μs) std dev 3.833 μs (1.858 μs .. 6.617 μs) variance introduced by outliers: 16% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (hit) time 64.01 μs (63.84 μs .. 64.18 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 64.85 μs (64.34 μs .. 66.02 μs) std dev 2.433 μs (547.6 ns .. 4.652 μs) variance introduced by outliers: 40% (moderately inflated) benchmarking keys database/getAssociatedFiles from 10000 (miss) time 50.33 μs (50.28 μs .. 50.39 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 50.32 μs (50.26 μs .. 50.38 μs) std dev 202.7 ns (167.6 ns .. 252.0 ns) benchmarking keys database/getAssociatedKey from 10000 (hit) time 1.142 ms (1.139 ms .. 1.146 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.142 ms (1.140 ms .. 1.144 ms) std dev 7.142 μs (4.994 μs .. 10.98 μs) benchmarking keys database/getAssociatedKey from 10000 (miss) time 1.094 ms (1.092 ms .. 1.096 ms) 1.000 R² (1.000 R² .. 1.000 R²) mean 1.095 ms (1.095 ms .. 1.097 ms) std dev 4.277 μs (2.591 μs .. 7.228 μs)	2016-01-12 13:07:03 -04:00
Joey Hess	121f5d5b0c	annex.thin Decided it's too scary to make v6 unlocked files have 1 copy by default, but that should be available to those who need it. This is consistent with git-annex not dropping unused content without --force, etc. * Added annex.thin setting, which makes unlocked files in v6 repositories be hard linked to their content, instead of a copy. This saves disk space but means any modification of an unlocked file will lose the local (and possibly only) copy of the old version. * Enable annex.thin by default on upgrade from direct mode to v6, since direct mode made the same tradeoff. * fix: Adjusts unlocked files as configured by annex.thin.	2015-12-27 15:59:59 -04:00
Joey Hess	723e4e31a1	merge clean into smudge command The git filter config can be used to map the single git-annex command to the 2 actions, and this avoids "git annex clean" being used for this thing, it might have a better use for that name later.	2015-12-04 15:32:47 -04:00
Joey Hess	20ca89dfa3	skeleton smudge/clean filters	2015-12-04 13:03:39 -04:00
Joey Hess	f16e235983	addurl, importfeed: Changed to honor annex.largefiles settings, when the content of the url is downloaded. (Not when using --fast or --relaxed.) importfeed just calls addurl functions, so inherits this from it. Note that addurl still generates a temp file, and uses that key to download the file. It just adds it to the work tree at the end when the file is small.	2015-12-02 15:12:33 -04:00
Joey Hess	dc8099872a	import: Changed to honor annex.largefiles settings.	2015-12-02 14:49:03 -04:00
Joey Hess	5ec67335f4	improve annex.largefiles documentation	2015-12-02 14:26:49 -04:00
Joey Hess	7fce3a0f81	more warnings about networked filesystems	2015-11-13 15:55:16 -04:00
Joey Hess	aa4192aea6	pid locking configuration and abstraction layer for git-annex (not actually used anywhere yet)	2015-11-12 17:50:34 -04:00
Joey Hess	2fb3722ce9	Do verification of checksums of annex objects downloaded from remotes. * When annex objects are received into git repositories, their checksums are verified then too. * To get the old, faster, behavior of not verifying checksums, set annex.verify=false, or remote.<name>.annex-verify=false. * setkey, rekey: These commands also now verify that the provided file matches the key, unless annex.verify=false. * reinject: Already verified content; this can now be disabled by setting annex.verify=false. recvkey and reinject already did verification, so removed now duplicate code from them. fsck still does its own verification, which is ok since it does not use getViaTmp, so verification doesn't happen twice when using fsck --from.	2015-10-01 15:56:39 -04:00
Joey Hess	ffa8221517	annex.hardlink extended to also try to use hard links when copying from the repository to a remote. Also, it used to only check that one of the repos was not in direct mode; now when either repo is direct mode, annex.hardlink won't have an effect.	2015-09-14 12:13:38 -04:00
Yaroslav Halchenko	72129503a9	DOC: refer to corresponding manpage not to non-existing PREFERRED CONTENT section	2015-09-02 12:05:08 -07:00
Øyvind A. Holm	67f7de5986	doc/*.mdwn: Minor fixes (typos, letter case)	2015-07-26 04:21:06 +02:00
Joey Hess	386b8c394e	got bash completion working for "git annex" not just "git-annex" This needs a patch to git to cause the git-annex completion to be auto-loaded when completing "git annex <tab>". Otherwise, it will only load when "git-annex" is tab completed. Once loaded, it works for both uses. I've submitted the git patch to the git mailing list.	2015-07-16 13:32:23 -04:00
Joey Hess	42948e960f	typo	2015-07-13 13:25:49 -04:00
Joey Hess	b4d22e6d49	doc updates	2015-07-10 13:49:37 -04:00
Joey Hess	a51b98cdd5	sync: When annex.autocommit=false, avoid making any commit of local changes, while still merging with remote to the extent possible.	2015-07-07 16:36:11 -04:00
Joey Hess	1529add61a	Brought back the setkey plumbing command that was removed in 2011, since we found a use case for it. Note that the command's syntax was changed for consistency.	2015-07-02 17:44:25 -04:00
Joey Hess	a099dc3f6a	comment and warning	2015-07-02 15:21:25 -04:00
anarcat	0d2151beb7	explicitely describe exit status in the standard section	2015-06-23 16:56:03 +00:00
Joey Hess	8b74aec3ea	Increased the default annex.bloomaccuracy from 1000 to 10000000 This makes git annex unused use around 48 mb more memory than it did before, but the massive increase in accuracy makes this worthwhile for all but the smallest systems. Also, I want to use the bloom filter for sync --all --content, to avoid dropping files that the preferred content doesn't want, and 1/1000 false positives would be far too many in that use case, even if it were acceptable for unused. Actual memory use numbers: 1000: 21.06user 3.42system 0:26.40elapsed 92%CPU (0avgtext+0avgdata 501552maxresident)k 1000000: 21.41user 3.55system 0:26.84elapsed 93%CPU (0avgtext+0avgdata 549496maxresident)k 10000000: 21.84user 3.52system 0:27.89elapsed 90%CPU (0avgtext+0avgdata 549920maxresident)k Based on these numbers, 10 million seemed a better pick than 1 million.	2015-06-16 18:12:00 -04:00
Joey Hess	f8ab3bc449	dead --key: Can be used to mark a key as dead.	2015-06-09 14:52:05 -04:00
Antoine Beaupré	1393797373	add and fix refs in man mainpage	2015-05-29 12:12:11 -04:00
Joey Hess	823bb8031b	add annex.used-refspec	2015-05-14 15:44:08 -04:00
Joey Hess	ef2202fd94	required: New command, like wanted, but for required content. Also refactored some code to reduce duplication.	2015-04-18 16:04:35 -04:00
Joey Hess	ce0a82f493	contentlocationn: New plumbing command.	2015-04-09 15:34:47 -04:00
Joey Hess	9445556c97	rethought distributed fsck; instead add activity.log and expire command This is much more space efficient!	2015-04-05 12:50:02 -04:00
Joey Hess	20fb91a7ad	WIP on making --quiet silence progress, and infra for concurrent progress bars	2015-04-03 16:48:30 -04:00
Øyvind A. Holm	490e97ec10	Various typo fixes in doc/*.mdwn	2015-04-02 01:50:17 +02:00
Joey Hess	9e25cbde20	importfeed: Avoid downloading a redundant item from a feed whose guid has been downloaded before, even when the url has changed. To support this, always store itemid in metadata; before this was only done when annex.genmetadata was set.	2015-03-31 13:30:13 -04:00
Joey Hess	cd6b62f35e	--auto is no longer a global option; only get, drop, and copy accept it. Not a behavior change unless you were passing it to a command that ignored it.	2015-03-25 17:06:14 -04:00
Joey Hess	0b029570a7	finished splitting out man pages for all commands	2015-03-25 12:09:49 -04:00
Joey Hess	0850e8eaf9	separated man pages for all the maintenance commands	2015-03-24 15:23:59 -04:00
Joey Hess	f10282807e	separated man pages for all the setup commands while at the gate in ATL	2015-03-23 18:20:42 -04:00
Joey Hess	3cc7c03721	Man pages for individual commands now available, and can be opened using "git annex help <command>"	2015-03-23 17:50:03 -04:00
Joey Hess	daec4b007a	splitting up the man page Common command man pages all split out and often expanded. A few sections split out into their own pages. Still need to do all the other commands..	2015-03-23 15:36:10 -04:00
Joey Hess	c233f98564	migrate: --force will force migration of keys already using the destination backend. Useful in rare cases.	2015-03-23 12:11:16 -04:00
Joey Hess	798da6cf2e	Added a post-update-annex hook, which is run after the git-annex branch is updated. Needed for git update-server-info. See https://github.com/datalad/datalad/issues/1#issuecomment-84094406	2015-03-20 14:52:58 -04:00
Joey Hess	e6158130c6	checkpresentkey: New plumbing command to check if a key can be verified to be present on a remote.	2015-03-20 11:44:46 -04:00
Joey Hess	50ef4105e3	readpresentkey: New plumbing command for checking location log.	2015-03-20 11:22:27 -04:00
Joey Hess	abfe3c09b2	registerurl: New plumbing command for mass-adding urls to keys.	2015-03-15 14:37:33 -04:00
Joey Hess	b24bb6b435	fromkey: Add stdin mode.	2015-03-15 14:07:43 -04:00
Joey Hess	fa180c1ba1	fromkey --force: Skip test that the key has its content in the annex.	2015-03-15 13:51:58 -04:00
Joey Hess	504dda82a4	addurl: Added --raw option, which bypasses special handling of quvi, bittorrent etc urls.	2015-03-05 14:46:08 -04:00
Joey Hess	022461d773	add a link	2015-02-25 15:49:18 -04:00
Joey Hess	68725d27e5	wording	2015-02-25 14:31:17 -04:00
Joey Hess	8066a1c3cc	The file matching options are now only accepted by commands that can actually use them.	2015-02-06 17:16:41 -04:00
Joey Hess	dfab5e6ff4	import: Support file matching options such as --exclude, --include, --smallerthan, --largerthan	2015-02-06 15:58:06 -04:00
Joey Hess	febb1c2082	groupwanted: New command to set the groupwanted preferred content expression.	2015-02-06 15:12:42 -04:00
Joey Hess	ba3825441c	rework Differences data type Eliminated complexity and future proofed. The most important change is that all functions over Difference are now total; any Difference that can be expressed should be handled. Avoids needs for sanity checking of inputs, and version skew with the future. Also, the difference.log now serializes a [Difference], not a Differences. This saves space and keeps it simpler. Note that [Difference] might contain conflicting differences (eg, [Version5, Version6]. In this case, one of them needs to consistently win over the others, probably based on Ord.	2015-01-28 13:50:02 -04:00
Joey Hess	70736d2b41	Repository tuning parameters can now be passed when initializing a repository for the first time. * init: Repository tuning parameters can now be passed when initializing a repository for the first time. For details, see http://git-annex.branchable.com/tuning/ * merge: Refuse to merge changes from a git-annex branch of a repo that has been tuned in incompatable ways.	2015-01-27 17:38:06 -04:00
Joey Hess	587f6a919b	addurl: When a Content-Disposition header suggests a filename to use, addurl will consider using it, if it's reasonable and doesn't conflict with an existing file. (--file overrides this)	2015-01-22 14:52:52 -04:00
Joey Hess	afc5153157	update my email address and homepage url	2015-01-21 12:50:09 -04:00
Joey Hess	4fcf65cda3	devblog	2015-01-13 18:36:17 -04:00
Joey Hess	534c29deae	implemented old Richih wishlist about remote/uuid info * info: Can now display info about a given uuid. * Added to remote/uuid info: Count of the number of keys present on the remote, and their size. This is rather expensive to calculate, so comes last and --fast will disable it. * Git remote info now includes the date of the last sync with the remote.	2015-01-13 18:13:14 -04:00
Joey Hess	43dc7f678f	setpresentkey: A new plumbing-level command.	2014-12-29 15:16:40 -04:00
Joey Hess	aba3e11776	sync: Now supports remote groups, the same way git remote update does.	2014-12-29 13:42:58 -04:00
Jean Jordaan	c011fe156d	Get rid of mysterious "_why_"	2014-12-20 14:18:24 +02:00
Joey Hess	0e44d95964	update for torrents	2014-12-18 00:57:41 -04:00
Joey Hess	a7690de016	Added bittorrent special remote addurl behavior change: When downloading an url ending in .torrent, it will download files from bittorrent, instead of the old behavior of adding the torrent file to the repository. Added Recommends on aria2 and bittornado \| bittorrent. This commit was sponsored by Asbjørn Sloth Tønnesen.	2014-12-16 23:22:46 -04:00
Joey Hess	6ecd3ff421	diffdriver: New git-annex command, to make git external diff drivers work with annexed files. Closes https://github.com/datalad/datalad/issues/18	2014-11-24 16:14:06 -04:00
Joey Hess	13260ccc3a	undo command This commit was sponsored by Andrew Cant.	2014-11-14 14:41:07 -04:00
Joey Hess	d22d650f59	proxy command is closer to plumbing than a general use command	2014-11-13 13:59:00 -04:00
Joey Hess	864086a956	proxy: for all your direct mode repository munging needs This allows bypassing the direct mode guard in a safe way to do all sorts of things including git revert, git mv, git checkout ... This commit was sponsored by the WikiMedia Foundation.	2014-11-12 15:51:46 -04:00
Joey Hess	a0297915c1	add per-remote-type info Now `git annex info $remote` shows info specific to the type of the remote, for example, it shows the rsync url. Remote types that support encryption or chunking also include that in their info. This commit was sponsored by Ævar Arnfjörð Bjarmason.	2014-10-21 14:36:09 -04:00
Joey Hess	aafaa363e3	info: When passed the name or uuid of a remote, displays info about that remote. No per-remote-type info yet. This commit was sponsored by Stanley Yamane.	2014-10-21 14:35:07 -04:00
Joey Hess	4a9e70c705	info: When run on a single annexed file, displays some info about the file, including its key and size.	2014-10-21 13:24:15 -04:00
Joey Hess	fe994e58e5	clarify that sync only commits changes to files already added to the repo	2014-09-18 15:43:20 -04:00
Joey Hess	3c49a5d29c	typo	2014-09-12 12:29:04 -04:00
Joey Hess	b874f84086	New annex.hardlink setting. Closes: #758593 * New annex.hardlink setting. Closes: #758593 * init: Automatically detect when a repository was cloned with --shared, and set annex.hardlink=true, as well as marking the repository as untrusted. Had to reorganize Logs.Trust a bit to avoid a cycle between it and Annex.Init.	2014-09-05 13:44:09 -04:00
Joey Hess	4b3f03ef38	clarify that --all doesn't operate on a single file	2014-08-19 12:11:19 -04:00
Yaroslav Halchenko	2d2d0a4d75	doc/ minor typos/trailing whitespaces + extension on get options	2014-08-19 01:22:24 -04:00
Joey Hess	93f20541f5	testremote --fast	2014-08-03 18:08:34 -04:00
Joey Hess	1ee24a0366	testremote now tests with and without encryption	2014-08-01 17:52:40 -04:00
Joey Hess	20d7295386	improve testremote command, adding chunk size testing And also a --size parameter to configure the basic object size.	2014-08-01 16:50:24 -04:00
Joey Hess	9720ee9e56	testremote: New command to test uploads/downloads to a remote. This only performs some basic tests so far; no testing of chunking or resuming. Also, the existing encryption type of the remote is used; it would be good later to derive an encrypted and a non-encrypted version of the remote and test them both. This commit was sponsored by Joseph Liu.	2014-08-01 15:10:01 -04:00
Joey Hess	c03e1c5648	add new section for testing commands	2014-08-01 12:49:26 -04:00
Joey Hess	522a0922b8	sync: Fix git sync with local git remotes even when they don't have an annex.uuid set. Catch an exception when ensureInitialized is run in a non-initted repository. In this case, just read the git config, so that the Git.Repo object is not LocalUnknown, which is what is used to represent remotes on eg, drives that are not connected. The assistant already got this right, and like with the assistant, this causes an implicit git-annex init of the local remote on the second sync, once the git-annex branch has been pushed to it. See this comment for more analysis: http://git-annex.branchable.com/todo/Recovering_from_a_bad_sync/#comment-64e469a2c1969829ee149cbb41b1c138 This commit was sponsored by jscit.	2014-07-15 14:27:43 -04:00
Joey Hess	618b1d2a38	improve documentation	2014-07-14 14:37:14 -04:00
Joey Hess	cb66ca3a76	resolvemerge: New plumbing command that runs the automatic merge conflict resolver.	2014-07-11 16:45:18 -04:00
Joey Hess	f22a77890e	improve documentation about sync	2014-07-11 14:08:40 -04:00
Joey Hess	d0c1a22e7c	import metadata from feeds When annex.genmetadata is set, metadata from the feed is added to files that are imported from it. Reused the same feedtitle and itemtitle, feedauthor, itemauthor, etc names that are used in --template. Also added title and author, which are the item title/author if available, falling back to the feed title/author. These are more likely to be common metadata fields. (There is a small bit of dupication here, but once git gets around to packing the object, it will compress it away.) The itempubdate field is not included in the metadata as a string; instead it is used to generate year and month fields, same as is done when adding files with annex.genmetadata set. This commit was sponsored by Amitai Schlair, who cooincidentially is responsible for ikiwiki generating nice feed metadata!	2014-07-03 14:15:00 -04:00
Joey Hess	fd23e819c5	clarify what mtime field in find --format is	2014-06-30 19:03:23 -04:00
Fraser Tweedale	4eb72392b4	execute remote.<name>.annex-shell on remote, if set It is useful to be able to specify an alternative git-annex-shell program to execute on the remote, e.g., to run a version not on the PATH. Use remote.<name>.annex-shell if specified, instead of the default "git-annex-shell" i.e., first so-named executable on the PATH.	2014-05-16 15:46:43 -04:00
Joey Hess	00986d19f4	group: When no groups are specified to set, lists the current groups of a repository.	2014-05-16 14:43:40 -04:00
Robie Basak	4184566627	ddar special remote	2014-05-15 16:32:44 -04:00
Joey Hess	ecc3dc8433	findref: New command, like find but shows files in a specified git ref.	2014-04-17 18:41:24 -04:00
Joey Hess	915d038bec	reinit: New command that can initialize a new reposotory using the configuration of a previously known repository. Useful if a repository got deleted and you want to clone it back the way it was.	2014-04-15 20:13:35 -04:00
Joey Hess	03ce5cd8d2	found a way to make uninit always fast To do so, I slightly changed the behavior of unannex. Now in fast mode, it only makes a hard link when the annexed file's link count is 1. This avoids unannexing 2 files with the same content in fast mode from hard linking them together. (One will end up hard linked to the annex, which the docs warn about.) With that change, uninit can simply always run unannex in fast mode. Since .git/annex/objects is being blown away anyway, there's no worry in this case about a hard link pointing into it causing an annexed object to be modified.	2014-04-15 14:23:08 -04:00
Joey Hess	49005d5c28	document uninit --fast and also why it's not the default	2014-04-15 14:12:42 -04:00
Joey Hess	b82582caf1	improve desc	2014-04-14 14:14:54 -04:00
Joey Hess	836f2cc816	importfeed: Filename template can now contain an itempubdate variable. Needs feed 0.3.9.2.	2014-04-07 16:55:04 -04:00
Joey Hess	43909723b3	added git-annex remotedaemon So far, handling connecting to git-annex-shell notifychanges, and pulling immediately when a change is pushed to a remote. A little bit buggy (crashes after the first pull), but it already works! This commit was sponsored by Mark Sheppard.	2014-04-06 19:10:23 -04:00
Joey Hess	18970377df	typo	2014-03-31 20:15:01 -04:00
Joey Hess	dcf4136340	improve metadata documentation	2014-03-26 16:55:29 -04:00
Joey Hess	2f538dd65c	add --include-dotfiles: New option, perhaps useful for backups.	2014-03-26 14:52:07 -04:00
Joey Hess	fb8a32cc7f	notifications on drop	2014-03-22 15:01:48 -04:00
Joey Hess	e426fac273	add desktop notifications Motivation: Hook scripts for nautilus or other file managers need to provide the user with feedback that a file is being downloaded. This commit was sponsored by THM Schoemaker.	2014-03-22 14:12:19 -04:00
Joey Hess	8bcd67b9d8	metadata: Add --get (from bremner)	2014-03-15 17:29:40 -04:00
Joey Hess	417aea25be	vicfg: Allows editing preferred content expressions for groups. This is stored in the git-annex branch, but not yet actually hooked up and used.	2014-03-15 16:17:01 -04:00
Joey Hess	7ea0b82cf9	note that webapp starts the assistant if it's not already running	2014-03-12 15:40:20 -04:00
Joey Hess	a3fe8270ca	annex.startupscan can be set to false to disable the assistant's startup scan.	2014-03-05 17:44:14 -04:00
Joey Hess	d0fce426c4	pre-commit-annex hook script to automatically extract metadata from lots of types of files Using the extract(1) program to do the heavy lifting. Decided to make git-annex run pre-commit-annex when committing. Since git-annex pre-commit also runs it, it'll be run when git commit is run too, via the pre-commit hook. This basically gives back the pre-commit hook that git-annex took away. The implementation avoids repeatedly looking for the hook script when the assistant is running and committing repeatedly; only checks if the hook is available once. To make the script simpler, made git-annex metadata -s field?=value only set a field when it's not already got a value. This commit was sponsored by bak.	2014-03-02 20:11:58 -04:00
Joey Hess	4643a0120c	doc improvements	2014-03-02 15:46:58 -04:00
Joey Hess	7d9486a709	vadd: Allow listing multiple desired values for a field.	2014-03-02 15:36:45 -04:00
Joey Hess	c2e8c21ca6	view, vfilter: Add support for filtering tags and values out of a view, using !tag and field!=value. Note that negated globs are not supported. Would have complicated the code to add them, without changing the data type serialization in a non-backwards-compatable way. This commit was sponsored by Denver Gingerich.	2014-03-02 14:53:19 -04:00
Joey Hess	6a355686ff	annex.listen can be configured, instead of using --listen	2014-03-01 00:31:17 -04:00
Joey Hess	0bc8dabb54	docs for remote webapp, securely	2014-02-28 22:39:06 -04:00
Joey Hess	aa39457f5f	update	2014-02-25 17:26:04 -04:00
Joey Hess	fb4e1ebfbe	metadata: Support --json	2014-02-23 13:58:16 -04:00
Joey Hess	7498c5dd96	annex.genmetadata can be set to make git-annex automatically set metadata (year and month) when adding files	2014-02-23 00:08:29 -04:00
Joey Hess	079b35a1a8	views: add automatically constructed file location metadata When constructing views, metadata is available about the location of the file in the view's reference branch. Allows incorporating parts of the directory hierarchy in a view. For example `git annex view tag=* podcasts/=` makes a view in the form tag/showname. Performance impact: I benchmarked git annex view tag= in the conference proceedings repo to take 6.459s before this change, and 6.544s after. FWIW, I considered making the syntax for this be podcasts/, which might be easier for the user to learn. However, I think it's not as good: The user has to then juggle two different syntaxes, and podcasts/* will be expanded by the shell so they also need to quote it, while podcasts/=* is unlikely to be expanded by the shell. * It would allow for things like podcasts// and *.mp3 which do not map well into views. This commit was sponsored by Aurélien Pinceaux.	2014-02-22 16:27:53 -04:00
Joey Hess	2a65f07621	note case insensative matching	2014-02-21 18:36:36 -04:00
Joey Hess	24f8136504	--metadata field=value can now use globs to match, and matches case insensatively, the same as git annex view field=value does. Also refactored glob code into its own module.	2014-02-21 18:34:34 -04:00
Joey Hess	1428390300	tweak wording	2014-02-20 16:00:41 -04:00
Joey Hess	d209566dfa	Revert "Fix command to match fsck description" This reverts commit `9e8370d1b9`. No, --incremental and --more are not needed when using --incremental-schedule. The --incremental-schedule option implies the other ones.	2014-02-20 15:36:59 -04:00
Joey Hess	134fdefb8c	fsck: When run with --all or --unused, while .gitattributes annex.numcopies cannot be honored since it's operating on keys instead of files, make it honor the global numcopies setting, and the annex.numcopies git config setting.	2014-02-20 14:45:17 -04:00
Joey Hess	dd7b99c860	add tip about metadata driven views (and more flexible view filtering) While writing this documentation, I realized that there needed to be a way to stay in a view like tag=* while adding a filter like tag=work that applies to the same field. So, there are really two ways a view can be refined. It can have a new "field=explicitvalue" filter added to it, which does not change the "shape" of the view, but narrows the files it shows. Or, it can have a new view added, which adds another level of subdirectories. So, added a vfilter command, which takes explicit values to add to the filter, and rejects changes that would change the shape of the view. And, made vadd only accept changes that change the shape of the view. And, changed the View data type slightly; now components that can match multiple metadata values can be visible, or not visible. This commit was sponsored by Stelian Iancu.	2014-02-19 16:29:56 -04:00
Joey Hess	d8ce6cac36	metadata: add --tag and --untag shorthand options	2014-02-19 15:04:12 -04:00
Joey Hess	e7672f197e	new section for metadata	2014-02-19 14:55:34 -04:00
Joey Hess	39ebfa1a2e	pre-commit: Update metadata when committing changes to annexed files within a view. So the user can now switch to a view and then move files around within it to manage metadata. For example, moving a file into a new directory when in the tags=* view adds a tag to it. Implementation is fairly efficient. One diff-index, which is no more expensive than the first stage of a git commit, followed by possibly some cat-file --batch traffic to find the key (when deleting a file). Very similar to what's done in direct mode when committing. And like direct mode when updating the WC after a merge, it has to buffer the diff-tree values in order to make 2 passes over them. When not in a view, pre-commit now does one extra git symbolic-ref, which is tiny overhead. This commit was sponsored by Andrew Eskridge.	2014-02-19 14:17:58 -04:00
Joey Hess	1a53c87057	vpop N	2014-02-18 21:57:21 -04:00
Joey Hess	67a5f02a0b	add vcycle command	2014-02-18 20:16:28 -04:00
Joey Hess	f603692a72	add vadd command	2014-02-18 20:02:09 -04:00
Joey Hess	67fd06af76	add git annex view command (And a vpop command, which is still a bit buggy.) Still need to do vadd and vrm, though this also adds their documentation. Currently not very happy with the view log data serialization. I had to lose the TDFA regexps temporarily, so I can have Read/Show instances of View. I expect the view log format will change in some incompatable way later, probably adding last known refs for the parent branch to View or something like that. Anyway, it basically works, although it's a bit slow looking up the metadata. The actual git branch construction is about as fast as it can be using the current git plumbing. This commit was sponsored by Peter Hogg.	2014-02-18 18:22:20 -04:00
stp	9e8370d1b9	Fix command to match fsck description	2014-02-17 15:53:46 +00:00
Joey Hess	2075cdeb59	limiting files based on metadata Note that there is currently no caching, so --metadata foo=bar --metadata tag=blah will currently read the log 2x per file.	2014-02-13 02:24:30 -04:00
Joey Hess	0e9a72b356	metacata command can now operate on many files at once	2014-02-13 01:49:38 -04:00
Joey Hess	9f7e76130e	add metadata command to get/set metadata Adds metadata log, and command. Note that unsetting field values seems to currently be broken. And in general this has had all of 2 minutes worth of testing. This commit was sponsored by Julien Lefrique.	2014-02-12 21:30:33 -04:00
Joey Hess	b9e6cb07ad	remove dropkey example	2014-02-08 15:25:58 -04:00
Joey Hess	a44e01c29c	--in can now refer to files that were located in a repository at some past date. For example, --in="here@{yesterday}"	2014-02-06 12:43:56 -04:00
Joey Hess	1858c1f44a	Document in man page that sshcaching uses ssh ControlMaster. Closes: #737476	2014-02-02 19:27:47 -04:00
Joey Hess	089c0109a2	Added ways to configure rsync options to be used only when uploading or downloading from a remote. Useful to eg limit upload bandwidth.	2014-02-02 16:06:34 -04:00
Joey Hess	ec7443eb06	All commands that support --all also support a --key option, which limits them to acting on a single key.	2014-01-26 14:59:47 -04:00
Joey Hess	5fc2d760ea	Optimise non-bare http remotes; no longer does a 404 to the wrong url every time before trying the right url. Needs annex-bare to be set to false, which is done when initially probing the uuid of a http remote.	2014-01-26 13:03:25 -04:00
Joey Hess	b93e485ef1	added annex.secure-erase-command config option.	2014-01-24 12:58:52 -04:00
Joey Hess	3da0064657	assistant unused file handling Make sanity checker run git annex unused daily, and queue up transfers of unused files to any remotes that will have them. The transfer retrying code works for us here, so eg when a backup disk remote is plugged in, any transfers to it are done. Once the unused files reach a remote, they'll be removed locally as unwanted. If the setup does not cause unused files to go to a remote, they'll pile up, and the sanity checker detects this using some heuristics that are pretty good -- 1000 unused files, or 10% of disk used by unused files, or more disk wasted by unused files than is left free. Once it detects this, it pops up an alert in the webapp, with a button to take action. TODO: Webapp UI to configure this, and also the ability to launch an immediate cleanup of all unused files. This commit was sponsored by Simon Michael.	2014-01-22 22:53:18 -04:00
Joey Hess	f2713a3bb9	benchmarked numcopies .gitattributes in preferred content Checking .gitattributes adds a full minute to a git annex find looking for files that don't have enough copies. 2:25 increasts to 3:27. I feel this is too much of a slowdown to justify making it the default. So, exposed two versions of the preferred content expression, a slow one and a fast but approximate one. I'm using the approximate one in the default preferred content expressions to avoid slowing down the assistant.	2014-01-21 18:49:25 -04:00
Joey Hess	d1bf61464f	expose tasty test suite's option parser	2014-01-21 00:08:43 -04:00
Joey Hess	3159da2693	Add and use numcopiesneeded preferred content expression. * Add numcopiesneeded preferred content expression. * Client, transfer, incremental backup, and archive repositories now want to get content that does not yet have enough copies. This means the asssistant will make copies of files that don't yet meet the configured numcopies, even to places that would not normally want the file. For example, if numcopies is 4, and there are 2 client repos and 2 transfer repos, and 2 removable backup drives, the file will be sent to both transfer repos in order to make 4 copies. Once a removable drive get a copy of the file, it will be dropped from one transfer repo or the other (but not both). Another example, numcopies is 3 and there is a client that has a backup removable drive and two small archive repos. Normally once one of the small archives has a file, it will not be put into the other one. But, to satisfy numcopies, the assistant will duplicate it into the other small archive too, if the backup repo is not available to receive the file. I notice that these examples are fairly unlikely setups .. the old behavior was not too bad, but it's nice to finally have it really correct. .. Almost. I have skipped checking the annex.numcopies .gitattributes out of fear it will be too slow. This commit was sponsored by Florian Schlegel.	2014-01-20 17:35:29 -04:00
Joey Hess	d66535f065	global numcopies setting * numcopies: New command, sets global numcopies value that is seen by all clones of a repository. * The annex.numcopies git config setting is deprecated. Once the numcopies command is used to set the global number of copies, any annex.numcopies git configs will be ignored. * assistant: Make the prefs page set the global numcopies. This global numcopies setting is needed to let preferred content expressions operate on numcopies. It's also convenient, because typically if you want git-annex to preserve N copies of files in a repo, you want it to do that no matter which repo it's running in. Making it global avoids needing to warn the user about gotchas involving inconsistent annex.numcopies settings. (See changes to doc/numcopies.mdwn.) Added a new variety of git-annex branch log file, that holds only 1 value. Will probably be useful for other stuff later. This commit was sponsored by Nicolas Pouillard.	2014-01-20 16:47:56 -04:00
Joey Hess	b6ba0bd556	sync --content: New option that makes the content of annexed files be transferred. Similar to the assistant, this honors any configured preferred content expressions. I am not entirely happpy with the implementation. It would be nicer if the seek function returned a list of actions which included the individual file gets and copies and drops, rather than the current list of calls to syncContent. This would allow getting rid of the somewhat reundant display of "sync file [ok\|failed]" after the get/put display. But, do that, withFilesInGit would need to somehow be able to construct such a mixed action list. And it would be less efficient than the current implementation, which is able to reuse several values between eg get and drop. Note that currently this does not try to satisfy numcopies when getting/putting files (numcopies are of course checked when dropping files!) This makes it like the assistant, and unlike get --auto and copy --auto, which do duplicate files when numcopies is not yet satisfied. I don't know if this is the right decision; it only seemed to make sense to have this parallel the assistant as far as possible to start with, since I know the assistant works. This commit was sponsored by Øyvind Andersen Holm.	2014-01-19 17:49:54 -04:00
Joey Hess	85185b8f50	Allow --all to be mixed with matching options like --copies and --in (but not --include and --exclude).	2014-01-18 14:58:56 -04:00
Joey Hess	a135bbd5a2	note that --all can't be mixed with eg --copies	2014-01-18 13:52:35 -04:00
Joey Hess	939eb666fe	clarify sync	2014-01-18 13:26:47 -04:00
Yaroslav Halchenko	0bf41b335b	Minor git-annex.mdwn tune ups (trailing spaces, typos, more consistency in tense) Conflicts: doc/git-annex.mdwn -- I have managed to work on an old copy, so overlapped a bit	2014-01-18 13:06:15 -04:00
Joey Hess	c20f31a1ad	add GETAVAILABILITY to external special remote protocol And some reworking of types, and added an annex-availability git config setting.	2014-01-13 14:41:10 -04:00
Joey Hess	85272d8a98	Added tahoe special remote. Known problems: 1. Tries to tahoe start when daemon is already running. 2. If multiple tahoe remotes are set up on the same computer, they will have the same node.url configured by default, and this confuses tahoe commands. This commit was sponsored by LeastAuthority.com	2014-01-08 16:14:41 -04:00

1 2 3 4 5 ...

585 commits