git-annex

Author	SHA1	Message	Date
Joey Hess	40df26757a	copy: avoid updating location log when no copy is performed git annex copy --to remote often does not need to copy a file, but it was still updating the location log in this case.	2012-09-24 19:58:34 -04:00
Joey Hess	df07ccf404	make the assistant retry failed transfers When a transfer fails, the progress info can be used to intelligently retry it. If the transfer managed to make some progress, but did not fully complete, then there's a good chance that a retry will finish it (or at least make more progress).	2012-09-23 13:27:13 -04:00
Joey Hess	715a9a2f8e	keep logs of failed transfers, and requeue them when doing a non-full scan of a remote	2012-08-23 15:24:15 -04:00
Joey Hess	7225c2bfc0	record transfer information on local git remotes In order to record a semi-useful filename associated with the key, this required plumbing the filename all the way through to the remotes' storeKey and retrieveKeyFile. Note that there is potential for deadlock here, narrowly avoided. Suppose the repos are A and B. A sends file foo to B, and at the same time, B gets file foo from A. So, A locks its upload transfer info file, and then locks B's download transfer info file. At the same time, B is taking the two locks in the opposite order. This is only not a deadlock because the lock code does not wait, and aborts. So one of A or B's transfers will be aborted and the other transfer will continue. Whew!	2012-07-01 17:15:11 -04:00
Joey Hess	e5fd8b67b7	get, move, copy: Now refuse to do anything when the requested file transfer is already in progress by another process. Note this is per-remote, so trying to get the same file from multiple remotes can still let duplicate downloads run. (And uploading the same file to multiple remotes is not duplicate at all of course.) get, move, and copy are the only git-annex subcommands that transfer files, but there's still git-annex-shell recvkey and sendkey to deal with too. I considered modifying retrieveKeyFile or getViaTmp, but they are called by other code that does not involve expensive file transfers (migrate) or that does file transfers that should not be checked by this (fsck --from).	2012-07-01 17:15:11 -04:00
Joey Hess	942d8f7298	hlint	2012-06-12 11:32:06 -04:00
Joey Hess	60ab3d84e1	added ifM and nuked 11 lines of code no behavior changes	2012-03-14 17:43:34 -04:00
Joey Hess	2fd294d06f	move --from, copy --from: 10 times faster scanning remote on local disk Rather than go through the location log to see which files are present on the remote, it simply looks at the disk contents directly. I benchmarked this speeding up scanning 834 files, from an annex on my phone's SSD, from 11.39 seconds to 1.31 seconds. (No files actually moved.) Also benchmarked 8139 files, from an annex on spinning storage, speeding up from 103.17 to 13.39 seconds. Note that benchmarking with an encrypted annex on flash actually showed a minor slowdown with this optimisation -- from 13.93 to 14.50 seconds. Seems the overhead of doing the crypto needed to get the filenames to directly check can be higher than the overhead of looking up data in the location log. (Which says good things about how well the location log and git have been optimised!) It may make sense to make encrypted local remotes not have hasKeyCheap set; further benchmarking is called for.	2012-02-26 14:59:48 -04:00
Joey Hess	61dbad505d	fsck --from remote --fast Avoids expensive file transfers, at the expense of checking file size and/or contents. Required some reworking of the remote code.	2012-01-20 13:23:11 -04:00
Joey Hess	06b0cb6224	add tmp flag parameter to retrieveKeyFile	2012-01-19 16:07:36 -04:00
Joey Hess	90319afa41	fsck --from Fscking a remote is now supported. It's done by retrieving the contents of the specified files from the remote, and checking them, so can be an expensive operation. (Several optimisations are possible, to speed it up, of course.. This is the slow and stupid remote fsck to start with.) Still, if the remote is a special remote, or a git repository that you cannot run fsck in locally, it's nice to have the ability to fsck it. If you have any directory special remotes, now would be a good time to fsck them, in case you were hit by the data loss bug fixed in the previous release!	2012-01-19 15:24:05 -04:00
Joey Hess	1f8a1058c9	tweak	2012-01-06 10:57:57 -04:00
Joey Hess	df21cbfdd2	look up --to and --from remote names only once This will speed up commands like move and drop.	2012-01-06 04:06:13 -04:00
Joey Hess	0a36f92a31	more command-specific options Made --from and --to command-specific options. Added generic storage for values of command-specific options, which allows removing some of the special case fields in AnnexState. (Also added generic storage for command-specific flags, although there are not yet any.) Note that this storage uses a Map, so repeatedly looking up the same value is slightly more expensive than looking up an AnnexState field. But, the value can be looked up once in the seek stage, transformed as necessary, and passed in a closure to the start stage, and this avoids that overhead. Still, I'm hesitant to use this for things like force or fast flags. It's probably best to reserve it for flags that are only used by a few commands, or options like --from and --to that it's important only be allowed to be used with commands that implement them, to avoid user confusion.	2012-01-06 03:16:42 -04:00
Joey Hess	4a02c2ea62	type alias cleanup	2011-12-31 04:11:58 -04:00
Joey Hess	95e748cbd4	inverted logic	2011-12-09 13:38:28 -04:00
Joey Hess	3f5f28b487	factor out a stopUnless code melt for lunch	2011-12-09 12:23:45 -04:00
Joey Hess	0f0169fa99	comment update	2011-11-20 22:49:53 -04:00
Joey Hess	1b90918cec	avoid error message when doing get --from on file not present on remote	2011-11-18 17:26:37 -04:00
Joey Hess	637b5feb45	lint	2011-11-11 01:52:58 -04:00
Joey Hess	b327227ba5	better limiting of start actions to only run whenAnnexed Mostly only refactoring, but this does remove one redundant stat of the symlink by copy.	2011-11-10 23:45:14 -04:00
Joey Hess	4389782628	tweak	2011-11-10 22:37:52 -04:00
Joey Hess	2de1e2c2ce	Optimized copy --from and get --from to avoid checking the location log for files that are already present. This can be a significant speedup when running in large trees that are only missing a few files; it makes copy --from just as fast as get.	2011-11-10 21:32:42 -04:00
Joey Hess	d3e1a3619f	safer inannex checking git-annex-shell inannex now returns always 0, 1, or 100 (the last when it's unclear if content is currently in the index due to it currently being moved or dropped). (Actual locking code still not yet written.)	2011-11-09 18:33:15 -04:00
Joey Hess	8ce7e73f74	reorg to allow taking content lock The lock will only persist during the perform stage, so the content must be removed from the annex then, rather than in the cleanup stage. (No lock is actually taken yet.)	2011-11-09 16:54:18 -04:00
Joey Hess	f97c783283	clean up check selection code This new approach allows filtering out checks from the default set that are not appropriate for a command, rather than having to list every check that is appropriate. It also reduces some boilerplate. Haskell does not define Eq for functions, so I had to go a long way around with each check having a unique id. Meh.	2011-10-29 15:19:05 -04:00
Joey Hess	6c31e3a8c3	drop --from is now supported to remove file content from a remote.	2011-10-28 17:26:38 -04:00
Joey Hess	5b74b130a3	refactored and generalized pre-command sanity checking	2011-10-27 16:31:35 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00
Joey Hess	b505ba83e8	minor syntax changes	2011-10-11 14:43:45 -04:00
Joey Hess	6a6ea06cee	rename	2011-10-05 16:02:51 -04:00
Joey Hess	cfe21e85e7	rename	2011-10-04 00:59:08 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	35145202d2	remove command type definitions These were a mistake, they make the type signatures harder to read and less flexible. The CommandSeek, CommandStart, CommandPerform, and CommandCleanup types were a good idea, but composing them with the parameters expected is going too far.	2011-09-15 16:50:49 -04:00
Joey Hess	e47d1fd43e	add error for move --auto It probably does not make sense to enable auto mode for move. I cannot think of a situation where it would make sense to try to use it. A hypothetical auto mode for move would only differ from a normal move in one case -- when both repositories have a file, move deletes it from one, and this reduces the number of copies. So an auto mode would either only let move work in that situation, or avoid removing the file in that situation, depending on the number of copies. This would be complex to implement, and is perhaps not a very obvious behavior. The error is a good thing to have, so users don't expect it to do something it does not.	2011-09-15 15:33:20 -04:00
Joey Hess	9fe3c6d211	clean up params in usage display	2011-09-15 14:33:37 -04:00
Joey Hess	00153eed48	unify elipsis handling And add a simple dots-based progress display, currently only used in v2 upgrade.	2011-07-19 14:07:23 -04:00
Joey Hess	6c396a256c	finished hlint pass	2011-07-15 12:47:14 -04:00
Joey Hess	cdbcd6f495	add web special remote Generalized LocationLog to PresenceLog, and use a presence log to record urls for the web special remote.	2011-07-01 15:30:42 -04:00
Joey Hess	7ee636f6dd	avoid unnecessary read of trust.log	2011-06-23 13:39:04 -04:00
Joey Hess	80302d0b46	improve bare repo handing Many more commands can work in bare repos now, thanks to the git-annex branch.	2011-06-22 18:32:41 -04:00
Joey Hess	971ab27e78	better types allowed breaking module dep loop	2011-06-01 19:11:27 -04:00
Joey Hess	a8fb97d2ce	Add --trust, --untrust, and --semitrust options.	2011-06-01 17:57:31 -04:00
Joey Hess	d006586cd0	add a message in potenatially confusing copy --fast failure situation	2011-05-16 13:27:19 -04:00
Joey Hess	56bc3e95ca	refactor some boilerplate	2011-05-15 02:02:46 -04:00
Joey Hess	76911a446a	Avoid using absolute paths when staging location log, as that can confuse git when a remote's path contains a symlink. Closes: #621386 This was a real PITA to fix, since location logs can be staged in both the current repo, as well as in local remote's repos, in which case the cwd will not be in the repo. And git add needs different params in both cases, when absolute paths are not used. In passing, git annex fsck now stages location log fixes.	2011-04-25 14:54:24 -04:00
Joey Hess	bc51387e6d	Periodically flush git command queue, to avoid boating memory usage too much. Since the queue is flushed in between subcommand actions being run, there should be no issues with actions that expect to queue up some stuff and have it run after they do other stuff. So I didn't have to audit for such assumptions.	2011-04-07 13:59:31 -04:00
Joey Hess	ed7fc4fce9	Bugfix: copy --to --fast never really copied, fixed.	2011-04-01 12:34:06 -04:00
Joey Hess	4868b64868	Provide a less expensive version of `git annex copy --to`, enabled via --fast. This assumes that location tracking information is correct, rather than contacting the remote for every file.	2011-03-27 18:34:30 -04:00
Joey Hess	a70035e981	converted move to use Remote Drop old Remotes.hs, now unused!	2011-03-27 17:24:20 -04:00
Joey Hess	140a351fc5	avoid version check before running version and upgrade commands There are two types of commands; those that access the repository and those that don't. Sorted.	2011-03-19 18:58:49 -04:00
Joey Hess	49b7f59183	test suite passes again doesn't test remote functionality.. but that may be working too now	2011-03-15 22:53:14 -04:00
Joey Hess	9d49fe2c17	first pass at using new keys It compiles. It sorta works. Several subcommands are FIXME marked and broken, because things that used to accept separate --backend and --key params need to be changed to accept just a --key that encodes all the key info, now that there is metadata in keys.	2011-03-15 21:34:13 -04:00
Joey Hess	9f20aee219	avoid logging to location log when in a bare repo This assumes that changes to content in bare repos are made from some non-bare repo, and that the location log is updated on that side. That's true for move --from and move --to. It's not true for dropkey and setkey and recvkey. But those are plumbing level commands, so I guess it's ok to assume that someone running those in a bare repo knows what they're doing. And git-annex-shell is used to run those, and if the bare repo is non-local, it needs to be able to use them even though they cannot update the location log. So this seems unavoidable.	2011-03-03 15:22:53 -04:00
Joey Hess	fcdc4797a9	use ShellParam type So, I have a type checked safe handling of filenames starting with dashes, throughout the code.	2011-02-28 16:18:55 -04:00
Joey Hess	dee9655237	bugfix to move --to Due to recent changes, the remotes config was not read before the remote to act on was picked.	2011-01-27 15:45:22 -04:00
Joey Hess	b7903eb2d1	move partitioning out of keyPossibilities And a bug fix in passing.	2011-01-26 16:44:14 -04:00
Joey Hess	7b2da21ab7	avoid moving if src and dest are the same	2011-01-26 15:59:10 -04:00
Joey Hess	6a97b10fcb	rework config storage Moved away from a map of flags to storing config directly in the AnnexState structure. Got rid of most accessor functions in Annex. This allowed supporting multiple --exclude flags.	2011-01-26 00:17:38 -04:00
Joey Hess	e7b557ef5d	got rid of Core module Most of it was to do with managing annexed Content, so put there	2011-01-16 16:05:05 -04:00
Joey Hess	e43d4730c5	bugfix: Running `copy --to` when both local and remote had the key dropped it from local.	2011-01-07 02:14:22 -04:00
Joey Hess	f1b747e6d9	bugfix: Running `move --to` with a remote whose UUID was not yet known * bugfix: Running `move --to` with a remote whose UUID was not yet known could result in git-annex not recording on the local side where the file was moved to. This could not result in data loss, or even a significant problem, since the remote did record that it had the file. * Also, add a general guard to detect attempts to record information about repositories with missing UUIDs.	2011-01-04 17:45:27 -04:00
Joey Hess	700aed13cf	git-annex-shell now exclusively used for all remote access	2010-12-31 19:09:17 -04:00
Joey Hess	30e0065ab9	tuple makes it clearer	2010-12-31 15:52:59 -04:00
Joey Hess	eac433a84a	use git-annex-shell configlist	2010-12-31 15:46:33 -04:00
Joey Hess	f38aa3e83a	unfinished switch to using git-annex-shell	2010-12-30 20:31:52 -04:00
Joey Hess	a89a6f2114	refactor in preparation for adding a git-annex-shell command	2010-12-30 15:06:26 -04:00
Joey Hess	6a5be9d53c	rename some stuff and prepare to break out more into Command/*	2010-12-30 14:19:16 -04:00
Joey Hess	e64ffc212e	support trusted repositories that are not configured as remotes	2010-12-29 16:58:44 -04:00
Joey Hess	d475aac375	refactor	2010-12-29 16:21:38 -04:00
Joey Hess	e97d13e29b	Add copy subcommand.	2010-11-27 17:02:53 -04:00
Joey Hess	eeae910242	finished hlinting	2010-11-22 17:51:55 -04:00
Joey Hess	da0de293d1	refactor param seeking	2010-11-11 18:54:52 -04:00
Joey Hess	fb824f7eb0	use -- before filenames when running git add, git rm, etc	2010-11-10 14:15:21 -04:00
Joey Hess	070e8530c1	refactoring, no code changes really	2010-11-08 15:15:21 -04:00
Joey Hess	0eae5b806c	broke subcommands out into separate modules	2010-11-02 19:04:24 -04:00

1 2 3 4

177 commits