git-annex

Author	SHA1	Message	Date
Joey Hess	2fd294d06f	move --from, copy --from: 10 times faster scanning remote on local disk Rather than go through the location log to see which files are present on the remote, it simply looks at the disk contents directly. I benchmarked this speeding up scanning 834 files, from an annex on my phone's SSD, from 11.39 seconds to 1.31 seconds. (No files actually moved.) Also benchmarked 8139 files, from an annex on spinning storage, speeding up from 103.17 to 13.39 seconds. Note that benchmarking with an encrypted annex on flash actually showed a minor slowdown with this optimisation -- from 13.93 to 14.50 seconds. Seems the overhead of doing the crypto needed to get the filenames to directly check can be higher than the overhead of looking up data in the location log. (Which says good things about how well the location log and git have been optimised!) It may make sense to make encrypted local remotes not have hasKeyCheap set; further benchmarking is called for.	2012-02-26 14:59:48 -04:00
Joey Hess	61dbad505d	fsck --from remote --fast Avoids expensive file transfers, at the expense of checking file size and/or contents. Required some reworking of the remote code.	2012-01-20 13:23:11 -04:00
Joey Hess	06b0cb6224	add tmp flag parameter to retrieveKeyFile	2012-01-19 16:07:36 -04:00
Joey Hess	90319afa41	fsck --from Fscking a remote is now supported. It's done by retrieving the contents of the specified files from the remote, and checking them, so can be an expensive operation. (Several optimisations are possible, to speed it up, of course.. This is the slow and stupid remote fsck to start with.) Still, if the remote is a special remote, or a git repository that you cannot run fsck in locally, it's nice to have the ability to fsck it. If you have any directory special remotes, now would be a good time to fsck them, in case you were hit by the data loss bug fixed in the previous release!	2012-01-19 15:24:05 -04:00
Joey Hess	1f8a1058c9	tweak	2012-01-06 10:57:57 -04:00
Joey Hess	df21cbfdd2	look up --to and --from remote names only once This will speed up commands like move and drop.	2012-01-06 04:06:13 -04:00
Joey Hess	0a36f92a31	more command-specific options Made --from and --to command-specific options. Added generic storage for values of command-specific options, which allows removing some of the special case fields in AnnexState. (Also added generic storage for command-specific flags, although there are not yet any.) Note that this storage uses a Map, so repeatedly looking up the same value is slightly more expensive than looking up an AnnexState field. But, the value can be looked up once in the seek stage, transformed as necessary, and passed in a closure to the start stage, and this avoids that overhead. Still, I'm hesitant to use this for things like force or fast flags. It's probably best to reserve it for flags that are only used by a few commands, or options like --from and --to that it's important only be allowed to be used with commands that implement them, to avoid user confusion.	2012-01-06 03:16:42 -04:00
Joey Hess	4a02c2ea62	type alias cleanup	2011-12-31 04:11:58 -04:00
Joey Hess	95e748cbd4	inverted logic	2011-12-09 13:38:28 -04:00
Joey Hess	3f5f28b487	factor out a stopUnless code melt for lunch	2011-12-09 12:23:45 -04:00
Joey Hess	0f0169fa99	comment update	2011-11-20 22:49:53 -04:00
Joey Hess	1b90918cec	avoid error message when doing get --from on file not present on remote	2011-11-18 17:26:37 -04:00
Joey Hess	637b5feb45	lint	2011-11-11 01:52:58 -04:00
Joey Hess	b327227ba5	better limiting of start actions to only run whenAnnexed Mostly only refactoring, but this does remove one redundant stat of the symlink by copy.	2011-11-10 23:45:14 -04:00
Joey Hess	4389782628	tweak	2011-11-10 22:37:52 -04:00
Joey Hess	2de1e2c2ce	Optimized copy --from and get --from to avoid checking the location log for files that are already present. This can be a significant speedup when running in large trees that are only missing a few files; it makes copy --from just as fast as get.	2011-11-10 21:32:42 -04:00
Joey Hess	d3e1a3619f	safer inannex checking git-annex-shell inannex now returns always 0, 1, or 100 (the last when it's unclear if content is currently in the index due to it currently being moved or dropped). (Actual locking code still not yet written.)	2011-11-09 18:33:15 -04:00
Joey Hess	8ce7e73f74	reorg to allow taking content lock The lock will only persist during the perform stage, so the content must be removed from the annex then, rather than in the cleanup stage. (No lock is actually taken yet.)	2011-11-09 16:54:18 -04:00
Joey Hess	f97c783283	clean up check selection code This new approach allows filtering out checks from the default set that are not appropriate for a command, rather than having to list every check that is appropriate. It also reduces some boilerplate. Haskell does not define Eq for functions, so I had to go a long way around with each check having a unique id. Meh.	2011-10-29 15:19:05 -04:00
Joey Hess	6c31e3a8c3	drop --from is now supported to remove file content from a remote.	2011-10-28 17:26:38 -04:00
Joey Hess	5b74b130a3	refactored and generalized pre-command sanity checking	2011-10-27 16:31:35 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00
Joey Hess	b505ba83e8	minor syntax changes	2011-10-11 14:43:45 -04:00
Joey Hess	6a6ea06cee	rename	2011-10-05 16:02:51 -04:00
Joey Hess	cfe21e85e7	rename	2011-10-04 00:59:08 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	35145202d2	remove command type definitions These were a mistake, they make the type signatures harder to read and less flexible. The CommandSeek, CommandStart, CommandPerform, and CommandCleanup types were a good idea, but composing them with the parameters expected is going too far.	2011-09-15 16:50:49 -04:00
Joey Hess	e47d1fd43e	add error for move --auto It probably does not make sense to enable auto mode for move. I cannot think of a situation where it would make sense to try to use it. A hypothetical auto mode for move would only differ from a normal move in one case -- when both repositories have a file, move deletes it from one, and this reduces the number of copies. So an auto mode would either only let move work in that situation, or avoid removing the file in that situation, depending on the number of copies. This would be complex to implement, and is perhaps not a very obvious behavior. The error is a good thing to have, so users don't expect it to do something it does not.	2011-09-15 15:33:20 -04:00
Joey Hess	9fe3c6d211	clean up params in usage display	2011-09-15 14:33:37 -04:00
Joey Hess	00153eed48	unify elipsis handling And add a simple dots-based progress display, currently only used in v2 upgrade.	2011-07-19 14:07:23 -04:00
Joey Hess	6c396a256c	finished hlint pass	2011-07-15 12:47:14 -04:00
Joey Hess	cdbcd6f495	add web special remote Generalized LocationLog to PresenceLog, and use a presence log to record urls for the web special remote.	2011-07-01 15:30:42 -04:00
Joey Hess	7ee636f6dd	avoid unnecessary read of trust.log	2011-06-23 13:39:04 -04:00
Joey Hess	80302d0b46	improve bare repo handing Many more commands can work in bare repos now, thanks to the git-annex branch.	2011-06-22 18:32:41 -04:00
Joey Hess	971ab27e78	better types allowed breaking module dep loop	2011-06-01 19:11:27 -04:00
Joey Hess	a8fb97d2ce	Add --trust, --untrust, and --semitrust options.	2011-06-01 17:57:31 -04:00
Joey Hess	d006586cd0	add a message in potenatially confusing copy --fast failure situation	2011-05-16 13:27:19 -04:00
Joey Hess	56bc3e95ca	refactor some boilerplate	2011-05-15 02:02:46 -04:00
Joey Hess	76911a446a	Avoid using absolute paths when staging location log, as that can confuse git when a remote's path contains a symlink. Closes: #621386 This was a real PITA to fix, since location logs can be staged in both the current repo, as well as in local remote's repos, in which case the cwd will not be in the repo. And git add needs different params in both cases, when absolute paths are not used. In passing, git annex fsck now stages location log fixes.	2011-04-25 14:54:24 -04:00
Joey Hess	bc51387e6d	Periodically flush git command queue, to avoid boating memory usage too much. Since the queue is flushed in between subcommand actions being run, there should be no issues with actions that expect to queue up some stuff and have it run after they do other stuff. So I didn't have to audit for such assumptions.	2011-04-07 13:59:31 -04:00
Joey Hess	ed7fc4fce9	Bugfix: copy --to --fast never really copied, fixed.	2011-04-01 12:34:06 -04:00
Joey Hess	4868b64868	Provide a less expensive version of `git annex copy --to`, enabled via --fast. This assumes that location tracking information is correct, rather than contacting the remote for every file.	2011-03-27 18:34:30 -04:00
Joey Hess	a70035e981	converted move to use Remote Drop old Remotes.hs, now unused!	2011-03-27 17:24:20 -04:00
Joey Hess	140a351fc5	avoid version check before running version and upgrade commands There are two types of commands; those that access the repository and those that don't. Sorted.	2011-03-19 18:58:49 -04:00
Joey Hess	49b7f59183	test suite passes again doesn't test remote functionality.. but that may be working too now	2011-03-15 22:53:14 -04:00
Joey Hess	9d49fe2c17	first pass at using new keys It compiles. It sorta works. Several subcommands are FIXME marked and broken, because things that used to accept separate --backend and --key params need to be changed to accept just a --key that encodes all the key info, now that there is metadata in keys.	2011-03-15 21:34:13 -04:00
Joey Hess	9f20aee219	avoid logging to location log when in a bare repo This assumes that changes to content in bare repos are made from some non-bare repo, and that the location log is updated on that side. That's true for move --from and move --to. It's not true for dropkey and setkey and recvkey. But those are plumbing level commands, so I guess it's ok to assume that someone running those in a bare repo knows what they're doing. And git-annex-shell is used to run those, and if the bare repo is non-local, it needs to be able to use them even though they cannot update the location log. So this seems unavoidable.	2011-03-03 15:22:53 -04:00
Joey Hess	fcdc4797a9	use ShellParam type So, I have a type checked safe handling of filenames starting with dashes, throughout the code.	2011-02-28 16:18:55 -04:00
Joey Hess	dee9655237	bugfix to move --to Due to recent changes, the remotes config was not read before the remote to act on was picked.	2011-01-27 15:45:22 -04:00

1 2

70 commits