git-annex

Author	SHA1	Message	Date
Joey Hess	86e8532c0a	allM has slightly better memory use	2014-07-26 22:34:40 -04:00
Joey Hess	67975bf50d	fix fallback to other chunk size when first does not have it	2014-07-26 22:25:50 -04:00
Joey Hess	1400cbb032	Support for remotes that are chunkable and encryptable. I'd have liked to keep these two concepts entirely separate, but that are entagled: Storing a key in an encrypted and chunked remote need to generate chunk keys, encrypt the keys, chunk the data, encrypt the chunks, and send them to the remote. Similar for retrieval, etc. So, here's an implemnetation of all of that. The total win here is that every remote was implementing encrypted storage and retrival, and now it can move into this single place. I expect this to result in several hundred lines of code being removed from git-annex eventually! This commit was sponsored by Henrik Ahlgren.	2014-07-26 20:14:31 -04:00
Joey Hess	d4d68f57e5	finish up basic chunked remote groundwork Chunk retrieval and reassembly, removal, and checking if all necessary chunks are present. This commit was sponsored by Damien Raude-Morvan.	2014-07-26 20:11:41 -04:00
Joey Hess	cf83697c33	reorg	2014-07-26 12:04:35 -04:00
Joey Hess	e4cb50db33	Merge branch 'master' into newchunks	2014-07-26 12:02:48 -04:00
Joey Hess	005aded3e0	Fix cost calculation for non-encrypted remotes. Encyptable types of remotes that were not actually encrypted still had the encryptedRemoteCostAdj applied to their configured cost, which was a bug.	2014-07-25 17:29:59 -04:00
Joey Hess	ab4cce4114	core implementation of new style chunking Not yet used by any special remotes, but should not be too hard to add it to most of them. storeChunks is the hairy bit! It's loosely based on Remote.Directory.storeLegacyChunked. The object is read in using a lazy bytestring, which is streamed though, creating chunks as needed, without ever buffering more than 1 chunk in memory. Getting the progress meter update to work right was also fun, since progress meter values are absolute. Finessed by constructing an offset meter. This commit was sponsored by Richard Collins.	2014-07-25 16:20:32 -04:00
Joey Hess	ceea04e77f	move meteredWriteFileChunks out of legacy	2014-07-24 16:42:35 -04:00
Joey Hess	e2c44bf656	implement chunk logs Slightly tricky as they are not normal UUIDBased logs, but are instead maps from (uuid, chunksize) to chunkcount. This commit was sponsored by Frank Thomas.	2014-07-24 16:23:36 -04:00
Joey Hess	bbdb2c04d5	improve chunk data types	2014-07-24 15:08:07 -04:00
Joey Hess	9e2d49d441	prepare for new style chunking Moved old legacy chunking code, and cleaned up the directory and webdav remotes use of it, so when no chunking is configured, that code is not used. The config for new style chunking will be chunk=1M instead of chunksize=1M. There should be no behavior changes from this commit. This commit was sponsored by Andreas Laas.	2014-07-24 14:49:22 -04:00
Joey Hess	26ee27915a	refactor locking	2014-07-10 00:32:23 -04:00
Joey Hess	c34b5e09f8	factor out getRemoteGitConfig	2014-05-16 16:08:20 -04:00
Fraser Tweedale	4eb72392b4	execute remote.<name>.annex-shell on remote, if set It is useful to be able to specify an alternative git-annex-shell program to execute on the remote, e.g., to run a version not on the PATH. Use remote.<name>.annex-shell if specified, instead of the default "git-annex-shell" i.e., first so-named executable on the PATH.	2014-05-16 15:46:43 -04:00
Joey Hess	f00cb21037	Bring back rsync -p, but only when git-annex is running on a non-crippled file system. This is a better approach to fix #700282 while not unncessarily losing file permissions on non-crippled systems.	2014-04-17 14:31:42 -04:00
Joey Hess	b63276309e	clean up cleanup action enumeration	2014-03-13 19:06:26 -04:00
Joey Hess	fa24ba2520	plumb creds from webapp to initremote Avoids abusing setting environment variables, which was always a hack and won't work on windows.	2014-02-11 14:07:56 -04:00
Joey Hess	089c0109a2	Added ways to configure rsync options to be used only when uploading or downloading from a remote. Useful to eg limit upload bandwidth.	2014-02-02 16:06:34 -04:00
Joey Hess	891c85cd88	use locking on Windows This is all the easy cases, where there was already a separate lock file.	2014-01-28 14:42:03 -04:00
Joey Hess	1ca111620d	reorg	2014-01-26 16:32:55 -04:00
Joey Hess	c20f31a1ad	add GETAVAILABILITY to external special remote protocol And some reworking of types, and added an annex-availability git config setting.	2014-01-13 14:41:10 -04:00
Joey Hess	f7727d2df1	Remotes can now be made read-only, by setting remote.<name>.annex-readonly	2014-01-02 13:12:32 -04:00
Joey Hess	58db042033	map: Work when there are gcrypt remotes.	2013-11-04 14:14:44 -04:00
Joey Hess	5756636486	directory, webdav: Fix bug introduced in version 4.20131002 that caused the chunkcount file to not be written. Work around repositories without such a file, so files can still be retreived from them.	2013-10-26 15:03:12 -04:00
Joey Hess	06ea92282f	fix inverted logic when determining whether to write a chunkcount file late-night hlint bit me on this one.. Reviewed `c1990702e9` and the rest of it seems ok	2013-10-26 14:08:29 -04:00
Joey Hess	571fe4999b	remove __WINDOWS__ ifdef	2013-10-06 17:23:30 -04:00
Joey Hess	4e1e625fa6	fix transferring to gcrypt repo from direct mode repo recvkey was told it was receiving a HMAC key from a direct mode repo, and that confused it into rejecting the transfer, since it has no way to verify a key using that backend, since there is no HMAC backend. I considered making recvkey skip verification in the case of an unknown backend. However, that could lead to bad results; a key can legitimately be in the annex with a backend that the remote git-annex-shell doesn't know about. Better to keep it rejecting if it cannot verify. Instead, made the gcrypt special remote not set the direct mode flag when sending (and receiving) files. Also, added some recvkey messages when its checks fail, since otherwise all that is shown is a confusing error message from rsync when the remote git-annex-shell exits nonzero.	2013-10-01 14:38:46 -04:00
Joey Hess	c1990702e9	hlint	2013-09-25 23:19:01 -04:00
Joey Hess	f9e438c1bc	factor out more ssh stuff from git remote This has the dual benefits of making Remote.Git shorter, and letting Remote.GCrypt use these utilities.	2013-09-24 13:37:41 -04:00
Joey Hess	7c1a9cdeb9	partially complete gcrypt remote (local send done; rest not) This is a git-remote-gcrypt encrypted special remote. Only sending files in to the remote works, and only for local repositories. Most of the work so far has involved making initremote work. A particular problem is that remote setup in this case needs to generate its own uuid, derivied from the gcrypt-id. That required some larger changes in the code to support. For ssh remotes, this will probably just reuse Remote.Rsync's code, so should be easy enough. And for downloading from a web remote, I will need to factor out the part of Remote.Git that does that. One particular thing that will need work is supporting hot-swapping a local gcrypt remote. I think it needs to store the gcrypt-id in the git config of the local remote, so that it can check it every time, and compare with the cached annex-uuid for the remote. If there is a mismatch, it can change both the cached annex-uuid and the gcrypt-id. That should work, and I laid some groundwork for it by already reading the remote's config when it's local. (Also needed for other reasons.) This commit was sponsored by Daniel Callahan.	2013-09-07 18:38:00 -04:00
Joey Hess	89eecd4b3b	rename constructor for clariy	2013-09-05 11:12:01 -04:00
guilhem	ac9807c887	Leverage an ambiguities between Ciphers Cipher is now a datatype data Cipher = Cipher String \| MacOnlyCipher String which makes more precise its interpretation MAC-only vs. MAC + used to derive a key for symmetric crypto.	2013-09-05 11:09:08 -04:00
Joey Hess	2b9f3cc175	tabs	2013-09-04 22:47:53 -04:00
Joey Hess	a51f1a4ee4	unimportant tweak fix something my internal haskell parser does a double take at	2013-09-04 22:39:25 -04:00
Joey Hess	930e6d22d6	replace an over-explained Bool with a data type This also highlights several places where a Read/Show or similar for the new data type could avoid redundant strings.	2013-09-04 22:18:33 -04:00
guilhem	3999a860eb	Encryption defaults to 'hybrid' When a keyid= is specified while encryption= is absent.	2013-09-04 21:34:33 -04:00
guilhem	8293ed619f	Allow public-key encryption of file content. With the initremote parameters "encryption=pubkey keyid=788A3F4C". /!\ Adding or removing a key has NO effect on files that have already been copied to the remote. Hence using keyid+= and keyid-= with such remotes should be used with care, and make little sense unless the point is to replace a (sub-)key by another. /!\ Also, a test case has been added to ensure that the cipher and file contents are encrypted as specified by the chosen encryption scheme.	2013-09-03 14:34:16 -04:00
guilhem	53ce59021a	Allow revocation of OpenPGP keys. /!\ It is to be noted that revoking a key does NOT necessarily prevent the owner of its private part from accessing data on the remote /!\ The only sound use of `keyid-=` is probably to replace a (sub-)key by another, where the private part of both is owned by the same person/entity: git annex enableremote myremote keyid-=2512E3C7 keyid+=788A3F4C Reference: http://git-annex.branchable.com/bugs/Using_a_revoked_GPG_key/ * Other change introduced by this patch: New keys now need to be added with option `keyid+=`, and the scheme specified (upon initremote only) with `encryption=`. The motivation for this change is to open for new schemes, e.g., strict asymmetric encryption. git annex initremote myremote encryption=hybrid keyid=2512E3C7 git annex enableremote myremote keyid+=788A3F4C	2013-08-29 14:31:33 -04:00
Joey Hess	a3224ce35b	avoid more build warnings on Windows	2013-08-04 14:05:36 -04:00
Joey Hess	06db8e0bd9	squash compiler warnings on Windows	2013-08-04 13:18:05 -04:00
Joey Hess	667a832de9	print encryption setup message before action	2013-05-18 19:36:55 -04:00
Joey Hess	abe8d549df	fix permission damage (thanks, Windows)	2013-05-11 23:54:25 -04:00
Joey Hess	3c7e30a295	git-annex now builds on Windows (doesn't work)	2013-05-11 15:03:00 -05:00
Joey Hess	8a2d1988d3	expose Control.Monad.join I think I've been looking for that function for some time. Ie, I remember wanting to collapse Just Nothing to Nothing.	2013-04-22 20:24:53 -04:00
Joey Hess	b9904b0c42	fix tab damage	2013-04-13 19:26:59 -04:00
guilhem	a1eded8641	Allow rsync to use other remote shells. Introduced a new per-remote option 'annex-rsync-transport' to specify the remote shell that it to be used with rsync. In case the value is 'ssh', connections are cached unless 'sshcaching' is unset.	2013-04-13 19:26:24 -04:00
Joey Hess	9e11699c76	connect existing meters to the transfer log for downloads Most remotes have meters in their implementations of retrieveKeyFile already. Simply hooking these up to the transfer log makes that information available. Easy peasy. This is particularly valuable information for encrypted remotes, which otherwise bypass the assistant's polling of temp files, and so don't have good progress bars yet. Still some work to do here (see progressbars.mdwn changes), but this is entirely an improvement from the lack of progress bars for encrypted downloads.	2013-04-11 17:32:31 -04:00
Joey Hess	c511eb048f	changelog & minor style fixes	2013-04-06 16:14:57 -04:00
guilhem	00fc21bfec	Generate ciphers with a better entropy. Unless highRandomQuality=false (or --fast) is set, use Libgcypt's 'GCRY_VERY_STRONG_RANDOM' level by default for cipher generation, like it's done for OpenPGP key generation. On the assistant side, the random quality is left to the old (lower) level, in order not to scare the user with an enless page load due to the blocking PRNG waiting for IO actions.	2013-04-06 16:09:51 -04:00
guilhem	55f0f858ee	Allow other MAC algorithms in the Remote Config.	2013-03-29 18:04:52 -04:00
Joey Hess	cf07a2c412	webapp: Progess bar fixes for many types of special remotes. There was confusion in different parts of the progress bar code about whether an update contained the total number of bytes transferred, or the number of bytes transferred since the last update. One way this bug showed up was progress bars that seemed to stick at zero for a long time. In order to fix it comprehensively, I add a new BytesProcessed data type, that is explicitly a total quantity of bytes, not a delta. Note that this doesn't necessarily fix every problem with progress bars. Particularly, buffering can now cause progress bars to seem to run ahead of transfers, reaching 100% when data is still being uploaded.	2013-03-28 17:04:37 -04:00
Joey Hess	19c0a0d5b1	split cost out into its own module Added a function to insert a new cost into a list, which could be used to asjust costs after a drag and drop.	2013-03-13 16:30:34 -04:00
Joey Hess	0c13d3065e	git subcommand cleanup Pass subcommand as a regular param, which allows passing git parameters like -c before it. This was already done in the pipeing set of functions, but not the command running set.	2013-03-03 13:39:07 -04:00
Joey Hess	24c6eae1b5	show errors	2013-01-02 13:50:16 -04:00
Joey Hess	4008590c68	type based git config handling for remotes Still a couple of places that use git config ad-hoc, but this is most of it done.	2013-01-01 13:58:14 -04:00
Joey Hess	0b6c889012	webapp: S3 and Glacier forms now have a select list of all currently-supported AWS regions.	2012-12-01 14:11:37 -04:00
Joey Hess	020a25abe1	avoid unnecessary Maybe	2012-11-30 00:55:59 -04:00
Joey Hess	a5111a6d85	Amazon Glacier special remote; 100% working	2012-11-20 16:43:58 -04:00
Joey Hess	9221e62d87	Allow controlling whether login credentials for S3 and webdav are committed to the repository, by setting embedcreds=yes\|no when running initremote.	2012-11-19 17:32:58 -04:00
Joey Hess	5f977cc725	directory special remote: Made more efficient and robust. Files are now written to a tmp directory in the remote, and once all chunks are written, etc, it's moved into the final place atomically. For now, checkpresent still checks every single chunk of a file, because the old method could leave partially transferred files with some chunks present and others not.	2012-11-19 13:18:23 -04:00
Joey Hess	7df1e71fe3	S3: Added progress display for uploading and downloading.	2012-11-18 22:49:07 -04:00
Joey Hess	c8751be151	simplify	2012-11-18 18:27:53 -04:00
Joey Hess	81379bb29c	better streaming while encrypting/decrypting Both the directory and webdav special remotes used to have to buffer the whole file contents before it could be decrypted, as they read from chunks. Now the chunks are streamed through gpg with no buffering.	2012-11-18 15:27:44 -04:00
Joey Hess	1fe76b57d6	webdav now checks presence of and receives chunked content Note that receiving encrypted chunked content currently involves buffering. (So does doing so with the directory special remote.)	2012-11-16 23:16:18 -04:00
Joey Hess	92d5d81c2c	generic chunked content helper However, directory still uses its optimzed chunked file writer, as it uses less memory than the generic one in the helper.	2012-11-16 17:58:08 -04:00
Joey Hess	2172cc586e	where indenting	2012-11-11 00:51:07 -04:00
Joey Hess	e4bf74a965	store S3 creds in a 600 mode file inside the local git repo	2012-09-26 14:42:32 -04:00
Joey Hess	226781c047	unify types	2012-09-21 14:50:14 -04:00
Joey Hess	aff09a1f33	add a progress callback to storeKey, and threaded it all the way through Transfer info files are updated when the callback is called, updating the number of bytes transferred. Left unused p variables at every place the callback should be used. Which is rather a lot..	2012-09-19 16:08:37 -04:00
Joey Hess	760e028dca	pass associatedfile and remoteuuid to git-annex-shell This almost works. Along the way, I noticed that the --uuid parameter was being accidentially passed after the --, so that has never been actually used by git-annex-shell to verify it's running in the expected repository. Oops. Fixed.	2012-07-02 10:57:51 -04:00
Joey Hess	7225c2bfc0	record transfer information on local git remotes In order to record a semi-useful filename associated with the key, this required plumbing the filename all the way through to the remotes' storeKey and retrieveKeyFile. Note that there is potential for deadlock here, narrowly avoided. Suppose the repos are A and B. A sends file foo to B, and at the same time, B gets file foo from A. So, A locks its upload transfer info file, and then locks B's download transfer info file. At the same time, B is taking the two locks in the opposite order. This is only not a deadlock because the lock code does not wait, and aborts. So one of A or B's transfers will be aborted and the other transfer will continue. Whew!	2012-07-01 17:15:11 -04:00
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	bd592d1450	refactor	2012-04-29 14:33:07 -04:00
Joey Hess	1c16f616df	Added shared cipher mode to encryptable special remotes. This option avoids gpg key distribution, at the expense of flexability, and with the requirement that all clones of the git repository be equally trusted.	2012-04-29 14:02:43 -04:00
Joey Hess	ed79596b75	noop	2012-04-21 23:32:33 -04:00
Joey Hess	bee420bd2d	in which I discover void void :: Functor f => f a -> f () -- ah, of course that's useful :)	2012-04-21 23:06:19 -04:00
Joey Hess	b98b69e8c6	honor core.sharedRepository when making all the other files in the annex Lock files, directories, etc.	2012-04-21 19:36:03 -04:00
Joey Hess	4eb5112681	rationalize getConfig getConfig got a remote-specific config, and this confusing name caused it to be used a couple of places that only were interested in global configs. Rename to getRemoteConfig and make getConfig only get global configs. There are no behavior changes here, but remote.<name>.annex-web-options never actually worked (and per-remote web options is a very unlikely to be useful case so I didn't make it work), so fix the documentation for it.	2012-03-22 17:32:47 -04:00
Joey Hess	c0c9991c9f	nukes another 15 lines thanks to ifM	2012-03-15 20:39:25 -04:00
Joey Hess	52e88f3ebf	add remote start and stop hooks Locking is used, so that, if there are multiple git-annex processes using a remote concurrently, the stop hook is only run by the last process that uses it.	2012-03-04 19:12:58 -04:00
Joey Hess	7ba79cfb8c	thread through original key to retrieveEnctypted Allows showing progress bar for this last case of the directory special remote.	2012-03-04 03:36:39 -04:00
Joey Hess	eb9001044f	order user provided params after connection caching params So the user can override them.	2012-01-20 17:32:32 -04:00
Joey Hess	47250a153a	ssh connection caching Ssh connection caching is now enabled automatically by git-annex. Only one ssh connection is made to each host per git-annex run, which can speed some things up a lot, as well as avoiding repeated password prompts. Concurrent git-annex processes also share ssh connections. Cached ssh connections are shut down when git-annex exits. Note: The rsync special remote does not yet participate in the ssh connection caching.	2012-01-20 17:14:56 -04:00
Joey Hess	61dbad505d	fsck --from remote --fast Avoids expensive file transfers, at the expense of checking file size and/or contents. Required some reworking of the remote code.	2012-01-20 13:23:11 -04:00
Joey Hess	06b0cb6224	add tmp flag parameter to retrieveKeyFile	2012-01-19 16:07:36 -04:00
Joey Hess	16e7178f20	reorg	2012-01-10 15:29:10 -04:00
Joey Hess	4a02c2ea62	type alias cleanup	2011-12-31 04:11:58 -04:00
Joey Hess	ef28b3fef7	split out Git/Command.hs	2011-12-14 15:56:11 -04:00
Joey Hess	02f1bd2bf4	split more stuff out of Git.hs	2011-12-14 15:43:13 -04:00
Joey Hess	13fff71f20	split out three modules from Git Constructors and configuration make sense in separate modules. A separate Git.Types is needed to avoid cycles.	2011-12-13 15:06:49 -04:00
Joey Hess	e3f1568e0f	Fix caching of decrypted ciphers, which failed when drop had to check multiple different encrypted special remotes.	2011-12-08 16:01:46 -04:00
Joey Hess	637b5feb45	lint	2011-11-11 01:52:58 -04:00
Joey Hess	56b8194470	cleanup	2011-11-09 01:33:20 -04:00
Joey Hess	bf460a0a98	reorder repo parameters last Many functions took the repo as their first parameter. Changing it consistently to be the last parameter allows doing some useful things with currying, that reduce boilerplate. In particular, g <- gitRepo is almost never needed now, instead use inRepo to run an IO action in the repo, and fromRepo to get a value from the repo. This also provides more opportunities to use monadic and applicative combinators.	2011-11-08 16:27:20 -04:00
Joey Hess	b11a63a860	clean up read/show abuse Avoid ever using read to parse a non-haskell formatted input string. show :: Key is arguably still show abuse, but displaying Keys as filenames is just too useful to give up.	2011-11-08 00:17:54 -04:00
Joey Hess	63a292324d	add a UUID type Should have done this a long time ago.	2011-11-07 15:59:16 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00
Joey Hess	9fa9214106	A remote can have a annexUrl configured, that is used by git-annex instead of its usual url. (Similar to pushUrl.)	2011-10-14 18:18:28 -04:00
Joey Hess	b505ba83e8	minor syntax changes	2011-10-11 14:43:45 -04:00
Joey Hess	6a6ea06cee	rename	2011-10-05 16:02:51 -04:00
Joey Hess	cfe21e85e7	rename	2011-10-04 00:59:08 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	203148363f	split groups of related functions out of Utility	2011-08-22 16:14:12 -04:00
Joey Hess	737b5d14c9	moved files around	2011-08-20 16:11:42 -04:00
Joey Hess	ec746c511f	note about why curl -# is used I'd rather use wget really, but as git-annex uses libcurl elsewhere, it seems best to stick with curl. And making this configurable seems overboard.	2011-08-20 12:52:29 -04:00
Joey Hess	a55faff08f	reorg Remote/*	2011-08-16 20:49:54 -04:00
Joey Hess	4545a0e78c	split out generic url stuff into a helper library from Remote.Web	2011-08-16 20:49:44 -04:00

... 3 4 5 6 7

309 commits