git-annex

Author	SHA1	Message	Date
Joey Hess	5c3d9d6caa	show multipart configuration in git annex info s3remote	2014-11-03 16:07:41 -04:00
Joey Hess	a3ec6ed73b	Merge branch 'master' into s3-aws-multipart	2014-11-03 16:05:03 -04:00
Joey Hess	8faeb25076	finish multipart support using unreleased update to aws lib to yield etags Untested and not even compiled yet. Testing should include checks that file content streams through without buffering in memory. Note that CL.consume causes all the etags to be buffered in memory. This is probably nearly unavoidable, since a request has to be constructed that contains the list of etags in its body. (While it might be possible to stream generation of the body, that would entail making a http request that dribbles out parts of the body as the multipart uploads complete, which is not likely to work well.. To limit this being a problem, it's best for partsize to be set to some suitably large value, like 1gb. Then a full terabyte file will need only 1024 etags to be stored, which will probably use around 1 mb of memory.	2014-11-03 16:04:55 -04:00
Joey Hess	39dd5a2ac3	improve uuid mismatch message	2014-10-28 15:54:44 -04:00
Joey Hess	6e89d070bc	WIP multipart S3 upload I'm a little stuck on getting the list of etags of the parts. This seems to require taking the md5 of each part locally, which doesn't get along well with lazily streaming in the part from the file. It would need to read the file twice, or lose laziness and buffer a whole part -- but parts might be quite large. This seems to be a problem with the API provided; S3 is supposed to return an etag, but that is not exposed. I have filed a bug: https://github.com/aristidb/aws/issues/141	2014-10-28 14:17:30 -04:00
Joey Hess	8ed1a0afee	fix build	2014-10-23 16:52:05 -04:00
Joey Hess	8edf7a0fc3	fix build	2014-10-23 16:51:10 -04:00
Joey Hess	171e677a3c	update for aws 0.10's better handling of DNE for HEAD Kept support for older aws, since Debian has 0.9.2 still.	2014-10-23 16:32:18 -04:00
Joey Hess	fa1318479e	rename isIA to configIA Already done on s3-aws branch, so reduce divergence.	2014-10-23 15:56:35 -04:00
Joey Hess	6acc6863c5	fix build	2014-10-23 15:54:00 -04:00
Joey Hess	7489f516bc	one last build fix, yes it builds now	2014-10-23 15:50:41 -04:00
Joey Hess	76ee815e89	needs type families	2014-10-23 15:48:37 -04:00
Joey Hess	f0989cf0bd	fix build	2014-10-23 15:41:57 -04:00
Joey Hess	8b48bdfdc8	enable frankfurt The aws library supports the AWS4-HMAC-SHA256 that it requires.	2014-10-23 11:02:24 -04:00
Joey Hess	4eefc12295	Merge branch 'master' into s3-aws	2014-10-23 11:02:14 -04:00
Joey Hess	e687c61d04	add new frankfurt region to list in webapp But commented out for now, because: The authorization mechanism you have provided is not supported. Please use AWS4-HMAC-SHA256	2014-10-23 11:02:02 -04:00
Joey Hess	35551d0ed0	Merge branch 'master' into s3-aws Conflicts: Remote/S3.hs	2014-10-22 17:14:38 -04:00
Joey Hess	5c15d6d3cc	show in info whether a remote uses hybrid encryption or not	2014-10-22 14:39:59 -04:00
Joey Hess	3006b79c86	include creds info for glacier and webdav That and S3 are all that uses creds currently, except that external remotes can use creds. I have not handled showing info about external remote creds because they can have 0, 1, or more separate cred pairs, and there's no way for info to enumerate them or know how they're used. So it seems ok to leave out creds info for external remotes.	2014-10-22 13:56:14 -04:00
Joey Hess	1b90838bbd	add internet archive item url to info	2014-10-21 15:34:32 -04:00
Joey Hess	9280fe4cbe	include creds location in info This is intended to let the user easily tell if a remote's creds are coming from info embedded in the repository, or instead from the environment, or perhaps are locally stored in a creds file. This commit was sponsored by Frédéric Schütz.	2014-10-21 15:09:40 -04:00
Joey Hess	a0297915c1	add per-remote-type info Now `git annex info $remote` shows info specific to the type of the remote, for example, it shows the rsync url. Remote types that support encryption or chunking also include that in their info. This commit was sponsored by Ævar Arnfjörð Bjarmason.	2014-10-21 14:36:09 -04:00
Joey Hess	fced322834	glacier: Fix pipe setup when calling glacier-cli to retrieve an object.	2014-10-20 15:11:01 -04:00
Joey Hess	ef3804bdb3	S3: Fix embedcreds=yes handling for the Internet Archive. Before, embedcreds=yes did not cause the creds to be stored in remote.log, but also prevented them being locally cached.	2014-10-12 13:15:52 -04:00
Joey Hess	9fd95d9025	indent with tabs not spaces Found these with: git grep "^ " $(find -type f -name \*.hs) \|grep -v ': where' Unfortunately there is some inline hamlet that cannot use tabs for indentation. Also, Assistant/WebApp/Bootstrap3.hs is a copy of a module and so I'm leaving it as-is.	2014-10-09 15:09:26 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	0ed33c8b74	deal with old repositories with non-encrypted creds See `2f3c3aa01f` for backstory about how a repo could be in this state. When decryption fails, the repo must be using non-encrypted creds. Note that creds are encrypted/decrypted using the encryption cipher which is stored in the repo, so the decryption cannot fail due to missing gpg keys etc. (For !shared encryptiom, the cipher is iteself encrypted using some gpg key(s), and the decryption of the cipher happens earlier, so not affected by this change. Print a warning message for !shared repos, and continue on using the cipher. Wrote a page explaining what users hit by this bug should do. This commit was sponsored by Samuel Tardieu.	2014-09-18 17:58:03 -04:00
Joey Hess	2f3c3aa01f	glacier, S3: Fix bug that caused embedded creds to not be encypted using the remote's key. encryptionSetup must be called before setRemoteCredPair. Otherwise, the RemoteConfig doesn't have the cipher in it, and so no cipher is used to encrypt the embedded creds. This is a security fix for non-shared encryption methods! For encryption=shared, there's no security problem, just an inconsistentency in whether the embedded creds are encrypted. This is very important to get right, so used some types to help ensure that setRemoteCredPair is only run after encryptionSetup. Note that the external special remote bypasses the type safety, since creds can be set after the initial remote config, if the external special remote program requests it. Also note that IA remotes never use encryption, so encryptionSetup is not run for them at all, and again the type safety is bypassed. This leaves two open questions: 1. What to do about S3 and glacier remotes that were set up using encryption=pubkey/hybrid with embedcreds? Such a git repo has a security hole embedded in it, and this needs to be communicated to the user. Is the changelog enough? 2. enableremote won't work in such a repo, because git-annex will try to decrypt the embedded creds, which are not encrypted, so fails. This needs to be dealt with, especially for ecryption=shared repos, which are not really broken, just inconsistently configured. Noticing that problem for encryption=shared is what led to commit `fbdeeeed5f`, which tried to fix the problem by not decrypting the embedded creds. This commit was sponsored by Josh Taylor.	2014-09-18 17:26:12 -04:00
Joey Hess	d84eab8a8a	Revert "S3, Glacier, WebDAV: Fix bug that prevented accessing the creds when the repository was configured with encryption=shared embedcreds=yes." This reverts commit `fbdeeeed5f`. I can find no basis for that commit and think that I made it in error. setRemoteCredPair always encrypts using the cipher from remoteCipher, even when the cipher is shared.	2014-09-18 15:21:47 -04:00
Joey Hess	f7847ae98d	Merge branch 'master' into s3-aws Conflicts: Utility/Url.hs debian/changelog git-annex.cabal	2014-09-18 14:36:20 -04:00
Joey Hess	9964584c34	WebDav: Fix enableremote crash when the remote already exists. (Bug introduced in version 5.20140817.)	2014-09-17 13:04:55 -04:00
Joey Hess	a97c9e43b7	The annex-rsync-transport configuration is now also used when checking if a key is present on a rsync remote, and when dropping a key from the remote.	2014-09-11 13:21:35 -04:00
Joey Hess	b874f84086	New annex.hardlink setting. Closes: #758593 * New annex.hardlink setting. Closes: #758593 * init: Automatically detect when a repository was cloned with --shared, and set annex.hardlink=true, as well as marking the repository as untrusted. Had to reorganize Logs.Trust a bit to avoid a cycle between it and Annex.Init.	2014-09-05 13:44:09 -04:00
Joey Hess	6eb5c3f479	Do not preserve permissions and acls when copying files from one local git repository to another. Timestamps are still preserved as long as cp --preserve=timestamps is supported. This avoids cp -a overriding the default mode acls that the user might have set in a git repository. With GNU cp, this behavior change should not be a breaking change, because git-anex also uses rsync sometimes in the same situation, and has only ever preserved timestamps when using rsync. Systems without GNU cp will no longer use cp -a, but instead just cp. So, timestamps will no longer be preserved. Preserving timestamps when copying between repos is not guaranteed anyway. Closes: #729757	2014-08-26 17:10:25 -07:00
Joey Hess	aebcc395ff	use types to enforce that removeAnnex can only be called inside lockContent This fixed one bug where it needed to be and wasn't (in Assistant.Unused). And also found one place where lockContent was used unnecessarily (by drop --from remote). A few other places like uninit probably don't really need to lockContent, but it doesn't hurt to do call it anyway. This commit was sponsored by David Wagner.	2014-08-20 20:13:47 -04:00
Joey Hess	1994771215	more lock file refactoring Also fixes a test suite failures introduced in recent commits, where inAnnexSafe failed in indirect mode, since it tried to open the lock file ReadWrite. This is why the new checkLocked opens it ReadOnly. This commit was sponsored by Chad Horohoe.	2014-08-20 18:58:14 -04:00
Joey Hess	d279180266	reorganize and refactor lock code Added a convenience Utility.LockFile that is not a windows/posix portability shim, but still manages to cut down on the boilerplate around locking. This commit was sponsored by Johan Herland.	2014-08-20 16:45:58 -04:00
Joey Hess	96dc423e39	When accessing a local remote, shut down git-cat-file processes afterwards, to ensure that remotes on removable media can be unmounted. Closes: #758630 This does mean that eg, copying multiple files to a local remote will become slightly slower, since it now restarts git-cat-file after each copy. Should not be significant slowdown. The reason git-cat-file is run on the remote at all is to update its location log. In order to add an item to it, it needs to get the current content of the log. Finding a way to avoid needing to do that would be a good path to avoiding this slowdown if it does become a problem somehow. This commit was sponsored by Evan Deaubl.	2014-08-20 12:07:57 -04:00
Joey Hess	83dc82c232	forgot some lifts	2014-08-20 11:51:47 -04:00
Joey Hess	092041fab0	Ensure that all lock fds are close-on-exec, fixing various problems with them being inherited by child processes such as git commands. (With the exception of daemon pid locking.) This fixes at part of #758630. I reproduced the assistant locking eg, a removable drive's annex journal lock file and forking a long-running git-cat-file process that inherited that lock. This did not affect Windows. Considered doing a portable Utility.LockFile layer, but git-annex uses posix locks in several special ways that have no direct Windows equivilant, and it seems like it would mostly be a complication. This commit was sponsored by Protonet.	2014-08-20 11:37:02 -04:00
Joey Hess	ef01ff1e77	Merge branch 'master' into s3-aws Conflicts: git-annex.cabal	2014-08-15 17:30:40 -04:00
Joey Hess	fbdeeeed5f	S3, Glacier, WebDAV: Fix bug that prevented accessing the creds when the repository was configured with encryption=shared embedcreds=yes. Since encryption=shared, the encryption key is stored in the git repo, so there is no point at all in encrypting the creds, also stored in the git repo with that key. So `initremote` doesn't. The creds are simply stored base-64 encoded. However, it then tried to always decrypt creds when encryption was used..	2014-08-12 15:35:29 -04:00
Joey Hess	6adbd50cd9	testremote: Add testing of behavior when remote is not available Added a mkUnavailable method, which a Remote can use to generate a version of itself that is not available. Implemented for several, but not yet all remotes. This allows testing that checkPresent properly throws an exceptions when it cannot check if a key is present or not. It also allows testing that the other methods don't throw exceptions in these circumstances. This immediately found several bugs, which this commit also fixes! * git remotes using ssh accidentially had checkPresent return an exception, rather than throwing it * The chunking code accidentially returned False rather than propigating an exception when there were no chunks and checkPresent threw an exception for the non-chunked key. This commit was sponsored by Carlo Matteo Capocasa.	2014-08-10 15:02:59 -04:00
Joey Hess	5fc54cb182	auto-create IA buckets Needs my patch to aws which will hopefully be accepted soon.	2014-08-09 22:17:40 -04:00
Joey Hess	445f04472c	better memoization	2014-08-09 22:13:03 -04:00
Joey Hess	5ee72b1bae	fix meter update	2014-08-09 16:49:31 -04:00
Joey Hess	3659cb9efb	S3: finish converting to aws library Implemented the Retriever. Unfortunately, it is a fileRetriever and not a byteRetriever. It should be possible to convert this to a byteRetiever, but I got stuck: The conduit sink needs to process individual chunks, but a byteRetriever needs to pass a single L.ByteString to its callback for processing. I looked into using unsafeInerlaveIO to build up the bytestring lazily, but the sink is already operating under conduit's inversion of control, and does not run directly in IO anyway. On the plus side, no more memory leak..	2014-08-09 15:58:01 -04:00
Joey Hess	57872b457b	pass metadata headers and storage class to S3 when putting objects	2014-08-09 14:44:53 -04:00
Joey Hess	1ba1e37be3	remove dead code	2014-08-09 14:30:28 -04:00
Joey Hess	4f007ace87	S3: convert to aws for store, remove, checkPresent Fixes the memory leak on store.. the second oldest open git-annex bug! Only retrieve remains to be converted. This commit was sponsored by Scott Robinson.	2014-08-09 14:26:19 -04:00

... 2 3 4 5 6 ...

792 commits