git-annex

Author	SHA1	Message	Date
Joey Hess	2192c54877	more robust fallback when a file is available from multiple torrents and some torrent files cannot be downloaded	2014-12-17 14:38:04 -04:00
Joey Hess	bf9df3fc7e	fix fencepost error and aria resume after partial download of multi-file torrent	2014-12-17 14:21:48 -04:00
Joey Hess	3a7d0be120	remove excess directory	2014-12-17 14:17:19 -04:00
Joey Hess	d5cbbe1b9a	fix torrentUrlNum when there is no #n	2014-12-17 14:07:05 -04:00
Joey Hess	7e422269a6	move dummy uuids to Annex.UUID	2014-12-17 13:57:52 -04:00
Joey Hess	af05ac3ec2	add aria2 progress parsing	2014-12-17 13:40:04 -04:00
Joey Hess	a7690de016	Added bittorrent special remote addurl behavior change: When downloading an url ending in .torrent, it will download files from bittorrent, instead of the old behavior of adding the torrent file to the repository. Added Recommends on aria2 and bittornado \| bittorrent. This commit was sponsored by Asbjørn Sloth Tønnesen.	2014-12-16 23:22:46 -04:00
Joey Hess	65bce2c80d	reformat	2014-12-16 15:26:13 -04:00
Joey Hess	67c05daf5e	sanitize filepaths provided by checkUrl	2014-12-11 20:08:49 -04:00
Joey Hess	8a17bcb0be	simplify external special remote implementation	2014-12-11 17:44:27 -04:00
Joey Hess	bce7e0dd96	use subdir for addurl when it creates multiple files The --file parameter specifies the subdir in this mode.	2014-12-11 16:09:56 -04:00
Joey Hess	2cd84fcc8b	Expand checkurl to support recommended filename, and multi-file-urls This commit was sponsored by an anonymous bitcoiner.	2014-12-11 15:33:42 -04:00
Joey Hess	7ae16bb6f7	Revert "let url claims optionally include a suggested filename" This reverts commit `85df9c30e9`. Putting filename in the claim was a bad idea.	2014-12-11 14:09:57 -04:00
Joey Hess	85df9c30e9	let url claims optionally include a suggested filename	2014-12-11 12:47:57 -04:00
Joey Hess	aafb121068	unmangled mangled urls from the log before passing to external special remote	2014-12-08 19:27:40 -04:00
Joey Hess	30bf112185	Urls can now be claimed by remotes. This will allow creating, for example, a external special remote that handles magnet: and *.torrent urls.	2014-12-08 19:15:07 -04:00
Joey Hess	ee27298b91	implement CLAIMURL for external special remote	2014-12-08 13:57:13 -04:00
Joey Hess	cb6e16947d	add stub claimUrl	2014-12-08 13:40:15 -04:00
Joey Hess	8093008ef4	External special remote protocol now includes commands for setting and getting the urls associated with a key.	2014-12-08 13:32:46 -04:00
Joey Hess	911ba8d972	Merge branch 's3-aws'	2014-12-03 14:10:52 -04:00
Joey Hess	55fa1789dd	Don't show "(gpg)" when decrypting the remote encryption cipher, since this could be taken to read that's the only time git-annex runs gpg, which is not the case.	2014-12-02 13:50:45 -04:00
Joey Hess	0a891fcfc5	support S3 front-end used by globalways.net This threw an unusual exception w/o an error message when probing to see if the bucket exists yet. So rather than relying on tryS3, catch all exceptions. This does mean that it might get an exception for some transient network error, think this means the bucket DNE yet, and try to create it, and then fail when it already exists.	2014-11-05 12:42:12 -04:00
Joey Hess	93feefae05	Revert "work around minimum part size problem" This reverts commit `a42022d8ff`. I misunderstood the cause of the problem.	2014-11-04 16:21:55 -04:00
Joey Hess	a42022d8ff	work around minimum part size problem When uploading the last part of a file, which was 640229 bytes, S3 rejected that part: "Your proposed upload is smaller than the minimum allowed size" I don't know what the minimum is, but the fix is just to include the last part into the previous part. Since this can result in a part that's double-sized, use half-sized parts normally.	2014-11-04 16:06:13 -04:00
Joey Hess	ad2125e24a	fix a couple type errors and the progress bar	2014-11-04 15:39:48 -04:00
Joey Hess	fccdd61eec	fix memory leak Unfortunately, I don't fully understand why it was leaking using the old method of a lazy bytestring. I just know that it was leaking, despite neither hGetUntilMetered nor byteStringPopper seeming to leak by themselves. The new method avoids the lazy bytestring, and simply reads chunks from the handle and streams them out to the http socket.	2014-11-04 15:22:08 -04:00
Joey Hess	29871e320c	combine 2 checks	2014-11-04 14:47:18 -04:00
Joey Hess	0f78f197eb	casts; now fully working.. but still leaking Still seems to buffer the whole partsize in memory, but I'm pretty sure my code is not what's doing it. See https://github.com/aristidb/aws/issues/142	2014-11-03 21:12:15 -04:00
Joey Hess	f0551578d6	this should avoid leaking memory	2014-11-03 20:49:30 -04:00
Joey Hess	4230b56b79	logic error	2014-11-03 20:15:33 -04:00
Joey Hess	62de9a39bf	WIP 3	2014-11-03 20:04:42 -04:00
Joey Hess	d16382e99f	WIP 2	2014-11-03 19:50:33 -04:00
Joey Hess	5360417436	WIP try sending using RequestBodyStreamChunked May not work; if it does this is gonna be the simplest way to get good memory size and progress reporting.	2014-11-03 19:18:46 -04:00
Joey Hess	8f61bfad51	link to memory leak bug	2014-11-03 17:55:05 -04:00
Joey Hess	711b18a6eb	improve info display for multipart	2014-11-03 17:24:53 -04:00
Joey Hess	2c53f331bd	fix build	2014-11-03 17:23:46 -04:00
Joey Hess	6a965cf8d7	adjust version check I assume 0.10.6 will have the fix for the bug I reported, which got fixed in master already..	2014-11-03 16:23:00 -04:00
Joey Hess	5c3d9d6caa	show multipart configuration in git annex info s3remote	2014-11-03 16:07:41 -04:00
Joey Hess	a3ec6ed73b	Merge branch 'master' into s3-aws-multipart	2014-11-03 16:05:03 -04:00
Joey Hess	8faeb25076	finish multipart support using unreleased update to aws lib to yield etags Untested and not even compiled yet. Testing should include checks that file content streams through without buffering in memory. Note that CL.consume causes all the etags to be buffered in memory. This is probably nearly unavoidable, since a request has to be constructed that contains the list of etags in its body. (While it might be possible to stream generation of the body, that would entail making a http request that dribbles out parts of the body as the multipart uploads complete, which is not likely to work well.. To limit this being a problem, it's best for partsize to be set to some suitably large value, like 1gb. Then a full terabyte file will need only 1024 etags to be stored, which will probably use around 1 mb of memory.	2014-11-03 16:04:55 -04:00
Joey Hess	39dd5a2ac3	improve uuid mismatch message	2014-10-28 15:54:44 -04:00
Joey Hess	6e89d070bc	WIP multipart S3 upload I'm a little stuck on getting the list of etags of the parts. This seems to require taking the md5 of each part locally, which doesn't get along well with lazily streaming in the part from the file. It would need to read the file twice, or lose laziness and buffer a whole part -- but parts might be quite large. This seems to be a problem with the API provided; S3 is supposed to return an etag, but that is not exposed. I have filed a bug: https://github.com/aristidb/aws/issues/141	2014-10-28 14:17:30 -04:00
Joey Hess	8ed1a0afee	fix build	2014-10-23 16:52:05 -04:00
Joey Hess	8edf7a0fc3	fix build	2014-10-23 16:51:10 -04:00
Joey Hess	171e677a3c	update for aws 0.10's better handling of DNE for HEAD Kept support for older aws, since Debian has 0.9.2 still.	2014-10-23 16:32:18 -04:00
Joey Hess	fa1318479e	rename isIA to configIA Already done on s3-aws branch, so reduce divergence.	2014-10-23 15:56:35 -04:00
Joey Hess	6acc6863c5	fix build	2014-10-23 15:54:00 -04:00
Joey Hess	7489f516bc	one last build fix, yes it builds now	2014-10-23 15:50:41 -04:00
Joey Hess	76ee815e89	needs type families	2014-10-23 15:48:37 -04:00
Joey Hess	f0989cf0bd	fix build	2014-10-23 15:41:57 -04:00
Joey Hess	8b48bdfdc8	enable frankfurt The aws library supports the AWS4-HMAC-SHA256 that it requires.	2014-10-23 11:02:24 -04:00
Joey Hess	4eefc12295	Merge branch 'master' into s3-aws	2014-10-23 11:02:14 -04:00
Joey Hess	e687c61d04	add new frankfurt region to list in webapp But commented out for now, because: The authorization mechanism you have provided is not supported. Please use AWS4-HMAC-SHA256	2014-10-23 11:02:02 -04:00
Joey Hess	35551d0ed0	Merge branch 'master' into s3-aws Conflicts: Remote/S3.hs	2014-10-22 17:14:38 -04:00
Joey Hess	5c15d6d3cc	show in info whether a remote uses hybrid encryption or not	2014-10-22 14:39:59 -04:00
Joey Hess	3006b79c86	include creds info for glacier and webdav That and S3 are all that uses creds currently, except that external remotes can use creds. I have not handled showing info about external remote creds because they can have 0, 1, or more separate cred pairs, and there's no way for info to enumerate them or know how they're used. So it seems ok to leave out creds info for external remotes.	2014-10-22 13:56:14 -04:00
Joey Hess	1b90838bbd	add internet archive item url to info	2014-10-21 15:34:32 -04:00
Joey Hess	9280fe4cbe	include creds location in info This is intended to let the user easily tell if a remote's creds are coming from info embedded in the repository, or instead from the environment, or perhaps are locally stored in a creds file. This commit was sponsored by Frédéric Schütz.	2014-10-21 15:09:40 -04:00
Joey Hess	a0297915c1	add per-remote-type info Now `git annex info $remote` shows info specific to the type of the remote, for example, it shows the rsync url. Remote types that support encryption or chunking also include that in their info. This commit was sponsored by Ævar Arnfjörð Bjarmason.	2014-10-21 14:36:09 -04:00
Joey Hess	fced322834	glacier: Fix pipe setup when calling glacier-cli to retrieve an object.	2014-10-20 15:11:01 -04:00
Joey Hess	ef3804bdb3	S3: Fix embedcreds=yes handling for the Internet Archive. Before, embedcreds=yes did not cause the creds to be stored in remote.log, but also prevented them being locally cached.	2014-10-12 13:15:52 -04:00
Joey Hess	9fd95d9025	indent with tabs not spaces Found these with: git grep "^ " $(find -type f -name \*.hs) \|grep -v ': where' Unfortunately there is some inline hamlet that cannot use tabs for indentation. Also, Assistant/WebApp/Bootstrap3.hs is a copy of a module and so I'm leaving it as-is.	2014-10-09 15:09:26 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	0ed33c8b74	deal with old repositories with non-encrypted creds See `2f3c3aa01f` for backstory about how a repo could be in this state. When decryption fails, the repo must be using non-encrypted creds. Note that creds are encrypted/decrypted using the encryption cipher which is stored in the repo, so the decryption cannot fail due to missing gpg keys etc. (For !shared encryptiom, the cipher is iteself encrypted using some gpg key(s), and the decryption of the cipher happens earlier, so not affected by this change. Print a warning message for !shared repos, and continue on using the cipher. Wrote a page explaining what users hit by this bug should do. This commit was sponsored by Samuel Tardieu.	2014-09-18 17:58:03 -04:00
Joey Hess	2f3c3aa01f	glacier, S3: Fix bug that caused embedded creds to not be encypted using the remote's key. encryptionSetup must be called before setRemoteCredPair. Otherwise, the RemoteConfig doesn't have the cipher in it, and so no cipher is used to encrypt the embedded creds. This is a security fix for non-shared encryption methods! For encryption=shared, there's no security problem, just an inconsistentency in whether the embedded creds are encrypted. This is very important to get right, so used some types to help ensure that setRemoteCredPair is only run after encryptionSetup. Note that the external special remote bypasses the type safety, since creds can be set after the initial remote config, if the external special remote program requests it. Also note that IA remotes never use encryption, so encryptionSetup is not run for them at all, and again the type safety is bypassed. This leaves two open questions: 1. What to do about S3 and glacier remotes that were set up using encryption=pubkey/hybrid with embedcreds? Such a git repo has a security hole embedded in it, and this needs to be communicated to the user. Is the changelog enough? 2. enableremote won't work in such a repo, because git-annex will try to decrypt the embedded creds, which are not encrypted, so fails. This needs to be dealt with, especially for ecryption=shared repos, which are not really broken, just inconsistently configured. Noticing that problem for encryption=shared is what led to commit `fbdeeeed5f`, which tried to fix the problem by not decrypting the embedded creds. This commit was sponsored by Josh Taylor.	2014-09-18 17:26:12 -04:00
Joey Hess	d84eab8a8a	Revert "S3, Glacier, WebDAV: Fix bug that prevented accessing the creds when the repository was configured with encryption=shared embedcreds=yes." This reverts commit `fbdeeeed5f`. I can find no basis for that commit and think that I made it in error. setRemoteCredPair always encrypts using the cipher from remoteCipher, even when the cipher is shared.	2014-09-18 15:21:47 -04:00
Joey Hess	f7847ae98d	Merge branch 'master' into s3-aws Conflicts: Utility/Url.hs debian/changelog git-annex.cabal	2014-09-18 14:36:20 -04:00
Joey Hess	9964584c34	WebDav: Fix enableremote crash when the remote already exists. (Bug introduced in version 5.20140817.)	2014-09-17 13:04:55 -04:00
Joey Hess	a97c9e43b7	The annex-rsync-transport configuration is now also used when checking if a key is present on a rsync remote, and when dropping a key from the remote.	2014-09-11 13:21:35 -04:00
Joey Hess	b874f84086	New annex.hardlink setting. Closes: #758593 * New annex.hardlink setting. Closes: #758593 * init: Automatically detect when a repository was cloned with --shared, and set annex.hardlink=true, as well as marking the repository as untrusted. Had to reorganize Logs.Trust a bit to avoid a cycle between it and Annex.Init.	2014-09-05 13:44:09 -04:00
Joey Hess	6eb5c3f479	Do not preserve permissions and acls when copying files from one local git repository to another. Timestamps are still preserved as long as cp --preserve=timestamps is supported. This avoids cp -a overriding the default mode acls that the user might have set in a git repository. With GNU cp, this behavior change should not be a breaking change, because git-anex also uses rsync sometimes in the same situation, and has only ever preserved timestamps when using rsync. Systems without GNU cp will no longer use cp -a, but instead just cp. So, timestamps will no longer be preserved. Preserving timestamps when copying between repos is not guaranteed anyway. Closes: #729757	2014-08-26 17:10:25 -07:00
Joey Hess	aebcc395ff	use types to enforce that removeAnnex can only be called inside lockContent This fixed one bug where it needed to be and wasn't (in Assistant.Unused). And also found one place where lockContent was used unnecessarily (by drop --from remote). A few other places like uninit probably don't really need to lockContent, but it doesn't hurt to do call it anyway. This commit was sponsored by David Wagner.	2014-08-20 20:13:47 -04:00
Joey Hess	1994771215	more lock file refactoring Also fixes a test suite failures introduced in recent commits, where inAnnexSafe failed in indirect mode, since it tried to open the lock file ReadWrite. This is why the new checkLocked opens it ReadOnly. This commit was sponsored by Chad Horohoe.	2014-08-20 18:58:14 -04:00
Joey Hess	d279180266	reorganize and refactor lock code Added a convenience Utility.LockFile that is not a windows/posix portability shim, but still manages to cut down on the boilerplate around locking. This commit was sponsored by Johan Herland.	2014-08-20 16:45:58 -04:00
Joey Hess	96dc423e39	When accessing a local remote, shut down git-cat-file processes afterwards, to ensure that remotes on removable media can be unmounted. Closes: #758630 This does mean that eg, copying multiple files to a local remote will become slightly slower, since it now restarts git-cat-file after each copy. Should not be significant slowdown. The reason git-cat-file is run on the remote at all is to update its location log. In order to add an item to it, it needs to get the current content of the log. Finding a way to avoid needing to do that would be a good path to avoiding this slowdown if it does become a problem somehow. This commit was sponsored by Evan Deaubl.	2014-08-20 12:07:57 -04:00
Joey Hess	83dc82c232	forgot some lifts	2014-08-20 11:51:47 -04:00
Joey Hess	092041fab0	Ensure that all lock fds are close-on-exec, fixing various problems with them being inherited by child processes such as git commands. (With the exception of daemon pid locking.) This fixes at part of #758630. I reproduced the assistant locking eg, a removable drive's annex journal lock file and forking a long-running git-cat-file process that inherited that lock. This did not affect Windows. Considered doing a portable Utility.LockFile layer, but git-annex uses posix locks in several special ways that have no direct Windows equivilant, and it seems like it would mostly be a complication. This commit was sponsored by Protonet.	2014-08-20 11:37:02 -04:00
Joey Hess	ef01ff1e77	Merge branch 'master' into s3-aws Conflicts: git-annex.cabal	2014-08-15 17:30:40 -04:00
Joey Hess	fbdeeeed5f	S3, Glacier, WebDAV: Fix bug that prevented accessing the creds when the repository was configured with encryption=shared embedcreds=yes. Since encryption=shared, the encryption key is stored in the git repo, so there is no point at all in encrypting the creds, also stored in the git repo with that key. So `initremote` doesn't. The creds are simply stored base-64 encoded. However, it then tried to always decrypt creds when encryption was used..	2014-08-12 15:35:29 -04:00
Joey Hess	6adbd50cd9	testremote: Add testing of behavior when remote is not available Added a mkUnavailable method, which a Remote can use to generate a version of itself that is not available. Implemented for several, but not yet all remotes. This allows testing that checkPresent properly throws an exceptions when it cannot check if a key is present or not. It also allows testing that the other methods don't throw exceptions in these circumstances. This immediately found several bugs, which this commit also fixes! * git remotes using ssh accidentially had checkPresent return an exception, rather than throwing it * The chunking code accidentially returned False rather than propigating an exception when there were no chunks and checkPresent threw an exception for the non-chunked key. This commit was sponsored by Carlo Matteo Capocasa.	2014-08-10 15:02:59 -04:00
Joey Hess	5fc54cb182	auto-create IA buckets Needs my patch to aws which will hopefully be accepted soon.	2014-08-09 22:17:40 -04:00
Joey Hess	445f04472c	better memoization	2014-08-09 22:13:03 -04:00
Joey Hess	5ee72b1bae	fix meter update	2014-08-09 16:49:31 -04:00
Joey Hess	3659cb9efb	S3: finish converting to aws library Implemented the Retriever. Unfortunately, it is a fileRetriever and not a byteRetriever. It should be possible to convert this to a byteRetiever, but I got stuck: The conduit sink needs to process individual chunks, but a byteRetriever needs to pass a single L.ByteString to its callback for processing. I looked into using unsafeInerlaveIO to build up the bytestring lazily, but the sink is already operating under conduit's inversion of control, and does not run directly in IO anyway. On the plus side, no more memory leak..	2014-08-09 15:58:01 -04:00
Joey Hess	57872b457b	pass metadata headers and storage class to S3 when putting objects	2014-08-09 14:44:53 -04:00
Joey Hess	1ba1e37be3	remove dead code	2014-08-09 14:30:28 -04:00
Joey Hess	4f007ace87	S3: convert to aws for store, remove, checkPresent Fixes the memory leak on store.. the second oldest open git-annex bug! Only retrieve remains to be converted. This commit was sponsored by Scott Robinson.	2014-08-09 14:26:19 -04:00
Joey Hess	8eac9eab03	Merge branch 'master' into s3-aws	2014-08-09 13:40:21 -04:00
Joey Hess	f69a9274f9	avoid printing really ugly webdav exceptions The responseheaders can sometimes include the entire input request, which is several pages of garbage.	2014-08-09 01:38:13 -04:00
Joey Hess	809ee40d76	wording	2014-08-08 21:42:46 -04:00
Joey Hess	ccfb433ab3	cleanup	2014-08-08 20:51:22 -04:00
Joey Hess	cf82b0e1ec	cleanup	2014-08-08 20:33:03 -04:00
Joey Hess	4f1ba9a23d	fix checkPresent error handling for non-present local git repos guardUsable r (error "foo") returned an error, rather than throwing it	2014-08-08 19:18:08 -04:00
Joey Hess	6fcca2f13e	WIP converting S3 special remote from hS3 to aws library Currently, initremote works, but not the other operations. They should be fairly easy to add from this base. Also, https://github.com/aristidb/aws/issues/119 blocks internet archive support. Note that since http-conduit is used, this also adds https support to S3. Although git-annex encrypts everything anyway, so that may not be extremely useful. It is not enabled by default, because existing S3 special remotes have port=80 in their config. Setting port=443 will enable it. This commit was sponsored by Daniel Brockman.	2014-08-08 19:00:53 -04:00
Joey Hess	1dd3232e8e	check for 200 response	2014-08-08 17:17:36 -04:00
Joey Hess	0260ee43e6	fix removeKey when not present	2014-08-08 14:57:05 -04:00
Joey Hess	6cb9e5c32f	show missing url= parameter error sooner	2014-08-08 14:19:08 -04:00
Joey Hess	c3f8512475	WebDAV: Avoid buffering whole file in memory when downloading. httpBodyRetriever will later also be used by S3 This commit was sponsored by Ethan Aubin.	2014-08-08 13:40:55 -04:00
Joey Hess	fc17cf852e	further break out legacy chunking code	2014-08-08 13:17:24 -04:00
Joey Hess	c784ef4586	unify exception handling into Utility.Exception Removed old extensible-exceptions, only needed for very old ghc. Made webdav use Utility.Exception, to work after some changes in DAV's exception handling. Removed Annex.Exception. Mostly this was trivial, but note that tryAnnex is replaced with tryNonAsync and catchAnnex replaced with catchNonAsync. In theory that could be a behavior change, since the former caught all exceptions, and the latter don't catch async exceptions. However, in practice, nothing in the Annex monad uses async exceptions. Grepping for throwTo and killThread only find stuff in the assistant, which does not seem related. Command.Add.undo is changed to accept a SomeException, and things that use it for rollback now catch non-async exceptions, rather than only IOExceptions.	2014-08-07 22:03:29 -04:00
Joey Hess	2dd8dab314	WebDAV: Avoid buffering whole file in memory when uploading. The httpStorer will later also be used by S3. This commit was sponsored by Torbjørn Thorsen.	2014-08-07 19:32:23 -04:00
Joey Hess	fc4b3cdcce	webdav: reuse http connection when operating on the chunks of a file For both new and legacy chunks. Massive speed up! This commit was sponsored by Dominik Wagenknecht.	2014-08-07 18:33:14 -04:00
Joey Hess	0b1b85d9ea	use DAV monad This speeds up the webdav special remote somewhat, since it often now groups actions together in a single http connection when eg, storing a file. Legacy chunks are still supported, but have not been sped up. This depends on a as-yet unreleased version of DAV. This commit was sponsored by Thomas Hochstein.	2014-08-07 17:32:57 -04:00
Joey Hess	aacb0b2823	convert WebDAV to new special remote interface, adding new-style chunking support Reusing http connection when operating on chunks is not done yet, I had to submit some patches to DAV to support that. However, this is no slower than old-style chunking was. Note that it's a fileRetriever and a fileStorer, despite DAV using bytestrings that would allow streaming. As a result, upload/download of encrypted files is made a bit more expensive, since it spools them to temp files. This was needed to get the progress meters to work. There are probably ways to avoid that.. But it turns out that the current DAV interface buffers the whole file content in memory, and I have sent in a patch to DAV to improve its interfaces. Using the new interfaces, it's certainly going to need to be a fileStorer, in order to read the file size from the file (getting the size of a bytestring would destroy laziness). It should be possible to use the new interface to make it be a byteRetriever, so I'll change that when I get to it. This commit was sponsored by Andreas Olsson.	2014-08-06 16:57:06 -04:00
Joey Hess	8025decc7f	run Preparer to get Remover and CheckPresent actions This will allow special remotes to eg, open a http connection and reuse it, while checking if chunks are present, or removing chunks. S3 and WebDAV both need this to support chunks with reasonable speed. Note that a special remote might want to cache a http connection across multiple requests. A simple case of this is that CheckPresent is typically called before Store or Remove. A remote using this interface can certianly use a Preparer that eg, uses a MVar to cache a http connection. However, it's up to the remote to then deal with things like stale or stalled http connections when eg, doing a series of downloads from a remote and other places. There could be long delays between calls to a remote, which could lead to eg, http connection stalls; the machine might even move to a new network, etc. It might be nice to improve this interface later to allow the simple case without needing to handle the full complex case. One way to do it would be to have a `Transaction SpecialRemote cache`, where SpecialRemote contains methods for Storer, Retriever, Remover, and CheckPresent, that all expect to be passed a `cache`.	2014-08-06 14:28:36 -04:00
Joey Hess	b4cf22a388	pushed checkPresent exception handling out of Remote implementations I tend to prefer moving toward explicit exception handling, not away from it, but in this case, I think there are good reasons to let checkPresent throw exceptions: 1. They can all be caught in one place (Remote.hasKey), and we know every possible exception is caught there now, which we didn't before. 2. It simplified the code of the Remotes. I think it makes sense for Remotes to be able to be implemented without needing to worry about catching exceptions inside them. (Mostly.) 3. Types.StoreRetrieve.Preparer can only work on things that return a Bool, which all the other relevant remote methods already did. I do not see a good way to generalize that type; my previous attempts failed miserably.	2014-08-06 13:45:19 -04:00
Joey Hess	22c7a7a41a	make local gcrypt storeKey be atomic Reuse Remote.Directory's code.	2014-08-04 09:35:57 -04:00
Joey Hess	00c1468160	gcrypt: fix removal of key that does not exist Generalized code from Remote.Directory and reused it. Test suite now passes for local gcrypt repos.	2014-08-04 09:01:40 -04:00
Joey Hess	6f4592966d	make testremote work with gcrypt repos This involved making Remote.Gcrypt.gen expect a Repo with a regular, non-gcrypt path. Since tht is what's stored as the Remote's gitrepo, testremote can then modify it and feed it back into gen.	2014-08-04 08:42:04 -04:00
Joey Hess	d3778e631b	remove write bit when storing to local gcrypt repo Same as is done by rsync, and for regular git repos.	2014-08-03 20:25:44 -04:00
Joey Hess	d12becfdde	fix removal from local gcrypt repo that had files stored using rsync When files are stored using rsync, they have their write bit removed; so does the directory they're put in. The local repo code did not turn these bits back on, so failed to remove.	2014-08-03 20:21:46 -04:00
Joey Hess	8601f8f571	when not using rsync (for local gcrypt repo), display own progress meter	2014-08-03 20:19:04 -04:00
Joey Hess	1cd2273035	finally properly fixed ssh zombie leak The leak was caused by the thread that sshd'd to send transferinfo not waiting on its ssh. Doh.	2014-08-03 20:14:20 -04:00
Joey Hess	b35f7983ff	convert gcrypt to new regime, including chunking Some reorg of Remote.Rsync code to export the things gcrypt needs.	2014-08-03 17:31:10 -04:00
Joey Hess	f5f961215b	finish making rsync support chunking This breaks gcrypt, which relies on some internals of the rsync remote. To fix next..	2014-08-03 16:54:57 -04:00
Joey Hess	6c450aad1d	move ugly rsync zombie workaround This reaping of any processes came to cause me problems when redoing the rsync special remote -- a gpg process that was running gets waited on and the place that then checks its return code fails. I cannot reproduce any zombies when using the rsync special remote. But I still can when using a normal git remote, accessed over ssh. There is 1 zombie per file downloaded without this horrible hack enabled. So, move the hack to only be used in that case.	2014-08-03 16:53:29 -04:00
Joey Hess	b3fe23b552	remove redundant progress meter display code specialRemote handles all meter display, so this is redundant.	2014-08-03 16:18:40 -04:00
Joey Hess	4b16989e98	roll ChunkedEncryptable into Special and improve interface Allow disabling progress displays, for eg, rsync.	2014-08-03 15:40:01 -04:00
Joey Hess	00f92a7e59	whitespace	2014-08-03 01:21:38 -04:00
Joey Hess	d05b7b9182	better byteRetriever Make the byteRetriever be passed the callback that consumes the bytestring. This way, there's no worries about the lazy bytestring not all being read when the resource that's creating it is closed. Which in turn lets bup, ddar, and S3 each switch from using an unncessary fileRetriver to a byteRetriever. So, more efficient on chunks and encrypted files. The only remaining fileRetrievers are hook and external, which really do retrieve to files.	2014-08-03 01:12:24 -04:00
Joey Hess	19b71cfb8f	convert ddar to new ChunkedEncryptable API (but do not support chunking) Since ddar de-deuplicates, I assume there is no benefit from chunking. This has not been tested!	2014-08-02 18:58:48 -04:00
Joey Hess	b261df735d	convert bup to new ChunkedEncryptable API (but do not support chunking) bup already splits files and does rolling deltas, so there is no reason to use chunking here. The new API made it easier to add progress support for storeKey, so that's done. Unfortunately, bup-split still outputs its own progress with -q, so a little ugly, but not too bad. Made dropping remove the branch for an object, for two reasons: 1. The new API calls removeKey to roll back a storeKey when the content changed unexpectedly. 2. So that testremote will be happy. Also, fixed a bug that caused a crash when removing the branch for an object in rollback.	2014-08-02 18:48:49 -04:00
Joey Hess	7f5cd868d7	hook: use ChunkedEncryptable	2014-08-02 17:25:16 -04:00
Joey Hess	0eb1f057c4	convert glacier to new ChunkedEncryptable API (but do not support chunking) Chunking would complicate the assistant's code that checks when a pending retrieval of a key from glacier is done. It would perhaps be nice to support it to allow resuming, but not right now. Converting to the new API still simplifies the code.	2014-08-02 16:59:07 -04:00
Joey Hess	32e4368377	S3: support chunking The assistant defaults to 1MiB chunk size for new S3 special remotes. Which will work around a couple of bugs: http://git-annex.branchable.com/bugs/S3_memory_leaks/ http://git-annex.branchable.com/bugs/S3_upload_not_using_multipart/	2014-08-02 15:51:58 -04:00
Joey Hess	c3750901d8	specialize Preparer a bit, so resourcePrepare can be added The forall a. in Preparer made resourcePrepare not seem to be usable, so I specialized a to Bool. Which works for both Preparer Storer and Preparer Retriever, but wouldn't let the Preparer be used for hasKey as it currently stands.	2014-08-02 15:34:09 -04:00
Joey Hess	de0da0aece	minor optimisation	2014-08-01 17:18:39 -04:00
Joey Hess	3991327d09	testremote: Test retrieveKeyFile resume And fixed a bug found by these tests; retrieveKeyFile would fail when the dest file was already complete. This commit was sponsored by Bradley Unterrheiner.	2014-08-01 17:16:20 -04:00
Joey Hess	9636cfd9e1	fix a fenchpost bug when resuming chunked store at end Discovered thanks to testremote command!	2014-08-01 16:29:39 -04:00
Joey Hess	8fce4e4bd7	fix chunk=0 Found by testremote	2014-08-01 15:36:11 -04:00
Joey Hess	b5ac627fee	WebDAV: Dropped support for DAV before 0.6.1. 0.6.1 is in testing, and stable does not have DAV at all, so I can dispense with this compatability code	2014-07-30 11:20:35 -04:00
Joey Hess	89416ba2d9	only chunk stable keys The content of unstable keys can potentially be different in different repos, so eg, resuming a chunked upload started by another repo would corrupt data.	2014-07-30 10:34:39 -04:00
Joey Hess	a963d790d3	update progress after each chunk, at least This way, when the remote implementation neglects to update progress, there will still be a somewhat useful progress display, as long as chunks are used.	2014-07-29 20:31:16 -04:00
Joey Hess	444944c7a9	fix cleanup of FileContents once done when them when retrieving	2014-07-29 20:27:13 -04:00
Joey Hess	53b87a859e	optimise case of remote that retrieves FileContent, when chunks and encryption are not being used No need to read whole FileContent only to write it back out to a file in this case. Can just rename! Yay. Also indidentially, fixed an attempt to open a file for write that was already opened for write, which caused a crash and deadlock.	2014-07-29 20:10:14 -04:00
Joey Hess	c0dc134cde	support chunking for all external special remotes! Removing code and at the same time adding great features, including upload/download resuming. This commit was sponsored by Romain Lenglet.	2014-07-29 18:50:20 -04:00
Joey Hess	bc9e4697b9	better type for Retriever Putting a callback in the Retriever type allows for the callback to remove the retrieved file when it's done with it. I did not really want to make Retriever be fixed to Annex Bool, but when I tried to use Annex a, I got into some type of type mess.	2014-07-29 18:41:41 -04:00
Joey Hess	47e522979c	allow Retriever action to update the progress meter Needed for eg, Remote.External. Generally, any Retriever that stores content in a file is responsible for updating the meter, while ones that procude a lazy bytestring cannot update the meter, so are not asked to.	2014-07-29 17:18:49 -04:00
Joey Hess	1d263e1e7e	lift types from IO to Annex Some remotes like External need to run store and retrieve actions in Annex, not IO. In order to do that lift, I had to dive pretty deep into the utilities, making Utility.Gpg and Utility.Tmp be partly converted to using MonadIO, and Control.Monad.Catch for exception handling. There should be no behavior changes in this commit. This commit was sponsored by Michael Barabanov.	2014-07-29 16:28:44 -04:00
Joey Hess	f5af470875	add ContentSource type, for remotes that act on files rather than ByteStrings Note that currently nothing cleans up a ContentSource's file, when eg, retrieving chunks.	2014-07-29 15:16:12 -04:00
Joey Hess	216fdbd6bd	fix non-checked hasKeyChunks	2014-07-29 15:07:32 -04:00
Joey Hess	58f727afdd	resume interrupted chunked uploads Leverage the new chunked remotes to automatically resume uploads. Sort of like rsync, although of course not as efficient since this needs to start at a chunk boundry. But, unlike rsync, this method will work for S3, WebDAV, external special remotes, etc, etc. Only directory special remotes so far, but many more soon! This implementation will also allow starting an upload from one repository, interrupting it, and then resuming the upload to the same remote from an entirely different repository. Note that I added a comment that storeKey should atomically move the content into place once it's all received. This was already an undocumented requirement -- it's necessary for hasKey to work reliably. This resume code just uses hasKey to find the first chunk that's missing. Note that if there are two uploads of the same key to the same chunked remote, one might resume at the point the other had gotten to, but both will then redundantly upload. As before. In the non-resume case, this adds one hasKey call per storeKey, and only if the remote is configured to use chunks. Future work: Try to eliminate that hasKey. Notice that eg, `git annex copy --to` checks if the key is present before sending it, so is already running hasKey.. which could perhaps be cached and reused. However, this additional overhead is not very large compared with transferring an entire large file, and the ability to resume is certianly worth it. There is an optimisation in place for small files, that avoids trying to resume if the whole file fits within one chunk. This commit was sponsored by Georg Bauer.	2014-07-28 14:35:52 -04:00
Joey Hess	153ace4524	fix handling of removal of keys that are not present	2014-07-28 14:14:01 -04:00
Joey Hess	80cc554c82	add ChunkMethod type and make Logs.Chunk use it, rather than assuming fixed size chunks (so eg, rolling hash chunks can be supported later) If a newer git-annex starts logging something else in the chunk log, it won't be used by this version, but it will be preserved when updating the log.	2014-07-28 13:19:08 -04:00
Joey Hess	9d4a766cd7	resume interrupted chunked downloads Leverage the new chunked remotes to automatically resume downloads. Sort of like rsync, although of course not as efficient since this needs to start at a chunk boundry. But, unlike rsync, this method will work for S3, WebDAV, external special remotes, etc, etc. Only directory special remotes so far, but many more soon! This implementation will also properly handle starting a download from one remote, interrupting, and resuming from another one, and so on. (Resuming interrupted chunked uploads is similarly doable, although slightly more expensive.) This commit was sponsored by Thomas Djärv.	2014-07-27 18:56:32 -04:00
Joey Hess	2996f0eb05	use existing chunks even when chunk=0 When chunk=0, always try the unchunked key first. This avoids the overhead of needing to read the git-annex branch to find the chunkcount. However, if the unchunked key is not present, go on and try the chunks. Also, when removing a chunked key, update the chunkcounts even when chunk=0.	2014-07-27 02:13:51 -04:00
Joey Hess	7afb057d60	reorg	2014-07-27 01:24:34 -04:00
Joey Hess	bffd0e34b3	comment typo	2014-07-27 01:22:51 -04:00
Joey Hess	c3af4897c0	faster storeChunks No need to process each L.ByteString chunk, instead ask it to split. Doesn't seem to have really sped things up much, but it also made the code simpler. Note that this does (and already did) buffer in memory. It seems that only the directory special remote could take advantage of streaming chunks to files w/o buffering, so probably won't add an interface to allow for that.	2014-07-27 01:18:38 -04:00
Joey Hess	f3e47b16a5	better Preparer interface This will allow things like WebDAV to opean a single persistent connection and reuse it for all the chunked data. The crazy types allow for some nice code reuse.	2014-07-27 00:30:04 -04:00
Joey Hess	9a8c4bb21f	improve exception handling Push it down from needing to be done in every Storer, to being checked once inside ChunkedEncryptable. Also, catch exceptions from PrepareStorer and PrepareRetriever, just in case..	2014-07-26 23:26:10 -04:00
Joey Hess	867fd116a7	better exception display	2014-07-26 23:01:44 -04:00
Joey Hess	0d89b65bfc	fix key checking when a directory special remote's directory is missing The best thing to do in this case is return Left, so that anything that tries to access it will fail.	2014-07-26 22:52:47 -04:00
Joey Hess	93be3296fc	fix another fallback bug	2014-07-26 22:47:52 -04:00
Joey Hess	86e8532c0a	allM has slightly better memory use	2014-07-26 22:34:40 -04:00
Joey Hess	67975bf50d	fix fallback to other chunk size when first does not have it	2014-07-26 22:25:50 -04:00
Joey Hess	adb6ca62ca	fix build	2014-07-26 20:21:36 -04:00
Joey Hess	34c6fdf5e3	fix build	2014-07-26 20:21:10 -04:00
Joey Hess	b2922c1d6d	convert directory special remote to using ChunkedEncryptable And clean up legacy chunking code, which is in its own module now. So much cleaner! This commit was sponsored by Henrik Ahlgren	2014-07-26 20:19:24 -04:00
Joey Hess	1400cbb032	Support for remotes that are chunkable and encryptable. I'd have liked to keep these two concepts entirely separate, but that are entagled: Storing a key in an encrypted and chunked remote need to generate chunk keys, encrypt the keys, chunk the data, encrypt the chunks, and send them to the remote. Similar for retrieval, etc. So, here's an implemnetation of all of that. The total win here is that every remote was implementing encrypted storage and retrival, and now it can move into this single place. I expect this to result in several hundred lines of code being removed from git-annex eventually! This commit was sponsored by Henrik Ahlgren.	2014-07-26 20:14:31 -04:00
Joey Hess	d4d68f57e5	finish up basic chunked remote groundwork Chunk retrieval and reassembly, removal, and checking if all necessary chunks are present. This commit was sponsored by Damien Raude-Morvan.	2014-07-26 20:11:41 -04:00
Joey Hess	cf83697c33	reorg	2014-07-26 12:04:35 -04:00
Joey Hess	e4cb50db33	Merge branch 'master' into newchunks	2014-07-26 12:02:48 -04:00
Joey Hess	005aded3e0	Fix cost calculation for non-encrypted remotes. Encyptable types of remotes that were not actually encrypted still had the encryptedRemoteCostAdj applied to their configured cost, which was a bug.	2014-07-25 17:29:59 -04:00
Joey Hess	9e8a4a0950	support new style chunking in directory special remote Only when storing non-encrypted so far, not retrieving or checking if a key is present or removing. This commit was sponsored by Renaud Casenave-Péré.	2014-07-25 16:21:01 -04:00
Joey Hess	ab4cce4114	core implementation of new style chunking Not yet used by any special remotes, but should not be too hard to add it to most of them. storeChunks is the hairy bit! It's loosely based on Remote.Directory.storeLegacyChunked. The object is read in using a lazy bytestring, which is streamed though, creating chunks as needed, without ever buffering more than 1 chunk in memory. Getting the progress meter update to work right was also fun, since progress meter values are absolute. Finessed by constructing an offset meter. This commit was sponsored by Richard Collins.	2014-07-25 16:20:32 -04:00
Joey Hess	ceea04e77f	move meteredWriteFileChunks out of legacy	2014-07-24 16:42:35 -04:00
Joey Hess	e2c44bf656	implement chunk logs Slightly tricky as they are not normal UUIDBased logs, but are instead maps from (uuid, chunksize) to chunkcount. This commit was sponsored by Frank Thomas.	2014-07-24 16:23:36 -04:00
Joey Hess	bbdb2c04d5	improve chunk data types	2014-07-24 15:08:07 -04:00
Joey Hess	9e2d49d441	prepare for new style chunking Moved old legacy chunking code, and cleaned up the directory and webdav remotes use of it, so when no chunking is configured, that code is not used. The config for new style chunking will be chunk=1M instead of chunksize=1M. There should be no behavior changes from this commit. This commit was sponsored by Andreas Laas.	2014-07-24 14:49:22 -04:00
Joey Hess	ec5ed2af9d	Set gcrypt-publish-participants when setting up a gcrypt repository, to avoid unncessary passphrase prompts. This is a security/usability tradeoff. To avoid exposing the gpg key ids who can decrypt the repository, users can unset gcrypt-publish-participants. The gcrypt-publish-participants option is available in my fork of git-remote-gcrypt. This commit was sponsored by Christopher Kernahan.	2014-07-15 17:33:14 -04:00
Joey Hess	cdf61071bc	optimise handling of unavailable repos The exception handling resulted in git config --list being run twice for unavailable repos. This dials it back down to running it only once.	2014-07-15 14:45:27 -04:00
Joey Hess	bd514eb65a	catch exception when repo is really not available	2014-07-15 14:39:31 -04:00
Joey Hess	522a0922b8	sync: Fix git sync with local git remotes even when they don't have an annex.uuid set. Catch an exception when ensureInitialized is run in a non-initted repository. In this case, just read the git config, so that the Git.Repo object is not LocalUnknown, which is what is used to represent remotes on eg, drives that are not connected. The assistant already got this right, and like with the assistant, this causes an implicit git-annex init of the local remote on the second sync, once the git-annex branch has been pushed to it. See this comment for more analysis: http://git-annex.branchable.com/todo/Recovering_from_a_bad_sync/#comment-64e469a2c1969829ee149cbb41b1c138 This commit was sponsored by jscit.	2014-07-15 14:27:43 -04:00
Joey Hess	604740b720	S3: Deal with AWS ACL configurations that do not allow creating or checking the location of a bucket, but only reading and writing content to it.	2014-07-11 15:21:43 -04:00
Joey Hess	26ee27915a	refactor locking	2014-07-10 00:32:23 -04:00
Joey Hess	a44fd2c019	export CreateProcess fields from Utility.Process update code to avoid cwd and env redefinition warnings	2014-06-10 19:20:14 -04:00
Joey Hess	2f84659d51	fix build with old versions of bytestring	2014-06-06 14:04:35 -04:00
Joey Hess	0c2a14e4aa	fix dodgy use of Char8 I don't know if this was a bug, but I don't know if it was not a bug either. See also, http://git-annex.branchable.com/bugs/Truncated_file_transferred_via_S3/ where the file is not truncated, but mangled..	2014-05-27 20:31:25 -04:00
Joey Hess	c07343e4f7	initremote/enableremote: Basic support for using with regular git remotes initremote stores the location of an already existing git remote, and enableremote setups up a remote using its stored location.	2014-05-22 13:42:17 -04:00
Joey Hess	c34b5e09f8	factor out getRemoteGitConfig	2014-05-16 16:08:20 -04:00
Fraser Tweedale	4eb72392b4	execute remote.<name>.annex-shell on remote, if set It is useful to be able to specify an alternative git-annex-shell program to execute on the remote, e.g., to run a version not on the PATH. Use remote.<name>.annex-shell if specified, instead of the default "git-annex-shell" i.e., first so-named executable on the PATH.	2014-05-16 15:46:43 -04:00
Joey Hess	0b899fa2f1	show a much longer message when annex-ignore is automatically set, to help the user fix their problem	2014-05-16 12:58:50 -04:00
Joey Hess	b1cddea7e4	remove odd character that snuck in somehow and broke build	2014-05-15 16:36:19 -04:00
Robie Basak	4184566627	ddar special remote	2014-05-15 16:32:44 -04:00
Joey Hess	f00cb21037	Bring back rsync -p, but only when git-annex is running on a non-crippled file system. This is a better approach to fix #700282 while not unncessarily losing file permissions on non-crippled systems.	2014-04-17 14:31:42 -04:00
Joey Hess	5af30678c7	factored out Utility.SimpleProtocol from the external special remote implementation	2014-04-05 13:29:28 -04:00
Joey Hess	3b8d5f03bb	Fix glacier repo creation bug Version 5.20140227 broke creation of glacier repositories, not including the datacenter and vault in their configuration. This bug is fixed, but glacier repositories set up with the broken version of git-annex need to have the datacenter and vault set in order to be usable. This can be done using git annex enableremote to add the missing settings. For details, see http://git-annex.branchable.com/bugs/problems_with_glacier/	2014-03-27 14:30:36 -04:00
Alberto Berti	0f7c2dd39b	Fix thaoe remote to work with latest tahoe (v. 1.10.0)	2014-03-26 00:31:02 +01:00
Joey Hess	e426fac273	add desktop notifications Motivation: Hook scripts for nautilus or other file managers need to provide the user with feedback that a file is being downloaded. This commit was sponsored by THM Schoemaker.	2014-03-22 14:12:19 -04:00
Joey Hess	40b599eff2	rsync special remote: Fix slashes when used on Windows.	2014-03-18 13:02:10 -04:00
Joey Hess	b63276309e	clean up cleanup action enumeration	2014-03-13 19:06:26 -04:00
Joey Hess	4d06037fdd	Fix zombie leak and general inneficiency when copying files to a local git repo. Benchmarking this with 1000 small files being copied, the time reduced from 15.98s to 14.64s -- an 8% improvement in the non-data-transfer overhead of git-annex copy.	2014-03-06 17:13:27 -04:00
Joey Hess	aa377ed567	webdav: When built with a new enough haskell DAV (0.6), disable the http response timeout, which was only 5 seconds.	2014-03-05 13:51:54 -04:00
Joey Hess	1f98d6fb00	glacier: Pass --region to glacier checkpresent. I suppose this is not necessary when it has a local cache, so I didn't notice it was missing.	2014-03-04 23:22:24 -04:00
Joey Hess	a1432bce2f	Put non-object tmp files in .git/annex/misctmp, leaving .git/annex/tmp for only partially transferred objects. This allows eg, putting .git/annex/tmp on a ram disk, if the disk IO of temp object files is too annoying (and if you don't want to keep partially transferred objects across reboots). .git/annex/misctmp must be on the same filesystem as the git work tree, since files are moved to there in a way that will not work cross-device, as well as symlinked into there. I first wanted to put the tmp objects in .git/annex/objects/tmp, but that would pose transition problems on upgrade when partially transferred objects existed. git annex info does not currently show the size of .git/annex/misctemp, since it should stay small. It would also be ok to make something clean it out, periodically.	2014-02-26 16:52:56 -04:00
Joey Hess	2aeb0750f9	more DAV url fixes for windows	2014-02-25 16:16:14 -04:00
Joey Hess	b1931d1cc1	add protocol-level debugging for dav	2014-02-25 15:58:44 -04:00
Joey Hess	2b66aaa763	Windows webdav: Fix DOS path separator bug. Use posix </> etc for urls.	2014-02-25 15:26:33 -04:00
Joey Hess	360ecb9f35	fix bare repo optimisation on Windows	2014-02-25 13:47:09 -04:00
Joey Hess	06142f4943	fix #740010 properly	2014-02-25 01:55:01 -04:00
Joey Hess	003fc2b7e1	add UrlOptions sum type	2014-02-24 22:00:25 -04:00
Joey Hess	c69d6eb035	Make annex.web-options be used in several places that call curl.	2014-02-24 21:29:37 -04:00
Joey Hess	d5a2b498f6	webdav: When built with DAV 0.6.0, use the new DAV monad to avoid locking files, which is not needed by git-annex's use of webdav, and does not work on Box.com.	2014-02-24 18:21:51 -04:00
Joey Hess	45e7040142	webapp: Fix creation of box.com, S3, and Glacier repositories, broken in 5.20140221.	2014-02-24 15:29:17 -04:00
Joey Hess	ded4ab5704	Fix handling of rsync remote urls containing a username, including rsync.net. This breakage seems to have been caused way back in `a1eded86`, but I am pretty sure rsync.net support has not been entirely broken since last April. AFAICS, the generated .ssh/config has not changed since then -- it has never included a Username setting line. So, I am puzzled at when this reversion was introduced. Note that the breakage only affected checkpresent and remove. Upload and download use the ssh connection caching, which includes a -l username.	2014-02-21 13:20:57 -04:00
Joey Hess	7d288d83c9	glacier: Do not try to run glacier value create when an existing glacier remote is enabled.	2014-02-20 15:56:26 -04:00
Joey Hess	4e0be2792b	remove Read instance for Ref Removed instance, got it all to build using fromRef. (With a few things that really need to show something using a ref for debugging stubbed out.) Then added back Read instance, and made Logs.View use it for serialization. This changes the view log format.	2014-02-19 01:19:57 -04:00
Joey Hess	7b19c7d25b	cleanup thanks to Utility.PID	2014-02-11 15:39:51 -04:00
Joey Hess	fa24ba2520	plumb creds from webapp to initremote Avoids abusing setting environment variables, which was always a hack and won't work on windows.	2014-02-11 14:07:56 -04:00
Joey Hess	e885080d06	Add progress display for transfers to/from external special remotes.	2014-02-10 21:33:22 -04:00
Joey Hess	08afe3a1f6	fix failing test case on Windows ensure file being modified is all read before it's opened for write	2014-02-03 10:20:18 -04:00
Joey Hess	1572c460e8	avoid using openFile when withFile can be used Potentially fixes some FD leak if an action on an opened file handle fails for some reason. There have been some hard to reproduce reports of git-annex leaking FDs, and this may solve them.	2014-02-03 10:19:06 -04:00
Joey Hess	089c0109a2	Added ways to configure rsync options to be used only when uploading or downloading from a remote. Useful to eg limit upload bandwidth.	2014-02-02 16:06:34 -04:00
Joey Hess	070ed4a766	change a few renameFile's to rename AFAIK, none of these ever operate on directories, but nor do I want to explicitly check if they're files and fail if not.	2014-01-29 15:21:02 -04:00
Joey Hess	891c85cd88	use locking on Windows This is all the easy cases, where there was already a separate lock file.	2014-01-28 14:42:03 -04:00
Joey Hess	74b101d1dd	reorg	2014-01-26 16:36:31 -04:00
Joey Hess	1ca111620d	reorg	2014-01-26 16:32:55 -04:00
Joey Hess	5fc2d760ea	Optimise non-bare http remotes; no longer does a 404 to the wrong url every time before trying the right url. Needs annex-bare to be set to false, which is done when initially probing the uuid of a http remote.	2014-01-26 13:03:25 -04:00
Joey Hess	b40df4f0d0	reorganize numcopies code (no behavior changes) Move stuff into Logs.NumCopies. Add a NumCopies newtype. Better names for various serialization classes that are specific to one thing or another.	2014-01-21 16:08:59 -04:00
Joey Hess	b6ba0bd556	sync --content: New option that makes the content of annexed files be transferred. Similar to the assistant, this honors any configured preferred content expressions. I am not entirely happpy with the implementation. It would be nicer if the seek function returned a list of actions which included the individual file gets and copies and drops, rather than the current list of calls to syncContent. This would allow getting rid of the somewhat reundant display of "sync file [ok\|failed]" after the get/put display. But, do that, withFilesInGit would need to somehow be able to construct such a mixed action list. And it would be less efficient than the current implementation, which is able to reuse several values between eg get and drop. Note that currently this does not try to satisfy numcopies when getting/putting files (numcopies are of course checked when dropping files!) This makes it like the assistant, and unlike get --auto and copy --auto, which do duplicate files when numcopies is not yet satisfied. I don't know if this is the right decision; it only seemed to make sense to have this parallel the assistant as far as possible to start with, since I know the assistant works. This commit was sponsored by Øyvind Andersen Holm.	2014-01-19 17:49:54 -04:00
Joey Hess	0d544649d0	catch exception checking if url exists when network is disconnected Leads to better failure message (or possibly fallback to another remote).	2014-01-16 21:24:17 -04:00
Joey Hess	207ac67aaa	avoid needing a build-dep on hxt for Data.AssocList	2014-01-14 16:42:10 -04:00
Joey Hess	d07f2d7865	Fix a long-standing bug that could cause the wrong index file to be used when committing to the git-annex branch, if GIT_INDEX_FILE is set in the environment. This typically resulted in git-annex branch log files being committed to the master branch and later showing up in the work tree. (These log files can be safely removed.)	2014-01-14 15:36:33 -04:00
Joey Hess	c20f31a1ad	add GETAVAILABILITY to external special remote protocol And some reworking of types, and added an annex-availability git config setting.	2014-01-13 14:41:10 -04:00
Joey Hess	57edce8ad9	external special remote protocol: Added GETGITDIR.	2014-01-13 14:00:09 -04:00
Joey Hess	d8fb366cf7	forgot to delay for 1 second in busy wait loop	2014-01-08 19:58:47 -04:00
Joey Hess	215ea66471	only run tahoe start once	2014-01-08 19:17:18 -04:00
Joey Hess	93161d0dea	copyright year	2014-01-08 16:29:15 -04:00
Joey Hess	85272d8a98	Added tahoe special remote. Known problems: 1. Tries to tahoe start when daemon is already running. 2. If multiple tahoe remotes are set up on the same computer, they will have the same node.url configured by default, and this confuses tahoe commands. This commit was sponsored by LeastAuthority.com	2014-01-08 16:14:41 -04:00
Joey Hess	5e23dfabd6	add DEBUG	2014-01-07 13:23:58 -04:00
Joey Hess	614986b19a	show PATH on failure	2014-01-07 12:59:26 -04:00
Joey Hess	3e68c1c2fd	add remote state logs This allows a remote to store a piece of arbitrary state associated with a key. This is needed to support Tahoe, where the file-cap is calculated from the data stored in it, and used to retrieve a key later. Glacier also would be much improved by using this. GETSTATE and SETSTATE are added to the external special remote protocol. Note that the state is left as-is even when a key is removed from a remote. It's up to the remote to decide when it wants to clear the state. The remote state log, $KEY.log.rmt, is a UUID-based log. However, rather than using the old UUID-based log format, I created a new variant of that format. The new varient is more space efficient (since it lacks the "timestamp=" hack, and easier to parse (and the parser doesn't mess with whitespace in the value), and avoids compatability cruft in the old one. This seemed worth cleaning up for these new files, since there could be a lot of them, while before UUID-based logs were only used for a few log files at the top of the git-annex branch. The transition code has also been updated to handle these new UUID-based logs. This commit was sponsored by Daniel Hofer.	2014-01-03 16:35:57 -04:00
Joey Hess	f7727d2df1	Remotes can now be made read-only, by setting remote.<name>.annex-readonly	2014-01-02 13:12:32 -04:00
Joey Hess	8e3032df2d	added GETWANTED, SETWANTED for Tobias's flickr remote This was unexpectedly difficult because of a depdenency cycle. To parse a preferred content expression involves several things that need to operate on the list of remotes. Which needs Remote.External. The only way to avoid this cycle (I tried breaking it at several points) was to skip parsing the expression in SETWANTED. That's sorta ok, because git-annex already has to deal with unparsable preferred content expressions being stored, in order to handle eg, upgrades. But I'm still not very happy that I cannot check it. I feel this is a strong indication that I need to beware of further bloating the special remote protocol interface.	2014-01-01 20:12:20 -04:00
Joey Hess	ed1fcab6d7	external special remote protocol: Added GETUUID.	2013-12-31 13:50:18 -04:00
Joey Hess	054e4f17e2	implement PREPARE-FAILURE for Tobias	2013-12-29 13:39:25 -04:00
Joey Hess	aa97a33dde	better error messages when external special remote exits unexpectedly or is not in PATH	2013-12-27 17:14:44 -04:00
Joey Hess	445b7b41b9	add credential storage support for external special remotes & update example	2013-12-27 16:01:43 -04:00
Joey Hess	551573570f	better protocol error message, indicate if the command was able to be parsed or was misplaced	2013-12-27 14:03:35 -04:00
Joey Hess	21342cae63	flush handle after writing message	2013-12-27 13:22:06 -04:00
Joey Hess	fa6f404a5f	fix deadlock when state TMVar is empty	2013-12-27 13:17:22 -04:00
Joey Hess	9125a25738	defer SETSTATE and GETSTATE for now TAHOE-LAFS may use these eventually, but that's TBD and none of git-annex's own special remotes need that, except for the web special remote's urls.	2013-12-27 13:07:56 -04:00
Joey Hess	a7f3724e21	implement GETCONFIG and SETCONFIG Changed protocol spec to make SETCONFIG only store it persistently when run during INITREMOTE. I see no reason to support storing it persistently at other times, and doing so would unnecessarily complicate the code. Also, letting that be done would probably result in use for storing data that doesn't really belong there, and special remote authors who don't understand how the union merging works would probably be surprised the results.	2013-12-27 12:37:23 -04:00
Joey Hess	91c9e98168	support encryption	2013-12-27 12:21:55 -04:00
Joey Hess	5d8ff64dc1	make --debug show transcript of special remote protocol messages	2013-12-27 03:10:00 -04:00
Joey Hess	3289155e28	don't send PREPARE before INITREMOTE That complicated special remote programs, because they had to avoid making PREPARE fail if some configuration is missing, because the remote might not be initialized yet. Instead, complicate git-annex slightly by only sending PREPARE immediately before some other request other than INITREMOTE (or PREPARE of course).	2013-12-27 02:49:10 -04:00
Joey Hess	6d504b57e7	make some requests optional, simplify and future-proof protocol more	2013-12-27 02:11:06 -04:00
Joey Hess	6c565ec905	external special remotes mostly implemented (untested) This has not been tested at all. It compiles! The only known missing things are support for encryption, and for get/set of special remote configuration, and of key state. (The latter needs separate work to add a new per-key log file to store that state.) Only thing I don't much like is that initremote needs to be passed both type=external and externaltype=foo. It would be better to have just type=foo Most of this is quite straightforward code, that largely wrote itself given the types. The only tricky parts were: * Need to lock the remote when using it to eg make a request, because in theory git-annex could have multiple threads that each try to use a remote at the same time. I don't think that git-annex ever does that currently, but better safe than sorry. * Rather than starting up every external special remote program when git-annex starts, they are started only on demand, when first used. This will avoid slowdown, especially when running fast git-annex query commands. Once started, they keep running until git-annex stops, currently, which may not be ideal, but it's hard to know a better time to stop them. * Bit of a chicken and egg problem with caching the cost of the remote, because setting annex-cost in the git config needs the remote to already be set up. Managed to finesse that. This commit was sponsored by Lukas Anzinger.	2013-12-26 18:23:13 -04:00
Joey Hess	8803e36814	future-proofing	2013-12-25 20:04:31 -04:00
Joey Hess	1dc930063a	basic data types and serialization for external special remote protocol This is mostly straightforward, but did turn out quite nicely stronly typed, and with a quite nice automatic tokenization and parsing of received messages. Made a few minor changes to the protocol to clear up ambiguities and make it easier to parse. Note particularly that setting remote configuration is moved to a separate command, which allows a remote to set arbitrary data.	2013-12-25 17:54:57 -04:00
Joey Hess	011b8bc7ec	pull in Win32-extras, to be able to get current process id in Windows Fixed up a number of things that had worked around there not being a way to get that. Most notably, transfer info files on windows now include the process id, since no locking is currently done. This means the file format varies between windows and unix.	2013-12-11 00:15:10 -04:00
Joey Hess	e425a966ed	Deal with box.com changing the url of their webdav endpoint. Use new url when making new remotes. Transparently rewrite old url to new for existing remotes.	2013-12-02 16:01:20 -04:00
Joey Hess	0a63ed563f	rsync special remote: Fix fallback mode for rsync remotes that use hashDirMixed. Closes: #731142	2013-12-02 12:53:39 -04:00
Joey Hess	58db042033	map: Work when there are gcrypt remotes.	2013-11-04 14:14:44 -04:00
Joey Hess	2203690822	really fix gcrypt for `7be69a2491` Fixed all the other ones, but forgot to fix gcrypt!	2013-11-02 20:10:54 -04:00
Joey Hess	b2cca95d1c	clean import list	2013-11-02 19:55:18 -04:00
Joey Hess	a04fe350b8	fix build	2013-11-02 19:54:59 -04:00
Joey Hess	7be69a2491	gcrypt, bup: Fix bug that prevented using these special remotes with encryption=pubkey. I think both of these are all that's affected, but I went ahead and fixed all the remotes that set their config to M.empty to instead store the actual config. Who knows what will expect it to be actually present in future, the Remote instance of getGpgEncParams came to..	2013-11-02 16:37:28 -04:00
Joey Hess	7ed8e87a34	assistant: Support repairing git remotes that are locally accessible (eg, on removable drives) gcrypt remotes are not yet handled. This commit was sponsored by Sören Brunk.	2013-10-27 15:38:59 -04:00
Joey Hess	5756636486	directory, webdav: Fix bug introduced in version 4.20131002 that caused the chunkcount file to not be written. Work around repositories without such a file, so files can still be retreived from them.	2013-10-26 15:03:12 -04:00
Joey Hess	06ea92282f	fix inverted logic when determining whether to write a chunkcount file late-night hlint bit me on this one.. Reviewed `c1990702e9` and the rest of it seems ok	2013-10-26 14:08:29 -04:00
Joey Hess	c76c94a0da	S3: Try to ensure bucket name is valid for archive.org.	2013-10-16 16:35:47 -04:00
Joey Hess	a6e9386d39	fix remote fsck to run in remote	2013-10-14 15:05:29 -04:00
Joey Hess	c78aaed317	ye olde inverted logic	2013-10-14 12:26:46 -04:00
Joey Hess	1ffb3bb0ba	add remote fsck interface Currently only implemented for local git remotes. May try to add support to git-annex-shell for ssh remotes later. Could concevably also be supported by some special remote, although that seems unlikely. Cronner user this when available, and when not falls back to fsck --fast --from remote git annex fsck --from does not itself use this interface. To do so, I would need to pass --fast and all other options that influence fsck on to the git annex fsck that it runs inside the remote. And that seems like a lot of work for a result that would be no better than cd remote; git annex fsck This may need to be revisited if git-annex-shell gets support, since it may be the case that the user cannot ssh to the server to run git-annex fsck there, but can run git-annex-shell there. This commit was sponsored by Damien Diederen.	2013-10-11 16:03:18 -04:00
Joey Hess	747f5b123c	url size fixes addurl: Improve message when adding url with wrong size to existing file. Before the message suggested the url didn't exist. Fixed handling of URL keys that have no recorded size. Before, if the key has no size, the url also had to not declare any size, which was unlikely and wrong, or it was taken to not exist. This probably would mostly affect keys that were added to the annex with addurl --relaxed.	2013-10-11 13:05:00 -04:00
Joey Hess	571fe4999b	remove __WINDOWS__ ifdef	2013-10-06 17:23:30 -04:00
Joey Hess	0ede6b7def	typoe and debug info	2013-10-01 19:10:45 -04:00
Joey Hess	bddfbef8be	git-annex-shell gcryptsetup command This was the least-bad alternative to get dedicated key gcrypt repos working in the assistant.	2013-10-01 17:20:51 -04:00
Joey Hess	1536ebfe47	Disable receive.denyNonFastForwards when setting up a gcrypt special remote gcrypt needs to be able to fast-forward the master branch. If a git repository is set up with git init --shared --bare, it gets that set, and pushing to it will then fail, even when it's up-to-date.	2013-10-01 15:23:48 -04:00
Joey Hess	101099f7b5	fix probing for local gcrypt repos	2013-10-01 14:38:20 -04:00
Joey Hess	995e1e3c5d	fix transferring to gcrypt repo from direct mode repo recvkey was told it was receiving a HMAC key from a direct mode repo, and that confused it into rejecting the transfer, since it has no way to verify a key using that backend, since there is no HMAC backend. I considered making recvkey skip verification in the case of an unknown backend. However, that could lead to bad results; a key can legitimately be in the annex with a backend that the remote git-annex-shell doesn't know about. Better to keep it rejecting if it cannot verify. Instead, made the gcrypt special remote not set the direct mode flag when sending (and receiving) files. Also, added some recvkey messages when its checks fail, since otherwise all that is shown is a confusing error message from rsync when the remote git-annex-shell exits nonzero.	2013-10-01 14:19:24 -04:00
Joey Hess	12f6b9693a	Send a git-annex user-agent when downloading urls. Overridable with --user-agent option. Not yet done for S3 or WebDAV due to limitations of libraries used -- nether allows a user-agent header to be specified. This commit sponsored by Michael Zehrer.	2013-09-28 14:35:21 -04:00
Joey Hess	c6032b0dab	clean up some ugly code	2013-09-27 19:52:36 -04:00
Joey Hess	e864c8d033	blind enabling gcrypt repos on rsync.net This pulls off quite a nice trick: When given a path on rsync.net, it determines if it is an encrypted git repository that the user has the key to decrypt, and merges with it. This is works even when the local repository had no idea that the gcrypt remote exists! (As previously done with local drives.) This commit sponsored by Pedro Côrte-Real	2013-09-27 16:21:56 -04:00
Joey Hess	e0b99f3960	support ssh://host/~/dir When generating the path for rsync, /~/ is not valid, so change to just host:dir Note that git remotes specified in host:dir form are internally converted to the ssh:// url form, so this was especially needed..	2013-09-26 15:02:27 -04:00
Joey Hess	c1990702e9	hlint	2013-09-25 23:19:01 -04:00
Joey Hess	3192b059b5	add back lost check that git-annex-shell supports gcrypt	2013-09-24 17:51:12 -04:00
Joey Hess	4c954661a1	git-annex-shell: Added support for operating inside gcrypt repositories. * Note that the layout of gcrypt repositories has changed, and if you created one you must manually upgrade it. See http://git-annex.branchable.com/upgrades/gcrypt/	2013-09-24 17:25:47 -04:00
Joey Hess	f9e438c1bc	factor out more ssh stuff from git remote This has the dual benefits of making Remote.Git shorter, and letting Remote.GCrypt use these utilities.	2013-09-24 13:37:41 -04:00
Joey Hess	7390f08ef9	Use cryptohash rather than SHA for hashing. This is a massive win on OSX, which doesn't have a sha256sum normally. Only use external hash commands when the file is > 1 mb, since cryptohash is quite close to them in speed. SHA is still used to calculate HMACs. I don't quite understand cryptohash's API for those. Used the following benchmark to arrive at the 1 mb number. 1 mb file: benchmarking sha256/internal mean: 13.86696 ms, lb 13.83010 ms, ub 13.93453 ms, ci 0.950 std dev: 249.3235 us, lb 162.0448 us, ub 458.1744 us, ci 0.950 found 5 outliers among 100 samples (5.0%) 4 (4.0%) high mild 1 (1.0%) high severe variance introduced by outliers: 10.415% variance is moderately inflated by outliers benchmarking sha256/external mean: 14.20670 ms, lb 14.17237 ms, ub 14.27004 ms, ci 0.950 std dev: 230.5448 us, lb 150.7310 us, ub 427.6068 us, ci 0.950 found 3 outliers among 100 samples (3.0%) 2 (2.0%) high mild 1 (1.0%) high severe 2 mb file: benchmarking sha256/internal mean: 26.44270 ms, lb 26.23701 ms, ub 26.63414 ms, ci 0.950 std dev: 1.012303 ms, lb 925.8921 us, ub 1.122267 ms, ci 0.950 variance introduced by outliers: 35.540% variance is moderately inflated by outliers benchmarking sha256/external mean: 26.84521 ms, lb 26.77644 ms, ub 26.91433 ms, ci 0.950 std dev: 347.7867 us, lb 210.6283 us, ub 571.3351 us, ci 0.950 found 6 outliers among 100 samples (6.0%) import Crypto.Hash import Data.ByteString.Lazy as L import Criterion.Main import Common testfile :: FilePath testfile = "/run/shm/data" -- on ram disk main = defaultMain [ bgroup "sha256" [ bench "internal" $ whnfIO internal , bench "external" $ whnfIO external ] ] sha256 :: L.ByteString -> Digest SHA256 sha256 = hashlazy internal :: IO String internal = show . sha256 <$> L.readFile testfile external :: IO String external = do s <- readProcess "sha256sum" [testfile] return $ fst $ separate (== ' ') s	2013-09-22 20:06:02 -04:00
Joey Hess	e8e209f4e5	better probing for gcrypt repositories using new --check option Now can tell if a repo uses gcrypt or not, and whether it's decryptable with the current gpg keys. This closes the hole that undecryptable gcrypt repos could have before been combined into the repo in encrypted mode.	2013-09-19 12:53:24 -04:00
Joey Hess	8062f6337f	webapp: support adding existing gcrypt special remotes from removable drives When adding a removable drive, it's now detected if the drive contains a gcrypt special remote, and that's all handled nicely. This includes fetching the git-annex branch from the gcrypt repo in order to find out how to set up the special remote. Note that gcrypt repos that are not git-annex special remotes are not supported. It will attempt to detect such a gcrypt repo and refuse to use it. (But this is hard to do any may fail; see https://github.com/blake2-ppc/git-remote-gcrypt/issues/6) The problem with supporting regular gcrypt repos is that we don't know what the gcrypt.participants setting is intended to be for the repo. So even if we can decrypt it, if we push changes to it they might not be visible to other participants. Anyway, encrypted sneakernet (or mailnet) is now fully possible with the git-annex assistant! Assuming that the gpg key distribution is handled somehow, which the assistant doesn't yet help with. This commit was sponsored by Navishkar Rao.	2013-09-18 15:55:31 -04:00
Joey Hess	6c35038643	gcrypt: Ensure that signing key is set to one of the participants keys. Otherwise gcrypt will fail to pull, since it requires this to be the case. This needs a patched gcrypt, which is in my forked version.	2013-09-17 16:06:29 -04:00
Joey Hess	5fe49b98f8	Support hot-swapping of removable drives containing gcrypt repositories. To support this, a core.gcrypt-id is stored by git-annex inside the git config of a local gcrypt repository, when setting it up. That is compared with the remote's cached gcrypt-id. When different, a drive has been changed. git-annex then looks up the remote config for the uuid mapped from the core.gcrypt-id, and tweaks the configuration appropriately. When there is no known config for the uuid, it will refuse to use the remote.	2013-09-12 15:54:35 -04:00
Joey Hess	b64f5baf2d	sync: support gcrypt	2013-09-09 10:02:15 -04:00
Joey Hess	ecbb326e9d	Allow building without quvi support.	2013-09-09 02:16:22 -04:00
Joey Hess	00fb5705ff	ignore gcrypt remotes w/o an annex-uuid	2013-09-08 15:19:14 -04:00
Joey Hess	3e079cdcd1	gcrypt: now supports rsync Use rsync for gcrypt remotes that are not local to the disk. (Note that I have punted on supporting http transport for now, it doesn't seem likely to be very useful.) This was mostly quite easy, it just uses the rsync special remote to handle the transfers. The git repository url is converted to a RsyncOptions structure, which required parsing it separately, since the rsync special remote only supports rsync urls, which use a different format. Note that annexed objects are now stored at the top of the gcrypt repo, rather than inside annex/objects. This simplified the rsync suport, since it doesn't have to arrange to create that directory. And git-annex is not going to be run directly within gcrypt repos -- or if in some strance scenario it was, it would make sense for it to not see the encrypted objects. This commit was sponsored by Sheila Miguez	2013-09-08 14:54:28 -04:00
Joey Hess	9477a07cbf	local gcrypt fully working!	2013-09-08 13:00:48 -04:00
Joey Hess	7c1a9cdeb9	partially complete gcrypt remote (local send done; rest not) This is a git-remote-gcrypt encrypted special remote. Only sending files in to the remote works, and only for local repositories. Most of the work so far has involved making initremote work. A particular problem is that remote setup in this case needs to generate its own uuid, derivied from the gcrypt-id. That required some larger changes in the code to support. For ssh remotes, this will probably just reuse Remote.Rsync's code, so should be easy enough. And for downloading from a web remote, I will need to factor out the part of Remote.Git that does that. One particular thing that will need work is supporting hot-swapping a local gcrypt remote. I think it needs to store the gcrypt-id in the git config of the local remote, so that it can check it every time, and compare with the cached annex-uuid for the remote. If there is a mismatch, it can change both the cached annex-uuid and the gcrypt-id. That should work, and I laid some groundwork for it by already reading the remote's config when it's local. (Also needed for other reasons.) This commit was sponsored by Daniel Callahan.	2013-09-07 18:38:00 -04:00
Joey Hess	a48a4e2f8a	automatically derive an annex-uuid from a gcrypt-uuids	2013-09-05 16:02:39 -04:00
Joey Hess	89eecd4b3b	rename constructor for clariy	2013-09-05 11:12:01 -04:00
guilhem	ac9807c887	Leverage an ambiguities between Ciphers Cipher is now a datatype data Cipher = Cipher String \| MacOnlyCipher String which makes more precise its interpretation MAC-only vs. MAC + used to derive a key for symmetric crypto.	2013-09-05 11:09:08 -04:00
Joey Hess	2b9f3cc175	tabs	2013-09-04 22:47:53 -04:00
Joey Hess	a51f1a4ee4	unimportant tweak fix something my internal haskell parser does a double take at	2013-09-04 22:39:25 -04:00
Joey Hess	930e6d22d6	replace an over-explained Bool with a data type This also highlights several places where a Read/Show or similar for the new data type could avoid redundant strings.	2013-09-04 22:18:33 -04:00
guilhem	3999a860eb	Encryption defaults to 'hybrid' When a keyid= is specified while encryption= is absent.	2013-09-04 21:34:33 -04:00
Joey Hess	1587fd42a3	fix build (seems getGpgEncOpts got renamed to getGpgEncParams)	2013-09-04 18:00:02 -04:00
guilhem	8293ed619f	Allow public-key encryption of file content. With the initremote parameters "encryption=pubkey keyid=788A3F4C". /!\ Adding or removing a key has NO effect on files that have already been copied to the remote. Hence using keyid+= and keyid-= with such remotes should be used with care, and make little sense unless the point is to replace a (sub-)key by another. /!\ Also, a test case has been added to ensure that the cipher and file contents are encrypted as specified by the chosen encryption scheme.	2013-09-03 14:34:16 -04:00
guilhem	53ce59021a	Allow revocation of OpenPGP keys. /!\ It is to be noted that revoking a key does NOT necessarily prevent the owner of its private part from accessing data on the remote /!\ The only sound use of `keyid-=` is probably to replace a (sub-)key by another, where the private part of both is owned by the same person/entity: git annex enableremote myremote keyid-=2512E3C7 keyid+=788A3F4C Reference: http://git-annex.branchable.com/bugs/Using_a_revoked_GPG_key/ * Other change introduced by this patch: New keys now need to be added with option `keyid+=`, and the scheme specified (upon initremote only) with `encryption=`. The motivation for this change is to open for new schemes, e.g., strict asymmetric encryption. git annex initremote myremote encryption=hybrid keyid=2512E3C7 git annex enableremote myremote keyid+=788A3F4C	2013-08-29 14:31:33 -04:00
Joey Hess	f8ebce9396	better cases	2013-08-22 23:36:35 -04:00
Joey Hess	c0d8064018	unimportant typo (u and u' happened to be the same)	2013-08-22 23:27:12 -04:00
Joey Hess	46b6d75274	Youtube support! (And 53 other video hosts) When quvi is installed, git-annex addurl automatically uses it to detect when an page is a video, and downloads the video file. web special remote: Also support using quvi, for getting files, or checking if files exist in the web. This commit was sponsored by Mark Hepburn. Thanks!	2013-08-22 18:50:43 -04:00
Joey Hess	a3224ce35b	avoid more build warnings on Windows	2013-08-04 14:05:36 -04:00
Joey Hess	38022f4f49	Windows: Fixed permissions problem that prevented removing files from directory special remote. Directory special remotes now fully usable.	2013-08-04 13:43:48 -04:00
Joey Hess	06db8e0bd9	squash compiler warnings on Windows	2013-08-04 13:18:05 -04:00
Joey Hess	93f2371e09	get rid of __WINDOWS__, use mingw32_HOST_OS The latter is harder for me to remember, but avoids build failures in code used by the configure program.	2013-08-02 12:27:32 -04:00
Joey Hess	ca9ac8770f	directory special remote: Fix checking that there is enough disk space to hold an object, was broken when using encryption.	2013-07-20 16:30:49 -04:00
Joey Hess	d2f40d3d76	Fix checking when content is present in a non-bare repository accessed via http. I thought at first this was a Windows specific problem, but it's not; this affects checking any non-bare repository exported via http. Which is a potentially important use case! The actual bug was the case where Right False was returned by the first url short-curcuited later checks. But the whole method used felt like code I'd no longer write, and the use of undefined was particularly disgusting. So I rewrote it. Also added an action display. This commit was sponsored by Eric Hanchrow. Thanks!	2013-07-18 14:20:57 -04:00
Joey Hess	ea6fdc745f	fix build on windows	2013-07-09 16:25:15 -04:00
Joey Hess	7e7b2daddf	Windows: Fix url to object when using a http remote. annexLocations uses OS-native directory separators, but for an url, it needs to use / even on Windows. This is an ugly workaround. Could parameterize a lot of stuff in annexLocations to fix it better. I suspect this is probably the only place it's needed though.	2013-07-07 13:35:56 -04:00
Oliver Matthews	acd1b88741	Strip leading /~/ from bup relatively pathed bup remotes	2013-06-21 09:28:43 +01:00
Joey Hess	8be3e9baa2	Merge branch 'glacier' Conflicts: debian/changelog	2013-06-11 10:34:55 -04:00
Joey Hess	a64106dcef	Supports indirect mode on encfs in paranoia mode, and other filesystems that do not support hard links, but do support symlinks and other POSIX filesystem features.	2013-06-10 13:11:33 -04:00
Joey Hess	88d2d59f83	glacier: Better handling of the glacier inventory, which avoids duplicate uploads to the same glacier repository by `git annex copy`. The checkpresent hook can return either True or, False, or fail with a message if it cannot successfully check the remote. Currently for glacier, when --trust-glacier is not set, it always returns False. Crucially, in the case when a file is in glacier, this is telling git-annex it's not there, so copy re-uploads it. This is not desirable; it breaks using glacier-cli to retreive that file later, and it wastes money/bandwidth. What if it instead, when the glacier inventory is missing a file, it returns False. And when the glacier inventory has a file, unless --trust-glacier is set, it fails. The result would be: * `git annex copy --to glacier` would only send things not listed in inventory. If a file is listed in the inventory, `copy` would complain that --trust-glacier` is not set, and not re-upload the file. * `git annex drop` would only trust that glacier has a file when --trust-glacier is set. Behavior unchanged. * `git annex move --to glacier`, when the file is not listed in inventory, would send the file, and delete it locally. Behavior unchanged. * `git annex move --to glacier`, when the file is listed in inventory, would only trust that glacier has the file when --trust-glacier is set * `git annex copy --from glacier` / `git annex get`, when the file is located in glacier, would trust the location log, and attempt to get the file from glacier.	2013-05-29 13:52:42 -04:00
Joey Hess	3b1aedea3d	Merge branch 'robustness'	2013-05-25 15:22:18 -04:00
Joey Hess	bf86b5ca16	improve robustness of fromDirect and replaceFile Made fromDirect check that a file in the tree has good content (and is not a broken symlink either) before copying it to another file that has the same key. Made replaceFile clean up the temp file if the action that creates it, or the file replacement action fails.	2013-05-25 15:06:02 -04:00
Joey Hess	e3c1586997	Improve error handling when getting uuid of http remotes to auto-ignore, like with ssh remotes.	2013-05-25 01:47:19 -04:00
Joey Hess	2dce874c77	hook special remote: Added combined hook program support.	2013-05-21 19:19:03 -04:00
Joey Hess	796c2f6bc8	remove unnecessary bracketIO	2013-05-19 18:15:29 -04:00
Joey Hess	667a832de9	print encryption setup message before action	2013-05-18 19:36:55 -04:00
Joey Hess	03eec12cff	fix	2013-05-14 13:58:17 -04:00
Joey Hess	17952a893e	fix imports	2013-05-14 13:53:29 -04:00
Joey Hess	1496342c9e	typo	2013-05-14 13:52:30 -04:00
Joey Hess	40a9d8e097	avoid running background transferinfo when ssh connection caching is not supported	2013-05-14 13:51:14 -04:00
Joey Hess	03a0f17fbb	deal with Cygwin rsync paths issue	2013-05-14 13:24:15 -04:00
Joey Hess	25a8d4b11c	rename module	2013-05-12 19:19:28 -04:00
Joey Hess	abe8d549df	fix permission damage (thanks, Windows)	2013-05-11 23:54:25 -04:00
Joey Hess	3c7e30a295	git-annex now builds on Windows (doesn't work)	2013-05-11 15:03:00 -05:00
Joey Hess	763cbda14f	fixup #if 0 stubs to use #ifndef mingw32_HOST_OS That's needed in files used to build the configure program. For the other files, I'm keeping my __WINDOWS__ define, as I find that much easier to type. I may search and replace it to use the mingw32_HOST_OS thing later.	2013-05-10 16:57:21 -05:00
Joey Hess	6c74a42cc6	stub out POSIX stuff	2013-05-10 16:29:59 -05:00
Joey Hess	f92eaf6315	rsync special remotes: When sending from a crippled filesystem, use the destination's default file permissions, as the local ones can be arbitrarily broken. (Ie, ----rwxr-x for files on Android)	2013-05-09 13:55:18 -04:00
Joey Hess	a0f6dab8de	When initializing a directory special remote with a relative path, the path is made absolute. Using a relative path would work, until the user changed to some other directory in the repo and tried to access the remote from there..	2013-05-06 17:15:36 -04:00
Joey Hess	543a78bae0	Support building with DAV 0.4.	2013-04-30 14:10:55 -04:00
Joey Hess	883b17af01	Store an annex-uuid file in the bucket when setting up a new S3 remote.	2013-04-27 17:01:24 -04:00
Joey Hess	c3498042fd	webapp: Now automatically fills in any creds used by an existing remote when creating a new remote of the same type. Done for Internet Archive, S3, Glacier, and Box.com remotes.	2013-04-27 15:16:06 -04:00
Joey Hess	3c7f4d2bd1	Automatically register public urls for files uploaded to the Internet Archive.	2013-04-25 17:28:25 -04:00
Joey Hess	e3ea36174b	webapp: Display some additional information about a repository on its edit page.	2013-04-25 16:42:17 -04:00
Joey Hess	3e396a3b89	S3: Dropping content from the Internet Archive doesn't work, but their API indicates it does. Always refuse to drop from there.	2013-04-25 15:20:31 -04:00
Joey Hess	8284b310a7	support enabling IA repositories	2013-04-25 13:14:49 -04:00
Joey Hess	4b1cf3d731	Detect when the remote is broken like bitbucket is, and exits 0 when it fails to run git-annex-shell.	2013-04-23 20:06:02 -04:00
Joey Hess	8a2d1988d3	expose Control.Monad.join I think I've been looking for that function for some time. Ie, I remember wanting to collapse Just Nothing to Nothing.	2013-04-22 20:24:53 -04:00
Joey Hess	8861e270be	sync, assistant: Sync with remotes that have annex-ignore set This is so git remotes on servers without git-annex installed can be used to keep clients' git repos in sync. This is a behavior change, but since annex-sync can be set to disable syncing with a remote, I think it's acceptable.	2013-04-22 14:57:09 -04:00
Joey Hess	b9904b0c42	fix tab damage	2013-04-13 19:26:59 -04:00
guilhem	a1eded8641	Allow rsync to use other remote shells. Introduced a new per-remote option 'annex-rsync-transport' to specify the remote shell that it to be used with rsync. In case the value is 'ssh', connections are cached unless 'sshcaching' is unset.	2013-04-13 19:26:24 -04:00
Joey Hess	9e11699c76	connect existing meters to the transfer log for downloads Most remotes have meters in their implementations of retrieveKeyFile already. Simply hooking these up to the transfer log makes that information available. Easy peasy. This is particularly valuable information for encrypted remotes, which otherwise bypass the assistant's polling of temp files, and so don't have good progress bars yet. Still some work to do here (see progressbars.mdwn changes), but this is entirely an improvement from the lack of progress bars for encrypted downloads.	2013-04-11 17:32:31 -04:00
Joey Hess	c511eb048f	changelog & minor style fixes	2013-04-06 16:14:57 -04:00
guilhem	00fc21bfec	Generate ciphers with a better entropy. Unless highRandomQuality=false (or --fast) is set, use Libgcypt's 'GCRY_VERY_STRONG_RANDOM' level by default for cipher generation, like it's done for OpenPGP key generation. On the assistant side, the random quality is left to the old (lower) level, in order not to scare the user with an enless page load due to the blocking PRNG waiting for IO actions.	2013-04-06 16:09:51 -04:00
Joey Hess	f1b0a4b404	Use lower case hash directories for storing files on crippled filesystems, same as is already done for bare repositories. * since this is a crippled filesystem anyway, git-annex doesn't use symlinks on it * so there's no reason to use the mixed case hash directories that we're stuck using to avoid breaking everyone's symlinks to the content * so we can do what is already done for all bare repos, and make non-bare repos on crippled filesystems use the all-lower case hash directories * which are, happily, all 3 letters long, so they cannot conflict with mixed case hash directories * so I was able to 100% fix this and even resuming `git annex add` in the test case will recover and it will all just work.	2013-04-04 15:46:33 -04:00
Joey Hess	8a5b397ac4	hlint	2013-04-03 03:52:41 -04:00
guilhem	55f0f858ee	Allow other MAC algorithms in the Remote Config.	2013-03-29 18:04:52 -04:00
Joey Hess	cf07a2c412	webapp: Progess bar fixes for many types of special remotes. There was confusion in different parts of the progress bar code about whether an update contained the total number of bytes transferred, or the number of bytes transferred since the last update. One way this bug showed up was progress bars that seemed to stick at zero for a long time. In order to fix it comprehensively, I add a new BytesProcessed data type, that is explicitly a total quantity of bytes, not a delta. Note that this doesn't necessarily fix every problem with progress bars. Particularly, buffering can now cause progress bars to seem to run ahead of transfers, reaching 100% when data is still being uploaded.	2013-03-28 17:04:37 -04:00
Joey Hess	449520a573	add globallyAvailable to remotes	2013-03-15 19:16:13 -04:00
Joey Hess	19c0a0d5b1	split cost out into its own module Added a function to insert a new cost into a list, which could be used to asjust costs after a drag and drop.	2013-03-13 16:30:34 -04:00
Joey Hess	f7de51e8b6	Bugfix: Fix bug in inode cache sentinal check, which broke copying to local repos if the repo being copied from had moved to a different filesystem or otherwise changed all its inodes'	2013-03-12 16:41:54 -04:00
guilhem	d2bc0e9f3e	GnuPG options for symmetric encryption.	2013-03-11 09:48:38 -04:00
Joey Hess	69ab9701eb	copyToRemote should return True when the remote already has the key This got broken in commit `e9238e9588`. I observed a key that had been copied to a remote, but the location log was out of date, and due to this bug, git annex transferkey failed and so the file could not be dropped when it was moved to an archive directory.	2013-03-10 17:54:27 -04:00
Joey Hess	56830af8d8	simpler use of MIN_VERSION checks	2013-03-10 15:43:17 -04:00
Joey Hess	ff6ce2bc15	print a warning message when garbage is received from configlist	2013-03-04 23:27:18 -04:00
Joey Hess	0c13d3065e	git subcommand cleanup Pass subcommand as a regular param, which allows passing git parameters like -c before it. This was already done in the pipeing set of functions, but not the command running set.	2013-03-03 13:39:07 -04:00
Joey Hess	b117efc19b	deal with http-conduit changing a data type Pity that the library does not provide a function to extract the status code from the StatusCodeException, so when they had to add a new field, it breaks every single place that does it.	2013-02-27 00:07:28 -04:00
Joey Hess	a7a1bcd1d6	Avoid passing -p to rsync, to interoperate with crippled filesystems. In general, git-annex does not try to preserve file permissions. For example, they don't round trip through special remotes. So it's ok to not preserve them for git remotes either. On crippled filesystems, rsync has been observed failing after the file was transferred because it couldn't set some permission or other.	2013-02-22 15:23:29 -04:00
Joey Hess	96613e85a9	build fix	2013-02-15 13:48:25 -04:00
Joey Hess	9e69fca5bb	optimise sending to encrypted rsync With an encrypted rsync remote, the encrpyted file can be renamed, rather than being copied, in crippled filesystem mode. This gets back to just as fast as non-crippled mode for this very common case.	2013-02-15 13:42:41 -04:00
Joey Hess	92b4a63a06	rsync special remote support for crippled filesystem mode Cannot make a hard link, have to copy. I did find a way to make it work without setting up a tree, just using --include and --exclude. But it needs the same hash directories to be used on both sides, which is normally not the case. Still, I hope one day I will convert non-bare repos to use the same hash dirs as everything else, and then this will get more efficient.	2013-02-15 13:33:36 -04:00
Joey Hess	47477b2807	crippled filesystem support, probing and initial support git annex init probes for crippled filesystems, and sets direct mode, as well as `annex.crippledfilesystem`. Avoid manipulating permissions of files on crippled filesystems. That would likely cause an exception to be thrown. Very basic support in Command.Add for cripped filesystems; avoids the lock down entirely since doing it needs both permissions and hard links. Will make this better soon.	2013-02-14 14:15:26 -04:00
Joey Hess	18a6935e42	safe recv-key in direct mode Checks the key's size and checksum. This is sorta expensive, but it avoids needing to add another round-trip to the protocol.	2013-01-11 16:03:45 -04:00
Joey Hess	fec55e742f	check for direct mode file change when copying from a local git remote Only missing direct mode transfer check now is git-annex shell recvkey.	2013-01-10 11:56:06 -04:00
Joey Hess	a6a5ed8121	check for direct mode file change when copying to a local git remote	2013-01-10 11:45:44 -04:00
Joey Hess	1bc49b7158	Special remotes now all rollback storage of keys that get modified during the transfer, which can happen in direct mode.	2013-01-09 18:42:29 -04:00
Joey Hess	909f67443f	Fix transferring files to special remotes in direct mode.	2013-01-06 14:29:01 -04:00
Joey Hess	24c6eae1b5	show errors	2013-01-02 13:50:16 -04:00
Joey Hess	4008590c68	type based git config handling for remotes Still a couple of places that use git config ad-hoc, but this is most of it done.	2013-01-01 13:58:14 -04:00
Joey Hess	ffdd08fd2e	Merge branch 'master' into desymlink	2012-12-13 00:46:10 -04:00
Joey Hess	0d50a6105b	whitespace fixes	2012-12-13 00:45:27 -04:00
Joey Hess	e7b8cb0063	direct mode committing	2012-12-12 19:20:38 -04:00
Joey Hess	b4c6da9cbd	Got object sending working in direct mode. However, I don't yet have a reliable way to deal with files being modified while they're being transferred. I have code that detects it on the sending side, but the receiver is still free to move the wrong content into its annex, and record that it has the content. So that's not acceptable, and I'll need to work on it some more. However, at this point I can use a direct mode repository as a remote and transfer files from and to it.	2012-12-08 17:03:39 -04:00
Joey Hess	5460414486	webdav: Avoid trying to set props, avoiding incompatability with livedrive.com. Needs DAV version 0.3.	2012-12-01 17:12:41 -04:00
Joey Hess	757f041cd8	instrument webdav test	2012-12-01 14:32:50 -04:00
Joey Hess	0b6c889012	webapp: S3 and Glacier forms now have a select list of all currently-supported AWS regions.	2012-12-01 14:11:37 -04:00
Joey Hess	020a25abe1	avoid unnecessary Maybe	2012-11-30 00:55:59 -04:00
Joey Hess	ea5d7292e6	dropping from web	2012-11-29 17:01:07 -04:00
Joey Hess	3b35cde0e8	assistant: Retrival from glacier now handled.	2012-11-29 15:23:33 -04:00
Joey Hess	8dd1d9aaf9	webapp: Defaults to sharing box.com account info with friends, allowing one-click enabling of the repository.	2012-11-28 13:31:49 -04:00
Joey Hess	5ff666ec99	rsync: Fix bug introduced in last release that broke encrypted rsync special remotes.	2012-11-27 16:29:31 -04:00
Joey Hess	356120652f	remove redundant showOutput calls. The meter code does that too.	2012-11-25 16:13:06 -04:00
Joey Hess	fb19d56476	progress bars for glacier downloads	2012-11-25 13:49:22 -04:00
Joey Hess	606c210378	progress bars for glacier uploads	2012-11-25 13:27:20 -04:00
Joey Hess	da6c738dad	adjust glacier remote cost to 1000 Higher than any other remote, this is mostly due to the long retrieval time, so it'd make sense to get a file from nearly any other remote. (Unless it's behind a very slow connection.)	2012-11-22 16:59:10 -04:00
Joey Hess	f53496830a	pass --quiet to checkpresent	2012-11-21 19:35:28 -04:00
Joey Hess	a5111a6d85	Amazon Glacier special remote; 100% working	2012-11-20 16:43:58 -04:00
Joey Hess	9221e62d87	Allow controlling whether login credentials for S3 and webdav are committed to the repository, by setting embedcreds=yes\|no when running initremote.	2012-11-19 17:32:58 -04:00
Joey Hess	f7a7ec4ebf	new storage regime implemented for webdav	2012-11-19 14:08:39 -04:00
Joey Hess	7b71685a93	Bugfix: directory special remote could loop forever storing a key when a too small chunksize was configured. Ensure that each file has something written to it, even if the bytestring chunk size is greater than the configured chunksize. This means we may write a bit larger than the configured value, but only when the configured value is very small; ie, < 8 kb.	2012-11-19 13:30:58 -04:00
Joey Hess	5f977cc725	directory special remote: Made more efficient and robust. Files are now written to a tmp directory in the remote, and once all chunks are written, etc, it's moved into the final place atomically. For now, checkpresent still checks every single chunk of a file, because the old method could leave partially transferred files with some chunks present and others not.	2012-11-19 13:18:23 -04:00
Joey Hess	d3dfeeb3d9	remove annex/ from key locations used for webdav	2012-11-18 23:59:39 -04:00
Joey Hess	7df1e71fe3	S3: Added progress display for uploading and downloading.	2012-11-18 22:49:07 -04:00
Joey Hess	b0e08ae457	S3: upload progress display	2012-11-18 22:20:43 -04:00
Joey Hess	e2b7fc1ebd	refactor	2012-11-18 21:50:16 -04:00
Joey Hess	afa2f9c967	upload progress bars for webdav!	2012-11-18 20:30:05 -04:00
Joey Hess	c8751be151	simplify	2012-11-18 18:27:53 -04:00
Joey Hess	81379bb29c	better streaming while encrypting/decrypting Both the directory and webdav special remotes used to have to buffer the whole file contents before it could be decrypted, as they read from chunks. Now the chunks are streamed through gpg with no buffering.	2012-11-18 15:27:44 -04:00
Joey Hess	3607c92222	fix warning	2012-11-18 14:06:54 -04:00
Joey Hess	8a6941a216	fix build with xml-conduit newer than in debian The Element data type changed to use a map of attributes. Rather than ifdef, I'm avoiding directly using that data type.	2012-11-18 13:46:38 -04:00
Joey Hess	7addb89dc1	webapp: support box.com	2012-11-17 15:30:11 -04:00
Joey Hess	1fe76b57d6	webdav now checks presence of and receives chunked content Note that receiving encrypted chunked content currently involves buffering. (So does doing so with the directory special remote.)	2012-11-16 23:16:18 -04:00
Joey Hess	0b3126a30b	back to standard directory layout for webdav remotes This allows deleting all chunks for a file with a single http command, so it's a win after all. However, does not look in the mixed case hash directories, which were in the past used by the directory, etc remotes.	2012-11-16 18:14:07 -04:00
Joey Hess	a1869ad662	webdav now supports sending chunked content Not yet getting it though.	2012-11-16 17:58:58 -04:00
Joey Hess	92d5d81c2c	generic chunked content helper However, directory still uses its optimzed chunked file writer, as it uses less memory than the generic one in the helper.	2012-11-16 17:58:08 -04:00
Joey Hess	0f782bd028	encrypted webdav working	2012-11-16 13:57:32 -04:00
Joey Hess	bb28c6114a	drop webdav compatability with the directory special remote etc The benefit of using a compatable directory structure does not outweigh the cost in complexity of handling the multiple locations content can be stored in directory special remotes. And this also allows doing away with the parent directories, which can't be made unwritable in DAV, so have no benefit there. This will save 2 http calls per file store. But, kept the directory hashing, just in case.	2012-11-16 00:42:33 -04:00
Joey Hess	a4b86c63d6	webdav is fully working in non-enctypted mode	2012-11-16 00:09:22 -04:00
Joey Hess	3c039d329c	update to dav 0.1, and basic uploading is working!	2012-11-15 13:46:16 -04:00
Joey Hess	0cba0cb2dd	skeltal webdav special remote Doesn't actually store anything yet, but initremote works and tests the server.	2012-11-14 20:25:31 -04:00
Joey Hess	e250f6f11f	factor out Creds	2012-11-14 19:32:27 -04:00
Joey Hess	2172cc586e	where indenting	2012-11-11 00:51:07 -04:00
Joey Hess	6eca362c5d	indentation foo, and a new coding style page. no code changes	2012-10-28 21:27:15 -04:00
Joey Hess	9767562f65	rsync special remote: Include annex-rsync-options when running rsync to test a key's presence. Also, use the new withQuietOutput function to avoid running the shell to /dev/null stderr in two other places.	2012-10-28 13:51:14 -04:00
Joey Hess	7ee0ffaeb9	Use USER and HOME environment when set, and only fall back to getpwent, which doesn't work with LDAP or NIS.	2012-10-25 18:17:54 -04:00
Joey Hess	ee24f23ecb	fix build	2012-10-24 10:54:58 -04:00
Joey Hess	8b1235b022	bup: Don't pass - to bup-split to make it read stdin bup 0.25 does not accept that; and bup split reads from stdin by default if no file is given. I'm not sure what version of bup changed this. This only affected bup special remotes that were encrypted.	2012-10-23 16:01:02 -04:00
Joey Hess	452e6819d0	!! removal	2012-10-21 00:51:42 -04:00
Joey Hess	14b376d440	Merge branch 'safesemaphore' Conflicts: debian/changelog git-annex.cabal	2012-10-20 12:44:25 -04:00
Joey Hess	e290f1b903	Automatically detect when a ssh remote does not have git-annex-shell installed, and set annex-ignore. Aka solve the github problem. Note that it's possible the initial configlist will fail for some network reason etc, and then the fetch succeeds. In this case, a usable remote gets disabled. But it does print a message, and this only happens once per remote, so that seems ok.	2012-10-12 13:45:14 -04:00
Ben Gamari	179aeeaacc	Remote/Git: Use SampleVar from SafeSemaphore instead of base SampleVars from base are unsafe	2012-10-05 17:03:58 -04:00
Joey Hess	47314c0fad	fix last zombies in the assistant Made Git.LsFiles return cleanup actions, and everything waits on processes now, except of course for Seek.	2012-10-04 19:56:32 -04:00
Joey Hess	de3ea4adb6	remove now-unnecessary manual reaps	2012-10-04 18:58:57 -04:00
Joey Hess	f18a53eec0	change s3 creds caching Rather than store decrypted creds in the environment, store them in the creds cache file. This way, a single git-annex can have multiple S3 remotes using different creds.	2012-09-26 14:42:51 -04:00
Joey Hess	e4bf74a965	store S3 creds in a 600 mode file inside the local git repo	2012-09-26 14:42:32 -04:00
Joey Hess	df07ccf404	make the assistant retry failed transfers When a transfer fails, the progress info can be used to intelligently retry it. If the transfer managed to make some progress, but did not fully complete, then there's a good chance that a retry will finish it (or at least make more progress).	2012-09-23 13:27:13 -04:00
Joey Hess	c048add74d	hooked up git-annex-shell transferinfo Finally done with progressbars!	2012-09-21 23:25:06 -04:00
Joey Hess	1722c23f56	fix logic error introduced yesterday	2012-09-21 20:24:08 -04:00
Joey Hess	ff32ee5152	upload progress tracking for the directory special remote	2012-09-21 14:54:24 -04:00
Joey Hess	226781c047	unify types	2012-09-21 14:50:14 -04:00
Joey Hess	2ae38325d5	hook rsync special remote up to the progress reporting Easy! Note that with an encrypted remote, rsync will be sending a little more data than the key size, so displayed progress may get to 100% slightly quicker than it should. I doubt this is a big enough effect to worry about.	2012-09-20 13:51:51 -04:00
Joey Hess	19e35f7f0d	upload progress bar for git remote on same filesystem cp is used here, but we can just watch the size of the destination file This commit made from within the ruins of an old mill, overlooking a beautiful waterfall.	2012-09-20 13:35:53 -04:00
Joey Hess	e1037adebc	rsync progress interception Current implementation parses rsync's output a character a time, which is hardly efficient. It could be sped up a lot by using hGetBufSome, but that would require going really lowlevel, down to raw C style buffers (good example of that here: http://users.aber.ac.uk/afc/stricthaskell.html) But rsync doesn't output very much, so currently it seems ok.	2012-09-19 16:55:08 -04:00
Joey Hess	aff09a1f33	add a progress callback to storeKey, and threaded it all the way through Transfer info files are updated when the callback is called, updating the number of bytes transferred. Left unused p variables at every place the callback should be used. Which is rather a lot..	2012-09-19 16:08:37 -04:00
Joey Hess	45a26175d6	renamed RsyncFile -> Rsync	2012-09-19 14:28:32 -04:00
Joey Hess	e9238e9588	avoid starting a download for a local transfer when the remote already has the key Turns out that recvkey already does this same check. This avoids a transfer file being created for the download that never happened, which in turn will avoid the assistant seeing that the download has finished, when no transfer actually took place.	2012-09-18 13:59:03 -04:00
Joey Hess	beaecce68b	git http:// remotes are readonly too	2012-08-26 15:53:31 -04:00
Joey Hess	271ea49978	add support for readonly remotes Currently only the web special remote is readonly, but it'd be possible to also have readonly drives, or other remotes. These are handled in the assistant by only downloading from them, and never trying to upload to them.	2012-08-26 15:39:02 -04:00
Joey Hess	f4ca592cd0	refactor	2012-08-26 14:34:30 -04:00
Joey Hess	78d3add86b	tweak field name	2012-08-26 14:26:43 -04:00
Joey Hess	b818337054	fix build warning	2012-08-16 16:48:27 -07:00
Joey Hess	cbca93cf7c	Merge branch 'master' into assistant Conflicts: debian/changelog	2012-08-16 16:36:32 -07:00
Joey Hess	ad4e152fd6	S3: Add fileprefix setting.	2012-08-09 13:54:54 -04:00
Joey Hess	94fcd0cf59	add routes to pause/start/cancel transfers This commit includes a paydown on technical debt incurred two years ago, when I didn't know that it was bad to make custom Read and Show instances for types. As the routes need Read and Show for Transfer, which includes a Key, and deriving my own Read instance of key was not practical, I had to finally clean that up. So the compact Key read and show functions are now file2key and key2file, and Read and Show are now derived instances. Changed all code that used the old instances, compiler checked. (There were a few places, particularly in Command.Unused, and the test suite where the Show instance continue to be used for legitimate comparisons; ie show key_x == show key_y (though really in a bloom filter))	2012-08-08 16:20:24 -04:00
Joey Hess	cb0f435d94	adding removable drive repos now basically works	2012-08-05 14:49:47 -04:00
Joey Hess	4ec9244f1a	add a path field to remotes Also broke out some helper functions around constructing remotes, to be used later.	2012-07-22 14:30:43 -04:00
Joey Hess	1db7d27a45	add back debug logging Make Utility.Process wrap the parts of System.Process that I use, and add debug logging to them. Also wrote some higher-level code that allows running an action with handles to a processes stdin or stdout (or both), and checking its exit status, all in a single function call. As a bonus, the debug logging now indicates whether the process is being run to read from it, feed it data, chat with it (writing and reading), or just call it for its side effect.	2012-07-19 00:46:52 -04:00
Joey Hess	d1da9cf221	switch from System.Cmd.Utils to System.Process Test suite now passes with -threaded! I traced back all the hangs with -threaded to System.Cmd.Utils. It seems it's just crappy/unsafe/outdated, and should not be used. System.Process seems to be the cool new thing, so converted all the code to use it instead. In the process, --debug stopped printing commands it runs. I may try to bring that back later. Note that even SafeSystem was switched to use System.Process. Since that was a modified version of code from System.Cmd.Utils, it needed to be converted too. I also got rid of nearly all calls to forkProcess, and all calls to executeFile, which I'm also doubtful about working well with -threaded.	2012-07-18 18:00:24 -04:00
Joey Hess	81b20a581a	avoid --no-inplace Not available on systems with shoddy getopts. Should not be necessary, as that's rsync's default.	2012-07-10 12:40:31 -06:00
Joey Hess	760e028dca	pass associatedfile and remoteuuid to git-annex-shell This almost works. Along the way, I noticed that the --uuid parameter was being accidentially passed after the --, so that has never been actually used by git-annex-shell to verify it's running in the expected repository. Oops. Fixed.	2012-07-02 10:57:51 -04:00
Joey Hess	7225c2bfc0	record transfer information on local git remotes In order to record a semi-useful filename associated with the key, this required plumbing the filename all the way through to the remotes' storeKey and retrieveKeyFile. Note that there is potential for deadlock here, narrowly avoided. Suppose the repos are A and B. A sends file foo to B, and at the same time, B gets file foo from A. So, A locks its upload transfer info file, and then locks B's download transfer info file. At the same time, B is taking the two locks in the opposite order. This is only not a deadlock because the lock code does not wait, and aborts. So one of A or B's transfers will be aborted and the other transfer will continue. Whew!	2012-07-01 17:15:11 -04:00
Joey Hess	29335bf326	pointlessness	2012-06-29 10:00:05 -04:00
Joey Hess	6aee7e5a8b	Better fix for unavailable local remotes Not including such remotes turned out to have other consequences, including annex-truselevel git config being ignored. Instead, add guards before each operation that might try to operate on such a repo.	2012-06-26 22:27:30 -04:00
Joey Hess	7e62e57f8c	Avoid ugly failure mode when moving content from a local repository that is not available. Prelude.undefined error message was introduced by `bb4f31a0ee`. It seems best to filter out local repositories that cannot be accessed from the list of remotes, rather than keeping them in and making every thing that uses the list have to deal with remotes that may have an unknown location. Besides fixing the error message, this also makes unavailable local remotes' names not be shown in various messages, including in git annex status output. Also, move --to an unavailable local repository now avoids some ugly errors like "changeWorkingDirectory: does not exist".	2012-06-26 17:22:44 -04:00
Joey Hess	75b6ee81f9	avoid ByteString.Char8 where not needed Its truncation behavior is a red flag, so avoid using it in these places where only raw ByteStrings are used, without looking at the data inside.	2012-06-20 13:13:40 -04:00
Joey Hess	e0095b0bdc	fishy commit	2012-06-14 00:01:48 -04:00
Joey Hess	5809f33f8b	use createAnnexDirectory when setting up tmp dir	2012-06-05 20:25:32 -04:00
Joey Hess	13118136c0	Preserve parent environment when running hooks of the hook special remote.	2012-06-04 21:52:36 -04:00
Joey Hess	37ef39c929	suppress "(Recording state in git)" message when committing change to remote state This was shown redundantly for a tricky reason -- while it runs inside a doSideAction block that would appear to supress it, the action being run is in a different state monad; for the remote, and so the suppression doesn't work. Always suppressing the message when committing to a local remote is ok do to though -- it mirrors the /dev/nulling of the git annex shell commit output. And it turns out that any time there is a git-annex branch state change to commit on the remote, the local repo has also had a similar change made, and so the message has been shown already.	2012-05-20 00:14:56 -04:00
Joey Hess	eb6cb1b87f	Add support for core.worktree, and fix support for GIT_WORK_TREE and GIT_DIR. The environment needs to override git-config. Changed when git config is read, and avoid rereading it once it's been read. chdir for both worktree settings.	2012-05-18 18:20:53 -04:00
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	f7d8982672	Fix use of several config settings annex.ssh-options, annex.rsync-options, annex.bup-split-options. And adjust types to avoid the bugs that broke several config settings recently. Now "annex." prefixing is enforced at the type level.	2012-05-05 20:16:56 -04:00
Joey Hess	6d61067599	rsync shellescape disable option Rsync special remotes can be configured with shellescape=no to avoid shell quoting that is normally done when using rsync over ssh. This is known to be needed for certian rsync hosting providers (specificially hidrive.strato.com) that use rsync over ssh but do not pass it through the shell.	2012-05-02 13:08:33 -04:00
Joey Hess	bd592d1450	refactor	2012-04-29 14:33:07 -04:00
Joey Hess	1c16f616df	Added shared cipher mode to encryptable special remotes. This option avoids gpg key distribution, at the expense of flexability, and with the requirement that all clones of the git repository be equally trusted.	2012-04-29 14:02:43 -04:00
Joey Hess	84ac8c58db	Add annex.httpheaders and annex.httpheader-command config settings Allow custom headers to be sent with all HTTP requests. (Requested by the Internet Archive)	2012-04-22 01:13:09 -04:00
Joey Hess	ed79596b75	noop	2012-04-21 23:32:33 -04:00
Joey Hess	bee420bd2d	in which I discover void void :: Functor f => f a -> f () -- ah, of course that's useful :)	2012-04-21 23:06:19 -04:00
Joey Hess	b98b69e8c6	honor core.sharedRepository when making all the other files in the annex Lock files, directories, etc.	2012-04-21 19:36:03 -04:00
Joey Hess	5cc76098ca	Directory special remotes now check annex.diskreserve.	2012-04-20 16:24:44 -04:00
Joey Hess	aa353d1400	use LANGUAGE CPP pragma, avoids running cpp on all the other sources	2012-04-17 18:37:40 -04:00
Joey Hess	626697b459	cabal file now autodetects whether S3 support is available.	2012-04-14 14:22:33 -04:00
Joey Hess	c924542e61	bup: Properly handle key names with spaces or other things that are not legal git refs. Continue using the key name as bup ref name, to preserve backwards compatability, unless it is an illegal git ref. In that case, use a sha256 of the key name instead.	2012-04-11 12:45:49 -04:00
Joey Hess	4eb5112681	rationalize getConfig getConfig got a remote-specific config, and this confusing name caused it to be used a couple of places that only were interested in global configs. Rename to getRemoteConfig and make getConfig only get global configs. There are no behavior changes here, but remote.<name>.annex-web-options never actually worked (and per-remote web options is a very unlikely to be useful case so I didn't make it work), so fix the documentation for it.	2012-03-22 17:32:47 -04:00
Joey Hess	a362c46b70	fun with symbols Nothing at all on hackage is using <&&> or <\|\|>. (Also, <&&> should short-circuit on failure.)	2012-03-17 00:38:40 -04:00
Joey Hess	c0c9991c9f	nukes another 15 lines thanks to ifM	2012-03-15 20:39:25 -04:00
Joey Hess	b27760aa68	Work around a bug in rsync (IMHO) introduced by openSUSE's SIP patch. openSUSE patches rsync with a patch adding SIP protocol support. https://gist.github.com/2026167 With this patch, running rsync with no hostname parameter is apparently supposed to list SIP hosts on the network. Practically, it does nothing and exits 0. git-annex uses rsync in a very special way to allow git-annex-shell to be run on the remote host, and so did not need to specify a hostname, or a file to transfer as a rsync parameter. So it sent ":", a degenerate case of "host:file". But the patch cannot differentiate ":" with no host parameter (a bug in the SIP patch surely). Results were that getting files failed, as rsync seemed to succeed, but the requested file failed to arrive. Also I think that sending files will make git-annex think a file has been transferred to the remote when really rsync does nothing. The workaround for this buggy rsync patch is to use "dummy:" as the hostname.	2012-03-12 22:53:43 -04:00
Joey Hess	52e88f3ebf	add remote start and stop hooks Locking is used, so that, if there are multiple git-annex processes using a remote concurrently, the stop hook is only run by the last process that uses it.	2012-03-04 19:12:58 -04:00
Joey Hess	3960825cef	better chunked file retrieval Avoids opening every chunk at once, instead streaming them in. Not done for encrypted file retrieval yet.	2012-03-04 11:48:23 -04:00
Joey Hess	7ba79cfb8c	thread through original key to retrieveEnctypted Allows showing progress bar for this last case of the directory special remote.	2012-03-04 03:36:39 -04:00
Joey Hess	4638314001	add progress display when receiving files That was actually really easy. But, when getting a file from an encrypted directory special remote, no meter can be shown, because the total file size is not known.	2012-03-04 03:25:41 -04:00
Joey Hess	9856c24a59	Add progress bar display to the directory special remote. So far I've only written progress bars for sending files, not yet receiving. No longer uses external cp at all. ByteString IO is fast enough.	2012-03-04 03:17:25 -04:00
Joey Hess	50c897c082	tweak	2012-03-03 20:02:48 -04:00
Joey Hess	3436aba6de	Directory special remotes now support chunking files written to them Avoiding writing files larger than a specified size is useful on certian things. For example, box.com has a file size limit of 100 mb. Could also be useful on really crappy removable media.	2012-03-03 18:05:55 -04:00
Joey Hess	c3fbe07d7a	do a cleanup commit after moving data from or to a git remote Added Annex.cleanup, which is a general purpose interface for adding actions to run at the end. Remotes with the old git-annex-shell will commit every time, and have no commit command, so hide stderr when running the commit command.	2012-02-25 18:02:49 -04:00
Joey Hess	cb631ce518	whereis: Prints the urls of files that the web special remote knows about.	2012-02-14 03:49:48 -04:00
Joey Hess	8fbc529d68	oops	2012-02-14 03:10:01 -04:00
Joey Hess	cbaebf538a	rework git check-attr interface Now gitattributes are looked up, efficiently, in only the places that really need them, using the same approach used for cat-file. The old CheckAttr code seemed very fragile, in the way it streamed files through git check-attr. I actually found that `cad8824852` was still deadlocking with ghc 7.4, at the end of adding a lot of files. This should fix that problem, and avoid future ones. The best part is that this removes withAttrFilesInGit and withNumCopies, which were complicated Seek methods, as well as simplfying the types for several other Seek methods that had a Backend tupled in.	2012-02-13 23:52:21 -04:00
Joey Hess	9030f68452	When checking that an url has a key, verify that the Content-Length, if available, matches the size of the key. If there's no Content-Length, or the key has no size, this check is not done, but it should happen most of the time, and protect against web content that has changed.	2012-02-10 19:23:41 -04:00
Joey Hess	57a747d081	S3: Fix irrefutable pattern failure when accessing encrypted S3 credentials.	2012-02-08 11:41:15 -04:00
Joey Hess	b9b72d22a9	refactor Wow, triple monadic lift!	2012-02-07 01:40:14 -04:00
Joey Hess	146c36ca54	IO exception rework ghc 7.4 comaplains about use of System.IO.Error to catch exceptions. Ok, use Control.Exception, with variants specialized to only catch IO exceptions.	2012-02-03 16:47:24 -04:00
Joey Hess	775958b4dc	faster local-local dropping Dropping a key from a local remote ran git-annex-shell unnecessarily. Now git-annex-shell is never used when acting on a local remote.	2012-01-28 16:00:20 -04:00
Joey Hess	b81d662cbf	Avoid repeated location log commits when a remote is receiving files. Done by adding a oneshot mode, in which location log changes are written to the journal, but not committed. Taking advantage of git-annex's existing ability to recover in this situation. This is used by git-annex-shell and other places where changes are made to a remote's location log.	2012-01-28 15:41:52 -04:00

... 8 9 10 11 12 ...

1129 commits