git-annex

Author	SHA1	Message	Date
Joey Hess	c6032b0dab	clean up some ugly code	2013-09-27 19:52:36 -04:00
Joey Hess	e864c8d033	blind enabling gcrypt repos on rsync.net This pulls off quite a nice trick: When given a path on rsync.net, it determines if it is an encrypted git repository that the user has the key to decrypt, and merges with it. This is works even when the local repository had no idea that the gcrypt remote exists! (As previously done with local drives.) This commit sponsored by Pedro Côrte-Real	2013-09-27 16:21:56 -04:00
Joey Hess	e0b99f3960	support ssh://host/~/dir When generating the path for rsync, /~/ is not valid, so change to just host:dir Note that git remotes specified in host:dir form are internally converted to the ssh:// url form, so this was especially needed..	2013-09-26 15:02:27 -04:00
Joey Hess	c1990702e9	hlint	2013-09-25 23:19:01 -04:00
Joey Hess	3192b059b5	add back lost check that git-annex-shell supports gcrypt	2013-09-24 17:51:12 -04:00
Joey Hess	4c954661a1	git-annex-shell: Added support for operating inside gcrypt repositories. * Note that the layout of gcrypt repositories has changed, and if you created one you must manually upgrade it. See http://git-annex.branchable.com/upgrades/gcrypt/	2013-09-24 17:25:47 -04:00
Joey Hess	f9e438c1bc	factor out more ssh stuff from git remote This has the dual benefits of making Remote.Git shorter, and letting Remote.GCrypt use these utilities.	2013-09-24 13:37:41 -04:00
Joey Hess	7390f08ef9	Use cryptohash rather than SHA for hashing. This is a massive win on OSX, which doesn't have a sha256sum normally. Only use external hash commands when the file is > 1 mb, since cryptohash is quite close to them in speed. SHA is still used to calculate HMACs. I don't quite understand cryptohash's API for those. Used the following benchmark to arrive at the 1 mb number. 1 mb file: benchmarking sha256/internal mean: 13.86696 ms, lb 13.83010 ms, ub 13.93453 ms, ci 0.950 std dev: 249.3235 us, lb 162.0448 us, ub 458.1744 us, ci 0.950 found 5 outliers among 100 samples (5.0%) 4 (4.0%) high mild 1 (1.0%) high severe variance introduced by outliers: 10.415% variance is moderately inflated by outliers benchmarking sha256/external mean: 14.20670 ms, lb 14.17237 ms, ub 14.27004 ms, ci 0.950 std dev: 230.5448 us, lb 150.7310 us, ub 427.6068 us, ci 0.950 found 3 outliers among 100 samples (3.0%) 2 (2.0%) high mild 1 (1.0%) high severe 2 mb file: benchmarking sha256/internal mean: 26.44270 ms, lb 26.23701 ms, ub 26.63414 ms, ci 0.950 std dev: 1.012303 ms, lb 925.8921 us, ub 1.122267 ms, ci 0.950 variance introduced by outliers: 35.540% variance is moderately inflated by outliers benchmarking sha256/external mean: 26.84521 ms, lb 26.77644 ms, ub 26.91433 ms, ci 0.950 std dev: 347.7867 us, lb 210.6283 us, ub 571.3351 us, ci 0.950 found 6 outliers among 100 samples (6.0%) import Crypto.Hash import Data.ByteString.Lazy as L import Criterion.Main import Common testfile :: FilePath testfile = "/run/shm/data" -- on ram disk main = defaultMain [ bgroup "sha256" [ bench "internal" $ whnfIO internal , bench "external" $ whnfIO external ] ] sha256 :: L.ByteString -> Digest SHA256 sha256 = hashlazy internal :: IO String internal = show . sha256 <$> L.readFile testfile external :: IO String external = do s <- readProcess "sha256sum" [testfile] return $ fst $ separate (== ' ') s	2013-09-22 20:06:02 -04:00
Joey Hess	e8e209f4e5	better probing for gcrypt repositories using new --check option Now can tell if a repo uses gcrypt or not, and whether it's decryptable with the current gpg keys. This closes the hole that undecryptable gcrypt repos could have before been combined into the repo in encrypted mode.	2013-09-19 12:53:24 -04:00
Joey Hess	8062f6337f	webapp: support adding existing gcrypt special remotes from removable drives When adding a removable drive, it's now detected if the drive contains a gcrypt special remote, and that's all handled nicely. This includes fetching the git-annex branch from the gcrypt repo in order to find out how to set up the special remote. Note that gcrypt repos that are not git-annex special remotes are not supported. It will attempt to detect such a gcrypt repo and refuse to use it. (But this is hard to do any may fail; see https://github.com/blake2-ppc/git-remote-gcrypt/issues/6) The problem with supporting regular gcrypt repos is that we don't know what the gcrypt.participants setting is intended to be for the repo. So even if we can decrypt it, if we push changes to it they might not be visible to other participants. Anyway, encrypted sneakernet (or mailnet) is now fully possible with the git-annex assistant! Assuming that the gpg key distribution is handled somehow, which the assistant doesn't yet help with. This commit was sponsored by Navishkar Rao.	2013-09-18 15:55:31 -04:00
Joey Hess	6c35038643	gcrypt: Ensure that signing key is set to one of the participants keys. Otherwise gcrypt will fail to pull, since it requires this to be the case. This needs a patched gcrypt, which is in my forked version.	2013-09-17 16:06:29 -04:00
Joey Hess	5fe49b98f8	Support hot-swapping of removable drives containing gcrypt repositories. To support this, a core.gcrypt-id is stored by git-annex inside the git config of a local gcrypt repository, when setting it up. That is compared with the remote's cached gcrypt-id. When different, a drive has been changed. git-annex then looks up the remote config for the uuid mapped from the core.gcrypt-id, and tweaks the configuration appropriately. When there is no known config for the uuid, it will refuse to use the remote.	2013-09-12 15:54:35 -04:00
Joey Hess	b64f5baf2d	sync: support gcrypt	2013-09-09 10:02:15 -04:00
Joey Hess	ecbb326e9d	Allow building without quvi support.	2013-09-09 02:16:22 -04:00
Joey Hess	00fb5705ff	ignore gcrypt remotes w/o an annex-uuid	2013-09-08 15:19:14 -04:00
Joey Hess	3e079cdcd1	gcrypt: now supports rsync Use rsync for gcrypt remotes that are not local to the disk. (Note that I have punted on supporting http transport for now, it doesn't seem likely to be very useful.) This was mostly quite easy, it just uses the rsync special remote to handle the transfers. The git repository url is converted to a RsyncOptions structure, which required parsing it separately, since the rsync special remote only supports rsync urls, which use a different format. Note that annexed objects are now stored at the top of the gcrypt repo, rather than inside annex/objects. This simplified the rsync suport, since it doesn't have to arrange to create that directory. And git-annex is not going to be run directly within gcrypt repos -- or if in some strance scenario it was, it would make sense for it to not see the encrypted objects. This commit was sponsored by Sheila Miguez	2013-09-08 14:54:28 -04:00
Joey Hess	9477a07cbf	local gcrypt fully working!	2013-09-08 13:00:48 -04:00
Joey Hess	7c1a9cdeb9	partially complete gcrypt remote (local send done; rest not) This is a git-remote-gcrypt encrypted special remote. Only sending files in to the remote works, and only for local repositories. Most of the work so far has involved making initremote work. A particular problem is that remote setup in this case needs to generate its own uuid, derivied from the gcrypt-id. That required some larger changes in the code to support. For ssh remotes, this will probably just reuse Remote.Rsync's code, so should be easy enough. And for downloading from a web remote, I will need to factor out the part of Remote.Git that does that. One particular thing that will need work is supporting hot-swapping a local gcrypt remote. I think it needs to store the gcrypt-id in the git config of the local remote, so that it can check it every time, and compare with the cached annex-uuid for the remote. If there is a mismatch, it can change both the cached annex-uuid and the gcrypt-id. That should work, and I laid some groundwork for it by already reading the remote's config when it's local. (Also needed for other reasons.) This commit was sponsored by Daniel Callahan.	2013-09-07 18:38:00 -04:00
Joey Hess	a48a4e2f8a	automatically derive an annex-uuid from a gcrypt-uuids	2013-09-05 16:02:39 -04:00
Joey Hess	89eecd4b3b	rename constructor for clariy	2013-09-05 11:12:01 -04:00
guilhem	ac9807c887	Leverage an ambiguities between Ciphers Cipher is now a datatype data Cipher = Cipher String \| MacOnlyCipher String which makes more precise its interpretation MAC-only vs. MAC + used to derive a key for symmetric crypto.	2013-09-05 11:09:08 -04:00
Joey Hess	2b9f3cc175	tabs	2013-09-04 22:47:53 -04:00
Joey Hess	a51f1a4ee4	unimportant tweak fix something my internal haskell parser does a double take at	2013-09-04 22:39:25 -04:00
Joey Hess	930e6d22d6	replace an over-explained Bool with a data type This also highlights several places where a Read/Show or similar for the new data type could avoid redundant strings.	2013-09-04 22:18:33 -04:00
guilhem	3999a860eb	Encryption defaults to 'hybrid' When a keyid= is specified while encryption= is absent.	2013-09-04 21:34:33 -04:00
Joey Hess	1587fd42a3	fix build (seems getGpgEncOpts got renamed to getGpgEncParams)	2013-09-04 18:00:02 -04:00
guilhem	8293ed619f	Allow public-key encryption of file content. With the initremote parameters "encryption=pubkey keyid=788A3F4C". /!\ Adding or removing a key has NO effect on files that have already been copied to the remote. Hence using keyid+= and keyid-= with such remotes should be used with care, and make little sense unless the point is to replace a (sub-)key by another. /!\ Also, a test case has been added to ensure that the cipher and file contents are encrypted as specified by the chosen encryption scheme.	2013-09-03 14:34:16 -04:00
guilhem	53ce59021a	Allow revocation of OpenPGP keys. /!\ It is to be noted that revoking a key does NOT necessarily prevent the owner of its private part from accessing data on the remote /!\ The only sound use of `keyid-=` is probably to replace a (sub-)key by another, where the private part of both is owned by the same person/entity: git annex enableremote myremote keyid-=2512E3C7 keyid+=788A3F4C Reference: http://git-annex.branchable.com/bugs/Using_a_revoked_GPG_key/ * Other change introduced by this patch: New keys now need to be added with option `keyid+=`, and the scheme specified (upon initremote only) with `encryption=`. The motivation for this change is to open for new schemes, e.g., strict asymmetric encryption. git annex initremote myremote encryption=hybrid keyid=2512E3C7 git annex enableremote myremote keyid+=788A3F4C	2013-08-29 14:31:33 -04:00
Joey Hess	f8ebce9396	better cases	2013-08-22 23:36:35 -04:00
Joey Hess	c0d8064018	unimportant typo (u and u' happened to be the same)	2013-08-22 23:27:12 -04:00
Joey Hess	46b6d75274	Youtube support! (And 53 other video hosts) When quvi is installed, git-annex addurl automatically uses it to detect when an page is a video, and downloads the video file. web special remote: Also support using quvi, for getting files, or checking if files exist in the web. This commit was sponsored by Mark Hepburn. Thanks!	2013-08-22 18:50:43 -04:00
Joey Hess	a3224ce35b	avoid more build warnings on Windows	2013-08-04 14:05:36 -04:00
Joey Hess	38022f4f49	Windows: Fixed permissions problem that prevented removing files from directory special remote. Directory special remotes now fully usable.	2013-08-04 13:43:48 -04:00
Joey Hess	06db8e0bd9	squash compiler warnings on Windows	2013-08-04 13:18:05 -04:00
Joey Hess	93f2371e09	get rid of __WINDOWS__, use mingw32_HOST_OS The latter is harder for me to remember, but avoids build failures in code used by the configure program.	2013-08-02 12:27:32 -04:00
Joey Hess	ca9ac8770f	directory special remote: Fix checking that there is enough disk space to hold an object, was broken when using encryption.	2013-07-20 16:30:49 -04:00
Joey Hess	d2f40d3d76	Fix checking when content is present in a non-bare repository accessed via http. I thought at first this was a Windows specific problem, but it's not; this affects checking any non-bare repository exported via http. Which is a potentially important use case! The actual bug was the case where Right False was returned by the first url short-curcuited later checks. But the whole method used felt like code I'd no longer write, and the use of undefined was particularly disgusting. So I rewrote it. Also added an action display. This commit was sponsored by Eric Hanchrow. Thanks!	2013-07-18 14:20:57 -04:00
Joey Hess	ea6fdc745f	fix build on windows	2013-07-09 16:25:15 -04:00
Joey Hess	7e7b2daddf	Windows: Fix url to object when using a http remote. annexLocations uses OS-native directory separators, but for an url, it needs to use / even on Windows. This is an ugly workaround. Could parameterize a lot of stuff in annexLocations to fix it better. I suspect this is probably the only place it's needed though.	2013-07-07 13:35:56 -04:00
Oliver Matthews	acd1b88741	Strip leading /~/ from bup relatively pathed bup remotes	2013-06-21 09:28:43 +01:00
Joey Hess	8be3e9baa2	Merge branch 'glacier' Conflicts: debian/changelog	2013-06-11 10:34:55 -04:00
Joey Hess	a64106dcef	Supports indirect mode on encfs in paranoia mode, and other filesystems that do not support hard links, but do support symlinks and other POSIX filesystem features.	2013-06-10 13:11:33 -04:00
Joey Hess	88d2d59f83	glacier: Better handling of the glacier inventory, which avoids duplicate uploads to the same glacier repository by `git annex copy`. The checkpresent hook can return either True or, False, or fail with a message if it cannot successfully check the remote. Currently for glacier, when --trust-glacier is not set, it always returns False. Crucially, in the case when a file is in glacier, this is telling git-annex it's not there, so copy re-uploads it. This is not desirable; it breaks using glacier-cli to retreive that file later, and it wastes money/bandwidth. What if it instead, when the glacier inventory is missing a file, it returns False. And when the glacier inventory has a file, unless --trust-glacier is set, it fails. The result would be: * `git annex copy --to glacier` would only send things not listed in inventory. If a file is listed in the inventory, `copy` would complain that --trust-glacier` is not set, and not re-upload the file. * `git annex drop` would only trust that glacier has a file when --trust-glacier is set. Behavior unchanged. * `git annex move --to glacier`, when the file is not listed in inventory, would send the file, and delete it locally. Behavior unchanged. * `git annex move --to glacier`, when the file is listed in inventory, would only trust that glacier has the file when --trust-glacier is set * `git annex copy --from glacier` / `git annex get`, when the file is located in glacier, would trust the location log, and attempt to get the file from glacier.	2013-05-29 13:52:42 -04:00
Joey Hess	3b1aedea3d	Merge branch 'robustness'	2013-05-25 15:22:18 -04:00
Joey Hess	bf86b5ca16	improve robustness of fromDirect and replaceFile Made fromDirect check that a file in the tree has good content (and is not a broken symlink either) before copying it to another file that has the same key. Made replaceFile clean up the temp file if the action that creates it, or the file replacement action fails.	2013-05-25 15:06:02 -04:00
Joey Hess	e3c1586997	Improve error handling when getting uuid of http remotes to auto-ignore, like with ssh remotes.	2013-05-25 01:47:19 -04:00
Joey Hess	2dce874c77	hook special remote: Added combined hook program support.	2013-05-21 19:19:03 -04:00
Joey Hess	796c2f6bc8	remove unnecessary bracketIO	2013-05-19 18:15:29 -04:00
Joey Hess	667a832de9	print encryption setup message before action	2013-05-18 19:36:55 -04:00
Joey Hess	03eec12cff	fix	2013-05-14 13:58:17 -04:00
Joey Hess	17952a893e	fix imports	2013-05-14 13:53:29 -04:00
Joey Hess	1496342c9e	typo	2013-05-14 13:52:30 -04:00
Joey Hess	40a9d8e097	avoid running background transferinfo when ssh connection caching is not supported	2013-05-14 13:51:14 -04:00
Joey Hess	03a0f17fbb	deal with Cygwin rsync paths issue	2013-05-14 13:24:15 -04:00
Joey Hess	25a8d4b11c	rename module	2013-05-12 19:19:28 -04:00
Joey Hess	abe8d549df	fix permission damage (thanks, Windows)	2013-05-11 23:54:25 -04:00
Joey Hess	3c7e30a295	git-annex now builds on Windows (doesn't work)	2013-05-11 15:03:00 -05:00
Joey Hess	763cbda14f	fixup #if 0 stubs to use #ifndef mingw32_HOST_OS That's needed in files used to build the configure program. For the other files, I'm keeping my __WINDOWS__ define, as I find that much easier to type. I may search and replace it to use the mingw32_HOST_OS thing later.	2013-05-10 16:57:21 -05:00
Joey Hess	6c74a42cc6	stub out POSIX stuff	2013-05-10 16:29:59 -05:00
Joey Hess	f92eaf6315	rsync special remotes: When sending from a crippled filesystem, use the destination's default file permissions, as the local ones can be arbitrarily broken. (Ie, ----rwxr-x for files on Android)	2013-05-09 13:55:18 -04:00
Joey Hess	a0f6dab8de	When initializing a directory special remote with a relative path, the path is made absolute. Using a relative path would work, until the user changed to some other directory in the repo and tried to access the remote from there..	2013-05-06 17:15:36 -04:00
Joey Hess	543a78bae0	Support building with DAV 0.4.	2013-04-30 14:10:55 -04:00
Joey Hess	883b17af01	Store an annex-uuid file in the bucket when setting up a new S3 remote.	2013-04-27 17:01:24 -04:00
Joey Hess	c3498042fd	webapp: Now automatically fills in any creds used by an existing remote when creating a new remote of the same type. Done for Internet Archive, S3, Glacier, and Box.com remotes.	2013-04-27 15:16:06 -04:00
Joey Hess	3c7f4d2bd1	Automatically register public urls for files uploaded to the Internet Archive.	2013-04-25 17:28:25 -04:00
Joey Hess	e3ea36174b	webapp: Display some additional information about a repository on its edit page.	2013-04-25 16:42:17 -04:00
Joey Hess	3e396a3b89	S3: Dropping content from the Internet Archive doesn't work, but their API indicates it does. Always refuse to drop from there.	2013-04-25 15:20:31 -04:00
Joey Hess	8284b310a7	support enabling IA repositories	2013-04-25 13:14:49 -04:00
Joey Hess	4b1cf3d731	Detect when the remote is broken like bitbucket is, and exits 0 when it fails to run git-annex-shell.	2013-04-23 20:06:02 -04:00
Joey Hess	8a2d1988d3	expose Control.Monad.join I think I've been looking for that function for some time. Ie, I remember wanting to collapse Just Nothing to Nothing.	2013-04-22 20:24:53 -04:00
Joey Hess	8861e270be	sync, assistant: Sync with remotes that have annex-ignore set This is so git remotes on servers without git-annex installed can be used to keep clients' git repos in sync. This is a behavior change, but since annex-sync can be set to disable syncing with a remote, I think it's acceptable.	2013-04-22 14:57:09 -04:00
Joey Hess	b9904b0c42	fix tab damage	2013-04-13 19:26:59 -04:00
guilhem	a1eded8641	Allow rsync to use other remote shells. Introduced a new per-remote option 'annex-rsync-transport' to specify the remote shell that it to be used with rsync. In case the value is 'ssh', connections are cached unless 'sshcaching' is unset.	2013-04-13 19:26:24 -04:00
Joey Hess	9e11699c76	connect existing meters to the transfer log for downloads Most remotes have meters in their implementations of retrieveKeyFile already. Simply hooking these up to the transfer log makes that information available. Easy peasy. This is particularly valuable information for encrypted remotes, which otherwise bypass the assistant's polling of temp files, and so don't have good progress bars yet. Still some work to do here (see progressbars.mdwn changes), but this is entirely an improvement from the lack of progress bars for encrypted downloads.	2013-04-11 17:32:31 -04:00
Joey Hess	c511eb048f	changelog & minor style fixes	2013-04-06 16:14:57 -04:00
guilhem	00fc21bfec	Generate ciphers with a better entropy. Unless highRandomQuality=false (or --fast) is set, use Libgcypt's 'GCRY_VERY_STRONG_RANDOM' level by default for cipher generation, like it's done for OpenPGP key generation. On the assistant side, the random quality is left to the old (lower) level, in order not to scare the user with an enless page load due to the blocking PRNG waiting for IO actions.	2013-04-06 16:09:51 -04:00
Joey Hess	f1b0a4b404	Use lower case hash directories for storing files on crippled filesystems, same as is already done for bare repositories. * since this is a crippled filesystem anyway, git-annex doesn't use symlinks on it * so there's no reason to use the mixed case hash directories that we're stuck using to avoid breaking everyone's symlinks to the content * so we can do what is already done for all bare repos, and make non-bare repos on crippled filesystems use the all-lower case hash directories * which are, happily, all 3 letters long, so they cannot conflict with mixed case hash directories * so I was able to 100% fix this and even resuming `git annex add` in the test case will recover and it will all just work.	2013-04-04 15:46:33 -04:00
Joey Hess	8a5b397ac4	hlint	2013-04-03 03:52:41 -04:00
guilhem	55f0f858ee	Allow other MAC algorithms in the Remote Config.	2013-03-29 18:04:52 -04:00
Joey Hess	cf07a2c412	webapp: Progess bar fixes for many types of special remotes. There was confusion in different parts of the progress bar code about whether an update contained the total number of bytes transferred, or the number of bytes transferred since the last update. One way this bug showed up was progress bars that seemed to stick at zero for a long time. In order to fix it comprehensively, I add a new BytesProcessed data type, that is explicitly a total quantity of bytes, not a delta. Note that this doesn't necessarily fix every problem with progress bars. Particularly, buffering can now cause progress bars to seem to run ahead of transfers, reaching 100% when data is still being uploaded.	2013-03-28 17:04:37 -04:00
Joey Hess	449520a573	add globallyAvailable to remotes	2013-03-15 19:16:13 -04:00
Joey Hess	19c0a0d5b1	split cost out into its own module Added a function to insert a new cost into a list, which could be used to asjust costs after a drag and drop.	2013-03-13 16:30:34 -04:00
Joey Hess	f7de51e8b6	Bugfix: Fix bug in inode cache sentinal check, which broke copying to local repos if the repo being copied from had moved to a different filesystem or otherwise changed all its inodes'	2013-03-12 16:41:54 -04:00
guilhem	d2bc0e9f3e	GnuPG options for symmetric encryption.	2013-03-11 09:48:38 -04:00
Joey Hess	69ab9701eb	copyToRemote should return True when the remote already has the key This got broken in commit `e9238e9588`. I observed a key that had been copied to a remote, but the location log was out of date, and due to this bug, git annex transferkey failed and so the file could not be dropped when it was moved to an archive directory.	2013-03-10 17:54:27 -04:00
Joey Hess	56830af8d8	simpler use of MIN_VERSION checks	2013-03-10 15:43:17 -04:00
Joey Hess	ff6ce2bc15	print a warning message when garbage is received from configlist	2013-03-04 23:27:18 -04:00
Joey Hess	0c13d3065e	git subcommand cleanup Pass subcommand as a regular param, which allows passing git parameters like -c before it. This was already done in the pipeing set of functions, but not the command running set.	2013-03-03 13:39:07 -04:00
Joey Hess	b117efc19b	deal with http-conduit changing a data type Pity that the library does not provide a function to extract the status code from the StatusCodeException, so when they had to add a new field, it breaks every single place that does it.	2013-02-27 00:07:28 -04:00
Joey Hess	a7a1bcd1d6	Avoid passing -p to rsync, to interoperate with crippled filesystems. In general, git-annex does not try to preserve file permissions. For example, they don't round trip through special remotes. So it's ok to not preserve them for git remotes either. On crippled filesystems, rsync has been observed failing after the file was transferred because it couldn't set some permission or other.	2013-02-22 15:23:29 -04:00
Joey Hess	96613e85a9	build fix	2013-02-15 13:48:25 -04:00
Joey Hess	9e69fca5bb	optimise sending to encrypted rsync With an encrypted rsync remote, the encrpyted file can be renamed, rather than being copied, in crippled filesystem mode. This gets back to just as fast as non-crippled mode for this very common case.	2013-02-15 13:42:41 -04:00
Joey Hess	92b4a63a06	rsync special remote support for crippled filesystem mode Cannot make a hard link, have to copy. I did find a way to make it work without setting up a tree, just using --include and --exclude. But it needs the same hash directories to be used on both sides, which is normally not the case. Still, I hope one day I will convert non-bare repos to use the same hash dirs as everything else, and then this will get more efficient.	2013-02-15 13:33:36 -04:00
Joey Hess	47477b2807	crippled filesystem support, probing and initial support git annex init probes for crippled filesystems, and sets direct mode, as well as `annex.crippledfilesystem`. Avoid manipulating permissions of files on crippled filesystems. That would likely cause an exception to be thrown. Very basic support in Command.Add for cripped filesystems; avoids the lock down entirely since doing it needs both permissions and hard links. Will make this better soon.	2013-02-14 14:15:26 -04:00
Joey Hess	18a6935e42	safe recv-key in direct mode Checks the key's size and checksum. This is sorta expensive, but it avoids needing to add another round-trip to the protocol.	2013-01-11 16:03:45 -04:00
Joey Hess	fec55e742f	check for direct mode file change when copying from a local git remote Only missing direct mode transfer check now is git-annex shell recvkey.	2013-01-10 11:56:06 -04:00
Joey Hess	a6a5ed8121	check for direct mode file change when copying to a local git remote	2013-01-10 11:45:44 -04:00
Joey Hess	1bc49b7158	Special remotes now all rollback storage of keys that get modified during the transfer, which can happen in direct mode.	2013-01-09 18:42:29 -04:00
Joey Hess	909f67443f	Fix transferring files to special remotes in direct mode.	2013-01-06 14:29:01 -04:00
Joey Hess	24c6eae1b5	show errors	2013-01-02 13:50:16 -04:00
Joey Hess	4008590c68	type based git config handling for remotes Still a couple of places that use git config ad-hoc, but this is most of it done.	2013-01-01 13:58:14 -04:00
Joey Hess	ffdd08fd2e	Merge branch 'master' into desymlink	2012-12-13 00:46:10 -04:00
Joey Hess	0d50a6105b	whitespace fixes	2012-12-13 00:45:27 -04:00
Joey Hess	e7b8cb0063	direct mode committing	2012-12-12 19:20:38 -04:00
Joey Hess	b4c6da9cbd	Got object sending working in direct mode. However, I don't yet have a reliable way to deal with files being modified while they're being transferred. I have code that detects it on the sending side, but the receiver is still free to move the wrong content into its annex, and record that it has the content. So that's not acceptable, and I'll need to work on it some more. However, at this point I can use a direct mode repository as a remote and transfer files from and to it.	2012-12-08 17:03:39 -04:00
Joey Hess	5460414486	webdav: Avoid trying to set props, avoiding incompatability with livedrive.com. Needs DAV version 0.3.	2012-12-01 17:12:41 -04:00
Joey Hess	757f041cd8	instrument webdav test	2012-12-01 14:32:50 -04:00
Joey Hess	0b6c889012	webapp: S3 and Glacier forms now have a select list of all currently-supported AWS regions.	2012-12-01 14:11:37 -04:00
Joey Hess	020a25abe1	avoid unnecessary Maybe	2012-11-30 00:55:59 -04:00
Joey Hess	ea5d7292e6	dropping from web	2012-11-29 17:01:07 -04:00
Joey Hess	3b35cde0e8	assistant: Retrival from glacier now handled.	2012-11-29 15:23:33 -04:00
Joey Hess	8dd1d9aaf9	webapp: Defaults to sharing box.com account info with friends, allowing one-click enabling of the repository.	2012-11-28 13:31:49 -04:00
Joey Hess	5ff666ec99	rsync: Fix bug introduced in last release that broke encrypted rsync special remotes.	2012-11-27 16:29:31 -04:00
Joey Hess	356120652f	remove redundant showOutput calls. The meter code does that too.	2012-11-25 16:13:06 -04:00
Joey Hess	fb19d56476	progress bars for glacier downloads	2012-11-25 13:49:22 -04:00
Joey Hess	606c210378	progress bars for glacier uploads	2012-11-25 13:27:20 -04:00
Joey Hess	da6c738dad	adjust glacier remote cost to 1000 Higher than any other remote, this is mostly due to the long retrieval time, so it'd make sense to get a file from nearly any other remote. (Unless it's behind a very slow connection.)	2012-11-22 16:59:10 -04:00
Joey Hess	f53496830a	pass --quiet to checkpresent	2012-11-21 19:35:28 -04:00
Joey Hess	a5111a6d85	Amazon Glacier special remote; 100% working	2012-11-20 16:43:58 -04:00
Joey Hess	9221e62d87	Allow controlling whether login credentials for S3 and webdav are committed to the repository, by setting embedcreds=yes\|no when running initremote.	2012-11-19 17:32:58 -04:00
Joey Hess	f7a7ec4ebf	new storage regime implemented for webdav	2012-11-19 14:08:39 -04:00
Joey Hess	7b71685a93	Bugfix: directory special remote could loop forever storing a key when a too small chunksize was configured. Ensure that each file has something written to it, even if the bytestring chunk size is greater than the configured chunksize. This means we may write a bit larger than the configured value, but only when the configured value is very small; ie, < 8 kb.	2012-11-19 13:30:58 -04:00
Joey Hess	5f977cc725	directory special remote: Made more efficient and robust. Files are now written to a tmp directory in the remote, and once all chunks are written, etc, it's moved into the final place atomically. For now, checkpresent still checks every single chunk of a file, because the old method could leave partially transferred files with some chunks present and others not.	2012-11-19 13:18:23 -04:00
Joey Hess	d3dfeeb3d9	remove annex/ from key locations used for webdav	2012-11-18 23:59:39 -04:00
Joey Hess	7df1e71fe3	S3: Added progress display for uploading and downloading.	2012-11-18 22:49:07 -04:00
Joey Hess	b0e08ae457	S3: upload progress display	2012-11-18 22:20:43 -04:00
Joey Hess	e2b7fc1ebd	refactor	2012-11-18 21:50:16 -04:00
Joey Hess	afa2f9c967	upload progress bars for webdav!	2012-11-18 20:30:05 -04:00
Joey Hess	c8751be151	simplify	2012-11-18 18:27:53 -04:00
Joey Hess	81379bb29c	better streaming while encrypting/decrypting Both the directory and webdav special remotes used to have to buffer the whole file contents before it could be decrypted, as they read from chunks. Now the chunks are streamed through gpg with no buffering.	2012-11-18 15:27:44 -04:00
Joey Hess	3607c92222	fix warning	2012-11-18 14:06:54 -04:00
Joey Hess	8a6941a216	fix build with xml-conduit newer than in debian The Element data type changed to use a map of attributes. Rather than ifdef, I'm avoiding directly using that data type.	2012-11-18 13:46:38 -04:00
Joey Hess	7addb89dc1	webapp: support box.com	2012-11-17 15:30:11 -04:00
Joey Hess	1fe76b57d6	webdav now checks presence of and receives chunked content Note that receiving encrypted chunked content currently involves buffering. (So does doing so with the directory special remote.)	2012-11-16 23:16:18 -04:00
Joey Hess	0b3126a30b	back to standard directory layout for webdav remotes This allows deleting all chunks for a file with a single http command, so it's a win after all. However, does not look in the mixed case hash directories, which were in the past used by the directory, etc remotes.	2012-11-16 18:14:07 -04:00
Joey Hess	a1869ad662	webdav now supports sending chunked content Not yet getting it though.	2012-11-16 17:58:58 -04:00
Joey Hess	92d5d81c2c	generic chunked content helper However, directory still uses its optimzed chunked file writer, as it uses less memory than the generic one in the helper.	2012-11-16 17:58:08 -04:00
Joey Hess	0f782bd028	encrypted webdav working	2012-11-16 13:57:32 -04:00
Joey Hess	bb28c6114a	drop webdav compatability with the directory special remote etc The benefit of using a compatable directory structure does not outweigh the cost in complexity of handling the multiple locations content can be stored in directory special remotes. And this also allows doing away with the parent directories, which can't be made unwritable in DAV, so have no benefit there. This will save 2 http calls per file store. But, kept the directory hashing, just in case.	2012-11-16 00:42:33 -04:00
Joey Hess	a4b86c63d6	webdav is fully working in non-enctypted mode	2012-11-16 00:09:22 -04:00
Joey Hess	3c039d329c	update to dav 0.1, and basic uploading is working!	2012-11-15 13:46:16 -04:00
Joey Hess	0cba0cb2dd	skeltal webdav special remote Doesn't actually store anything yet, but initremote works and tests the server.	2012-11-14 20:25:31 -04:00
Joey Hess	e250f6f11f	factor out Creds	2012-11-14 19:32:27 -04:00
Joey Hess	2172cc586e	where indenting	2012-11-11 00:51:07 -04:00
Joey Hess	6eca362c5d	indentation foo, and a new coding style page. no code changes	2012-10-28 21:27:15 -04:00
Joey Hess	9767562f65	rsync special remote: Include annex-rsync-options when running rsync to test a key's presence. Also, use the new withQuietOutput function to avoid running the shell to /dev/null stderr in two other places.	2012-10-28 13:51:14 -04:00
Joey Hess	7ee0ffaeb9	Use USER and HOME environment when set, and only fall back to getpwent, which doesn't work with LDAP or NIS.	2012-10-25 18:17:54 -04:00
Joey Hess	ee24f23ecb	fix build	2012-10-24 10:54:58 -04:00
Joey Hess	8b1235b022	bup: Don't pass - to bup-split to make it read stdin bup 0.25 does not accept that; and bup split reads from stdin by default if no file is given. I'm not sure what version of bup changed this. This only affected bup special remotes that were encrypted.	2012-10-23 16:01:02 -04:00
Joey Hess	452e6819d0	!! removal	2012-10-21 00:51:42 -04:00
Joey Hess	14b376d440	Merge branch 'safesemaphore' Conflicts: debian/changelog git-annex.cabal	2012-10-20 12:44:25 -04:00
Joey Hess	e290f1b903	Automatically detect when a ssh remote does not have git-annex-shell installed, and set annex-ignore. Aka solve the github problem. Note that it's possible the initial configlist will fail for some network reason etc, and then the fetch succeeds. In this case, a usable remote gets disabled. But it does print a message, and this only happens once per remote, so that seems ok.	2012-10-12 13:45:14 -04:00
Ben Gamari	179aeeaacc	Remote/Git: Use SampleVar from SafeSemaphore instead of base SampleVars from base are unsafe	2012-10-05 17:03:58 -04:00
Joey Hess	47314c0fad	fix last zombies in the assistant Made Git.LsFiles return cleanup actions, and everything waits on processes now, except of course for Seek.	2012-10-04 19:56:32 -04:00
Joey Hess	de3ea4adb6	remove now-unnecessary manual reaps	2012-10-04 18:58:57 -04:00
Joey Hess	f18a53eec0	change s3 creds caching Rather than store decrypted creds in the environment, store them in the creds cache file. This way, a single git-annex can have multiple S3 remotes using different creds.	2012-09-26 14:42:51 -04:00
Joey Hess	e4bf74a965	store S3 creds in a 600 mode file inside the local git repo	2012-09-26 14:42:32 -04:00
Joey Hess	df07ccf404	make the assistant retry failed transfers When a transfer fails, the progress info can be used to intelligently retry it. If the transfer managed to make some progress, but did not fully complete, then there's a good chance that a retry will finish it (or at least make more progress).	2012-09-23 13:27:13 -04:00
Joey Hess	c048add74d	hooked up git-annex-shell transferinfo Finally done with progressbars!	2012-09-21 23:25:06 -04:00
Joey Hess	1722c23f56	fix logic error introduced yesterday	2012-09-21 20:24:08 -04:00
Joey Hess	ff32ee5152	upload progress tracking for the directory special remote	2012-09-21 14:54:24 -04:00
Joey Hess	226781c047	unify types	2012-09-21 14:50:14 -04:00
Joey Hess	2ae38325d5	hook rsync special remote up to the progress reporting Easy! Note that with an encrypted remote, rsync will be sending a little more data than the key size, so displayed progress may get to 100% slightly quicker than it should. I doubt this is a big enough effect to worry about.	2012-09-20 13:51:51 -04:00
Joey Hess	19e35f7f0d	upload progress bar for git remote on same filesystem cp is used here, but we can just watch the size of the destination file This commit made from within the ruins of an old mill, overlooking a beautiful waterfall.	2012-09-20 13:35:53 -04:00
Joey Hess	e1037adebc	rsync progress interception Current implementation parses rsync's output a character a time, which is hardly efficient. It could be sped up a lot by using hGetBufSome, but that would require going really lowlevel, down to raw C style buffers (good example of that here: http://users.aber.ac.uk/afc/stricthaskell.html) But rsync doesn't output very much, so currently it seems ok.	2012-09-19 16:55:08 -04:00
Joey Hess	aff09a1f33	add a progress callback to storeKey, and threaded it all the way through Transfer info files are updated when the callback is called, updating the number of bytes transferred. Left unused p variables at every place the callback should be used. Which is rather a lot..	2012-09-19 16:08:37 -04:00
Joey Hess	45a26175d6	renamed RsyncFile -> Rsync	2012-09-19 14:28:32 -04:00
Joey Hess	e9238e9588	avoid starting a download for a local transfer when the remote already has the key Turns out that recvkey already does this same check. This avoids a transfer file being created for the download that never happened, which in turn will avoid the assistant seeing that the download has finished, when no transfer actually took place.	2012-09-18 13:59:03 -04:00
Joey Hess	beaecce68b	git http:// remotes are readonly too	2012-08-26 15:53:31 -04:00
Joey Hess	271ea49978	add support for readonly remotes Currently only the web special remote is readonly, but it'd be possible to also have readonly drives, or other remotes. These are handled in the assistant by only downloading from them, and never trying to upload to them.	2012-08-26 15:39:02 -04:00
Joey Hess	f4ca592cd0	refactor	2012-08-26 14:34:30 -04:00
Joey Hess	78d3add86b	tweak field name	2012-08-26 14:26:43 -04:00
Joey Hess	b818337054	fix build warning	2012-08-16 16:48:27 -07:00
Joey Hess	cbca93cf7c	Merge branch 'master' into assistant Conflicts: debian/changelog	2012-08-16 16:36:32 -07:00
Joey Hess	ad4e152fd6	S3: Add fileprefix setting.	2012-08-09 13:54:54 -04:00
Joey Hess	94fcd0cf59	add routes to pause/start/cancel transfers This commit includes a paydown on technical debt incurred two years ago, when I didn't know that it was bad to make custom Read and Show instances for types. As the routes need Read and Show for Transfer, which includes a Key, and deriving my own Read instance of key was not practical, I had to finally clean that up. So the compact Key read and show functions are now file2key and key2file, and Read and Show are now derived instances. Changed all code that used the old instances, compiler checked. (There were a few places, particularly in Command.Unused, and the test suite where the Show instance continue to be used for legitimate comparisons; ie show key_x == show key_y (though really in a bloom filter))	2012-08-08 16:20:24 -04:00
Joey Hess	cb0f435d94	adding removable drive repos now basically works	2012-08-05 14:49:47 -04:00
Joey Hess	4ec9244f1a	add a path field to remotes Also broke out some helper functions around constructing remotes, to be used later.	2012-07-22 14:30:43 -04:00
Joey Hess	1db7d27a45	add back debug logging Make Utility.Process wrap the parts of System.Process that I use, and add debug logging to them. Also wrote some higher-level code that allows running an action with handles to a processes stdin or stdout (or both), and checking its exit status, all in a single function call. As a bonus, the debug logging now indicates whether the process is being run to read from it, feed it data, chat with it (writing and reading), or just call it for its side effect.	2012-07-19 00:46:52 -04:00
Joey Hess	d1da9cf221	switch from System.Cmd.Utils to System.Process Test suite now passes with -threaded! I traced back all the hangs with -threaded to System.Cmd.Utils. It seems it's just crappy/unsafe/outdated, and should not be used. System.Process seems to be the cool new thing, so converted all the code to use it instead. In the process, --debug stopped printing commands it runs. I may try to bring that back later. Note that even SafeSystem was switched to use System.Process. Since that was a modified version of code from System.Cmd.Utils, it needed to be converted too. I also got rid of nearly all calls to forkProcess, and all calls to executeFile, which I'm also doubtful about working well with -threaded.	2012-07-18 18:00:24 -04:00
Joey Hess	81b20a581a	avoid --no-inplace Not available on systems with shoddy getopts. Should not be necessary, as that's rsync's default.	2012-07-10 12:40:31 -06:00
Joey Hess	760e028dca	pass associatedfile and remoteuuid to git-annex-shell This almost works. Along the way, I noticed that the --uuid parameter was being accidentially passed after the --, so that has never been actually used by git-annex-shell to verify it's running in the expected repository. Oops. Fixed.	2012-07-02 10:57:51 -04:00
Joey Hess	7225c2bfc0	record transfer information on local git remotes In order to record a semi-useful filename associated with the key, this required plumbing the filename all the way through to the remotes' storeKey and retrieveKeyFile. Note that there is potential for deadlock here, narrowly avoided. Suppose the repos are A and B. A sends file foo to B, and at the same time, B gets file foo from A. So, A locks its upload transfer info file, and then locks B's download transfer info file. At the same time, B is taking the two locks in the opposite order. This is only not a deadlock because the lock code does not wait, and aborts. So one of A or B's transfers will be aborted and the other transfer will continue. Whew!	2012-07-01 17:15:11 -04:00
Joey Hess	29335bf326	pointlessness	2012-06-29 10:00:05 -04:00
Joey Hess	6aee7e5a8b	Better fix for unavailable local remotes Not including such remotes turned out to have other consequences, including annex-truselevel git config being ignored. Instead, add guards before each operation that might try to operate on such a repo.	2012-06-26 22:27:30 -04:00
Joey Hess	7e62e57f8c	Avoid ugly failure mode when moving content from a local repository that is not available. Prelude.undefined error message was introduced by `bb4f31a0ee`. It seems best to filter out local repositories that cannot be accessed from the list of remotes, rather than keeping them in and making every thing that uses the list have to deal with remotes that may have an unknown location. Besides fixing the error message, this also makes unavailable local remotes' names not be shown in various messages, including in git annex status output. Also, move --to an unavailable local repository now avoids some ugly errors like "changeWorkingDirectory: does not exist".	2012-06-26 17:22:44 -04:00
Joey Hess	75b6ee81f9	avoid ByteString.Char8 where not needed Its truncation behavior is a red flag, so avoid using it in these places where only raw ByteStrings are used, without looking at the data inside.	2012-06-20 13:13:40 -04:00
Joey Hess	e0095b0bdc	fishy commit	2012-06-14 00:01:48 -04:00
Joey Hess	5809f33f8b	use createAnnexDirectory when setting up tmp dir	2012-06-05 20:25:32 -04:00
Joey Hess	13118136c0	Preserve parent environment when running hooks of the hook special remote.	2012-06-04 21:52:36 -04:00
Joey Hess	37ef39c929	suppress "(Recording state in git)" message when committing change to remote state This was shown redundantly for a tricky reason -- while it runs inside a doSideAction block that would appear to supress it, the action being run is in a different state monad; for the remote, and so the suppression doesn't work. Always suppressing the message when committing to a local remote is ok do to though -- it mirrors the /dev/nulling of the git annex shell commit output. And it turns out that any time there is a git-annex branch state change to commit on the remote, the local repo has also had a similar change made, and so the message has been shown already.	2012-05-20 00:14:56 -04:00
Joey Hess	eb6cb1b87f	Add support for core.worktree, and fix support for GIT_WORK_TREE and GIT_DIR. The environment needs to override git-config. Changed when git config is read, and avoid rereading it once it's been read. chdir for both worktree settings.	2012-05-18 18:20:53 -04:00
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	f7d8982672	Fix use of several config settings annex.ssh-options, annex.rsync-options, annex.bup-split-options. And adjust types to avoid the bugs that broke several config settings recently. Now "annex." prefixing is enforced at the type level.	2012-05-05 20:16:56 -04:00
Joey Hess	6d61067599	rsync shellescape disable option Rsync special remotes can be configured with shellescape=no to avoid shell quoting that is normally done when using rsync over ssh. This is known to be needed for certian rsync hosting providers (specificially hidrive.strato.com) that use rsync over ssh but do not pass it through the shell.	2012-05-02 13:08:33 -04:00
Joey Hess	bd592d1450	refactor	2012-04-29 14:33:07 -04:00
Joey Hess	1c16f616df	Added shared cipher mode to encryptable special remotes. This option avoids gpg key distribution, at the expense of flexability, and with the requirement that all clones of the git repository be equally trusted.	2012-04-29 14:02:43 -04:00
Joey Hess	84ac8c58db	Add annex.httpheaders and annex.httpheader-command config settings Allow custom headers to be sent with all HTTP requests. (Requested by the Internet Archive)	2012-04-22 01:13:09 -04:00
Joey Hess	ed79596b75	noop	2012-04-21 23:32:33 -04:00
Joey Hess	bee420bd2d	in which I discover void void :: Functor f => f a -> f () -- ah, of course that's useful :)	2012-04-21 23:06:19 -04:00
Joey Hess	b98b69e8c6	honor core.sharedRepository when making all the other files in the annex Lock files, directories, etc.	2012-04-21 19:36:03 -04:00
Joey Hess	5cc76098ca	Directory special remotes now check annex.diskreserve.	2012-04-20 16:24:44 -04:00
Joey Hess	aa353d1400	use LANGUAGE CPP pragma, avoids running cpp on all the other sources	2012-04-17 18:37:40 -04:00
Joey Hess	626697b459	cabal file now autodetects whether S3 support is available.	2012-04-14 14:22:33 -04:00
Joey Hess	c924542e61	bup: Properly handle key names with spaces or other things that are not legal git refs. Continue using the key name as bup ref name, to preserve backwards compatability, unless it is an illegal git ref. In that case, use a sha256 of the key name instead.	2012-04-11 12:45:49 -04:00
Joey Hess	4eb5112681	rationalize getConfig getConfig got a remote-specific config, and this confusing name caused it to be used a couple of places that only were interested in global configs. Rename to getRemoteConfig and make getConfig only get global configs. There are no behavior changes here, but remote.<name>.annex-web-options never actually worked (and per-remote web options is a very unlikely to be useful case so I didn't make it work), so fix the documentation for it.	2012-03-22 17:32:47 -04:00
Joey Hess	a362c46b70	fun with symbols Nothing at all on hackage is using <&&> or <\|\|>. (Also, <&&> should short-circuit on failure.)	2012-03-17 00:38:40 -04:00
Joey Hess	c0c9991c9f	nukes another 15 lines thanks to ifM	2012-03-15 20:39:25 -04:00
Joey Hess	b27760aa68	Work around a bug in rsync (IMHO) introduced by openSUSE's SIP patch. openSUSE patches rsync with a patch adding SIP protocol support. https://gist.github.com/2026167 With this patch, running rsync with no hostname parameter is apparently supposed to list SIP hosts on the network. Practically, it does nothing and exits 0. git-annex uses rsync in a very special way to allow git-annex-shell to be run on the remote host, and so did not need to specify a hostname, or a file to transfer as a rsync parameter. So it sent ":", a degenerate case of "host:file". But the patch cannot differentiate ":" with no host parameter (a bug in the SIP patch surely). Results were that getting files failed, as rsync seemed to succeed, but the requested file failed to arrive. Also I think that sending files will make git-annex think a file has been transferred to the remote when really rsync does nothing. The workaround for this buggy rsync patch is to use "dummy:" as the hostname.	2012-03-12 22:53:43 -04:00
Joey Hess	52e88f3ebf	add remote start and stop hooks Locking is used, so that, if there are multiple git-annex processes using a remote concurrently, the stop hook is only run by the last process that uses it.	2012-03-04 19:12:58 -04:00
Joey Hess	3960825cef	better chunked file retrieval Avoids opening every chunk at once, instead streaming them in. Not done for encrypted file retrieval yet.	2012-03-04 11:48:23 -04:00
Joey Hess	7ba79cfb8c	thread through original key to retrieveEnctypted Allows showing progress bar for this last case of the directory special remote.	2012-03-04 03:36:39 -04:00
Joey Hess	4638314001	add progress display when receiving files That was actually really easy. But, when getting a file from an encrypted directory special remote, no meter can be shown, because the total file size is not known.	2012-03-04 03:25:41 -04:00
Joey Hess	9856c24a59	Add progress bar display to the directory special remote. So far I've only written progress bars for sending files, not yet receiving. No longer uses external cp at all. ByteString IO is fast enough.	2012-03-04 03:17:25 -04:00
Joey Hess	50c897c082	tweak	2012-03-03 20:02:48 -04:00
Joey Hess	3436aba6de	Directory special remotes now support chunking files written to them Avoiding writing files larger than a specified size is useful on certian things. For example, box.com has a file size limit of 100 mb. Could also be useful on really crappy removable media.	2012-03-03 18:05:55 -04:00
Joey Hess	c3fbe07d7a	do a cleanup commit after moving data from or to a git remote Added Annex.cleanup, which is a general purpose interface for adding actions to run at the end. Remotes with the old git-annex-shell will commit every time, and have no commit command, so hide stderr when running the commit command.	2012-02-25 18:02:49 -04:00
Joey Hess	cb631ce518	whereis: Prints the urls of files that the web special remote knows about.	2012-02-14 03:49:48 -04:00
Joey Hess	8fbc529d68	oops	2012-02-14 03:10:01 -04:00
Joey Hess	cbaebf538a	rework git check-attr interface Now gitattributes are looked up, efficiently, in only the places that really need them, using the same approach used for cat-file. The old CheckAttr code seemed very fragile, in the way it streamed files through git check-attr. I actually found that `cad8824852` was still deadlocking with ghc 7.4, at the end of adding a lot of files. This should fix that problem, and avoid future ones. The best part is that this removes withAttrFilesInGit and withNumCopies, which were complicated Seek methods, as well as simplfying the types for several other Seek methods that had a Backend tupled in.	2012-02-13 23:52:21 -04:00
Joey Hess	9030f68452	When checking that an url has a key, verify that the Content-Length, if available, matches the size of the key. If there's no Content-Length, or the key has no size, this check is not done, but it should happen most of the time, and protect against web content that has changed.	2012-02-10 19:23:41 -04:00
Joey Hess	57a747d081	S3: Fix irrefutable pattern failure when accessing encrypted S3 credentials.	2012-02-08 11:41:15 -04:00
Joey Hess	b9b72d22a9	refactor Wow, triple monadic lift!	2012-02-07 01:40:14 -04:00
Joey Hess	146c36ca54	IO exception rework ghc 7.4 comaplains about use of System.IO.Error to catch exceptions. Ok, use Control.Exception, with variants specialized to only catch IO exceptions.	2012-02-03 16:47:24 -04:00
Joey Hess	775958b4dc	faster local-local dropping Dropping a key from a local remote ran git-annex-shell unnecessarily. Now git-annex-shell is never used when acting on a local remote.	2012-01-28 16:00:20 -04:00
Joey Hess	b81d662cbf	Avoid repeated location log commits when a remote is receiving files. Done by adding a oneshot mode, in which location log changes are written to the journal, but not committed. Taking advantage of git-annex's existing ability to recover in this situation. This is used by git-annex-shell and other places where changes are made to a remote's location log.	2012-01-28 15:41:52 -04:00
Joey Hess	303666965a	Revert "Avoid creating ~/.bup when initializing a bup remote" This reverts commit `6da40100c9`. On closer examinaton, this change is wrong. The bup special remote can be configured with "buprepo=", which makes it use the default ~/.bup repo. This change makes it use a different temp dir each time, which I'm sure would not be appreciated by anyone with that configuration. Bup insisting in creating ~/.bup even when using a different repo does seem like a bug in something, but I'm leaning toward the bug being in bup itself.	2012-01-28 15:23:28 -04:00
Lauri Alanko	6da40100c9	Avoid creating ~/.bup when initializing a bup remote	2012-01-26 01:11:57 -04:00
Joey Hess	ce5637498f	remove Utility.Conditional and use IfElse This drops the >>! and >>? with the nice low fixity. IfElse does have undocumented >>=>>! and >>=>>? operators, but I deem that too fishy. Anyway, using whenM and unlessM is easier; I sometimes mixed the operators up.	2012-01-24 16:22:07 -04:00
Joey Hess	eb9001044f	order user provided params after connection caching params So the user can override them.	2012-01-20 17:32:32 -04:00
Joey Hess	47250a153a	ssh connection caching Ssh connection caching is now enabled automatically by git-annex. Only one ssh connection is made to each host per git-annex run, which can speed some things up a lot, as well as avoiding repeated password prompts. Concurrent git-annex processes also share ssh connections. Cached ssh connections are shut down when git-annex exits. Note: The rsync special remote does not yet participate in the ssh connection caching.	2012-01-20 17:14:56 -04:00
Joey Hess	61dbad505d	fsck --from remote --fast Avoids expensive file transfers, at the expense of checking file size and/or contents. Required some reworking of the remote code.	2012-01-20 13:23:11 -04:00
Joey Hess	effaa298fa	optimise fsck --from normal git remotes For a local git remote, can symlink the file. For a git remote using rsync, can preseed any local content. There are a few reasons to use fsck --from on a normal git remote. One is if it's using gitosis or similar, and you don't have shell access to run git annex locally. Another reason could be if you just want to fsck certian files of a bare remote.	2012-01-19 17:10:44 -04:00
Joey Hess	71cb04bb6d	optimize fsck --from directory special remote No need to copy anything, just symlink to the file.	2012-01-19 16:14:40 -04:00
Joey Hess	06b0cb6224	add tmp flag parameter to retrieveKeyFile	2012-01-19 16:07:36 -04:00
Joey Hess	94aa6b42b5	optimise fsck --from rsync special remote When a file is present locally, the remote's version can be rsynced to a copy of it, which will avoid wasting a lot of bandwidth.	2012-01-19 15:49:55 -04:00
Joey Hess	f161b5eb59	Fix data loss bug in directory special remote When moving a file to the remote failed, and partially transferred content was left behind in the directory, re-running the same move would think it succeeded and delete the local copy. I reproduced data loss when moving files to a partition that was almost full. Interrupting a transfer could have similar results. Easily fixed by using a temp file which is then moved atomically into place once the transfer completes. I've audited other calls to copyFileExternal, and other special remote file transfer code; everything else seems to use temp files correctly (rsync, git), or otherwise use atomic transfers (bup, S3).	2012-01-16 16:28:15 -04:00
Joey Hess	16e7178f20	reorg	2012-01-10 15:29:10 -04:00
Joey Hess	07cacbeee9	break module dependancy loop A PITA but worth it to clean up the trust configuration code.	2012-01-10 13:32:38 -04:00
Joey Hess	f534fcc7b1	remove S3stub stuff Let's keep that in a no-s3 branch, which can be merged into eg, debian-stable.	2012-01-05 23:14:10 -04:00
Joey Hess	c371c40a88	Don't list S3 as a remote type when built without S3 support.	2012-01-05 23:11:07 -04:00
Joey Hess	ee554542c1	after is a better name for observe_	2012-01-03 00:29:27 -04:00
Joey Hess	fc80b8d96b	factor observe_	2012-01-03 00:11:00 -04:00
Joey Hess	aa0882691b	Added remote.name.annex-web-options configuration setting, which can be used to provide parameters to whichever of wget or curl git-annex uses (depends on which is available, but most of their important options suitable for use here are the same).	2012-01-02 14:20:20 -04:00
Joey Hess	f0957426c5	skip local remotes that are not available (ie, not mounted) With --fast, unavailable local remotes are filtered out of the fast set. This way, if there are local remotes, --fast always acts only on them, and if none are mounted, acts on nothing. This consistency is better than --fast acting on different remotes depending on what's mounted.	2011-12-31 04:50:39 -04:00
Joey Hess	4a02c2ea62	type alias cleanup	2011-12-31 04:11:58 -04:00
Joey Hess	8a33573caf	better filtering out of special remotes	2011-12-31 03:27:37 -04:00
Joey Hess	20482712d0	Improve deletion of files from rsync special remotes. Closes: #652849 Rsync is only run once, with include / exclude rules used to specify exactly what to delete. This is faster, and avoids ugly error messages from rsync, and doesn't fail if the content already got deleted somehow.	2011-12-21 16:57:03 -04:00
Joey Hess	da0bdc1a57	Fix the hook special remote, which bitrotted a while ago.	2011-12-20 12:23:49 -04:00
Joey Hess	95d2391f58	more partial function removal Left a few Prelude.head's in where it was checked not null and too hard to remove, etc.	2011-12-15 18:19:36 -04:00
Joey Hess	09cd042775	Properly handle multiline git config values. A crash on parsing was fixed a while ago. This adds support for fully correctly parsing multiline git config values, using git config --null. Since git-annex-shell configlist uses normal git config output, I left in support for that too; the two forms of config output can be easily identified by the parser. Since configlist only prints the annex.uuid config, there's no risk of multiline values there, so no need to change it.	2011-12-15 12:48:27 -04:00
Joey Hess	ef28b3fef7	split out Git/Command.hs	2011-12-14 15:56:11 -04:00
Joey Hess	02f1bd2bf4	split more stuff out of Git.hs	2011-12-14 15:43:13 -04:00
Joey Hess	13fff71f20	split out three modules from Git Constructors and configuration make sense in separate modules. A separate Git.Types is needed to avoid cycles.	2011-12-13 15:06:49 -04:00
Joey Hess	98dfc0c9b0	split out Annex/BranchState.hs	2011-12-12 17:38:46 -04:00
Joey Hess	c7e65bbb12	optimiation avoids reading the config of a local remote twice in a row	2011-12-12 02:24:37 -04:00
Joey Hess	f44f715f51	ensure local remote is initialized when copying to it Needed due to this scenario: Bare repo origin is made, foo is cloned from it; foo is initalized; a file is added to foo's annex; git annex move --to origin Since the git-annex branch has not yet been pushed to origin, it doesn't auto-initialize. When the content is sent to it, it's stored, but the remote has NoUUID, and so nothing is logged in the location log. Then the content is removed from the local repo, and git-annex has lost track of it. git annex fsck in origin will find the lost content, but let's not let this happen. Content should only be sent to initalized remotes. This cannot happen for non-local remotes, since git-annex-shell always checks that the repo is initialized.	2011-12-10 19:54:20 -04:00
Joey Hess	9ba99a544b	update	2011-12-10 18:51:01 -04:00
Joey Hess	d64132a43a	hslint	2011-12-09 01:57:13 -04:00
Joey Hess	e3f1568e0f	Fix caching of decrypted ciphers, which failed when drop had to check multiple different encrypted special remotes.	2011-12-08 16:01:46 -04:00
Joey Hess	64672c6262	refactor	2011-12-03 09:10:23 -04:00
Joey Hess	e19dc85547	factor out untilTrue	2011-12-02 16:12:31 -04:00
Joey Hess	fb68a7881f	convert rsync special backend to using both hash directory types	2011-12-02 15:50:27 -04:00
Joey Hess	db5b479f3f	use lowercase hash by default; non-bare repos are a special case Directory special remotes will now always store keys in the lowercase name, which avoids the complication of catching failures to create the mixed case name. Git remotes using http will now try the lowercase name first.	2011-12-02 14:56:48 -04:00
Joey Hess	0815cc2fc1	refactor	2011-12-02 14:47:59 -04:00
Joey Hess	bff6ca2634	refactor	2011-11-28 23:20:31 -04:00
Joey Hess	da9cd315be	add support for using hashDirLower in addition to hashDirMixed Supporting multiple directory hash types will allow converting to a different one, without a flag day. gitAnnexLocation now checks which of the possible locations have a file. This means more statting of files. Several places currently use gitAnnexLocation and immediately check if the returned file exists; those need to be optimised.	2011-11-28 22:43:51 -04:00
Joey Hess	75a590bdd8	Put a workaround in the directory special remote for strange behavior with VFAT filesystems on Linux (mounted with shortname=mixed)	2011-11-22 18:21:28 -04:00
Joey Hess	1326bb8635	Avoid excessive escaping for rsync special remotes that are not accessed over ssh. This is actually tricky, `45bbf210a1` added the escaping because it's needed for rsync that does go over ssh. So I had to detect whether the remote's rsync url will use ssh or not, and vary the escaping.	2011-11-18 12:53:48 -04:00
Joey Hess	637b5feb45	lint	2011-11-11 01:52:58 -04:00
Joey Hess	49d2177d51	factored out some useful error catching methods	2011-11-10 20:57:28 -04:00
Joey Hess	d3e1a3619f	safer inannex checking git-annex-shell inannex now returns always 0, 1, or 100 (the last when it's unclear if content is currently in the index due to it currently being moved or dropped). (Actual locking code still not yet written.)	2011-11-09 18:33:15 -04:00
Joey Hess	56b8194470	cleanup	2011-11-09 01:33:20 -04:00
Joey Hess	bf460a0a98	reorder repo parameters last Many functions took the repo as their first parameter. Changing it consistently to be the last parameter allows doing some useful things with currying, that reduce boilerplate. In particular, g <- gitRepo is almost never needed now, instead use inRepo to run an IO action in the repo, and fromRepo to get a value from the repo. This also provides more opportunities to use monadic and applicative combinators.	2011-11-08 16:27:20 -04:00
Joey Hess	b11a63a860	clean up read/show abuse Avoid ever using read to parse a non-haskell formatted input string. show :: Key is arguably still show abuse, but displaying Keys as filenames is just too useful to give up.	2011-11-08 00:17:54 -04:00
Joey Hess	63a292324d	add a UUID type Should have done this a long time ago.	2011-11-07 15:59:16 -04:00
Joey Hess	aae0417d94	Don't try to read config from repos with annex-ignore set.	2011-11-07 11:50:30 -04:00
Joey Hess	c879eb873e	do commit location changes to remote in copy --to test suite pointed out that if a file was copied from B to A, and then A cloned, the clone ought to immediatly know it can get the file from A.	2011-10-27 18:03:36 -04:00
Joey Hess	f84d66fa15	reap in onLocal Each onLocal call involves a new Annex state, so needs to clean up after it.	2011-10-27 14:55:07 -04:00
Joey Hess	c30366e95a	improve config reading when operating on remote on same host Before the config was read each time onLocal was called, and entirely redundantly since it's read for same-host remotes on startup. Also a minor bug fix: When rsyncing to a same-host remote, use the rsync-options from the repository that the user ran git-annex in, not those of the receiving repository.	2011-10-27 14:55:06 -04:00
Joey Hess	373cad993d	Sped up some operations on remotes that are on the same host. Specifically, disabled trying to update the git-annex branch on the remote, since that data is never used by operations that act on such remotes. Also, when copying content to such a remote, skip committing the presence information changes to its git-annex branch. Leaving it in the journal there is ok: Any command run on the remote that needs the info will flush the journal. This may partially solve this bug: http://git-annex.branchable.com/bugs/fails_to_handle_lot_of_files/ Although I still see unreaped git processes piling up when doing a copy --to.	2011-10-27 14:55:06 -04:00
Joey Hess	23f2a12816	broke up Utility	2011-10-16 00:50:12 -04:00
Joey Hess	91366c896d	clean Annex stuff out of Utility/	2011-10-16 00:04:26 -04:00
Joey Hess	1480d71adb	fix	2011-10-15 18:45:32 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	ec169f84b1	migrate: Copy url logs for keys when migrating.	2011-10-15 16:36:56 -04:00
Joey Hess	b4015064e1	break web log handling into a separate module	2011-10-15 16:25:51 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00
Joey Hess	9fa9214106	A remote can have a annexUrl configured, that is used by git-annex instead of its usual url. (Similar to pushUrl.)	2011-10-14 18:18:28 -04:00
Joey Hess	b505ba83e8	minor syntax changes	2011-10-11 14:43:45 -04:00
Joey Hess	6a6ea06cee	rename	2011-10-05 16:02:51 -04:00
Joey Hess	cfe21e85e7	rename	2011-10-04 00:59:08 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	4bf1a5ef59	refactor	2011-09-23 18:13:24 -04:00
Joey Hess	9f6b7935dd	go go gadget hlint	2011-09-20 23:24:48 -04:00
Joey Hess	dd463a3100	rework annex-ignore handling Only one place need to filter the list of remotes for ignored remotes: keyPossibilities. Make the full list available to everything else. This allows getting rid of the special case handing for --from and --to to make ignored remotes not be ignored with those options.	2011-09-18 20:11:39 -04:00
Joey Hess	999d5df90b	factor out firstM and anyM Control.Monad.Loops has these, but has no Debian package yet.	2011-08-28 15:46:49 -04:00
Joey Hess	f82da1d9dc	show a message if asked to get something from the web that is not there	2011-08-27 07:08:15 -04:00
Joey Hess	203148363f	split groups of related functions out of Utility	2011-08-22 16:14:12 -04:00
Joey Hess	737b5d14c9	moved files around	2011-08-20 16:11:42 -04:00
Joey Hess	ec746c511f	note about why curl -# is used I'd rather use wget really, but as git-annex uses libcurl elsewhere, it seems best to stick with curl. And making this configurable seems overboard.	2011-08-20 12:52:29 -04:00
Joey Hess	b7a4ff1c31	optimise initialized check Avoid running external command if annex.version is set.	2011-08-17 18:38:26 -04:00
Joey Hess	32f27cc3e8	when reading configs of local repos, first initializeSafe This auto-generates a uuid if the local repo does not already have one.	2011-08-17 14:44:31 -04:00
Joey Hess	f5449aae16	error out when dropping from http repo	2011-08-16 21:20:14 -04:00
Joey Hess	5ccb926b51	support for getting files from http git remotes	2011-08-16 21:04:23 -04:00
Joey Hess	a55faff08f	reorg Remote/*	2011-08-16 20:49:54 -04:00
Joey Hess	4545a0e78c	split out generic url stuff into a helper library from Remote.Web	2011-08-16 20:49:44 -04:00
Joey Hess	07f2e7ee72	support reading git config from http remotes The config file is downloaded to a temp file, and git-config run on that to parse it.	2011-08-16 20:48:11 -04:00
Joey Hess	dd8e649f49	fix file name for web remote log files The key name was not being sufficiently escaped, although it didn't break anything due to luck. Switch to properly escaped key names for the log filename, with a fallback to the buggy old name.	2011-08-06 14:45:58 -04:00
Joey Hess	45bbf210a1	Fix shell escaping in rsync special remote.	2011-07-29 15:28:21 +02:00
Joey Hess	00153eed48	unify elipsis handling And add a simple dots-based progress display, currently only used in v2 upgrade.	2011-07-19 14:07:23 -04:00
Joey Hess	6c396a256c	finished hlint pass	2011-07-15 12:47:14 -04:00
Joey Hess	cab4ac247c	rename	2011-07-05 20:36:43 -04:00
Joey Hess	c98b5cf36e	rename	2011-07-05 20:24:10 -04:00
Joey Hess	9f1577f746	remove unused backend machinery The only remaining vestiage of backends is different types of keys. These are still called "backends", mostly to avoid needing to change user interface and configuration. But everything to do with storing keys in different backends was gone; instead different types of remotes are used. In the refactoring, lots of code was moved out of odd corners like Backend.File, to closer to where it's used, like Command.Drop and Command.Fsck. Quite a lot of dead code was removed. Several data structures became simpler, which may result in better runtime efficiency. There should be no user-visible changes.	2011-07-05 19:57:46 -04:00
Joey Hess	5c69ac14eb	Drop the dependency on the haskell curl bindings, use regular haskell HTTP.	2011-07-04 19:33:11 -04:00
Joey Hess	e6b9539a65	make curl follow redirs	2011-07-01 21:52:27 -04:00
Joey Hess	ace9de37e8	download urls via tmp file, and support resuming	2011-07-01 18:59:40 -04:00
Joey Hess	79016c197c	add hashing to web log files	2011-07-01 17:23:01 -04:00
Joey Hess	6bddebdb79	add the addurl command	2011-07-01 17:15:46 -04:00
Joey Hess	cdbcd6f495	add web special remote Generalized LocationLog to PresenceLog, and use a presence log to record urls for the web special remote.	2011-07-01 15:30:42 -04:00
Joey Hess	f6063a094e	renamed GitRepo to Git It was always imported qualified as Git anyway	2011-06-30 13:21:39 -04:00
Joey Hess	c4e6730042	commit git-annex branch when copying to a remote (locally) Otherwise, the location log changes are only staged in its index, and this can confuse matters if pulling or cloning from the remote. The test suite was failing because this wasn't done.	2011-06-22 21:21:09 -04:00
Joey Hess	d0482d4154	bigfix: stat parent dirs	2011-06-13 21:46:28 -04:00
Joey Hess	30d7cce7ec	rsync is now used when copying files from repos on other filesystems cp is still used when copying file from repos on the same filesystem, since --reflink=auto can make it significantly faster on filesystems such as btrfs. Directory special remotes still use cp, not rsync. It's not clear what tmp file should be used when rsyncing to such a remote.	2011-06-13 20:33:52 -04:00
Joey Hess	19428ea2f4	fix building with S3 stub	2011-06-10 12:11:34 -04:00
Joey Hess	703c437bd9	rename modules for data types into Types/ directory	2011-06-01 21:56:04 -04:00
Joey Hess	93a4f3d4e6	Add --debug option. Closes: #627499 This takes advantage of the debug logging done by missingh, and I added my own debug messages for executeFile calls. There are still some other low-level ways git-annex runs stuff that are not shown by debugging, but this gets most of it easily.	2011-05-21 11:52:13 -04:00
Joey Hess	21d9c84e72	more standard names for whenM and unlessM operators These are defined in ifelse, but it's not currently available and I don't want to pull in a library for 6 lines of code anyhow. Also, ifelse sets the fixity to 1, which does not allow >>? error $ ...	2011-05-17 11:45:24 -04:00
Joey Hess	c91929f693	add whenM and unlessM Just more golfing.. I am pretty sure something in a library somewhere can do this, but I have been unable to find it.	2011-05-17 03:13:11 -04:00
Joey Hess	760cde28b6	more pointless monadic golfing	2011-05-16 14:49:28 -04:00
Joey Hess	0a7bcd47ae	IA: do not create bucket at initremote time This way, the metadata sent when uploading a file is applied to the bucket then.	2011-05-16 13:10:26 -04:00
Joey Hess	1d2984441c	add a few tweaks to make it easy to use the Internet Archive's variant of S3 In particular, munge key filenames to comply with the IA's filename limits, disable encryption, support their nonstandard way of creating buckets, and allow x-amz-* headers to be specified in initremote to set item metadata. Still TODO: initremote does not handle multiword metadata headers right.	2011-05-16 11:20:35 -04:00
Joey Hess	79c74bf27d	refactor	2011-05-16 09:42:54 -04:00
Joey Hess	3e15a8a791	Maybe reduction pass 2	2011-05-15 12:25:58 -04:00
Joey Hess	cad0e1c8b7	simplified a bunch of Maybe handling	2011-05-15 03:38:08 -04:00
Joey Hess	3c319cd844	avoid always decrypting cipher Last change moved cipher decryption to remote setup time. Fixed this with a bit of a hack.	2011-05-01 15:13:54 -04:00
Joey Hess	2ddade8132	factor out base64 code	2011-05-01 14:27:40 -04:00
Joey Hess	1f84c7a964	S3: When encryption is enabled, the Amazon S3 login credentials are stored, encrypted, in .git-annex/remotes.log, so environment variables need not be set after the remote is initialized.	2011-05-01 14:05:10 -04:00
Joey Hess	cf501d3b9b	set ANNEX_HASH_* always	2011-04-29 14:04:20 -04:00
Joey Hess	3ab3f41aea	hook special remote implemented, and tested	2011-04-28 17:21:45 -04:00
Joey Hess	d7b330b33b	Fix hasKeyCheap setting for bup and rsync special remotes.	2011-04-28 14:39:51 -04:00
Joey Hess	39966ba4ee	filter out --delete rsync option rsync does not have a --no-delete, so do it this way instead	2011-04-27 20:31:56 -04:00
Joey Hess	e68f128a9b	rsync special remote Fully tested and working, including resuming and encryption. (Though not resuming when sending with encryption; gpg doesn't produce identical output each time.) Uses same layout as the directory special remote and the .git/annex/objects/ directory.	2011-04-27 20:23:09 -04:00
Joey Hess	45bdb2d413	ensure tmp dir exists	2011-04-21 10:53:29 -04:00
Joey Hess	6fcd3e1ef7	fix S3 upload buffering problem Provide file size to new version of hS3.	2011-04-21 10:33:17 -04:00
Joey Hess	4837176897	update on memory leak Finished applying to S3 the change that fixed the memory leak in bup, but it didn't seem to help S3.. with encryption it still grows to 2x file size.	2011-04-19 16:31:35 -04:00
Joey Hess	5985acdfad	bup: Avoid memory leak when transferring encrypted data. This was a most surprising leak. It occurred in the process that is forked off to feed data to gpg. That process was passed a lazy ByteString of input, and ghc seemed to not GC the ByteString as it was lazily read and consumed, so memory slowly leaked as the file was read and passed through gpg to bup. To fix it, I simply changed the feeder to take an IO action that returns the lazy bytestring, and fed the result directly to hPut. AFAICS, this should change nothing WRT buffering. But somehow it makes ghc's GC do the right thing. Probably I triggered some weakness in ghc's GC (version 6.12.1). (Note that S3 still has this leak, and others too. Fixing it will involve another dance with the type system.) Update: One theory I have is that this has something to do with the forking of the feeder process. Perhaps, when the ByteString is produced before the fork, ghc decides it need to hold a pointer to the start of it, for some reason -- maybe it doesn't realize that it is only used in the forked process.	2011-04-19 15:27:03 -04:00
Joey Hess	b1274b6378	refactor	2011-04-19 14:50:09 -04:00
Joey Hess	a441e08da1	Fix stalls in S3 when transferring encrypted data. Stalls were caused by code that did approximatly: content' <- liftIO $ withEncryptedContent cipher content return store content' The return evaluated without actually reading content from S3, and so the cleanup code began waiting on gpg to exit before gpg could send all its data. Fixing it involved moving the `store` type action into the IO monad: liftIO $ withEncryptedContent cipher content store Which was a bit of a pain to do, thank you type system, but avoids the problem as now the whole content is consumed, and stored, before cleanup.	2011-04-19 14:45:19 -04:00

... 5 6 7 8 9 ...

705 commits