git-annex

Author	SHA1	Message	Date
Joey Hess	f0195b2a43	Fix GETURLS in external special remote protocol to strip downloader prefix from logged url info before checking for the specified prefix. This doesn't change what GETURLS returns, but only whether it matches any prefix that the external special remote asked for.	2015-03-27 18:49:03 -04:00
Joey Hess	707293ba7e	remotedaemon: Fixed support for notifications of changes to gcrypt remotes, which was never tested and didn't quite work before.	2015-03-16 15:28:29 -04:00
Joey Hess	6045406deb	Added SETURIPRESENT and SETURIMISSING to external special remote protocol Useful for things like ipfs that don't use regular urls. An external special remote can add a regular url to a key, and then git-annex get will download it from the web. But for ipfs, we want to instead tell git-annex that the uri uses OtherDownloader. Before this change, the external special remote protocol lacked a way to do that.	2015-03-05 13:50:15 -04:00
Joey Hess	9b93278e8a	metadata: Fix encoding problem that led to mojibake when storing metadata strings that contained both unicode characters and a space (or '!') character. The fix is to stop using w82s, which does not properly reconstitute unicode strings. Instrad, use utf8 bytestring to get the [Word8] to base64. This passes unicode through perfectly, including any invalid filesystem encoded characters. Note that toB64 / fromB64 are also used for creds and cipher embedding. It would be unfortunate if this change broke those uses. For cipher embedding, note that ciphers can contain arbitrary bytes (should really be using ByteString.Char8 there). Testing indicated it's not safe to use the new fromB64 there; I think that characters were incorrectly combined. For credpair embedding, the username or password could contain unicode. Before, that unicode would fail to round-trip through the b64. So, I guess this is not going to break any embedded creds that worked before. This bug may have affected some creds before, and if so, this change will not fix old ones, but should fix new ones at least.	2015-03-04 12:54:30 -04:00
Joey Hess	450ee53ab6	When re-execing git-annex, use current program location, rather than ~/.config/git-annex/program, when possible. Most of the time, there will be no discreprancy between programPath and readProgramFile. But, the programFile might have been written by an old version of git-annex that is still installed, while a newer one is currently running. In this case, we want to run the same one that's currently running. This is especially important for things like the GIT_SSH=git-annex used for ssh connection caching. The only code that still uses readProgramFile directly is the upgrade code, which needs to know where the standalone git-annex was installed, in order to upgrade it.	2015-02-28 17:23:13 -04:00
Joey Hess	5be7ba7ee5	The ssh-options git config is now used by gcrypt, rsync, and ddar special remotes that use ssh as a transport.	2015-02-12 15:44:10 -04:00
Joey Hess	52e40970c8	avoid unncessary IO	2015-02-12 15:33:44 -04:00
Joey Hess	a22eaaae27	comment	2015-02-09 14:16:42 -04:00
Joey Hess	69a9c98e71	glacier: Detect when the glacier command in PATH is the wrong one, from boto, rather than from glacier-cli, and refuse to use it, since the boto program fails to fail when passed parameters it does not understand.	2015-02-06 14:39:27 -04:00
Joey Hess	1af8107fec	windows build fix	2015-01-29 13:46:57 -04:00
Joey Hess	e0187d5d12	test suite found a problem with today's work ". def" did not do what I thought it would, at all.	2015-01-28 18:05:08 -04:00
Joey Hess	009bd050c1	implement annex.tune.objecthashlower Split out Annex.DirHashes which never really belonged in Locations.	2015-01-28 16:52:08 -04:00
Joey Hess	e8c376e0ad	import Data.Default in Common	2015-01-28 16:11:28 -04:00
Joey Hess	0fd5f257d0	groundwork for parameterizing hash depth	2015-01-28 15:55:17 -04:00
Joey Hess	32fac4b71b	remove unnecessary use of MissingH	2015-01-21 13:36:48 -04:00
Joey Hess	afc5153157	update my email address and homepage url	2015-01-21 12:50:09 -04:00
Joey Hess	4f657aa14e	add getFileSize, which can get the real size of a large file on Windows Avoid using fileSize which maxes out at just 2 gb on Windows. Instead, use hFileSize, which doesn't have a bounded size. Fixes support for files > 2 gb on Windows. Note that the InodeCache code only needs to compare a file size, so it doesn't matter it the file size wraps. So it has been left as-is. This was necessary both to avoid invalidating existing inode caches, and because the code passed FileStatus around and would have become more expensive if it called getFileSize. This commit was sponsored by Christian Dietrich.	2015-01-20 17:09:24 -04:00
Joey Hess	534c29deae	implemented old Richih wishlist about remote/uuid info * info: Can now display info about a given uuid. * Added to remote/uuid info: Count of the number of keys present on the remote, and their size. This is rather expensive to calculate, so comes last and --fast will disable it. * Git remote info now includes the date of the last sync with the remote.	2015-01-13 18:13:14 -04:00
Joey Hess	3bab5dfb1d	revert parentDir change Reverts `965e106f24` Unfortunately, this caused breakage on Windows, and possibly elsewhere, because parentDir and takeDirectory do not behave the same when there is a trailing directory separator.	2015-01-09 13:11:56 -04:00
Joey Hess	965e106f24	made parentDir return a Maybe FilePath; removed most uses of it parentDir is less safe than takeDirectory, especially when working with relative FilePaths. It's really only useful in loops that want to terminate at / This commit was sponsored by Audric SCHILTKNECHT.	2015-01-06 18:55:56 -04:00
Joey Hess	6b3d0cb11a	bittorrent: Fix locking problem when using addurl file:// Fixes: /home/joey/tmp/xxx/.git/annex/misctmp/torrent18347: openFile: resource busy (file is locked)	2014-12-30 13:07:20 -04:00
Joey Hess	c9a3e80d32	fixed all remaining build warnings on Windows	2014-12-29 17:30:20 -04:00
Joey Hess	27fb7e514d	Fix build with -f-S3.	2014-12-19 16:53:25 -04:00
Joey Hess	ef12386924	When possible, build with the haskell torrent library for parsing torrent files.	2014-12-18 14:26:10 -04:00
Joey Hess	e2214f6ac8	remove default untrusted hack for bittorrent This is better handled by checkPresent always failing.	2014-12-17 15:38:00 -04:00
Joey Hess	e08fa65131	note about http://hackage.haskell.org/package/torrent	2014-12-17 15:34:38 -04:00
Joey Hess	6ca54c521d	make checkkey always fail for torrents See comment.	2014-12-17 14:54:54 -04:00
Joey Hess	2192c54877	more robust fallback when a file is available from multiple torrents and some torrent files cannot be downloaded	2014-12-17 14:38:04 -04:00
Joey Hess	bf9df3fc7e	fix fencepost error and aria resume after partial download of multi-file torrent	2014-12-17 14:21:48 -04:00
Joey Hess	3a7d0be120	remove excess directory	2014-12-17 14:17:19 -04:00
Joey Hess	d5cbbe1b9a	fix torrentUrlNum when there is no #n	2014-12-17 14:07:05 -04:00
Joey Hess	7e422269a6	move dummy uuids to Annex.UUID	2014-12-17 13:57:52 -04:00
Joey Hess	af05ac3ec2	add aria2 progress parsing	2014-12-17 13:40:04 -04:00
Joey Hess	a7690de016	Added bittorrent special remote addurl behavior change: When downloading an url ending in .torrent, it will download files from bittorrent, instead of the old behavior of adding the torrent file to the repository. Added Recommends on aria2 and bittornado \| bittorrent. This commit was sponsored by Asbjørn Sloth Tønnesen.	2014-12-16 23:22:46 -04:00
Joey Hess	65bce2c80d	reformat	2014-12-16 15:26:13 -04:00
Joey Hess	67c05daf5e	sanitize filepaths provided by checkUrl	2014-12-11 20:08:49 -04:00
Joey Hess	8a17bcb0be	simplify external special remote implementation	2014-12-11 17:44:27 -04:00
Joey Hess	bce7e0dd96	use subdir for addurl when it creates multiple files The --file parameter specifies the subdir in this mode.	2014-12-11 16:09:56 -04:00
Joey Hess	2cd84fcc8b	Expand checkurl to support recommended filename, and multi-file-urls This commit was sponsored by an anonymous bitcoiner.	2014-12-11 15:33:42 -04:00
Joey Hess	7ae16bb6f7	Revert "let url claims optionally include a suggested filename" This reverts commit `85df9c30e9`. Putting filename in the claim was a bad idea.	2014-12-11 14:09:57 -04:00
Joey Hess	85df9c30e9	let url claims optionally include a suggested filename	2014-12-11 12:47:57 -04:00
Joey Hess	aafb121068	unmangled mangled urls from the log before passing to external special remote	2014-12-08 19:27:40 -04:00
Joey Hess	30bf112185	Urls can now be claimed by remotes. This will allow creating, for example, a external special remote that handles magnet: and *.torrent urls.	2014-12-08 19:15:07 -04:00
Joey Hess	ee27298b91	implement CLAIMURL for external special remote	2014-12-08 13:57:13 -04:00
Joey Hess	cb6e16947d	add stub claimUrl	2014-12-08 13:40:15 -04:00
Joey Hess	8093008ef4	External special remote protocol now includes commands for setting and getting the urls associated with a key.	2014-12-08 13:32:46 -04:00
Joey Hess	911ba8d972	Merge branch 's3-aws'	2014-12-03 14:10:52 -04:00
Joey Hess	55fa1789dd	Don't show "(gpg)" when decrypting the remote encryption cipher, since this could be taken to read that's the only time git-annex runs gpg, which is not the case.	2014-12-02 13:50:45 -04:00
Joey Hess	0a891fcfc5	support S3 front-end used by globalways.net This threw an unusual exception w/o an error message when probing to see if the bucket exists yet. So rather than relying on tryS3, catch all exceptions. This does mean that it might get an exception for some transient network error, think this means the bucket DNE yet, and try to create it, and then fail when it already exists.	2014-11-05 12:42:12 -04:00
Joey Hess	93feefae05	Revert "work around minimum part size problem" This reverts commit `a42022d8ff`. I misunderstood the cause of the problem.	2014-11-04 16:21:55 -04:00
Joey Hess	a42022d8ff	work around minimum part size problem When uploading the last part of a file, which was 640229 bytes, S3 rejected that part: "Your proposed upload is smaller than the minimum allowed size" I don't know what the minimum is, but the fix is just to include the last part into the previous part. Since this can result in a part that's double-sized, use half-sized parts normally.	2014-11-04 16:06:13 -04:00
Joey Hess	ad2125e24a	fix a couple type errors and the progress bar	2014-11-04 15:39:48 -04:00
Joey Hess	fccdd61eec	fix memory leak Unfortunately, I don't fully understand why it was leaking using the old method of a lazy bytestring. I just know that it was leaking, despite neither hGetUntilMetered nor byteStringPopper seeming to leak by themselves. The new method avoids the lazy bytestring, and simply reads chunks from the handle and streams them out to the http socket.	2014-11-04 15:22:08 -04:00
Joey Hess	29871e320c	combine 2 checks	2014-11-04 14:47:18 -04:00
Joey Hess	0f78f197eb	casts; now fully working.. but still leaking Still seems to buffer the whole partsize in memory, but I'm pretty sure my code is not what's doing it. See https://github.com/aristidb/aws/issues/142	2014-11-03 21:12:15 -04:00
Joey Hess	f0551578d6	this should avoid leaking memory	2014-11-03 20:49:30 -04:00
Joey Hess	4230b56b79	logic error	2014-11-03 20:15:33 -04:00
Joey Hess	62de9a39bf	WIP 3	2014-11-03 20:04:42 -04:00
Joey Hess	d16382e99f	WIP 2	2014-11-03 19:50:33 -04:00
Joey Hess	5360417436	WIP try sending using RequestBodyStreamChunked May not work; if it does this is gonna be the simplest way to get good memory size and progress reporting.	2014-11-03 19:18:46 -04:00
Joey Hess	8f61bfad51	link to memory leak bug	2014-11-03 17:55:05 -04:00
Joey Hess	711b18a6eb	improve info display for multipart	2014-11-03 17:24:53 -04:00
Joey Hess	2c53f331bd	fix build	2014-11-03 17:23:46 -04:00
Joey Hess	6a965cf8d7	adjust version check I assume 0.10.6 will have the fix for the bug I reported, which got fixed in master already..	2014-11-03 16:23:00 -04:00
Joey Hess	5c3d9d6caa	show multipart configuration in git annex info s3remote	2014-11-03 16:07:41 -04:00
Joey Hess	a3ec6ed73b	Merge branch 'master' into s3-aws-multipart	2014-11-03 16:05:03 -04:00
Joey Hess	8faeb25076	finish multipart support using unreleased update to aws lib to yield etags Untested and not even compiled yet. Testing should include checks that file content streams through without buffering in memory. Note that CL.consume causes all the etags to be buffered in memory. This is probably nearly unavoidable, since a request has to be constructed that contains the list of etags in its body. (While it might be possible to stream generation of the body, that would entail making a http request that dribbles out parts of the body as the multipart uploads complete, which is not likely to work well.. To limit this being a problem, it's best for partsize to be set to some suitably large value, like 1gb. Then a full terabyte file will need only 1024 etags to be stored, which will probably use around 1 mb of memory.	2014-11-03 16:04:55 -04:00
Joey Hess	39dd5a2ac3	improve uuid mismatch message	2014-10-28 15:54:44 -04:00
Joey Hess	6e89d070bc	WIP multipart S3 upload I'm a little stuck on getting the list of etags of the parts. This seems to require taking the md5 of each part locally, which doesn't get along well with lazily streaming in the part from the file. It would need to read the file twice, or lose laziness and buffer a whole part -- but parts might be quite large. This seems to be a problem with the API provided; S3 is supposed to return an etag, but that is not exposed. I have filed a bug: https://github.com/aristidb/aws/issues/141	2014-10-28 14:17:30 -04:00
Joey Hess	8ed1a0afee	fix build	2014-10-23 16:52:05 -04:00
Joey Hess	8edf7a0fc3	fix build	2014-10-23 16:51:10 -04:00
Joey Hess	171e677a3c	update for aws 0.10's better handling of DNE for HEAD Kept support for older aws, since Debian has 0.9.2 still.	2014-10-23 16:32:18 -04:00
Joey Hess	fa1318479e	rename isIA to configIA Already done on s3-aws branch, so reduce divergence.	2014-10-23 15:56:35 -04:00
Joey Hess	6acc6863c5	fix build	2014-10-23 15:54:00 -04:00
Joey Hess	7489f516bc	one last build fix, yes it builds now	2014-10-23 15:50:41 -04:00
Joey Hess	76ee815e89	needs type families	2014-10-23 15:48:37 -04:00
Joey Hess	f0989cf0bd	fix build	2014-10-23 15:41:57 -04:00
Joey Hess	8b48bdfdc8	enable frankfurt The aws library supports the AWS4-HMAC-SHA256 that it requires.	2014-10-23 11:02:24 -04:00
Joey Hess	4eefc12295	Merge branch 'master' into s3-aws	2014-10-23 11:02:14 -04:00
Joey Hess	e687c61d04	add new frankfurt region to list in webapp But commented out for now, because: The authorization mechanism you have provided is not supported. Please use AWS4-HMAC-SHA256	2014-10-23 11:02:02 -04:00
Joey Hess	35551d0ed0	Merge branch 'master' into s3-aws Conflicts: Remote/S3.hs	2014-10-22 17:14:38 -04:00
Joey Hess	5c15d6d3cc	show in info whether a remote uses hybrid encryption or not	2014-10-22 14:39:59 -04:00
Joey Hess	3006b79c86	include creds info for glacier and webdav That and S3 are all that uses creds currently, except that external remotes can use creds. I have not handled showing info about external remote creds because they can have 0, 1, or more separate cred pairs, and there's no way for info to enumerate them or know how they're used. So it seems ok to leave out creds info for external remotes.	2014-10-22 13:56:14 -04:00
Joey Hess	1b90838bbd	add internet archive item url to info	2014-10-21 15:34:32 -04:00
Joey Hess	9280fe4cbe	include creds location in info This is intended to let the user easily tell if a remote's creds are coming from info embedded in the repository, or instead from the environment, or perhaps are locally stored in a creds file. This commit was sponsored by Frédéric Schütz.	2014-10-21 15:09:40 -04:00
Joey Hess	a0297915c1	add per-remote-type info Now `git annex info $remote` shows info specific to the type of the remote, for example, it shows the rsync url. Remote types that support encryption or chunking also include that in their info. This commit was sponsored by Ævar Arnfjörð Bjarmason.	2014-10-21 14:36:09 -04:00
Joey Hess	fced322834	glacier: Fix pipe setup when calling glacier-cli to retrieve an object.	2014-10-20 15:11:01 -04:00
Joey Hess	ef3804bdb3	S3: Fix embedcreds=yes handling for the Internet Archive. Before, embedcreds=yes did not cause the creds to be stored in remote.log, but also prevented them being locally cached.	2014-10-12 13:15:52 -04:00
Joey Hess	9fd95d9025	indent with tabs not spaces Found these with: git grep "^ " $(find -type f -name \*.hs) \|grep -v ': where' Unfortunately there is some inline hamlet that cannot use tabs for indentation. Also, Assistant/WebApp/Bootstrap3.hs is a copy of a module and so I'm leaving it as-is.	2014-10-09 15:09:26 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	0ed33c8b74	deal with old repositories with non-encrypted creds See `2f3c3aa01f` for backstory about how a repo could be in this state. When decryption fails, the repo must be using non-encrypted creds. Note that creds are encrypted/decrypted using the encryption cipher which is stored in the repo, so the decryption cannot fail due to missing gpg keys etc. (For !shared encryptiom, the cipher is iteself encrypted using some gpg key(s), and the decryption of the cipher happens earlier, so not affected by this change. Print a warning message for !shared repos, and continue on using the cipher. Wrote a page explaining what users hit by this bug should do. This commit was sponsored by Samuel Tardieu.	2014-09-18 17:58:03 -04:00
Joey Hess	2f3c3aa01f	glacier, S3: Fix bug that caused embedded creds to not be encypted using the remote's key. encryptionSetup must be called before setRemoteCredPair. Otherwise, the RemoteConfig doesn't have the cipher in it, and so no cipher is used to encrypt the embedded creds. This is a security fix for non-shared encryption methods! For encryption=shared, there's no security problem, just an inconsistentency in whether the embedded creds are encrypted. This is very important to get right, so used some types to help ensure that setRemoteCredPair is only run after encryptionSetup. Note that the external special remote bypasses the type safety, since creds can be set after the initial remote config, if the external special remote program requests it. Also note that IA remotes never use encryption, so encryptionSetup is not run for them at all, and again the type safety is bypassed. This leaves two open questions: 1. What to do about S3 and glacier remotes that were set up using encryption=pubkey/hybrid with embedcreds? Such a git repo has a security hole embedded in it, and this needs to be communicated to the user. Is the changelog enough? 2. enableremote won't work in such a repo, because git-annex will try to decrypt the embedded creds, which are not encrypted, so fails. This needs to be dealt with, especially for ecryption=shared repos, which are not really broken, just inconsistently configured. Noticing that problem for encryption=shared is what led to commit `fbdeeeed5f`, which tried to fix the problem by not decrypting the embedded creds. This commit was sponsored by Josh Taylor.	2014-09-18 17:26:12 -04:00
Joey Hess	d84eab8a8a	Revert "S3, Glacier, WebDAV: Fix bug that prevented accessing the creds when the repository was configured with encryption=shared embedcreds=yes." This reverts commit `fbdeeeed5f`. I can find no basis for that commit and think that I made it in error. setRemoteCredPair always encrypts using the cipher from remoteCipher, even when the cipher is shared.	2014-09-18 15:21:47 -04:00
Joey Hess	f7847ae98d	Merge branch 'master' into s3-aws Conflicts: Utility/Url.hs debian/changelog git-annex.cabal	2014-09-18 14:36:20 -04:00
Joey Hess	9964584c34	WebDav: Fix enableremote crash when the remote already exists. (Bug introduced in version 5.20140817.)	2014-09-17 13:04:55 -04:00
Joey Hess	a97c9e43b7	The annex-rsync-transport configuration is now also used when checking if a key is present on a rsync remote, and when dropping a key from the remote.	2014-09-11 13:21:35 -04:00
Joey Hess	b874f84086	New annex.hardlink setting. Closes: #758593 * New annex.hardlink setting. Closes: #758593 * init: Automatically detect when a repository was cloned with --shared, and set annex.hardlink=true, as well as marking the repository as untrusted. Had to reorganize Logs.Trust a bit to avoid a cycle between it and Annex.Init.	2014-09-05 13:44:09 -04:00
Joey Hess	6eb5c3f479	Do not preserve permissions and acls when copying files from one local git repository to another. Timestamps are still preserved as long as cp --preserve=timestamps is supported. This avoids cp -a overriding the default mode acls that the user might have set in a git repository. With GNU cp, this behavior change should not be a breaking change, because git-anex also uses rsync sometimes in the same situation, and has only ever preserved timestamps when using rsync. Systems without GNU cp will no longer use cp -a, but instead just cp. So, timestamps will no longer be preserved. Preserving timestamps when copying between repos is not guaranteed anyway. Closes: #729757	2014-08-26 17:10:25 -07:00
Joey Hess	aebcc395ff	use types to enforce that removeAnnex can only be called inside lockContent This fixed one bug where it needed to be and wasn't (in Assistant.Unused). And also found one place where lockContent was used unnecessarily (by drop --from remote). A few other places like uninit probably don't really need to lockContent, but it doesn't hurt to do call it anyway. This commit was sponsored by David Wagner.	2014-08-20 20:13:47 -04:00
Joey Hess	1994771215	more lock file refactoring Also fixes a test suite failures introduced in recent commits, where inAnnexSafe failed in indirect mode, since it tried to open the lock file ReadWrite. This is why the new checkLocked opens it ReadOnly. This commit was sponsored by Chad Horohoe.	2014-08-20 18:58:14 -04:00

1 2 3 4 5 ...

756 commits