git-annex

Author	SHA1	Message	Date
Joey Hess	8ac4126bd2	cleanup	2016-12-09 16:22:06 -04:00
Joey Hess	e152c322f8	refactor ref change watching Added to change notification to P2P protocol. Switched to a TBChan so that a single long-running thread can be started, and serve perhaps intermittent requests for change notifications, without buffering all changes in memory. The P2P runner currently starts up a new thread each times it waits for a change, but that should allow later reusing a thread. Although each connection from a peer will still need a new watcher thread to run. The dependency on stm-chans is more or less free; some stuff in yesod uses it, so it was already indirectly pulled in when building with the webapp. This commit was sponsored by Francois Marier on Patreon.	2016-12-09 15:01:09 -04:00
Joey Hess	15be5c04a6	git-annex-shell, remotedaemon, git remote: Fix some memory DOS attacks. The attacker could just send a very lot of data, with no \n and it would all be buffered in memory until the kernel killed git-annex or perhaps OOM killed some other more valuable process. This is a low impact security hole, only affecting communication between local git-annex and git-annex-shell on the remote system. (With either able to be the attacker). Only those with the right ssh key can do it. And, there are probably lots of ways to construct git repositories that make git use a lot of memory in various ways, which would have similar impact as this attack. The fix in P2P/IO.hs would have been higher impact, if it had made it to a released version, since it would have allowed DOSing the tor hidden service without needing to authenticate. (The LockContent and NotifyChanges instances may not be really exploitable; since the line is read and ignored, it probably gets read lazily and does not end up staying buffered in memory.)	2016-12-09 13:34:32 -04:00
Joey Hess	8e00efb938	didn't mean to commit this change yet	2016-12-08 17:10:48 -04:00
Joey Hess	43e7044b43	comment	2016-12-08 17:10:24 -04:00
Joey Hess	af41519126	convert P2P runners from Maybe to Either String So we get some useful error messages when things fail. This commit was sponsored by Peter Hogg on Patreon.	2016-12-08 15:47:49 -04:00
Joey Hess	e56506d83c	include error message when unable to connect to peer	2016-12-08 14:14:08 -04:00
Joey Hess	2fb6fd7434	Merge branch 'master' into tor	2016-12-07 14:32:25 -04:00
Joey Hess	0d9a11625c	remote uuid discovery in p2p --link This also tests that we can connect to the peer. This commit was sponsored by Jeff Goeke-Smith on Patreon.	2016-12-07 12:38:21 -04:00
Joey Hess	f61508aed4	add: Stage modified non-large files when running in indirect mode. (This was already done in v6 mode and direct mode.)	2016-12-05 14:10:21 -04:00
Joey Hess	82d01f5619	rekey: Added --batch mode. Would have liked to make the Parser parse the file and key pairs, but it seems that optparse-applicative is unable to handle eg: many ((,) <$> argument <*> argument) This commit was sponsored by Thomas Hochstein on Patreon.	2016-12-05 12:55:50 -04:00
Joey Hess	6246c4a6db	minor style	2016-12-05 12:16:07 -04:00
Joey Hess	b0978b0196	Merge kite:tmp/git-annex	2016-12-05 12:15:48 -04:00
Joey Hess	93852dd7e8	rmurl: --batch * rmurl: Multiple pairs of files and urls can be provided on the command line. * rmurl: Added --batch mode. This commit was sponsored by Trenton Cronholm on Patreon.	2016-12-05 12:10:07 -04:00
Daniel Brooks	24317be646	git-annex fromkey now takes multiple pairs of keys and filenames It also still reads from stdin when none are specified.	2016-12-05 09:59:20 -05:00
Joey Hess	3ab12ba923	implement p2p --link This commit was sponsored by Riku Voipio.	2016-11-30 15:16:25 -04:00
Joey Hess	bfc8305814	implement p2p command	2016-11-30 14:35:24 -04:00
Joey Hess	568d81944a	avoid too-long command synopsis It was making git-annex usage output columns far too wide	2016-11-30 14:16:57 -04:00
Joey Hess	24593aaa32	Merge branch 'master' into tor	2016-11-30 14:16:36 -04:00
Joey Hess	8354612131	prefer xdot over dot * map: Run xdot if it's available in PATH. On OSX, the dot command does not support graphical display, while xdot does. * Debian: xdot is a better interactive viewer than dot, so Suggest xdot, rather than graphviz.	2016-11-30 12:50:49 -04:00
Joey Hess	38425fdc39	finish git-annex enable-tor Make it stash the address away for git-annex p2p to use later, rather than outputting it. And, look up the UUID itself.	2016-11-29 17:30:27 -04:00
Joey Hess	398345cb26	Merge branch 'master' into tor	2016-11-29 15:45:29 -04:00
Markus Hauru	9e2073f331	Fixed typo in Schedule.hs.	2016-11-24 07:37:33 -04:00
Joey Hess	9f179ae8b9	fix regression The file matcher needs to be run on the destination file not the tmp file, in order for filename matches to work properly. However, it also needs to be able to probe the file for size and mime type. This is a quick fix to a regression. The double rename is not pretty. It would be good to either have a way to run the largeFileMatcher such that it is matching on the final filename but looks at the temp file, or to make addAnnexedFile not need the temp file in a different location.	2016-11-22 11:18:41 -04:00
Joey Hess	48d8c175f8	avoid backtrace when rekey cntent verification fails	2016-11-22 01:16:18 -04:00
Joey Hess	070fb9e624	Added git-remote-tor-annex, which allows git pull and push to the tor hidden service. Almost working, but there's a bug in the relaying. Also, made tor hidden service setup pick a random port, to make it harder to port scan. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2016-11-21 17:27:38 -04:00
Joey Hess	6e6d1a8c15	addurl: Fix bug in checking annex.largefiles expressions using largerthan, mimetype, and smallerthan; the first two always failed to match, and the latter always matched.	2016-11-21 11:30:53 -04:00
Joey Hess	74691ddf0e	remotedaemon: serve tor hidden service	2016-11-20 15:48:12 -04:00
Joey Hess	a101b8de37	remotedaemon: Fork to background by default. Added --foreground switch to enable old behavior. Groundwork for tor hidden services, which the remotedaemon will serve.	2016-11-20 14:50:36 -04:00
Joey Hess	95916b2ecf	Merge branch 'master' into tor	2016-11-17 12:56:27 -04:00
Joey Hess	10703dc817	improve comment	2016-11-16 16:03:23 -04:00
Joey Hess	2577f1c0a2	fsck --all --from was checking the content of files in the local repository, rather than on the special remote. Straight up forgot to handle this case! This commit was sponsored by Fernando Jimenez on Patreon.	2016-11-16 15:33:57 -04:00
Joey Hess	0a4479b8ec	Avoid backtraces on expected failures when built with ghc 8; only use backtraces for unexpected errors. ghc 8 added backtraces on uncaught errors. This is great, but git-annex was using error in many places for a error message targeted at the user, in some known problem case. A backtrace only confuses such a message, so omit it. Notably, commands like git annex drop that failed due to eg, numcopies, used to use error, so had a backtrace. This commit was sponsored by Ethan Aubin.	2016-11-15 21:29:54 -04:00
Joey Hess	556b2ded2b	sync: Pass --allow-unrelated-histories to git merge when used with git git 2.9.0 or newer. This makes merging a remote into a freshly created direct mode repository work the same as it works in indirect mode. The git-annex branches would get merged in any case by a sync, since that doesn't use git merge. This might need to be revisited later to better mirror git's behavior.	2016-11-15 18:26:17 -04:00
Joey Hess	57d33f7923	use socket for tor hidden service This avoids needing to bind to the right port before something else does. The socket is in /var/run/user/$uid/ which ought to be writable by only that uid. At least it is on linux systems using systemd. For Windows, may need to revisit this and use ports or something. The first version of tor to support sockets for hidden services was 0.2.6.3. That is not in Debian stable, but is available in backports. This commit was sponsored by andrea rota.	2016-11-14 16:47:56 -04:00
Joey Hess	07ad19f421	git-annex enable-tor command Tor unfortunately does not come out of the box configured to let hidden services register themselves on the fly via the ControlPort. And, changing the config to enable the ControlPort and a particular type of auth for it may break something already using the ControlPort, or lessen the security of the system. So, this leaves only one option to us: Add a hidden service to the torrc. git-annex enable-tor does so, and picks an unused high port for tor to listen on for connections to the hidden service. It's up to the caller to somehow pick a local port to listen on that won't be used by something else. That may be difficult to do.. This commit was sponsored by Jochen Bartl on Patreon.	2016-11-14 13:48:35 -04:00
Joey Hess	5afc2eaa54	reinject --known: Avoid second, unncessary checksum of file.	2016-11-07 12:07:36 -04:00
Joey Hess	8dcf79694d	enable forwardRetry for command-line transfers If a transfer fails for some reason, but some data managed to be sent, the transfer will be retried. (The assistant already did this.) Possible impacts: * More ssh prompts if ssh needs to prompt for a password to connect to a host, or is prompting about some other problem like a ssh key mismatch. * More data transfer due to retrying, epecially when a remote does not support resuming a transfer. In the worst case, a lot of data will be transferred but it fails before the end, and then all that data gets transferred again plus one byte more; repeat until it manages to get the whole file.	2016-10-26 15:38:27 -04:00
Joey Hess	0b1c061382	importfeed: Drop URL parameters from file extension. Thanks, James MacMahon.	2016-10-17 16:02:05 -04:00
Joey Hess	8e22114735	upgrade: Handle upgrade to v6 when the repository already contains v6 unlocked files whose content is already present. Closes https://github.com/datalad/datalad/issues/1020 The use of runWriter in scanUnlockedFiles broke due to this change; it failed with blocked indefinitely in mvar, because the database write handle was taken while linkFromAnnex needed to also write to it (to update the inode cache). So, switched to using a separate runWriter for each call to addAssociatedFileFast. A little less efficient, but not greatly; the writes should all still be cached.	2016-10-17 15:19:47 -04:00
Joey Hess	ee309d6941	lock: Fix edge cases where data loss could occur in v6 mode. In the case where the pointer file is in place, and not the content of the object, lock's performNew was called with filemodified=True, which caused it to try to repopulate the object from an unmodified associated file, of which there were none. So, the content of the object got thrown away incorrectly. This was the cause (although not the root cause) of data loss in https://github.com/datalad/datalad/issues/1020 The same problem could also occur when the work tree file is modified, but the object is not, and lock is called with --force. Added a test case for this, since it's excercising the same code path and is easier to set up than the problem above. Note that this only occurred when the keys database did not have an inode cache recorded for the annex object. Normally, the annex object would be in there, but there are of course circumstances where the inode cache is out of sync with reality, since it's only a cache. Fixed by checking if the object is unmodified; if so we don't need to try to repopulate it. This does add an additional checksum to the unlock path, but it's already checksumming the worktree file in another case, so it doesn't slow it down overall. Further investigation found a similar problem occurred when smudge --clean is called on a file and the inode cache is not populated. cleanOldKeys deleted the unmodified old object file in this case. This was also fixed by checking if the object is unmodified. In general, use of getInodeCaches and sameInodeCache is potentially dangerous if the inode cache has not gotten populated for some reason. Better to use isUnmodified. I breifly auited other places that check the inode cache, and did not see any immediate problems, but it would be easy to miss this kind of problem.	2016-10-17 13:58:43 -04:00
Joey Hess	f867fc157f	When auto-upgrading a v3 remote, avoid upgrading to version 6, instead keep it at version 5. Fixes a bug introduced with v6 mode that I didn't notice until now. Probably not many v3 repos left out there, and upgrading them to v6 mode is not disastrous, only a little premature. This commit was sponsored by Riku Voipio	2016-10-05 16:23:09 -04:00
Joey Hess	166d70db77	convert TMVars that are never left empty into TVars This is probably more efficient, and it avoids mistakenly leaving them empty.	2016-09-30 19:51:16 -04:00
Joey Hess	c910004d50	addurl, importfeed: Improve behavior when file being added is gitignored.	2016-09-21 17:21:48 -04:00
Joey Hess	a569f195b7	fix bugs in handing of deep branches with sync and adjusted branches * sync: Previously, when run in a branch with a slash in its name, such as "foo/bar", the sync branch was "synced/bar". That conflicted with the sync branch used for branch "bar", so has been changed to "synced/foo/bar". * adjust: Previously, when adjusting a branch with a slash in its name, such as "foo/bar", the adjusted branch was "adjusted/bar(unlocked)". That conflicted with the adjusted branch used for branch "bar", so has been changed to "adjusted/foo/bar(unlocked)" * Also, running sync in an adjusted branch did not correctly sync changes back to the parent branch when it had a slash in its name. This bug has been fixed. Eliminate use of Git.Ref.under and Git.Ref.basename; using Git.Ref.underBase and Git.Ref.base make everything handle deep branches correctly. Probably noone was adjusting deep branches, and v6 is still experimental anyway, so I'm not going to worry about the mess that was left by that bug. In the case of git-annex sync, using a fixed git-annex with an old unfixed one will mean they use different sync branches for a deep branch, and so they may stop syncing until the old one is upgraded. However, that's only a problem when syncing between repositories without going via a central bare repository. Added a warning about this to the CHANGELOG, but it's probably not going to affect many people at all. This commit was sponsored by Riku Voipio.	2016-09-21 15:23:47 -04:00
Joey Hess	0e30e71e9c	info: Support being passed a treeish, and show info about the annexed files in it similar to how a directory is handled.	2016-09-15 12:51:00 -04:00
Joey Hess	3e22d60549	copy, move, mirror: Support --json and --json-progress.	2016-09-09 16:24:26 -04:00
Joey Hess	a108235565	better locking for json with -J Avoid threads emitting json at the same time and scrambling, which was still possible even with the buffering, just less likely. Converted json IO actions to JSONChunk data too.	2016-09-09 15:51:34 -04:00
Joey Hess	05d4438383	addurl, get: Added --json-progress option, which adds progress objects to the json output. This doesn't work right when used with -J yet, and there is some really ugly hand-crafting of part of the json output.	2016-09-09 15:06:54 -04:00
Joey Hess	8ef494a833	disentangle concurrency and message type This makes -Jn work with --json and --quiet, where before setting -Jn disabled those options. Concurrent json output is currently a mess though since threads output chunks over top of one-another.	2016-09-09 12:57:42 -04:00

1 2 3 4 5 ...

1761 commits