git-annex

Author	SHA1	Message	Date
Joey Hess	d689a5b338	git style filename quoting controlled by core.quotePath This is by no means complete, but escaping filenames in actionItemDesc does cover most commands. Note that for ActionItemBranchFilePath, the value is branch:file, and I choose to only quote the file part (if necessary). I considered quoting the whole thing. But, branch names cannot contain control characters, and while they can contain unicode, git coes not quote unicode when displaying branch names. So, it would be surprising for git-annex to quote unicode in a branch name. The find command is the most obvious command that still needs to be dealt with. There are probably other places that filenames also get displayed, eg embedded in error messages. Some other commands use ActionItemOther with a filename, I think that ActionItemOther should either be pre-sanitized, or should explicitly not be used for filenames, so that needs more work. When --json is used, unicode does not get escaped, but control characters were already escaped in json. (Key escaping may turn out to be needed, but I'm ignoring that for now.) Sponsored-by: unqueued on Patreon	2023-04-08 14:52:26 -04:00
Joey Hess	be19a68276	new matching options --want-get-by and --want-drop-by Sponsored-by: Graham Spencer on Patreon	2022-07-28 13:26:03 -04:00
Joey Hess	81d402216d	cache the serialization of a Key This will speed up the common case where a Key is deserialized from disk, but is then serialized to build eg, the path to the annex object. Previously attempted in `4536c93bb2` and reverted in `96aba8eff7`. The problems mentioned in the latter commit are addressed now: Read/Show of KeyData is backwards-compatible with Read/Show of Key from before this change, so Types.Distribution will keep working. The Eq instance is fixed. Also, Key has smart constructors, avoiding needing to remember to update the cached serialization. Used git-annex benchmark: find is 7% faster whereis is 3% faster get when all files are already present is 5% faster Generally, the benchmarks are running 0.1 seconds faster per 2000 files, on a ram disk in my laptop.	2019-11-22 17:49:16 -04:00
Joey Hess	40ecf58d4b	update licenses from GPL to AGPL This does not change the overall license of the git-annex program, which was already AGPL due to a number of sources files being AGPL already. Legally speaking, I'm adding a new license under which these files are now available; I already released their current contents under the GPL license. Now they're dual licensed GPL and AGPL. However, I intend for all my future changes to these files to only be released under the AGPL license, and I won't be tracking the dual licensing status, so I'm simply changing the license statement to say it's AGPL. (In some cases, others wrote parts of the code of a file and released it under the GPL; but in all cases I have contributed a significant portion of the code in each file and it's that code that is getting the AGPL license; the GPL license of other contributors allows combining with AGPL code.)	2019-03-13 15:48:14 -04:00
Joey Hess	2e0e557e75	Support being built with ghc 8.0.1 (MonadFail) Tested on an older ghc by enabling MonadFailDesugaring globally. In TransferQueue, the lack of a MonadFail for STM exposed what would normally be a bug in the pattern matching, although in this case an earlier check that the queue was not empty avoided a pattern match failure.	2019-01-05 11:55:15 -04:00
Joey Hess	fd5a392006	cache remotes via annex-speculate-present Added remote.name.annex-speculate-present config that can be used to make cache remotes. Implemented it in Remote.keyPossibilities, which is used by the get/move/copy/mirror commands, and nothing else. This way, things like whereis will not show content that's speculatively present. The assistant and sync --content were not using Remote.keyPossibilities, and were changed to use it. The efficiency hit should be small; Remote.keyPossibilities is only used before transferring a file, which is the expensive operation. And, it's only doing one lookup of the remoteList and a very cheap filter over it. Note that, git-annex still updates the location log when copying content to a remote with annex-speculate-present set. In this case, the location tracking will indicate that content is present in the remote. This may not be wanted for caches, or may not be a real problem for them. TBD. This commit was supported by the NSF-funded DataLad project.	2018-08-01 14:28:05 -04:00
Joey Hess	256d8f07e8	avoid insertWith' depreaction warning Switch to Data.Map.Strict everywhere that used it. There are still lots of lazy maps in git-annex. I think switching these is safe. The risk is that there might be a map that is used in a way that relies on the values not being evaluated to WHNF, and switching to strict might result in bad performance or memory use. So, I have not switched everything.	2018-04-22 13:28:31 -04:00
Joey Hess	46d19648ee	first pass at assistant knowing about export remotes Split exportRemotes out from syncDataRemotes; the parts of the assistant that upload keys and drop keys from remotes don't apply to exports, because those operations are not supported. Some parts of the assistant and webapp do operate on both syncDataRemotes and exportRemotes. Particularly when downloading from either of them. Added a downloadRemotes that combines both. With this, the assistant should download from exports, but it won't yet upload changes to them. This commit was sponsored by Fernando Jimenez on Patreon.	2017-09-20 13:58:27 -04:00
Joey Hess	1a0e2c9901	get, move, copy, mirror: Added --failed switch which retries failed copies/moves Note that get --from foo --failed will get things that a previous get --from bar tried and failed to get, etc. I considered making --failed only retry transfers from the same remote, but it was easier, and seems more useful, to not have the same remote requirement. Noisy due to some refactoring into Types/	2016-08-03 12:37:12 -04:00
Joey Hess	afc5153157	update my email address and homepage url	2015-01-21 12:50:09 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	964a181026	try to drop unused object if it does not need to be transferred anywhere	2014-01-23 16:51:16 -04:00
Joey Hess	3518c586cf	fix transfers of key with no associated file Several places assumed this would not happen, and when the AssociatedFile was Nothing, did nothing. As part of this, preferred content checks pass the Key around. Note that checkMatcher is sometimes now called with Just Key and Just File. It currently constructs a FileMatcher, ignoring the Key. However, if it constructed a FileKeyMatcher, which contained both, then it might be possible to speed up parts of Limit, which currently call the somewhat expensive lookupFileKey to get the Key. I have not made this optimisation yet, because I am not sure if the key is always the same. Will need some significant checking to satisfy myself that's the case..	2014-01-23 16:44:02 -04:00
Joey Hess	3da0064657	assistant unused file handling Make sanity checker run git annex unused daily, and queue up transfers of unused files to any remotes that will have them. The transfer retrying code works for us here, so eg when a backup disk remote is plugged in, any transfers to it are done. Once the unused files reach a remote, they'll be removed locally as unwanted. If the setup does not cause unused files to go to a remote, they'll pile up, and the sanity checker detects this using some heuristics that are pretty good -- 1000 unused files, or 10% of disk used by unused files, or more disk wasted by unused files than is left free. Once it detects this, it pops up an alert in the webapp, with a button to take action. TODO: Webapp UI to configure this, and also the ability to launch an immediate cleanup of all unused files. This commit was sponsored by Simon Michael.	2014-01-22 22:53:18 -04:00
Joey Hess	1c8f5f830b	revert use of Data.Map.Strict memory profile shows this did not contribute to the memory leaks fixed in `b92b54bd42`	2014-01-07 14:55:06 -04:00
Joey Hess	b92b54bd42	assistant: Fixed several minor memory leaks that manifested when adding a large number of files.	2014-01-05 21:30:48 -04:00
Joey Hess	362ed9f0e3	use DList for the transfer queue Some nice efficiency gains here for list appending, although mostly the small size of the transfer queue makes them irrelivant.	2013-04-25 01:33:44 -04:00
Joey Hess	c6da464051	use a DList for the deferred downloads queue	2013-04-25 01:26:23 -04:00
Joey Hess	69a80a9a4d	avoid queuing transfers that are currently running	2013-04-02 16:17:06 -04:00
Joey Hess	1f4fe92f61	avoid queuing transfer that is already queued, with a different associated file	2013-04-02 16:04:49 -04:00
Joey Hess	0f6a6f2a8b	avoid queueing uploads to remotes that already have the content	2013-04-02 15:51:58 -04:00
Joey Hess	60760cb430	tweak	2013-03-13 13:11:49 -04:00
Joey Hess	99989bcc8e	logic error	2013-03-07 14:08:50 -04:00
Joey Hess	265e440d22	avoid queuing transfers that are already queued I saw this happen in real life, when syncing to a newly added usb drive. I think it got scanned twice, and files were doubled in the queue. This could be optimised a little bit more, to only read from the mvar once, rather than twice.	2013-03-07 12:35:42 -04:00
Joey Hess	46c9cbeb1e	add additional debug info about reasons for transfers	2013-03-01 15:23:59 -04:00
Joey Hess	99a8a5297c	--auto fixes * get/copy --auto: Transfer data even if it would exceed numcopies, when preferred content settings want it. * drop --auto: Fix dropping content when there are no preferred content settings.	2012-12-06 13:22:16 -04:00
Joey Hess	5e44ab177c	don't try to transfer data to/from XMPP remotes Partition syncRemotes into ones needing git sync (ie, non-special remotes), and ones needing data sync (ie, non-XMPP remotes).	2012-11-11 16:23:16 -04:00
Joey Hess	93ffd47d76	finished pushing Assistant monad into all relevant files All temporary and old functions are removed.	2012-10-30 17:14:51 -04:00
Joey Hess	47d94eb9a4	pushed Assistant monad down into DaemonStatus code Currently have three old versions of functions that more reworking is needed to remove: getDaemonStatusOld, modifyDaemonStatusOld_, and modifyDaemonStatusOld	2012-10-30 15:39:15 -04:00
Joey Hess	ea8df8fe9f	cleanup daemonStatus accessors	2012-10-30 14:44:18 -04:00
Joey Hess	68118b8986	split remaining assistant types	2012-10-30 14:34:48 -04:00
Joey Hess	f78ca9bc58	split out daemonstatus types	2012-10-30 14:11:14 -04:00
Joey Hess	6144f325b9	reorder get list of remotes after, rather than before, a potentially blocking action	2012-10-14 15:09:37 -04:00
Joey Hess	6ea6a2e71c	better variable name	2012-10-14 14:50:03 -04:00
Joey Hess	5ac15149cc	assistant: Now honors preferred content settings when deciding what to transfer. Both when queueing downloads, and uploads, consults the preferred content settings. I didn't make it check yet when requeing failed transfers or queuing deferred downloads; dealing with the preferred content settings (or indeed, other settings) changing while the assistant is running still needs work.	2012-10-09 12:18:41 -04:00
Joey Hess	18bae020ed	make other repositories list list all autostarted repos And add a form to add another, unrelated repository	2012-09-18 17:50:07 -04:00
Joey Hess	9f05d19108	avoid sending uploads right back to where the download came from Just an optimisation.	2012-09-18 14:24:51 -04:00
Joey Hess	3c22977e44	deferred downloads Now when a download is queued and there's no known remote to get it from, it's added to a deferred download list, which will be retried later. The Merger thread tries to queue any deferred downloads when it receives a push to the git-annex branch. Note that the Merger thread now also forces an update of the git-annex branch. The assistant was not updating this branch before, and it saw a (mostly) correct view of state, but now that incoming pushes go to synced/git-annex, it needs to be merged in.	2012-09-17 21:05:50 -04:00
Joey Hess	df337bb63b	hlint	2012-09-13 00:57:52 -04:00
Joey Hess	8335a7ff7a	remove the TChan component from the data structure The code to maintain that TChan in parallel with the list was buggy, the two were not always the same. And all that TChan was needed for was blocking on the next transfer, which can be accomplished just as well by checking the size and retrying, thanks to STM. Also, this is faster, and uses less memory. Total win.	2012-08-31 12:59:00 -04:00
Joey Hess	8d32d54320	make start button work on queued transfers When multiple downloads of a key are queued, it starts the first, but leaves the other downloads in the queue. This ensures that we don't lose a queued download if the one that got started failed.	2012-08-29 16:30:40 -04:00
Joey Hess	c21a9fe04a	more generic	2012-08-29 15:56:47 -04:00
Joey Hess	0842e99637	forgot to update the tchan when dequeuing transfers	2012-08-29 15:46:42 -04:00
Joey Hess	99525f8454	when canceling a transfer, also cancel all other downloads of the same key	2012-08-29 15:24:09 -04:00
Joey Hess	271ea49978	add support for readonly remotes Currently only the web special remote is readonly, but it'd be possible to also have readonly drives, or other remotes. These are handled in the assistant by only downloading from them, and never trying to upload to them.	2012-08-26 15:39:02 -04:00
Joey Hess	715a9a2f8e	keep logs of failed transfers, and requeue them when doing a non-full scan of a remote	2012-08-23 15:24:15 -04:00
Joey Hess	8ba9830653	implement pausing of transfers A paused transfer's thread keeps running, keeping the slot in use. This is intentional; pausing a transfer should not let other queued transfers to run in its place.	2012-08-10 18:42:44 -04:00
Joey Hess	21bd92f077	send update notificaton when removing a queued transfer	2012-08-10 16:00:24 -04:00
Joey Hess	20203b45b9	transfer canceling Should work (untested) for transfers being run by other processes. Not yet by transfers being run by the assistant. killThread does not kill processes forked off by a thread. To fix this, will probably need to make `git annex getkey` and `git annex sendkey` commands that operate on keys, and write their own transfer info. Then the assistant can run them, and kill them, as needed.	2012-08-08 17:55:56 -04:00
Joey Hess	ebd8362d58	fix bug in transfer initiation checking Putting the transfer on the currentTransfers atomically introduced a bug: It checks to see if the transfer is in progress, and cancels it. Fixed by moving that check inside the STM transaction.	2012-07-29 13:37:26 -04:00

1 2

67 commits