git-annex

Author	SHA1	Message	Date
Joey Hess	c1b4d76e6b	make MatchFiles introspectable matchNeedsFileContent is not used yet, but shows how to add information about terminals. That one would be needed for https://git-annex.branchable.com/todo/sync_fast_import/ Note the tricky bit in Annex.FileMatcher.call where it folds over the included matcher to propagate the information. This commit was sponsored by Svenne Krap on Patreon.	2020-09-24 14:01:53 -04:00
Joey Hess	5cfcf1f05f	cache remote.log Unlikely to speed up any of the existing uses much, but I want to use it in a message that might be displayed many times.	2020-09-22 13:52:26 -04:00
Joey Hess	d0b06c17c0	Added --no-check-gitignore option for finer grained control than using --force. add, addurl, importfeed, import: Added --no-check-gitignore option for finer grained control than using --force. (--force is used for too many different things, and at least one of these also uses it for something else. I would like to reduce --force's footprint until it only forces drops or a few other data losses. For now, --force still disables checking ignores too.) addunused: Don't check .gitignores when adding files. This is a behavior change, but I justify it by analogy with git add of a gitignored file adding it, asking to add all unused files back should add them all back, not skip some. The old behavior was surprising. In Command.Lock and Command.ReKey, CheckGitIgnore False does not change behavior, it only makes explicit what is done. Since these commands are run on annexed files, the file is already checked into git, so git add won't check ignores.	2020-09-18 13:19:13 -04:00
Joey Hess	922621301a	Serialize use of C magic library, which is not thread safe. This fixes failures uploading to S3 when using -J. This commit was sponsored by Denis Dzyubenko on Patreon.	2020-09-17 17:27:42 -04:00
Joey Hess	77c42782d0	differentiate between concurrency enabled at command line and by git config The latter should not affect --batch mode.	2020-09-16 11:47:12 -04:00
Joey Hess	3a05d53761	add SeekInput (not yet used) No behavior changes (hopefully), just adding SeekInput and plumbing it through to the JSON display code for later use. Over the course of 2 grueling days. withFilesNotInGit reimplemented in terms of seekHelper should be the only possible behavior change. It seems to test as behaving the same. Note that seekHelper dummies up the SeekInput in the case where segmentPaths' gives up on sorting the expanded paths because there are too many input paths. When SeekInput later gets exposed as a json field, that will result in it being a little bit wrong in the case where 100 or more paths are passed to a git-annex command. I think this is a subtle enough problem to not matter. If it does turn out to be a problem, fixing it would require splitting up the input parameters into groups of < 100, which would make git ls-files run perhaps more than is necessary. May want to revisit this, because that fix seems fairly low-impact.	2020-09-15 15:41:13 -04:00
Joey Hess	62372ee052	resolvemerge: Improve cleanup of cruft left in the working tree by a conflicted merge This commit was sponsored by Jake Vosloo on Patreon.	2020-09-07 16:50:27 -04:00
Joey Hess	0e21a3221e	clean up old code withworktree is no longer doing anything useful so remove it	2020-09-07 16:16:15 -04:00
Joey Hess	03dee56546	revert change that broke test suite Opened a new bug about it. This commit was sponsored by Ethan Aubin.	2020-09-07 15:42:38 -04:00
Joey Hess	d120c73302	sync, assistant: When merge.directoryRenames is not set, default it it to "false" Works better with automatic merge conflict resolution than git's ususual default of "conflict". This is not done when automatic merge conflict resolution is disabled. This commit was sponsored by Mark Reidenbach on Patreon.	2020-09-07 13:50:58 -04:00
Joey Hess	f4c4b89aa3	refactor Make all calls to git merge go through autoMergeFrom, in preparation for fine-tuning git merge's config for automatic merge conflict resolution. This commit was sponsored by Ryan Newton on Patreon.	2020-09-07 13:26:16 -04:00
Joey Hess	69053a93a2	resolvemerge: Improve cleanup of files that were deleted by one side of a conflicted merge, and modified by the other side This case was handled by cleanConflictCruft, but only when the annexed file's object was present. When not present, it left the annexed file with the original name, not checked into git, while adding the variant file. So, add an explicit deletion of the deleted file in this case. My specific case where this happened actually involves merge.directoryRenames=conflict. After a merge involving that, the situation was the file appears as "added by them", because that caused the file that they added to be moved into a directory we renamed. That case is the same as them adding a modified version of the file, while we deleted it. (Except for the history of the file, since it's a new file, but this doesn't look at history.) This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2020-09-07 12:25:57 -04:00
Joey Hess	a360437215	make automerge behavior when one side deleted explict This does not actually change how the merge conflict is resolved when one side deleted the file, but it was not documented before, and I think it only worked by accident. This commit was sponsored by Brett Eisenberg on Patreon.	2020-09-07 12:01:03 -04:00
Joey Hess	e36bae74da	Exposed annex.forward-retry git config One reason is, 5 is an arbitrary number so ought to be configurable. The real reason though, is I wanted to make the man page explain when forward retry can override annex.retry, and having a config made the man page easier to write.	2020-09-04 15:16:40 -04:00
Joey Hess	2bb933eb60	import: Retry downloads that fail Also, using the transfer machinery for this makes eg, git-annex info show in-progress imports, and makes --notify-start/finish work.	2020-09-04 13:54:05 -04:00
Joey Hess	1a42b2c5a3	combine retry deciders in better way This fixes the problem that, if forwardRetry was checked for the first 5 and decided to retry, the 6th would go to configuredRetry which would see the counter was 6 and so wait retry-delay*2^5 seconds (default 32). Now, it waits for retry-delay before each retry, even when forwardRetry initiated the retry.	2020-09-04 12:48:30 -04:00
Joey Hess	1d244bafbd	Limit retrying of failed transfers when forward progress is being made to 5 To avoid some unusual edge cases where too much retrying could result in far more data transfer than makes sense.	2020-09-04 12:46:37 -04:00
Joey Hess	eed20fe3b7	fix some file modes in calls to withTmpFileIn to honor umask Also audited for other calls to openTempFile, and all are ok, except for viaTmp which will need further work. Remote.Directory fixed to set umask mode when writing to an export, although it has another one using viaTmp that's not fixed. Will make exports that are published via a http server running as another user work, for example. Remote.BitTorrent fixed to set umask mode when downloading the torrent file. Normally this does not matter as that file does not hang around after the download, but if a bittorrent download were started by one user, got interrupted and then another user ran it, this will let them access the torrent file created by the first user.	2020-09-02 14:36:08 -04:00
Joey Hess	00937c4813	when downloading same content from multiple urls, only display error if all fail	2020-09-02 11:35:07 -04:00
Joey Hess	571ec900ac	Added http special remote, which is useful for accessing other remotes that publish content stored in them via http/https. With automatic layout learning!	2020-09-01 15:16:35 -04:00
Joey Hess	f95664305b	remove unused imports	2020-08-28 11:16:51 -04:00
Joey Hess	b68f214312	Display a message when git-annex has to wait for a pid lock file held by another process	2020-08-26 13:05:34 -04:00
Joey Hess	b24ba92231	refactor out Annex.PidLock	2020-08-26 12:29:13 -04:00
Joey Hess	7bdb0cdc0d	add gitAnnexChildProcess and use instead of incorrect use of runsGitAnnexChildProcess Fixes reversion in 8.20200617 that made annex.pidlock being enabled result in some commands stalling, particularly those needing to autoinit. Renamed runsGitAnnexChildProcess to make clearer where it should be used. Arguably, it would be better to have a way to make any process git-annex runs have the env var set. But then it would need to take the pid lock when running any and all processes, and that would be a problem when git-annex runs two processes concurrently. So, I'm left doing it ad-hoc in places where git-annex really does run a child process, directly or indirectly via a particular git command.	2020-08-25 14:57:49 -04:00
Joey Hess	2b6fc17f70	fix comment format	2020-08-25 13:40:52 -04:00
Joey Hess	283d2f85d1	importfeed: Fix reversion that caused some '.' in filenames to be replaced with '_' sanitizeFilePath was changed to sanitize leading '.', but ImportFeed was running it on parts of the template. So eg the leading '.' in the extension got sanitized. Note the added case for sanitizeLeadingFilePathCharacter ('/':_) -- this was added because, if the template is title/episode and the title is not set, it would expand to "/episode". So this is another potential security fix.	2020-08-05 11:35:00 -04:00
Joey Hess	f75be32166	external backends wip It's able to start them up, the only thing not implemented is generating and verifying keys. And, the key translation for HasExt.	2020-07-29 15:23:18 -04:00
Joey Hess	555fe669e1	refactoring in preparation for external backends	2020-07-29 12:00:27 -04:00
Joey Hess	f5e65d680b	add back inAnnex check for drop here Needed again after last commit removed it from startLocal again.	2020-07-25 18:17:33 -04:00
Joey Hess	2a45b5ae9a	avoid failure to lock content of removed file causing drop etc to fail This was already prevented in other ways, but as seen in commit `c30fd24d91`, those were a bit fragile. And I'm not sure races were avoided in every case before. At least a race between two separate git-annex processes, dropping the same content, seemed possible. This way, if locking fails, and the content is not present, it will always do the right thing. Also, it avoids the overhead of an unncessary inAnnex check for every file. This commit was sponsored by Denis Dzyubenko on Patreon.	2020-07-25 11:59:33 -04:00
Joey Hess	c30fd24d91	add back inAnnex check after seeking The test suite noticed this case, where two files with the same key are dropped, and the seek stage sees both have content due to the way files stream through it. But then locking the content to drop fails on the second file, because the first file has already been dropped. So, add back otherwise redundant inAnnex check.	2020-07-25 11:18:50 -04:00
Joey Hess	18f1fb5841	drop performance improvements Sped up seeking files to drop by 2x, and also some performance improvements to checking numcopies. Interestingly, the seek speedup is not due to precaching, but I think is due to calling getParsed earlier. Annex.Drop had to be changed to check inAnnex there, since it was removed from Command.Drop. All other users of Command.Drop already checked inAnnex themselves. This commit was sponsored by Ryan Newton on Patreon.	2020-07-24 13:27:46 -04:00
Joey Hess	c4cc2cdf4c	rename getKey to genKey for consistency with external backend protocol	2020-07-20 14:06:05 -04:00
Joey Hess	172743728e	move cryptographicallySecure into Backend type This is groundwork for external backends, but also makes sense to keep this information with the rest of a Backend's implementation. Also, removed isVerifiable. I noticed that the same information is encoded by whether a Backend implements verifyKeyContent or not.	2020-07-20 12:17:42 -04:00
Joey Hess	2634a5ed99	avoid inflating error counter when forking and merging annex state	2020-07-19 18:31:25 -04:00
Joey Hess	7a42a47902	renaming	2020-07-10 14:17:35 -04:00
Joey Hess	9f6bd6cc05	add inRepoDetails planned to use for an optimisation most things using stagedDetails were not expecting to get dup files in a conflicted merge and deal with them, so converted them to use inRepoDetails.	2020-07-08 15:36:35 -04:00
Joey Hess	7347e50123	add stage number to stagedDetails parser And convert parser to attoparsec, probably faster. Before, a parse failure threw the whole --stage output line in to the filename, which was certianly a bad idea, so fixed that.	2020-07-08 15:05:12 -04:00
Joey Hess	9483b10469	cache one more log file for metadata My worry was that a preferred content expression that matches on metadata would have removed the location log from cache, causing an expensive re-read when a Seek action later checked the location log. Especially when the --all optimisation in the previous commit pre-cached the location log. This also means that the --all optimisation could cache the metadata log too, if it wanted too, but not currently done. The cache is a list, with the most recently accessed file first. That optimises it for the common case of reading the same file twice, eg a get, examine, followed by set reads it twice. And sync --content reads the location log 3 times in a row commonly. But, as a list, it should not be made to be too long. I thought about expanding it to 5 items, but that seemed unlikely to be a win commonly enough to outweigh the extra time spent checking the cache. Clearly there could be some further benchmarking and tuning here.	2020-07-07 14:18:55 -04:00
Joey Hess	e72ec8b9b2	add back git-annex branch read cache The cache was removed way back in 2012, commit `3417c55189` Then I forgot I had removed it! I remember clearly multiple times when I thought, "this reads the same data twice, but the cache will avoid that being very expensive". The reason it was removed was it messed up the assistant noticing when other processes made changes. That same kind of problem has recently been addressed when adding the optimisation to avoid reading the journal unnecessarily. Indeed, enableInteractiveJournalAccess is run in just the right places, so can just piggyback on it to know when it's not safe to use the cache.	2020-07-06 12:22:33 -04:00
Joey Hess	57cceac569	simplify interface by removing size Add size to the returned key after the fact, unless the remote happened to add it itself.	2020-07-03 14:22:22 -04:00
Joey Hess	85506a7015	import: Added --no-content option, which avoids downloading files from a special remote Only supported by some special remotes: directory I need to check the rest and they're currently missing methods until I do. git-annex sync --no-content does not yet use this to do imports	2020-07-03 13:41:57 -04:00
Joey Hess	b2f4b84d27	clean up some build warnings on windows	2020-07-02 11:34:18 -04:00
Joey Hess	087b7ee66a	Revert "data type that starts off using a set but converts to a bloom filter when large" This reverts commit `7e2c4ed216`. I was not able to use this in the end.. See comment in the previous commit.	2020-07-01 20:12:19 -04:00
Joey Hess	a09937580e	more windows build fixes	2020-07-01 15:22:56 -04:00
Joey Hess	7e2c4ed216	data type that starts off using a set but converts to a bloom filter when large This adds a dep on hashable, but it's a free dependency, since unordered-containers already pulled it in. Using unordered-containers for the set seems to make sense, since it hashes and bloom filter hashes too. (Though different hashes.) I dunno, never quite know if I should use unordered-containers or containers.	2020-07-01 14:06:12 -04:00
Joey Hess	d3d187c869	fix build on windows Annex.GitOverlay was using a module that needs posix to build.	2020-07-01 11:22:15 -04:00
Joey Hess	a59e95a82d	improve "unable to lock down 1 copy" message This is a fairly hard to understand situation for the user. Listing the remotes should help them understand it a bit better. This commit was sponsored by Ethan Aubin.	2020-06-26 13:00:40 -04:00
Joey Hess	b651d3ede0	test: Fix some test cases that assumed git's default branch name git is making that configurable, and configuring it globally would break the test suite in a few places. No other part of git-annex assumes any branch name. Renamed a few placeholders to make that clearer. This commit was sponsored by Jake Vosloo on Patreon.	2020-06-23 16:40:51 -04:00
Joey Hess	7757c0e900	Honor annex.largefiles when importing a tree from a special remote. This commit was sponsored by Martin D on Patreon.	2020-06-23 16:07:18 -04:00

1 2 3 4 5 ...

1498 commits