git-annex

Author	SHA1	Message	Date
Joey Hess	ea2876ae77	add PackageImports This makes loading it in ghci work when both crypton and cryptonite are installed.	2023-10-30 14:10:46 -04:00
Joey Hess	0f3b78ec29	simplify	2023-10-26 14:00:02 -04:00
Joey Hess	c873586e14	eliminate s2w8 and w82s Note that the use of s2w8 in genUUIDInNameSpace made it truncate unicode characters. Luckily, genUUIDInNameSpace is only ever used on ASCII strings as far as I can determine. In particular, git-remote-gcrypt's gcrypt-id is an ASCII string.	2023-10-26 13:12:57 -04:00
Joey Hess	3742263c99	simplify base64 to only use ByteString Note the use of fromString and toString from Data.ByteString.UTF8 dated back to commit `9b93278e8a`. Back then it was using the dataenc package for base64, which operated on Word8 and String. But with the switch to sandi, it uses ByteString, and indeed fromB64' and toB64' were already using ByteString without that complication. So I think there is no risk of such an encoding related breakage. I also tested the case that `9b93278e8a` fixed: git-annex metadata -s foo='a …' x git-annex metadata x metadata x foo=a … In Remote.Helper.Encryptable, it was avoiding using Utility.Base64 because of that UTF8 conversion. Since that's no longer done, it can just use it now.	2023-10-26 13:10:05 -04:00
Joey Hess	6a61c7ff45	Fix crash of enableremote when the special remote has embedcreds=yes The crash occurred because writeCreds got called twice, and writeFileProtected neglected to close its file handle, so the file was open for write when written the second time. It seems unncessary and suboptimal that writeCreds gets called twice. One call is from getRemoteCredPair and the other from setRemoteCredPair'. What happens is that in the enableremote case, code that also runs at initremote does unncessary work. Might be possible to improve that, but I've gone for the simple fix. Sponsored-by: k0ld on Patreon	2023-10-20 13:19:12 -04:00
Joey Hess	54da44d42a	Support being built with crypton rather than cryptonite crypton is a fork of cryptonite, and cryptonite's github repo has been archived. Some deps are already using cryptonite so it's clearly the way forward. Added a build flag without a default, so cabal configure will select on its own which to use. stack files pin to cryptonite for now. Sponsored-by: Nicholas Golder-Manning on Patreon	2023-09-21 12:43:42 -04:00
Joey Hess	50300a47fe	Removed the vendored git-lfs and the GitLfs build flag AFAICS all git-annex builds are using the git-lfs library not the vendored copy. Debian stable now includes a new enough haskell-git-lfs package as well. Last time this was tried it did not.	2023-08-28 13:12:31 -04:00
Joey Hess	88b0bb5793	fix build on windows thanks to jkniiv	2023-08-18 13:03:47 -04:00
Joey Hess	10b5f79e2d	fix empty tree import when directory does not exist Fix behavior when importing a tree from a directory remote when the directory does not exist. An empty tree was imported, rather than the import failing. Merging that tree would delete every file in the branch, if those files had been exported to the directory before. The problem was that dirContentsRecursive returned [] when the directory did not exist. Better for it to throw an exception. But in commit `74f0d67aa3` back in 2012, I made it never theow exceptions, because exceptions throw inside unsafeInterleaveIO become untrappable when the list is being traversed. So, changed it to list the contents of the directory before entering unsafeInterleaveIO. So exceptions are thrown for the directory. But still not if it's unable to list the contents of a subdirectory. That's less of a problem, because the subdirectory does exist (or if not, it got removed after being listed, and it's ok to not include it in the list). A subdirectory that has permissions that don't allow listing it will have its contents omitted from the list still. (Might be better to have it return a type that includes indications of errors listing contents of subdirectories?) The rest of the changes are making callers of dirContentsRecursive use emptyWhenDoesNotExist when they relied on the behavior of it not throwing an exception when the directory does not exist. Note that it's possible some callers of dirContentsRecursive that used to ignore permissions problems listing a directory will now start throwing exceptions on them. The fix to the directory special remote consisted of not making its call in listImportableContentsM use emptyWhenDoesNotExist. So it will throw an exception as desired. Sponsored-by: Joshua Antonishen on Patreon	2023-08-15 12:57:41 -04:00
Joey Hess	9aac41f86c	remove unused imports	2023-08-15 12:43:26 -04:00
Joey Hess	be028f10e5	split out Utility.Url.Parse This is mostly for git-repair which can't include all of Utility.Url without adding many dependencies that are not really necessary.	2023-08-14 12:28:10 -04:00
Joey Hess	adda6c1088	Add git-annex remote refs that are not newer to the merged refs list Significant startup speed increase by avoiding repeatedly checking if some remote git-annex branch refs need to be merged when it is not newer. One way this could happen is when there are 2 remotes that are themselves connected. The git-annex branch on the first remote gets updated. Then the second remote pulls from the first, and merges in its git-annex branch. Then the local repo pulls from the second remote, and merges its git-annex branch. At this point, a pull from the first remote will get a git-annex branch that is not newer, but is not on the merged refs list. In my big repo, git-annex startup time dropped from 4 seconds to 0.1 seconds. There were 5 to 10 such remote refs out of 18 remotes. Sponsored-by: Graham Spencer on Patreon	2023-08-09 13:31:36 -04:00
Joey Hess	85aadcfa1e	windows back to lts-18.13 temporarily I can't seem to get stack to resolve dependencies with Win32-2.13.4.0, no matter what I try. Why it blows up, I don't know. And allow-newer: true actually causes it to downgrade Win32 to the one version that won't build. Unbelivable that allows downgrades. So just gonna have to wait for that to get into stackage nightly, and then stack.yaml can be updated to use that, and the changes in this commit reverted.	2023-08-02 12:49:38 -04:00
Joey Hess	461330c585	remove support for building with older Win32 No need to preserve this since the cabal file depends on the newer one.	2023-08-02 11:59:57 -04:00
Joey Hess	9a60f5b65f	fix build on windows	2023-08-02 10:43:20 -04:00
Joey Hess	8adafdd013	avoid cpp failure on windows Seems that while the module is not imported by anything on windows, it still gets cpped, and MIN_VERSION_unix is not defined so it failed to preprocess.	2023-08-02 10:08:00 -04:00
Joey Hess	68c9b08faf	fix build with unix-2.8.0 Changed the parameters to openFd. So needed to add a small wrapper library to keep supporting older versions as well.	2023-08-01 18:41:27 -04:00
Joey Hess	63f76d0ac3	fix build with unix-2.8.0 It made UserInfo into a pattern to discourage manually constructing them, so just to use UserInfo in a type signature of a function that consumes them, have to import the new ByteString module.	2023-08-01 17:47:30 -04:00
Joey Hess	d76f088dc4	fix build on windows	2023-08-01 17:39:24 -04:00
Joey Hess	fb640bc2f4	support building with unix-compat 0.7 It removed System.PosixCompat.User.	2023-08-01 15:17:43 -04:00
Joey Hess	08071a1b90	improve match result display simplifier Sponsored-by: Dartmouth College's DANDI project	2023-07-26 15:28:57 -04:00
Joey Hess	70de4a7e6d	fix bug in match result display simplifier Sponsored-by: Dartmouth College's DANDI project	2023-07-26 15:28:49 -04:00
Joey Hess	518a51a8a0	--explain for preferred/required content matching And annex.largefiles and annex.addunlocked. Also git-annex matchexpression --explain explains why its input expression matches or fails to match. When there is no limit, avoid explaining why the lack of limit matches. This is also done when no preferred content expression is set, although in a few cases it defaults to a non-empty matcher, which will be explained. Sponsored-by: Dartmouth College's DANDI project	2023-07-26 14:50:04 -04:00
Joey Hess	f25eeedeac	initial implementation of --explain Currently it only displays explanations of options like --in and --copies. In the future, it should explain preferred content expression evaluation and other decisions. The explanations of a few things could be better. In particular, "standard" will just appear as-is (or as "!standard" if it doesn't match), rather than explaining why the standard preferred content expression for the group matches or not. Currently as implemented, it goes to stdout, and so commands like git-annex find that have custom output will not display --explain information. Perhaps that should change, dunno. Sponsored-by: Dartmouth College's DANDI project	2023-07-25 16:52:57 -04:00
Joey Hess	cf40e2d4b6	Revert "use existing debug machinery for explain" This reverts commit `409572c9e4`.	2023-07-25 15:53:50 -04:00
Joey Hess	409572c9e4	use existing debug machinery for explain explain is a kind of debug message, but not formatted in the same way. So it makes sense to reuse the debug machinery for it, since that is already quite optimised. Sponsored-by: Dartmouth College's DANDI project	2023-07-25 15:47:58 -04:00
Joey Hess	fbf19338be	remove excess doubled parens in match description Sponsored-by: Dartmouth College's DANDI project	2023-07-25 13:55:01 -04:00
Joey Hess	f280d38045	parenthesize match description as needed to avoid ambiguity While avoiding most unncessary parens. Once case where unncessary parens are not avoided is: not ( ( not foo and baz ) ) It would be good eventually to remove doubled parens like these. Sponsored-by: Dartmouth College's DANDI project	2023-07-25 13:40:23 -04:00
Joey Hess	0f63374be3	accumulate description while matching This is to be used to explain why something did or didn't match. Note that this reimplements match in terms of matchMrun. Implementing match' as a Writer and matchMrun' as a MonadWriter resulted in nearly identical implementations, which collapsed into the same thing thanks to Writer being WriterT Identity. MAnd and MOr implement short circuiting. So an expression like "not (foo and bar)" will be explained as [MatchedNot, MatchOperation "foo"] when foo does not match; whether bar matches is irrelevant. Similarly "foo or bar" will be explained as [MatchedOperation "foo"] when foo matches. It seems like that will keep the explanations more understandable. But also, matchMrun already did short circuiting, and it could be considerably more work to check if bar matches in these cases. Note that the type signature of matchMrun changed, but it was over-generic before. Note that these changes are licensed under the AGPL. Changed module license accordingly. Sponsored-by: Dartmouth College's DANDI project	2023-07-25 12:53:05 -04:00
Joey Hess	7298123520	build git trees using ContentIdentifier to speed up import This gets the trees built, but it does not use them. Next step will be to remember the tree for next time an import is done, and diff between old and new trees to find the files that have changed. Added --missing to the mktree parameters. That only disables a check, so it's ok to do everywhere mktree is used. It probably also speeds up mktree to disable the check. Note that git fsck does not complain about the resulting tree objects that point to shas that are not in the repository. Even with --strict. A quick benchmark, importing 10000 files, this slowed it down from 2:04.06 to 2:04.28. So it will more than pay for itself. Sponsored-by: Luke Shumaker on Patreon	2023-05-31 12:46:54 -04:00
Joey Hess	0da0e2efcc	add git config debugging (and process cwd debugging) Sponsored-by: Dartmouth College's Datalad project	2023-05-15 15:35:29 -04:00
Joey Hess	9812d9aaec	support aeson for Map Make unused --json use it, which is better than the doubly nested lists it was using. Sponsored-By: the NIH-funded NICEMAN (ReproNim TR&D3) project	2023-05-10 13:51:37 -04:00
Joey Hess	57c1b4f5e5	initremote: Avoid creating a remote that is not encrypted when gpg is broken checksize was applied lazily, so the exception didn't happen until the remote was set up. Sponsored-by: k0ld on Patreon	2023-05-01 13:00:05 -04:00
Joey Hess	aff37fc208	avoid annexFileMode special case This makes annexFileMode be just an application of setAnnexPerm', which avoids having 2 functions that do different versions of the same thing. Fixes some buggy behavior for some combinations of core.sharedRepository and umask. Sponsored-by: Jack Hill on Patreon	2023-04-27 15:58:37 -04:00
Joey Hess	2efceba789	fix windows build	2023-04-12 19:33:19 -04:00
Joey Hess	c5bcb55a8b	add ScopedTypeVariables	2023-04-12 19:19:22 -04:00
Joey Hess	6fc999193f	avoid displaying ExitCode exceptions Don't need to be sanitized and displaying them messes up actually exiting with the right exit code! And broke the test suite. Sponsored-by: Brett Eisenberg on Patreon	2023-04-12 17:04:57 -04:00
Joey Hess	2fdb6ca879	remove unused imports	2023-04-12 16:48:18 -04:00
Joey Hess	a576fc3b12	fix mojibake reversion in display of utf8 When displaying a ByteString like "💕", safeOutput operates on individual bytes like "\240\159\146\149" and isControl '\146' = True, so it got truncated to just "\240". So, only treat the low control characters, and DEL, as control characters. Also split Utility.Terminal out of Utility.SafeOutput. The latter needs win32, but Utility.SafeOutput is used by Control.Exception, which is used by Setup. Sponsored-by: Nicholas Golder-Manning on Patreon	2023-04-12 13:53:30 -04:00
Joey Hess	31111d15c8	newline and tab are safe control characters Oops, let's let git-annex display those! Lol	2023-04-11 15:38:47 -04:00
Joey Hess	afa5b883dc	find, findkeys, examinekey: escape output to terminal when --format is not used Note that filenames are not quoted, only escaped. This is to match the output of --format with escaping. Sponsored-by: Lawrence Brogan on Patreon	2023-04-11 15:27:07 -04:00
Joey Hess	df6f9f1ee8	filter out control characters and quote filenames Searched for uses of putStr and hPutStr and changed appropriate ones to filter out control characters and quote filenames. This notably does not make find and findkeys quote filenames in their default output. Because they should only do that when stdout is non a pipe. A few commands like calckey and lookupkey seem too low-level to make sense to filter output, so skipped those. Also when relaying output from other commands that is not progress output, have git-annex filter out control characters. Sponsored-by: k0ld on Patreon	2023-04-11 14:27:22 -04:00
Joey Hess	cd544e548b	filter out control characters in error messages giveup changed to filter out control characters. (It is too low level to make it use StringContainingQuotedPath.) error still does not, but it should only be used for internal errors, where the message is not attacker-controlled. Changed a lot of existing error to giveup when it is not strictly an internal error. Of course, other exceptions can still be thrown, either by code in git-annex, or a library, that include some attacker-controlled value. This does not guard against those. Sponsored-by: Noam Kremen on Patreon	2023-04-10 13:50:51 -04:00
Joey Hess	81bc57322f	clean up	2023-04-07 17:20:58 -04:00
Joey Hess	d9b6be7782	convert encode_c to ByteString This turns out to be possible after all, because the old one decomposed a unicode Char to multiple Word8s and encoded those. It should be faster in some places, particularly in Git.Filename.encodeAlways. The old version encoded all unicode by default as well as ascii control characters and also '"'. The new one only encodes ascii control characters by default. That old behavior was visible in Utility.Format.format, which did escape '"' when used in eg git-annex find --format='${escaped_file}\n' So made sure to keep that working the same. Although the man page only says it will escape "unusual" characters, so it might be able to be changed. Git.Filename.encodeAlways also needs to escape '"' ; that was the original reason that was escaped. Types.Transferrer I judge is ok to not escape '"', because the escaped value is sent in a line-based protocol, which is decoded at the other end by decode_c. So old git-annex and new will be fine whether that is escaped or not, the result will be the same. Note that when asked to escape a double quote, it is escaped to \" rather than to \042. That's the same behavior as git has. It's perhaps somehow more of a special case than it needs to be. Sponsored-by: k0ld on Patreon	2023-04-07 17:10:49 -04:00
Joey Hess	371d4f8183	decode_c converted to ByteString This speeds up a few things, notably CmdLine.Seek using Git.Filename which uses decode_c and this avoids a conversion to String and back, and probably the ByteString implementation of decode_c is also faster for simple cases at least than the string version. encode_c cannot be converted to ByteString (or if it did, it would have to convert right back to String in order to handle unicode). Sponsored-by: Brock Spratlen on Patreon	2023-04-07 14:44:19 -04:00
Joey Hess	d2d7d766d8	fix comment	2023-03-28 12:40:08 -04:00
Joey Hess	cd076cd085	Windows: Support urls like "file:///c:/path" That is a legal url, but parseUrl parses it to "/c:/path" which is not a valid path on Windows. So as a workaround, use parseURIPortable everywhere, which removes the leading slash when run on windows. Note that if an url is parsed like this and then serialized back to a string, it will be different from the input. Which could potentially be a problem, but is probably not in practice. An alternative way to do it would be to have an uriPathPortable that fixes up the path after parsing. But it would be harder to make sure that is used everywhere, since uriPath is also used when constructing an URI. It's also worth noting that System.FilePath.normalize "/c:/path" yields "c:/path". The reason I didn't use it is that it also may change "/" to "\" in the path and I wanted to keep the url changes minimal. Also noticed that convertToWindowsNativeNamespace handles "/c:/path" the same as "c:/path". Sponsored-By: the NIH-funded NICEMAN (ReproNim TR&D3) project	2023-03-27 13:38:02 -04:00
Joey Hess	79d00a3b9f	fix build warning on windows	2023-03-27 12:17:55 -04:00
Joey Hess	2b5fa091e2	annex.maxextensionlength for view view: Support annex.maxextensionlength when generating filenames for the view branch. Note that refining an existing view will reuse the extension length that was configured when initially constructing the view. This is necessarily the case because it reuses the filenames. Also view files used to have all extensions at the end, no matter how many there were. Since annex.maxextensionlength's documentation includes that it's limited to 2 extensions, I made it consistent with that. Sponsored-by: k0ld on Patreon	2023-03-24 14:01:38 -04:00

1 2 3 4 5 ...

1660 commits