git-annex

Author	SHA1	Message	Date
Joey Hess	afa5b883dc	find, findkeys, examinekey: escape output to terminal when --format is not used Note that filenames are not quoted, only escaped. This is to match the output of --format with escaping. Sponsored-by: Lawrence Brogan on Patreon	2023-04-11 15:27:07 -04:00
Joey Hess	81bc57322f	clean up	2023-04-07 17:20:58 -04:00
Joey Hess	d9b6be7782	convert encode_c to ByteString This turns out to be possible after all, because the old one decomposed a unicode Char to multiple Word8s and encoded those. It should be faster in some places, particularly in Git.Filename.encodeAlways. The old version encoded all unicode by default as well as ascii control characters and also '"'. The new one only encodes ascii control characters by default. That old behavior was visible in Utility.Format.format, which did escape '"' when used in eg git-annex find --format='${escaped_file}\n' So made sure to keep that working the same. Although the man page only says it will escape "unusual" characters, so it might be able to be changed. Git.Filename.encodeAlways also needs to escape '"' ; that was the original reason that was escaped. Types.Transferrer I judge is ok to not escape '"', because the escaped value is sent in a line-based protocol, which is decoded at the other end by decode_c. So old git-annex and new will be fine whether that is escaped or not, the result will be the same. Note that when asked to escape a double quote, it is escaped to \" rather than to \042. That's the same behavior as git has. It's perhaps somehow more of a special case than it needs to be. Sponsored-by: k0ld on Patreon	2023-04-07 17:10:49 -04:00
Joey Hess	371d4f8183	decode_c converted to ByteString This speeds up a few things, notably CmdLine.Seek using Git.Filename which uses decode_c and this avoids a conversion to String and back, and probably the ByteString implementation of decode_c is also faster for simple cases at least than the string version. encode_c cannot be converted to ByteString (or if it did, it would have to convert right back to String in order to handle unicode). Sponsored-by: Brock Spratlen on Patreon	2023-04-07 14:44:19 -04:00
Joey Hess	447d798987	export encode_c'	2020-12-09 15:28:45 -04:00
Joey Hess	30ac015b79	add a formatContainsVar function Also, the format function gets faster because it checks for "escaped_" at gen time instead of every time format is called.	2020-05-19 15:35:00 -04:00
Joey Hess	e006acc8e3	fix quickcheck failure prop_encode_decode_roundtrip failed on "\175" in C locale. This may be a new problem after the switch to RawFilePath, but it already had filtering for high chars, so changed to only test ascii chars.	2019-12-30 13:54:46 -04:00
Joey Hess	da8e84efe9	fix failing quickcheck properties QuickCheck 2.10 found a counterexample eg "\929184" broke the property. As far as I can tell, Git.Filename is matching how git handles encoding of strange high unicode characters in filenames for display. Git does not display high unicode characters, and instead displays the C-style escaped form of each byte. This is ambiguous, but since git is not unicode aware, it doesn't need to roundtrip parse it. So, making Git.FileName's roundtrip test only chars < 256 seems fine. Utility.Format.format uses encode_c, in order to mimic git, so that's ok. Utility.Format.gen uses decode_c, but only so that stuff like "\n" in the format string is handled. If the format string contains C-style octal escapes, they will be converted to ascii characters, and not combined into unicode characters, but that should not be a problem. If the user wants unicode characters, they can include them in the format string, without escaping them. Finally, decode_c is used by Utility.Gpg.secretKeys, because gpg --with-colons hex-escapes some characters in particular ':' and '\\'. gpg passes unicode through, so this use of decode_c is not a problem. This commit was sponsored by Henrik Riomar on Patreon.	2017-06-17 16:48:00 -04:00
Joey Hess	613d6056f5	better types	2016-02-14 16:26:39 -04:00
Joey Hess	b0626230b7	fix use of hifalutin terminology	2015-11-16 14:37:31 -04:00
Joey Hess	afc5153157	update my email address and homepage url	2015-01-21 12:50:09 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	2427832bed	relicense general utility library code to BSD Omitted a couple of files what have had significant contributions from others.	2014-05-10 11:01:27 -03:00
Joey Hess	e4290c61d7	gpg secret keys list parsing Note that Utility.Format.prop_idempotent_deencode does not hold now that hex escaped characters are supported. quickcheck fails to notice this, so I have left it as-is for now.	2013-09-16 12:57:39 -04:00
Joey Hess	f87a781aa6	finished where indentation changes	2012-12-13 00:24:19 -04:00
Joey Hess	a1e52f0ce5	hlint	2012-02-16 00:44:51 -04:00
Joey Hess	ba6088b249	rename readMaybe to readish a stricter (but also partial) readMaybe is getting added to base	2012-01-23 17:00:10 -04:00
Joey Hess	183bdacca2	treak	2012-01-21 02:49:32 -04:00
Joey Hess	f015ef5fde	cleanup	2011-12-23 01:08:19 -04:00
Joey Hess	7227dd8f21	add escape_var hack Makes it easy to find files with duplicate contents, anyway.. :)	2011-12-23 01:08:19 -04:00
Joey Hess	db964e358f	reorg	2011-12-23 01:08:18 -04:00
Joey Hess	cba3ce08df	handle C-style escapes in Format I was happily able to repurpose some code from Git.Filename to handle this. I remember writing that code... a whole afternoon at a coffee shop, after which I felt I'd struggled with Haskell and git, and sorta lost, in needing to write this nasty peice of code. But was also pleased at the use of a pair of functions and quickcheck that allowed me to get it 100% right. So, turns out I not only got it right, but the code wasn't as special-purpose as I'd feared. Yay!	2011-12-23 01:05:16 -04:00
Joey Hess	a0872a8ec3	better data type	2011-12-22 19:56:31 -04:00
Joey Hess	06bafae9e0	Format strings can be specified using the new --find option, to control what is output by git annex find.	2011-12-22 18:31:44 -04:00
Joey Hess	cf496f09ab	add a text formatter This is built for speed; a format string is parsed once, generating a Format, that can be applied repeatedly to different sets of variables to generate output.	2011-12-22 17:59:14 -04:00

25 commits