git-annex

Author	SHA1	Message	Date
Joey Hess	a2fc471e14	safer git sha object filename Rather than use the filename provided by INPUT, which could come from user input, and so could be something that looks like a dashed parameter, use a .git/object/<sha> filename. This avoids user input passing through INPUT and back out, with the file path then passed to a command, which could do something unexpected with a dashed parameter, or other special parameter. Added a note in the design about being careful of passing user input to commands. They still have to be careful of that in general, just not in this case.	2025-03-04 14:54:13 -04:00
Joey Hess	52f51d065a	rename config to annex.security.allowed-compute-programs And require for enable as well as autoenable. It seemed asking for trouble for `git-annex enable foo` to use whatever compute program is stored in the git config, without verifying that the user wants that program to be used. Note that it would be good to allow `git-annex enable foo program=...` to be used without the program being in the git config. Not implemented yet though.	2025-03-03 16:12:03 -04:00
Joey Hess	f32d2aecce	autoenable security for compute special remote Added annex.security.autoenable-compute-programs and only allow autoenabling special remotes that use compute programs on that list. The reason this is needed is a user might have some compute programs that are less safe to use than others. They might want to use an unsafe one only with one repository, where they are the only committer or other committers are trusted. They might be ok with others being used by any repository, and if so they can add them to the list. Another reason would be a user who has installed a compute program by accident. Eg, it might be included with git-annex at some point, or pulled in by some dependency. That user doesn't necessarily want that compute program to be used in an autoenabled special remote.	2025-03-03 15:52:56 -04:00
Joey Hess	89bfeada87	recompute: display one of the changed files	2025-03-03 15:12:19 -04:00
Joey Hess	a0d6a6ea2a	support git files as input to computations Using GIT keys, like are used when exporting git files to special remotes. Except here the GIT key refers to a file checked into the git repo. Note that, since the compute remote uses catObject to get the content, a symlink that is checked into git does not get followed. This is important for security, because following a symlink and adding the content to the repo as an annex object would allow exfiltrating content from outside the repository. Instead, the behavior with a symlink is to run the computation on the symlink target. This may turn out to be confusing, and it might be worth addcomputed checking if the file in git is a symlink and erroring out. Or it could follow symlinks as long as the destination is a file in the repisitory.	2025-03-03 12:09:25 -04:00
Joey Hess	e6ae5e8d56	many recompute improvements I've lost track of them all, but it includes: * Using the same key backend as was used in the original computation. * Fixing bug that prevented updating the source file key in the compute state * Handling --reproducible and --unreproducible. * recompute --original of a file using VURL, when the result is different, but the key remains the same, makes the object file be updated with the new content * Detecting some other ways the program behavior can change, just for completeness. * Also adds --backend to addcomputed.	2025-02-27 15:18:27 -04:00
Joey Hess	9c2c3002a6	fix recompute of renamed files When a computed file has been renamed, a recompute needs to write to the new filename. I decided to remove --others because it's not clear what it should do in the face of renames. Should it update only other files that have not been renamed? Or update files that use the old key to the new key anywhere in the tree? Or write the other files to the cwd, ignoring renames? Since --others is just a way to save on compute time, adding this complexity at this point seems like a bad idea. May revisit later. Added temporary TODO-compute file	2025-02-27 11:27:26 -04:00
Joey Hess	d6a010a615	recompute closer to working properly Proper behavior without --others implemented. And eliminated most of the code duplication through refactoring. Also, changed it to not stage recomputed files. This way, git diff will show files that have differences.	2025-02-26 15:52:52 -04:00
Joey Hess	3bec89a3c3	started git-annex recompute The perform action of this still needs work to do the right thing. In particular, it currently behaves as if --others was always set. And, it duplicates a lot of code from addcomputed.	2025-02-26 11:54:09 -04:00
Joey Hess	eed522a0f8	addcomputed inherits extra initremote parameters This is limited because the remote config is a field/value map. So order is not preserved, and when 2 parameters have the same field name, only the last one will be passed.	2025-02-26 09:45:35 -04:00
Joey Hess	2b8428bb17	wording	2025-02-25 17:26:28 -04:00
Joey Hess	f8c7cea019	pdate demo program needed a mkdir	2025-02-25 17:23:38 -04:00
Joey Hess	71e92a509a	use compute program REPRODUCIBLE by default	2025-02-25 17:10:41 -04:00
Joey Hess	16f529c05f	addcomputed --fast and --unreproducible working For these, use VURL and URL keys, with an "annex-compute:" URI prefix. These URL keys will look something like this: URL--annex-compute&cbar4,63pconvert,3-f4d3d72cf3f16ac9c3e9a8012bde4462 Generally it's too long so most of it gets md5summed. It's a little ugly, but it's what fell out of the existing URL key generation machinery. I did consider special casing to eg "URL--annex-compute&c4d3d72cf3f16ac9c3e9a8012bde4462". But it seems at least possibly useful that the name of the file that was computed is visible and perhaps one or two words of the git-annex compute command parameters. Note that two different output files from the same computation will get the same URL key. And these keys should remain stable.	2025-02-25 16:43:15 -04:00
Joey Hess	2e1fe1620e	handle comutations in subdirs of the git repository Eg, a computation might be run in "foo/" and refer to "../bar" as an input or output. So, the subdir is part of the computation state. Also, prevent input or output of files that are outside the git repository. Of course, the program can access any file on disk if it wants to; this is just a guard against mistakes. And it may also be useful if the program comunicates with something less trusted than it, eg a container image, so input/output files communicated by that are not the source of security problems.	2025-02-25 15:08:38 -04:00
Joey Hess	556f44d404	update for new interface	2025-02-24 16:15:04 -04:00
Joey Hess	921850d05c	support addcomputed --fast This complicates the interface but it's still simpler to understand than the old interface.	2025-02-24 13:48:46 -04:00
Joey Hess	490174b068	new compute program interface This is much more flexible, and also simpler to understand.	2025-02-24 12:44:20 -04:00
Joey Hess	b804f8a3cc	update	2025-02-21 15:09:46 -04:00
Joey Hess	e897229088	wip	2025-02-20 17:23:15 -04:00
Joey Hess	4f3d9f8115	update	2025-02-20 13:27:59 -04:00
Joey Hess	c1b53dbbd0	wip	2025-02-20 13:27:47 -04:00
Joey Hess	a2fa2a8c5f	update	2025-02-19 16:03:34 -04:00
Joey Hess	2f11c65491	comments	2025-02-19 15:14:52 -04:00
Joey Hess	b5319ec575	documentation for compute remote and associated commands None of this is implemented yet.	2025-02-19 14:29:18 -04:00
Joey Hess	ace9944d1c	add REPRODUCIBLE	2025-02-19 14:16:36 -04:00
Joey Hess	f52385f63d	optional and required inputs and some other changes	2025-02-19 12:47:32 -04:00
Joey Hess	f4c3fdeaed	improved draft design	2025-02-18 15:46:47 -04:00
Joey Hess	d394f0b020	git-lfs apiurl parameter git-lfs: Added an optional apiurl parameter. This needs version 1.2.5 of the haskell git-lfs library to be used. stack.yaml updated to use that. Note that git-annex enableremote can be used to add apiurl= to an existing git-lfs special remote. To allow unsetting the apiurl and instead use the probed url, support enableremote with apiurl set to an empty string. Sponsored-by: Luke T. Shumaker	2025-02-18 14:11:21 -04:00
sharad	dcf2f71696	Added a comment: Faced same issue for long time	2025-02-17 19:30:28 +00:00
Joey Hess	5324f34092	Merge branch 'ospath'	2025-02-17 11:58:20 -04:00
datamanager	93fb1ba536	Added a comment	2025-02-15 21:46:33 +00:00
puck	f32f22bc64		2025-02-15 10:36:03 +00:00
Joey Hess	e8b00faea8	Merge branch 'master' into ospath	2025-02-14 16:28:43 -04:00
anarcat	438ff929f3	more details on my issues	2025-02-14 17:54:24 +00:00
anarcat	73d4997047	Added a comment: similar topic	2025-02-14 17:51:29 +00:00
anarcat	376d2b25e3	Added a comment: similar topic	2025-02-14 17:47:02 +00:00
Joey Hess	e6e69f8f93	draft	2025-02-13 16:12:07 -04:00
Joey Hess	bf6446528d	comment	2025-02-13 13:51:21 -04:00
Joey Hess	2ff0adba0b	comment	2025-02-13 13:01:15 -04:00
Joey Hess	46d38b002d	remove the git-union-merge command This has never been built and shipped as part of git-annex, and including it as a pedagolical example in the source code doesn't have much benefit. The program was not currently buildable after recent OsPath changes. Of course, Git/UnionMerge.hs is still available and can be used.	2025-02-12 12:37:36 -04:00
Joey Hess	bab26da74b	Merge branch 'master' into ospath	2025-02-11 16:56:17 -04:00
Joey Hess	90eb1e2da6	update todo	2025-02-11 13:01:13 -04:00
thk	e333ef9337		2025-02-08 06:59:34 +00:00
thk	09d47726b7	Added a comment: iroh	2025-02-08 06:56:32 +00:00
Joey Hess	cb2c069ad1	Revert "update" This reverts commit `f5c6dc7cfb`.	2025-02-06 11:42:49 -04:00
Joey Hess	874882efc4	Revert "update" This reverts commit `3464612445`.	2025-02-06 11:41:37 -04:00
Joey Hess	3464612445	update	2025-02-06 11:41:10 -04:00
Joey Hess	f5c6dc7cfb	update	2025-02-06 11:40:03 -04:00
Joey Hess	9394197621	Merge branch 'master' into ospath	2025-02-05 13:31:07 -04:00

1 2 3 4 5 ...

35,290 commits