git-annex

Author	SHA1	Message	Date
Joey Hess	3af29b3ba9	When annex.thin is set, allow hard links to be made between executable work tree files and annex objects. This is safe, because while the annex object ends up executable, there were already at least two other cases where it ended up executable: 1. git add an an executable file 2. chmod +x of a a non-executable worktree file that was hard linked to the annex object After copy/hard link, it always fixes up the permissions to match the mode of the worktree file, so when an executable annex object gets hard linked to a non-executable worktree file, its execute bit gets removed. Commit `b7c8bf5274` already said it would do this; I suspect the line of code I've removed was included in that commit accidentially. Also improves annex.thin documentation. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2018-10-26 13:51:43 -04:00
Joey Hess	9f87133bf5	snap --version= to auto-upgrade This makes --version=6 still work, despite v6 not being in supportedVersions. Which is useful for scripts that use it. I didn't document it on the man page, because it's indistinguishable from an automatic upgrade after initting as v6.	2018-10-26 11:44:05 -04:00
Joey Hess	d59995b9ee	default to v7 adjusted unlocked in crippled filesystem init: When in a crippled filesystem, initialize a v7 repository using an adjusted unlocked branch, instead of a direct mode repository. Direct mode is deprecated, so this makes sense to do already I hope. This commit was sponsored by Ole-Morten Duesund on Patreon.	2018-10-25 18:49:57 -04:00
Joey Hess	b996b38b4f	fix autoupgrade from v6 to go to v7, not v5 v3 and v4 still autoupgrade to v5 And a few more upgrade doc updates.	2018-10-25 18:40:04 -04:00
Joey Hess	234842a347	v7 Install new git hooks in this version. This does beg the question of what to do if git later gets eg a post-smudge hook, that could run git-annex smudge --update. I think the thing to do in that case would be to make git-annex smudge --update install the new hooks. That way, as the user uses git-annex, the hook would be created pretty quickly and without needing any extra syscalls except for when git-annex smudge --update is called. I considered doing something like that for installation of the post-checkout and post-merge hooks, which would have avoided the need for v7. But the only place it was cheap to do it would be in git-annex smudge which could cheaply notice that smudge.log didn't exist yet and so know the hooks needed to be installed. But since smudge used to populate pointer files, it would be quite surprising if a single git checkout/merge failed to update the work tree, and so that idea didn't work out. The other reason for v7 is psychological -- users don't need to worry about whether they might be running an old version of git-annex that doesn't support their v7 repository very well. And bug reports about "v6" have gotten a bit of a bad association in my head since they often hit one of the known limitations and didn't realize it was experimental. newtyped RepoVersion Int to avoid needing 2 comparisons in versionSupportsUnlockedPointers etc. Also it's just nicer. This commit was sponsored by John Pellman on Patreon.	2018-10-25 18:24:23 -04:00
Joey Hess	ca7de61454	git post-checkout and post-merge hooks * init, upgrade: Install git post-checkout and post-merge hooks that run git annex smudge --update. * precommit: Run git annex smudge --update, because the post-merge hook is not run when there is a merge conflict. So the work tree will be updated when a commit is made to resolve the merge conflict. * precommit: Run git annex smudge --update, because the post-merge hook is not run when there is a merge conflict. So the work tree will be updated when a commit is made to resolve the merge conflict. * Note that git has no hooks run after git stash or git cherry-pick, so the user will have to manually run git annex smudge --update after such commands. Nothing currently installs the hooks into v6 repos that already exist. Something will need to be done about that, either move this behavior to v7, or document that the user will need to manually fix up their v6 repos. This commit was sponsored by Eric Drechsel on Patreon.	2018-10-25 15:59:51 -04:00
Joey Hess	917a2c6095	defer updating unlocked files until after smudge filter The smuge filter no longer provides git with annexed file content, to avoid a git memory leak, and because that did not honor annex.thin. git annex smudge --update has to be run after a checkout to update unlocked files in the working tree with annexed file contents. No hooks yet to run it. This commit was sponsored by Nick Piper on Patreon.	2018-10-25 15:08:20 -04:00
Joey Hess	c24e255de1	Fix concurrency bug that occurred on the first download from an exporttree remote Block other threads while the export database is being constructed (or updated) by the first thread to try to access it. This work is supported by the NIH-funded NICEMAN (ReproNim TR&D3) project.	2018-10-22 12:59:10 -04:00
Joey Hess	d0b0589146	link to tip	2018-10-20 14:25:03 -04:00
Joey Hess	4a6ebb1034	make sync update adjusted branch to hide/unhide This completes initial support for --hide-missing, although the assistant still needs to be updated and it perhaps needs to be sped up, and maybe there needs to be a way for git-annex get to operate on missing files. Opened some more todos for those things. This commit was sponsored by Henrik Riomar.	2018-10-20 14:22:28 -04:00
Joey Hess	4a788fbb3b	sync --content now supports --hide-missing adjusted branches This relies on git ls-files --with-tree, which I'm using in a way that its man page does not document. Hm. I emailed the git list to try to get the docs improved, but at least the git test suite does test the same kind of use case I'm using here. Performance impact when not in an adjusted branch is limited to some additional MVar accesses, and a single git call to determine the name of the current branch. So very minimal. When in an adjusted branch, the performance impact is in Annex.WorkTree.lookupFile, which starts doing an equal amount of work for files that didn't exist as it already did for files that were unlocked. This commit was sponsored by Jochen Bartl on Patreon.	2018-10-19 17:51:25 -04:00
Joey Hess	24838547e2	adjust --hide-missing * At long last there's a way to hide annexed files whose content is missing from the working tree: git-annex adjust --hide-missing * When already in an adjusted branch, running git-annex adjust again will update the branch as needed. This is mostly useful with --hide-missing to hide/unhide files after their content has been dropped or received. Still needs integration with sync and the assistant, and not as fast as it could be, but already usable. This commit was sponsored by Ethan Aubin.	2018-10-18 15:32:42 -04:00
Joey Hess	b2bafdb2fc	v6: Fix database inconsistency That could cause git-annex to get confused about whether a locked file's content was present, when the object file got touched. Unfortunately this means more work sometimes when annex.thin is set, since it has to checksum the file to tell if it's still got the right content. Had to suppress output when inAnnex calls isUnmodified, otherwise "(checksum...)" would be printed in places it ought not to be, eg "git annex get" could turn out not need to get anything, and so only display that. This commit was sponsored by Ole-Morten Duesund on Patreon.	2018-10-16 13:51:37 -04:00
Joey Hess	42842ea0ea	runshell: Use system locales when built with GIT_ANNEX_PACKAGE_INSTALL set This is to work around https://github.com/datalad/datalad/issues/2769 which I don't know how to reproduce outside that environment, nor do I understand the root cause of. For some time, Neurodebian has been working around it by building its standalone debs with a patch that disables use of the locales bundled with the standalone build, letting the system locales be used. Using the system locales is asking for trouble if there's significant version skew between the system and bundled glibc, and possibly also if the architeciture is different, or whatever. That's why git-annex bundles and uses its own locales, because numerous users reported real problems with using the system locales. ... However, in the specific case of the Neurodebian standalone debs, the deb is built on a system very like the one it's targeted to be installed on. Or well, so they assure me, although doc/install/Ubuntu.mdwn also promotes those for use across all versions of Ubuntu, and the deb is built avoiding xz so it will work with old versions of dpkg, so I wonder how true it is. It does seem that, at least currently, there is no bad version skew in the locales of the systems the deb is used on, since it's already been using the system locales for some time. Anyway, since the Neurodebian build already is setting GIT_ANNEX_PACKAGE_INSTALL=1 in runshell, I made runshell use system locales when that's set. This is a small scope creep for GIT_ANNEX_PACKAGE_INSTALL, but it's not documented and AFAIK only used for the Neurodebian build, so that seems ok. This will let them stop carrying their patch for this forward. This work is supported by the NIH-funded NICEMAN (ReproNim TR&D3) project.	2018-10-13 15:04:10 -04:00
Joey Hess	d14983ee68	webapp: fix termux detection The bundled uname -o says Linux in termux; have runshell on Android delete it so the termux one is used instead. This fixes the webapp so it will enter Android mode. This commit was sponsored by mo on Patreon.	2018-10-13 12:08:27 -04:00
Joey Hess	38d691a10f	removed the old Android app Running git-annex linux builds in termux seems to work well enough that the only reason to keep the Android app would be to support Android 4-5, which the old Android app supported, and which I don't know if the termux method works on (although I see no reason why it would not). According to [1], Android 4-5 remains on around 29% of devices, down from 51% one year ago. [1] https://www.statista.com/statistics/271774/share-of-android-platforms-on-mobile-devices-with-android-os/ This is a rather large commit, but mostly very straightfoward removal of android ifdefs and patches and associated cruft. Also, removed support for building with very old ghc < 8.0.1, and with yesod < 1.4.3, and without concurrent-output, which were only being used by the cross build. Some documentation specific to the Android app (screenshots etc) needs to be updated still. This commit was sponsored by Brett Eisenberg on Patreon.	2018-10-13 01:41:11 -04:00
Joey Hess	426f0f3f4b	releasing package git-annex version 6.20181011	2018-10-11 13:50:53 -04:00
Joey Hess	0240775f32	adding arm64 build, and improved termux installation process * Added arm64 Linux standalone build. (No autobuilder yet.) * Improved termux installation process. Added git-annex-install.sh script to avoid user needing to type as much in termux. The scope of this script is limited; runshell handles the rest. Runshell runs termux-fix-shebang on the shell scripts. The problem is the bundled bin/sh script, deleting that script also works, but then the others probably use the system Android /bin/sh, which could be old or broken or not posix or whatever. Using termux sh to run the scripts is better. This commit was sponsored by Eric Drechsel on Patreon.	2018-10-11 13:32:00 -04:00
Joey Hess	a97ef366fa	Linux standalone: Avoid using bundled cp before envionment is fully set up. On android arm64, I saw the cp fail with "Bad system call", because proot has not run yet. runshell only recently started using cp, and it's bundled with git-annex, so this fixes a reversion. This commit was sponsored by Nick Piper on Patreon.	2018-10-10 16:02:25 -04:00
Joey Hess	6f0d8870df	Fix crash when exporttree is set to a bad value. Made it impossible to recover from setting a bad value since enableremote to change it would crash. This commit was sponsored by Henrik Riomar on Patreon.	2018-10-10 10:44:54 -04:00
Joey Hess	def5d8b02c	Fix potential crash in exporttree database due to failure to honor uniqueness constraint I don't know the circumstances, but have a report of this: git-annex: failed to commit changes to sqlite database: Just SQLite3 returned ErrorConstraint while attempting to perform step. All 3 tables in the export db have uniqueness constraints on them, insertUnique is used for all the rest, but this use of insertMany means it doesn't check the constraint. I guess that's what caused the crash, but I have not been able to test it yet. Use putMany when available, as it should be faster than mapM of insertMany. This commit was sponsored by Brock Spratlen on Patreon.	2018-10-09 16:56:33 -04:00
Joey Hess	91b799d1a6	export: Fix false positive in export conflict detection It occurred when the same tree was exported by multiple clones. nub out identical trees. This commit was sponsored by Jochen Bartl on Patreon.	2018-10-09 15:54:12 -04:00
Joey Hess	451171b7c1	clean up url removal presence update * rmurl: Fix a case where removing the last url left git-annex thinking content was still present in the web special remote. * SETURLPRESENT, SETURIPRESENT, SETURLMISSING, and SETURIMISSING used to update the presence information of the external special remote that called them; this was not documented behavior and is no longer done. Done by making setUrlPresent and setUrlMissing only update presence info for the web, and only when the url is a web url. See the comment for reasoning about why that's the right thing to do. In AddUrl, had to make it update location tracking, to handle the non-web-url case. This commit was sponsored by Ewen McNeill on Patreon.	2018-10-04 17:35:49 -04:00
Joey Hess	4b793fb077	Fix reversion in support of annex.web-options Inverted logic added as part of the url security fix made it always use curl when annex.security.allowed-http-addresses=all unless annex.web-options was set. That nobody noticed kind of makes me wonder if anyone uses annex.web-options.. This commit was sponsored by Denis Dzyubenko on Patreon.	2018-10-04 13:43:29 -04:00
Joey Hess	6ba3dea566	annex.jobs Added annex.jobs setting, which is like using the -J option. Of course, -J overrides annex.jobs. This commit was sponsored by Trenton Cronholm on Patreon.	2018-10-04 12:47:27 -04:00
Joey Hess	303d10cee6	Improve display when git config download from a http remote fails. The error message displayed used to only come from curl/wget and perhaps was clearer than the one displayed now that http-client is used. In any case, it does make sense to hide it because git-annex prints its own warning message. This commit was sponsored by Jake Vosloo on Patreon.	2018-10-03 12:31:09 -04:00
Joey Hess	9adee3f2fb	sync: Warn when a remote's export is not updated to the current tree because export tracking is not configured. Only display the warning when the current branch has a tree that is not the same as the tree in the export. Note that it doesn't check to see if the current tree is in incompleteExportedTreeish; it might be worth checking that and reminding the user about an incomplete export, but when export tracking is not configured, they are probably not in the right clone of the repository to resolve the incomplete export. This commit was sponsored by Ethan Aubin.	2018-09-27 15:41:18 -04:00
Joey Hess	012d67c3eb	releasing package git-annex version 6.20180926	2018-09-26 13:16:54 -04:00
Joey Hess	bc31b93c77	remote.name.annex-security-allow-unverified-downloads Added remote.name.annex-security-allow-unverified-downloads, a per-remote setting for annex.security.allow-unverified-downloads. This commit was sponsored by Brock Spratlen on Patreon.	2018-09-25 15:34:47 -04:00
Joey Hess	177e45517f	improve back-compat of post-receive hook * init: Improve generated post-receive hook, so it won't fail when run on a system whose git-annex is too old to support git-annex post-receive * init: Update the post-receive hook when re-run in an existing repository. This commit was sponsored by Jack Hill on Patreon.	2018-09-25 15:02:12 -04:00
Joey Hess	16cbecbd09	Revert "clean P2P protocol shutdown on EOF" This reverts commit `b18fb1e343`. That broke support for old git-annex-shell before p2pstdio was added. The immediate problem is that postAuth had a fallthrough case that sent an error back to the peer, but sending an error back when the connection is closed is surely not going to work. But thinking about it some more, making every function that uses receiveMessage need to handle ProtocolEOF adds a lot of complication, so I don't want to do that. The commit only cleaned up the test suite output a tiny bit, so I'm just gonna revert it for now.	2018-09-25 14:04:12 -04:00
Joey Hess	4ecba916a1	annex.maxextensionlength Added annex.maxextensionlength for use cases where extensions longer than 4 characters are needed. This commit was sponsored by Henrik Riomar on Patreon.	2018-09-24 12:10:18 -04:00
Joey Hess	cc82f81227	More FreeBSD build fixes. Untested, on FreeBSD but enough to fix the listed build errors. Seems that System.Posix.Files must have used to export this stuff and it was split. This commit was sponsored by Peter on Patreon.	2018-09-24 11:25:56 -04:00
Joey Hess	1d1054faa6	added -z Added -z option to git-annex commands that use --batch, useful for supporting filenames containing newlines. It only controls input to --batch, the output will still be line delimited unless --json or etc is used to get some other output. While git often makes -z affect both input and output, I don't like trying them together, and making it affect output would have been a significant complication, and also git-annex output is generally not intended to be machine parsed, unless using --json or a format option. Commands that take pairs like "file key" still separate them with a space in --batch mode. All such commands take care to support filenames with spaces when parsing that, so there was no need to change it, and it would have needed significant changes to the batch machinery to separate tose with a null. To make fromkey and registerurl support -z, I had to give them a --batch option. The implicit batch mode they enter when not provided with input parameters does not support -z as that would have complicated option parsing. Seemed better to move these toward using the same --batch as everything else, though the implicit batch mode can still be used. This commit was sponsored by Ole-Morten Duesund on Patreon.	2018-09-20 16:11:47 -04:00
Joey Hess	2aae6e84af	Support newlines in filenames. Work around git cat-file --batch's protocol not supporting newlines by running git cat-file not batched and passing the filename as a parameter. Of course this is quite a lot less efficient, especially because it currently runs it multiple times to query for different pieces of information. Also, it has subtly different behavior when the batch process was started and then some changes were made, in which case the batch process sees the old index but this workaround sees the current index. Since that batch behavior is mostly a problem that affects the assistant and has to be worked around in it, I think I can get away with this difference. I don't know of any other problems with newlines in filenames, everything else in git I can think of supports -z. And git-annex's json output supports newlines in filenames so downstream parsers from git-annex will be ok. git-annex commands that use --batch themselves don't support newlines in input filenames; using --json --batch is currently a way around that problem. This commit was sponsored by Ewen McNeill on Patreon.	2018-09-20 13:45:44 -04:00
Yaroslav Halchenko	672973149f	BF: add netbase to Depends: see https://github.com/nipy/heudiconv/issues/260 for more context, but it seems to be required on a lean docker instances for git annex to be usable	2018-09-19 15:28:59 -04:00
Joey Hess	b3c9c59d3d	--debug urls When git-annex used wget and curl, --debug would show urls. So there can't be any new security problem with doing so. This commit was sponsored by John Pellman on Patreon.	2018-09-14 12:46:39 -04:00
Joey Hess	773084c49b	S3: Fix url construction bug When the publicurl has been set to an url that does not end with a slash, we need to add one in between it and the rest of the url. As far as I can see, git-annex does not default to such publicurls; it's careful to end them with slashes. But this was observed in the wild, and there may be documentation that doesn't include the slash. And it's an easy mistake to make in any case. This commit was sponsored by Eric Drechsel on Patreon.	2018-09-14 12:25:23 -04:00
Joey Hess	547d01fd0e	releasing package git-annex version 6.20180913	2018-09-13 15:50:50 -04:00
Joey Hess	677038199c	fix build with older aws S3: Multipart uploads are now only supported when git-annex is built with aws-0.16.0 or later, as earlier versions of the library don't support versioning with multipart uploads. This will affect the android build, and debian stable also has a too old aws to support both features at the same time. This commit was sponsored by Nick Piper on Patreon.	2018-09-13 09:58:39 -04:00
Joey Hess	2743224658	change v6 git-annex add of staged unmodified unlocked file v6: When a file is unlocked but has not been modified, and the unlocking is only staged, git-annex add did not lock it. Now it will, for consistency with how modified files are handled and with v5. Note the removal of the sameInodeCache check. Otherwise it would see that the unmodified file is unmodified and stop there. That check seems to have been copied from the direct mode branch. But, direct mode had a specific reason to check for unmodified content, that does not apply to v6. The second pass means there is potential for a race, eg the unlocked file could be modified in between the first and second passes. No problem with that, since both passes do the same thing. This commit was sponsored by Jake Vosloo on Patreon.	2018-09-12 14:00:05 -04:00
Joey Hess	942b466293	wording	2018-09-11 16:03:58 -04:00
Joey Hess	fdbdf64d87	fix reversions due to undocumented and buggy git behavior * Don't use GIT_PREFIX when GIT_WORK_TREE=. because it seems git does not intend GIT_WORK_TREE to be relative to GIT_PREFIX in that case, despite GIT_WORK_TREE=.. being relative to GIT_PREFIX. * Don't use GIT_PREFIX to fix up a relative GIT_DIR, because git 2.11 sets GIT_PREFIX set to a path it's not relative to. and apparently GIT_DIR is never relative to GIT_PREFIX. Commit `e50ed4ba48` led us down this path by working around a git bug by relying on the barely documented GIT_PREFIX. This commit was sponsored by Trenton Cronholm on Patreon.	2018-09-11 15:54:21 -04:00
Joey Hess	7407a80c27	S3: Support AWS_SESSION_TOKEN This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2018-09-05 15:53:57 -04:00
Joey Hess	b600ad71ce	make linkToAnnex freezeContent the object file v6: Fix annex object file permissions when git-annex add is run on a modified unlocked file, and in some related cases. If a hard link is made, don't freeze it; annex.thin uses writable object files. Also: For some reason, linkToAnnex used to thawContent src. I can see no reason why it needed to do that, so I eliminated that. This commit was sponsored by Brock Spratlen on Patreon.	2018-09-05 15:27:22 -04:00
Joey Hess	69907e397f	revert a few problem areas of git-annex.cabal patch	2018-09-05 11:47:00 -04:00
Joey Hess	69d4c8dce6	devblog	2018-08-30 15:52:44 -04:00
Joey Hess	f54c72d2e1	Fix build on FreeBSD This must have been broken for years.. This commit was sponsored by Jack Hill on Patreon.	2018-08-29 12:09:03 -04:00
Joey Hess	c565340adc	stop using external hash programs, since cryptonite is faster In 2013, I wrote "Cryptohash benchmarks 90 to 101% faster than external hashers". Re-benchmarking today, I found cryptonite's sha256 consistently outperformed coreutils by 10% for large files. Tested 10 mb, 100 mb, 1 gb files with both sha256 and sha512. And for smaller files, the external process startup time swamps the hash time. Perhaps cryptonite has improved. Or it could just do better on my current CPU Intel(R) Pentium(R) CPU 4410Y @ 1.50GHz). Anyway, even if cryptonite is slower in some situations, seems likely it would only be marginally slower; it's got the same class of highly optimised C code under the hood as coreutils. The main difference between the two sha256 implementations seems to be how much of the inner loop they unroll.. This commit was sponsored by Henrik Riomar on Patreon.	2018-08-28 18:10:58 -04:00
Joey Hess	759a87ad70	fix git command queue to be concurrency safe Probably not noticed until now because the queue is large enough that two threads each filling theirs at the same time and flushing is unlikely to happen. Also made explicit that each worker thread gets its own queue. I think that was the case before, but if something was put in the queue before worker threads were forked off, they could have each inherited the same queue. Could have gone with a single shared queue, but per-worker queues is more efficient, because a worker can add lots of stuff to its own queue without any locking. This commit was sponsored by Ole-Morten Duesund on Patreon.	2018-08-28 13:16:33 -04:00

1 2 3 4 5 ...

538 commits