git-annex

Author	SHA1	Message	Date
Joey Hess	2cea674d1e	Merge branch 'master' into v8	2020-01-01 14:26:43 -04:00
Joey Hess	503788238c	add --force-annex/--force-git options make it easier to override annex.largefiles configuration (and potentially safer as it avoids bugs like the smudge bug fixed in the last release) Deleted some old comments that were posted to the man page discussing such options. Updated docs that used -c annex.largefiles to use the options. Note that addSmallOverridden was needed to avoid the clean filter running on the file. It would be possible to make addFile also update the index directly, rather than going via git add. However, it was not necessary, and I want to avoid breaking on some edge case, particularly if the code in addSmallOverridden has some oversight. Also, when annex.addunlocked is set and annex.largefiles does not match a file, git annex add --force-large works, but git status will then show the file as added, with a unstaged modification. The unstaged modification adds the file to git. This is identical behavior to using -c annex.largefiles=nothing when annex.addunlocked is set. This does not prevent committing what was intended to be added. I have not gotten to the bottom of why git thinks the file is modified and runs it through the clean filter in this case.	2020-01-01 14:03:06 -04:00
Joey Hess	985373f8e7	releasing package git-annex version 7.20191230	2019-12-30 14:49:31 -04:00
Joey Hess	ea3cb7d277	fix a case where file tracked by git unexpectedly becomes annex pointer file smudge: When annex.largefiles=anything, files that were already stored in git, and have not been modified could sometimes be converted to being stored in the annex. Changes in 7.20191024 made this more of a problem. This case is now detected and prevented.	2019-12-27 15:08:03 -04:00
Joey Hess	3cd3757236	annex.dotfiles The git add behavior changes could be avoided if it turns out to be really annoying, but then it would need to behave the old way when annex.dotfiles=false and the new way when annex.dotfiles=true. I'd rather not have the config option result in such divergent behavior as `git annex add .` skipping a dotfile (old) vs adding to annex (new). Note that the assistant always adds dotfiles to the annex. This is surprising, but not new behavior. Might be worth making it also honor annex.dotfiles, but I wonder if perhaps some user somewhere uses it and keeps large files in a directory that happens to begin with a dot. Since dotfiles and dotdirs are a unix culture thing, and the assistant users may not be part of that culture, it seems best to keep its current behavior for now.	2019-12-26 16:33:39 -04:00
Joey Hess	2b821eb225	Merge branch 'master' into sqlite	2019-12-26 15:15:42 -04:00
Joey Hess	444d5591ee	Improve file ordering behavior when one parameter is "." and other parameters are other directories eg, `git-annex get . ..` used to order the files strangly, because it did not realize that when git ls-files output eg "foo", that should be grouped with the first set of files and not the second set. Fixed by making dirContains "." "./foo" = True which makes sense, because dirContains ".." "../foo" = True	2019-12-20 18:01:29 -04:00
Joey Hess	37467a008f	annex.addunlocked expressions * annex.addunlocked can be set to an expression with the same format used by annex.largefiles, in case you want to default to unlocking some files but not others. * annex.addunlocked can be configured by git-annex config. Added a git-annex-matching-expression man page, broken out from tips/largefiles. A tricky consequence of this is that git-annex add --relaxed honors annex.addunlocked, but an expression might want to know the size or content of an url, which it's not going to download. I decided it was better not to fail, and just dummy up some plausible data in that case. Performance impact should be negligible. The global config is already loaded for annex.largefiles. The expression only has to be parsed once, and in the simple true/false case, it should not do any additional work matching it.	2019-12-20 15:56:25 -04:00
Joey Hess	5591622731	git-annex-config --set/--unset: No longer change the local git config setting `e53070c1f` quietly made it set the local git config too, but that was never documented anywhere, and it had surprising results. If I set annex.largefiles globally in a repo, I would expect to be able to change it in another repo, and the original repo would get the change and use it, rather than being stuck on the old value set there. And, if I have a local annex.largefiles and set a different global default, I'd be surprised to have my local setting overwritten. annex.securehashesonly does need to be set locally, since it's a security feature and the global is only a default until it gets set locally. So special cased.	2019-12-20 13:17:28 -04:00
Joey Hess	4acbb40112	git-annex config annex.largefiles annex.largefiles can be configured by git-annex config, to more easily set a default that will also be used by clones, without needing to shoehorn the expression into the gitattributes file. The git config and gitattributes override that. Whenever something is added to git-annex config, we have to consider what happens if a user puts a purposfully bad value in there. Or, if a new git-annex adds some new value that an old git-annex can't parse. In this case, a global annex.largefiles that can't be parsed currently makes an error be thrown. That might not be ideal, but the gitattribute behaves the same, and is almost equally repo-global. Performance notes: git-annex add and addurl construct a matcher once and uses it for every file, so the added time penalty for reading the global config log is minor. If the gitattributes annex.largefiles were deprecated, git-annex add would get around 2% faster (excluding hashing), because looking that up for each file is not fast. So this new way of setting it is progress toward speeding up add. git-annex smudge does need to load the log every time. As well as checking the git attribute. Not ideal. Setting annex.gitaddtoannex=false avoids both overheads.	2019-12-20 13:01:41 -04:00
Joey Hess	ce3fb0b2e5	fixed an oversight that had always prevented annex.resolvemerge from being honored, when it was configured by git-annex config forgot to add it to the merge function	2019-12-20 11:00:08 -04:00
Joey Hess	f6c18f6940	Merge branch 'bs' into sqlite-bs	2019-12-18 15:14:44 -04:00
Joey Hess	7d9dff5b05	Merge branch 'master' into bs and update changelog	2019-12-18 15:13:30 -04:00
Joey Hess	d5628a16b8	Merge branch 'bs' into sqlite-bs	2019-12-18 14:51:03 -04:00
Joey Hess	7fd5376334	inprogress: Support --key	2019-12-18 14:14:16 -04:00
Joey Hess	1bc7055a21	add back changelog entry	2019-12-18 13:53:10 -04:00
Joey Hess	c19211774f	use filepath-bytestring for annex object manipulations git-annex find is now RawFilePath end to end, no string conversions. So is git-annex get when it does not need to get anything. So this is a major milestone on optimisation. Benchmarks indicate around 30% speedup in both commands. Probably many other performance improvements. All or nearly all places where a file is statted use RawFilePath now.	2019-12-11 15:25:07 -04:00
Joey Hess	2f9a80d803	merging sqlite and bs branches Since the sqlite branch uses blobs extensively, there are some performance benefits, ByteStrings now get stored and retrieved w/o conversion in some cases like in Database.Export.	2019-12-06 15:30:45 -04:00
Joey Hess	718fa83da6	mention optimisations	2019-12-05 11:46:55 -04:00
Joey Hess	960f62a564	typo	2019-11-22 19:48:34 -04:00
Joey Hess	81d402216d	cache the serialization of a Key This will speed up the common case where a Key is deserialized from disk, but is then serialized to build eg, the path to the annex object. Previously attempted in `4536c93bb2` and reverted in `96aba8eff7`. The problems mentioned in the latter commit are addressed now: Read/Show of KeyData is backwards-compatible with Read/Show of Key from before this change, so Types.Distribution will keep working. The Eq instance is fixed. Also, Key has smart constructors, avoiding needing to remember to update the cached serialization. Used git-annex benchmark: find is 7% faster whereis is 3% faster get when all files are already present is 5% faster Generally, the benchmarks are running 0.1 seconds faster per 2000 files, on a ram disk in my laptop.	2019-11-22 17:49:16 -04:00
Joey Hess	7263aafd2b	Merge branch 'master' into sqlite	2019-11-22 12:49:35 -04:00
Joey Hess	92e1bb250b	simplify the name of the test cases	2019-11-21 17:38:58 -04:00
Joey Hess	58a8005441	Merge branch 'master' into sqlite	2019-11-21 17:28:27 -04:00
Joey Hess	a9888f6151	Windows: Fix handling of changes to time zone. Used to work but was broken in version 7.20181031, specifically commit `5ab0f48ffb`. That this was not noticed over at least 1 daylight savings time zone changes makes me wonder if the TSDelta stuff is still needed. Perhaps the mtime on Windows no longer changes when the time zone is changed? (cherry picked from commit `09ee6b0ccb`)	2019-11-21 17:28:18 -04:00
Joey Hess	d4661959de	Merge branch 'master' into sqlite	2019-11-21 17:26:50 -04:00
Joey Hess	25ba8156bc	improve benchmark --databases * benchmark: Changed --databases to take a parameter specifiying the size of the database to benchmark. * benchmark --databases: Display size of the populated database. * benchmark --databases: Improve the "addAssociatedFile to (new)" benchmark to really add new values, not overwriting old values.	2019-11-21 17:25:20 -04:00
Joey Hess	43f19ef00a	Fix bug that made bare repos be treated as non-bare when --git-dir was used. Eg: git clone url --bare r git --git-dir r annex init This resulted in worktree = Just "." and so several things that check worktree to determine when the repo is bare ran code paths intended for non-bare. One such code path[1] ran git checkout with --worktree=. which actually makes it ignore core.bare config, and so the current directory got populated with a checkout of the master branch in this example. There was probably also other breakage. The fix is a bit complicated because whether the repo is bare is not known until after Git.Config reads the config, but Git.Config handles setting the RepoLocations's worktree when core.worktree is set. So have to assume the worktree is the cwd, let core.worktree override that, and then if the repo turns out to be bare, it's set back to Nothing. (And then GIT_WORK_TREE can still override all of that.) [1] switchHEADBack, which runs even when the clone is not from a bare repo.	2019-11-21 13:26:02 -04:00
Joey Hess	b207d944f3	sync, assistant: Pull and push from git-lfs remotes. Oversight, forgot to add it to gitSyncableRemote	2019-11-18 16:13:21 -04:00
Joey Hess	5877de5e80	git-lfs: remember urls, and autoenable remotes using known urls * git-lfs: The url provided to initremote/enableremote will now be stored in the git-annex branch, allowing enableremote to be used without an url. initremote --sameas can be used to add additional urls. * git-lfs: When there's a git remote with an url that's known to be used for git-lfs, automatically enable the special remote.	2019-11-18 16:09:09 -04:00
Joey Hess	cee14f147a	stop displaying rsync progress, and use git-annex's own progress display for local-to-local repo transfers Reasons to do this include: 1. I've gotten pretty used to git-annex's own progress display, which is used for all transfers over ssh (except to old git-annex-shell), and for most special remote transfers. It's getting to seem weird to see the rsync progress display instead. 2. When -J was used, the rsync output could not be shown, and so there was no progress display. Now there will be. Progress will also be displayed now when cp CoW is used. But I'd expect a CoW copy to typically run so fast that the progress display will barely be noticable. This commit was sponsored by Peter on Patreon.	2019-11-15 13:21:06 -04:00
Joey Hess	a95efcbc55	releasing package git-annex version 7.20191114	2019-11-14 21:58:23 -04:00
Joey Hess	b321526473	OSX link libs into git-core directory So that binaries in that directory can find the library next to them, where they get modified to look. This is a hack; it would be better for OSXMkLibs to build a list of what libraries are needed where. Unsure if this is needed due to a recent reversion, or is an older problem, so updated changelog accordingly.	2019-11-14 18:31:58 -04:00
Joey Hess	f037ad92ec	OSX git-annex.app: Fix a regression that broke git-remote-https, git-remote-http, and git-shell Putting the binaries in bundle/git-core/bin didn't work on OSX, linker can't find the libraries next to those binaries where it expects to. So instead put the binaries in the progDir.	2019-11-14 16:15:42 -04:00
Joey Hess	842449b086	linuxstandalone: Fix a regression that broke git-remote-https.	2019-11-14 15:08:23 -04:00
Joey Hess	667d38a8f1	Fix a crash (STM deadlock) when -J is used with multiple files that point to the same key See the comment for a trace of the deadlock. Added a new StartStage. New worker threads begin in the StartStage. Once a thread is ready to do work, it moves away from the StartStage, and no thread will ever transition back to it. A thread that blocks waiting on another thread that is processing the same key will block while in the StartStage. That other thread will never switch back to the StartStage, and so the deadlock is avoided.	2019-11-14 13:51:09 -04:00
Joey Hess	890330f0fe	make --json-error-messages capture url download errors Convert Utility.Url to return Either String so the error message can be displated in the annex monad and so captured. (When curl is used, its errors are still not caught.)	2019-11-12 13:52:38 -04:00
Joey Hess	3b34d123ed	Added annex.allowsign option. This commit was sponsored by Ilya Shlyakhter on Patreon.	2019-11-11 16:28:56 -04:00
Joey Hess	aa010108cd	Merge branch 'master' into sqlite	2019-11-07 13:20:04 -04:00
Joey Hess	09ee6b0ccb	Windows: Fix handling of changes to time zone. Used to work but was broken in version 7.20181031, specifically commit `5ab0f48ffb`. That this was not noticed over at least 1 daylight savings time zone changes makes me wonder if the TSDelta stuff is still needed. Perhaps the mtime on Windows no longer changes when the time zone is changed?	2019-11-06 14:36:49 -04:00
Joey Hess	73e928fcfb	prep release	2019-11-06 12:21:02 -04:00
Joey Hess	6147130e86	Merge branch 'master' into sqlite	2019-11-05 12:59:28 -04:00
Joey Hess	e2d4c133f5	init: fix data loss bug Fix bug that lost modifications to unlocked files when init is re-ran in an already initialized repo. In retrospect needing scanUnlockedFiles False in the direct mode upgrade path was a good hint that it was unsafe when used with True. However, this bug did not affect upgrade from v5. In such an upgrade, an unlocked file that is modified is left as-is. The only place scanUnlockedFiles True did overwrite modified unlocked files is during an git-annex init of a repo that was already initialized by git-annex. (I also tried a scenario where the repo had not been initialized by git-annex yet, but was cloned from a v7 repo with an unlocked file, and the pointer file replaced with some other content, and the data loss did not occur in that situation.) Since the fixed scanUnlockedFiles avoids overwriting non-pointer files, it should be safe to run in any situation, so there's no need any longer for the parameter.	2019-11-05 12:41:15 -04:00
Joey Hess	09c7cbbaa8	update for things already fixed in this branch	2019-10-30 13:57:22 -04:00
Joey Hess	25f912de5b	benchmark: Add --databases to benchmark sqlite databases Rescued from commit `11d6e2e260` which removed db benchmarks in favor of benchmarking arbitrary git-annex commands. Which is nice and general, but microbenchmarks are useful too.	2019-10-29 16:59:27 -04:00
Joey Hess	fd96408c67	releasing package git-annex version 7.20191024	2019-10-25 13:07:58 -04:00
Joey Hess	59b8294b2b	prep release	2019-10-24 14:40:36 -04:00
Joey Hess	31a5b58b2c	documentation for making git add only annex when configured by annex.largefiles Code change should be trvial, but not yet implemented. This significantly complicated the task of documenting how git-annex works. I'm not sure how useful the annex.gitaddtoannex confguration is after this change; seems that if a user has an annex.largefiles they will want it applied consistently. But the last thing I want to hear is more complaining from users about git add doing something they don't want it to. There's a pretty high risk users who got used to the git add behavior and don't have annex.largefiles configured will miss the NEWS and complain bitterly about their suddenly bloated repositories. Oh well. Removed outdated comments about the old behavior to avoid confusion. I don't know if I've found all the places that griping spread to.	2019-10-24 14:01:54 -04:00
Joey Hess	bd197be3ad	annex.gitaddtoannex configuration Added annex.gitaddtoannex configuration. Setting it to false prevents git add from usually adding files to the annex. (Unless the file was annexed before, or a renamed annexed file is detected.) Currently left at true; some users are encouraging it be set to false.	2019-10-23 15:29:46 -04:00
Joey Hess	bbdeb1a1a8	sync: Fix crash when there are submodules and an adjusted branch is checked out Reverse adjusting the branch uses treeItemToTreeContent, which was missed when adding submodule support earlier.	2019-10-23 11:52:56 -04:00
Joey Hess	9a5d9019ba	Deal with pkexec changing to root's home directory when running a command. Wow, that's not documented anywhere, and seems like a major gotcha in pkexec. Broke enable-tor.	2019-10-21 12:39:19 -04:00
Joey Hess	5db79339a1	init: Fix a failure when used in a submodule on a crippled filesystem. When the submodule's parent repo has an adjusted unlocked branch, it gets cloned by git, but git checks out master. git annex init then fails because it wants to enter the adjusted branch, but: adjusted branch adjusted/master(unlocked) already exists. Aborting because that branch may have changes that have not yet reached master Note that init actually then exits 0, leaving master checked out. This could also happen, absent submodules, if the parent repo has an adjusted unlocked branch, but it is not checked out. In the more common case where that branch is checked out, the clone uses the same branch, so no problem. The choices to fix this: * Init could delete the existing adjusted branch, and re-adjust. But then running init inside an adjusted branch on a crippled filesystem would lose any changes that have not been synced back to master. * Init could sync any changes back to master, but that would be very surprising behavior for it. * Init could simply check out the existing adjusted branch. If the branch is diverged from master, well, sync will sort that out later. This mirrors the behavior of cloning a repo that has an adjusted branch checked out that has not yet been synced back to master. Picked this choice.	2019-10-21 11:41:15 -04:00
Joey Hess	f60e8f2c93	releasing package git-annex version 7.20191017	2019-10-17 18:19:47 -04:00
Joey Hess	904b175707	Fix build with persistent-2.10. Added an additional constraint that persistent needs. This also builds with persistent-2.9.2 without needing any cpp.	2019-10-17 11:58:31 -04:00
Joey Hess	5463f97ca2	OSX: Deal with symbolic link problem that caused git to not be included in the git-annex.dmg Homebrew now has eg: datalads-imac:~ joey$ ls -l /Users/joey/homebrew/Cellar/git/2.23.0/libexec/git-core total 36776 lrwxr-xr-x 1 joey staff 13 Aug 29 13:38 git -> ../../bin/git lrwxr-xr-x 1 joey staff 13 Aug 29 13:38 git-add -> ../../bin/git So the target of the symlink also needs to be installed now. Doing it in shell code was too hairy for my dentistry-addled brain, so reimplemented in haskell. Also using it for building linuxstandalone.	2019-10-17 11:01:41 -04:00
Joey Hess	4306dfbe68	remove empty log files in transition forget --drop-dead: Remove several classes of git-annex log files when they become empty, further reducing the size of the git-annex branch. Noticed while testing sameas uuid removal, but it could happen other times too. An empty log file is always treated by git-annex the same as no file being present, and when the files are per-key, it can be a sizable space saving to exclude them from the tree.	2019-10-14 16:04:15 -04:00
Joey Hess	9828f45d85	add RemoteStateHandle This solves the problem of sameas remotes trampling over per-remote state. Used for: * per-remote state, of course * per-remote metadata, also of course * per-remote content identifiers, because two remote implementations could in theory generate the same content identifier for two different peices of content While chunk logs are per-remote data, they don't use this, because the number and size of chunks stored is a common property across sameas remotes. External special remote had a complication, where it was theoretically possible for a remote to send SETSTATE or GETSTATE during INITREMOTE or EXPORTSUPPORTED. Since the uuid of the remote is typically generate in Remote.setup, it would only be possible to pass a Maybe RemoteStateHandle into it, and it would otherwise have to construct its own. Rather than go that route, I decided to send an ERROR in this case. It seems unlikely that any existing external special remote will be affected. They would have to make up a git-annex key, and set state for some reason during INITREMOTE. I can imagine such a hack, but it doesn't seem worth complicating the code in such an ugly way to support it. Unfortunately, both TestRemote and Annex.Import needed the Remote to have a new field added that holds its RemoteStateHandle.	2019-10-14 13:51:42 -04:00
Joey Hess	37f725a9f7	Merge branch 'master' into sameas	2019-10-11 15:56:00 -04:00
Joey Hess	8131451c35	releasing package git-annex version 7.20191009	2019-10-09 12:33:09 -04:00
Joey Hess	f4dd7d5191	work around windows having infected git's plumbing Work around git cat-file --batch's odd stripping of carriage return from the end of the line (some windows infection), avoiding crashing when the repo contains a filename ending in a carriage return.	2019-10-08 15:27:05 -04:00
Joey Hess	8966ba2cff	git-annex-standalone.rpm: Fix the git-annex-shell symlink	2019-10-08 14:43:28 -04:00
Joey Hess	53da7f1cf8	update uninit to handle all the v7 stuff * uninit: Remove several git hooks that git-annex init sets up. * uninit: Remove the smudge and clean filters that git-annex init sets up.	2019-10-08 14:34:00 -04:00
Joey Hess	1113caa53e	preserve unlocked file mtime when dropping When dropping an unlocked file, preserve its mtime, which avoids git status unncessarily running the clean filter on the file. If the index file has close to the same mtime as a work tree file, git will not trust the index to be up-to-date, and re-runs the clean filter unncessarily. Preserving the mtime when depopulating a pointer file avoids git status doing a little (or maybe a lot) of unncessary work. There are other places that the mtime could be preserved, including other places where pointer files are written perhaps, but also populatePointerFile. But, I don't know of cases where those lead to git status doing unncessary work, so I just fixed the one I'm aware of for now.	2019-10-08 14:01:12 -04:00
Joey Hess	2e6fd5de71	fix flipped diffUTCTime fsck --incremental/--more: Fix bug that prevented the incremental fsck information from being updated every 5 minutes as it was supposed to be; it was only updated after 1000 files were checked, which may be more files that are possible to fsck in a given fsck time window. Thanks to Peter Simons for help with analysis of this bug. Auditing for other cases of the same mistake, the keys db also had it backwards. This seems unlikely to really have been a problem; it would need associated files updates etc to be coming in slowly for some reason and then be interrupted to cause any problem. IIRC the design of the keys db assumes that any interruped operation will be restarted, and so it can lose any buffered database updates safely.	2019-10-03 09:54:19 -04:00
Joey Hess	61b384d2b7	add --sameas option, not yet used	2019-10-01 12:36:25 -04:00
Joey Hess	3066bdb1fb	fix annex.largefiles largerthan/smallerthan bug Fix bug in handling of annex.largefiles that use largerthan/smallerthan. When adding a modified file, it incorrectly used the file size of the old version of the file, not the current size. That was the only largefiles limit that didn't directly look at the file on disk already. Added a new type to keep straight the two different ways such a limit can be matched. I kind of wanted to extend MatchingFile or FileInfo to indicate that the matcher is supposed to operate on files from disk or annex, but it turned out to be too complex to implement it that way. This also changes the LimitAnnexFiles case when lookupFileKey does not find a key. It used to fall back to statting the file, now it always returns False. I doubt the old code could really get to that point, but if it somehow does, it's better for preferred content matching to be consistent.	2019-09-30 17:15:08 -04:00
Joey Hess	b90ddbc383	enable-tor: Use pkexec to run command as root when gksu and kdesu are not available. gksu is no longer in debian, even stable kdesu in debian is not installed in PATH any longer, though the executable is still present under /usr/lib pkexec is packagekit's replacement for those older commands.	2019-09-30 15:19:01 -04:00
Joey Hess	f2737a5fbe	enable-tor: Run kdesu with -c option.	2019-09-30 15:14:05 -04:00
Joey Hess	2b55a2b882	remotedaemon: Don't list --stop in help since it's not supported. Also, move out of plumbing section. When using tor, the remotedaemon is part of the user's workflow, as it runs the tor hidden service.	2019-09-30 14:40:46 -04:00
Joey Hess	090898a138	adjust --lock: This enters an adjusted branch where files are locked. Straightforward, except for the issue of how to reverse LockAdjustment. With --unlock, a commit that modifies/adds unlocked files gets reverse adjusted to use locked files. That's fairly reasonable, I think. But reversing --lock by unlocking all modified files feels wrong. Maybe that's just because repositories typically seem to still have mostly locked files in them (unless one is in an adjusted unlocked branch of course!) It may be that eventually how to reverse both will need to be configurable, I don't know.	2019-09-27 14:23:25 -04:00
Joey Hess	9628ae2e67	Close sqlite databases more robustly. Had a report of close throwing ErrorBusy on CIFS. Retrying up to 16 seconds is a balance between hopefully waiting long enough for the problem to clear up and waiting so long that git-annex seems to hang. The new dependency is free; persistent depends on unliftio-core.	2019-09-26 12:25:21 -04:00
Joey Hess	8af791d769	Test: Use more robust directory removal method. I just had a test that crashed at cleanup on linux with: .t/gpgtest/12/S.gpg-agent.browser: removeDirectoryRecursive:removeContentsRecursive:removePathRecursive:removeContentsRecursive:removePathRecursive:removeContentsRecursive:removePathRecursive:getSymbolicLinkStatus: does not exist (No such file or directory) sleeping 10 seconds and will retry directory cleanup git-annex: .t/gpgtest/14/S.gpg-agent.browser: removeDirectoryRecursive:removeContentsRecursive:removePathRecursive:removeContentsRecursive:removePathRecursive:removeContentsRecursive:removePathRecursive:getSymbolicLinkStatus: does not exist (No such file or directory) removePathForcibly is supposed to be more robust to things in the directory vanishing while it's running, etc. Will probably avoid such crashes. It was added to directory-1.2.7, which comes with ghc since 8.0.2. Since base >= 4.11.1.0 means ghc 8.4.4, I expect all builds will have it, but I ifdefed it to be sure.	2019-09-24 16:59:37 -04:00
Joey Hess	6ae0a44c64	git-lfs: Added support for http basic auth	2019-09-24 14:46:20 -04:00
Joey Hess	de564df8b3	git-lfs: Only do endpoint discovery once when concurrency is enabled This avoids some extra work, but I don't think it was possible for two ssh endpoint discoveries run concurrently to both prompt for the ssh password; Annex.Ssh itself deals with concurrency. This is mostly groundwork for http password prompting.	2019-09-24 13:01:51 -04:00
Joey Hess	b13a350556	added --unlocked and --locked	2019-09-19 12:33:13 -04:00
Joey Hess	fda1bdd679	Added --mimetype and --mimeencoding file matching options. Already had these for largefiles matching, but I forgot to add them as command-line options.	2019-09-19 12:09:59 -04:00
Joey Hess	ab739242a3	releasing package git-annex version 7.20190912	2019-09-13 12:53:40 -04:00
Joey Hess	a8fea1644d	docs for git-annex-standalone rpm	2019-09-13 12:18:36 -04:00
Joey Hess	4508198507	building a standalone rpm from the standalone tarball This allows the rpm to be built anywhere the necessary build deps are available (including on debian) and the resulting package will work on as broad a range of rpm distributions as the libc/kernel supports. The DistributionUpdate changes to use the new script have not yet been tested.	2019-09-13 11:53:17 -04:00
Joey Hess	4a4e08e123	release prep	2019-09-12 13:53:22 -04:00
Joey Hess	fef3cd055d	Removed support for git versions older than 2.1 debian oldoldstable has 2.1, and that's what i386ancient uses. It would be better to require git 2.2, which is needed to use adjusted branches, but can't do that w/o losing support for some old linux kernels or a complicated git backport.	2019-09-11 16:14:43 -04:00
Joey Hess	061231621e	Merge branch 'master' into v7-default	2019-09-10 16:06:43 -04:00
Joey Hess	94c75d2bd9	init: Fix a reversion that broke initialization on systems that need to use pid locking This brings back .git/annex/misctmp, but only for init. If an init is interrupted while probing using that temp directory, the files it left will get deleted 1 week later by a subsequent git-annex run.	2019-09-10 13:37:07 -04:00
Joey Hess	0af7ebdc2a	info: Display trust level when getting info on a uuid, same as on a remote.	2019-09-01 16:48:46 -04:00
Joey Hess	f845195354	Added annex.autoupgraderepository configuration Can be set to false to prevent any automatic repository upgrades. Also, removed direct mode specific upgrade code in Annex.Init, and made needsUpgrade always include the name/path of the repo, so if there's a problem it's clear what repo has the problem. And, made needsUpgrade catch any exceptions that might occur during the upgrade, so it can display a more useful error message than just the exception.	2019-09-01 13:42:26 -04:00
Joey Hess	3f0eef4baa	v7 for all repositories * Default to v7 for new repositories. * Automatically upgrade v5 repositories to v7.	2019-08-30 14:09:14 -04:00
Joey Hess	1558e03014	Refuse to upgrade direct mode repositories when git is older than 2.22 That git fixed a memory leak that could cause an OOM during the upgrade. Most git-annex builds have a new enough git already. OSX git was upgraded with brew. Linux i386ancient build's git was too old. Upgrading it to a fixed git didn't work (due to the newer git not working with the old ssh, https://bugs.chromium.org/p/git/issues/detail?id=7 ) Choices to deal with that were: * Somehow make direct mode upgrade work with the old git, avoiding its OOM problem. One way would be to switch the repo to indirect mode first, and so upgrade to a repo with locked files. Not good when the filesystem does not support symlinks. * backport the OOM fix from git 2.22 (And do what about the version number so git-annex knows it's fixed?) * backport openssh (and possibly more stuff) * move the i386ancient build to at least Debian stretch (still backporting git) But this will make it no longer work with some of the ancient kernels it targets. Of those, backporting the OOM fix seemed the best approach. Put "oomfix" in the git version number to indicate it. I have not automated building the git backport, so here's the patch I used: diff -ur orig/git-2.1.4/convert.c git-2.1.4/convert.c --- orig/git-2.1.4/convert.c 2014-12-18 18:42:18.000000000 +0000 +++ git-2.1.4/convert.c 2019-08-29 20:05:04.371872338 +0100 @@ -404,7 +404,7 @@ if (start_async(&async)) return 0; /* error was already reported */ - if (strbuf_read(&nbuf, async.out, len) < 0) { + if (strbuf_read(&nbuf, async.out, 0) < 0) { error("read from external filter %s failed", cmd); ret = 0; } diff -ur orig/git-2.1.4/GIT-VERSION-GEN git-2.1.4/GIT-VERSION-GEN --- orig/git-2.1.4/GIT-VERSION-GEN 2014-12-18 18:42:18.000000000 +0000 +++ git-2.1.4/GIT-VERSION-GEN 2019-08-29 20:06:39.132743228 +0100 @@ -1,7 +1,7 @@ #!/bin/sh GVF=GIT-VERSION-FILE -DEF_VER=v2.1.4 +DEF_VER=v2.1.4.oomfix LF=' ' diff -ur orig/git-2.1.4/configure git-2.1.4/configure --- orig/git-2.1.4/configure 2014-12-18 18:42:19.000000000 +0000 +++ git-2.1.4/configure 2019-08-29 20:27:45.896380015 +0100 @@ -580,8 +580,8 @@ # Identity of this package. PACKAGE_NAME='git' PACKAGE_TARNAME='git' -PACKAGE_VERSION='2.1.4' -PACKAGE_STRING='git 2.1.4' +PACKAGE_VERSION='2.1.4.oomfix' +PACKAGE_STRING='git 2.1.4.oomfix' PACKAGE_BUGREPORT='git@vger.kernel.org' PACKAGE_URL='' diff -ur orig/git-2.1.4/version git-2.1.4/version --- orig/git-2.1.4/version 2014-12-18 18:42:19.000000000 +0000 +++ git-2.1.4/version 2019-08-29 20:06:17.572545210 +0100 @@ -1 +1 @@ -2.1.4 +2.1.4.oomfix	2019-08-29 15:24:41 -04:00
Joey Hess	4f59ac05b6	info: remove "repository mode" info: Removed the "repository mode" from its output (including the --json output) since with the removal of direct mode, there is no repository mode.	2019-08-29 14:12:22 -04:00
Joey Hess	d6e1f09ed2	init: Catch more exceptions when testing locking.	2019-08-29 12:19:07 -04:00
Joey Hess	586db7f06d	Avoid making a commit when upgrading from direct mode to v7 Three reasons: * Committing as part of an upgrade is very unusual and unexpected. * The commit was failing with a weird error message when done during an automatic upgrade. * Let me remove more of that sweet^Whorrible direct mode code.	2019-08-26 16:35:44 -04:00
Joey Hess	adb89ee71b	update test suite for removal of direct mode Removed that pass and all the complications of checking direct mode's edge cases.	2019-08-26 15:07:10 -04:00
Joey Hess	20741b1eb4	Automatically convert direct mode repositories to v7 with adjusted unlocked branches * Automatically convert direct mode repositories to v7 with adjusted unlocked branches and set annex.thin. * init: When run on a crippled filesystem with --version=5, will error out, since version 7 is needed for adjusted unlocked branch. * direct: This command always errors out as direct mode is no longer supported. * indirect: This command has become a deprecated noop. * proxy: This command is deprecated because it was only needed in direct mode. (But it continues to work.) Also removed mentions of direct mode throughough the documentation. I have not removed all the direct mode code yet.	2019-08-26 15:05:25 -04:00
Joey Hess	5877a15d7b	fix hard links when upgrading from direct mode When upgrading a direct mode repo to v7 with adjusted unlocked branches, fix a bug that prevented annex.thin from taking effect for the files in working tree. The hard links used to be ok, but commit `8e22114735` accidentially broke them. It repopulates the worktree file, which is already a hard link, and when it's creating the new file, the link count is already 2, and so it doesn't make a hard link then.	2019-08-26 13:54:39 -04:00
Joey Hess	2fd27c6df5	assistant: When creating a new repository use v7 adjusted branches with annex.thin Rather than direct mode, which this is a small step on the path to removing. Init on a crippled filesystem already used v7 adjusted branches, and like that, this doesn't pose any interoperability issues with old versions of git-annex that clone the same repo, because files are only unlocked on the adjusted branch.	2019-08-26 12:54:14 -04:00
Joey Hess	c650389118	info: error out when file matching options used on non-directory When file matching options are specified when getting info of something other than a directory, they won't have any effect, so error out to avoid confusion. This commit was sponsored by mo on Patreon.	2019-08-24 13:20:19 -04:00
Joey Hess	972fd11f4e	releasing package git-annex version 7.20190819	2019-08-19 12:26:45 -04:00
Joey Hess	7f97575941	Makefile: Changed default zsh completion location to zsh default fpath. Systems such as Debian that have overridden the default fpath will need to set ZSH_COMPLETIONS_PATH. I feel that Debian is causing unncessary complexity by making this change, and have filed a bug report about it. This also means that when git-annex is installed with PREFIX=/usr/local it will use /usr/local/share/zsh/site-functions which works with probably all versions of zsh.	2019-08-16 14:08:56 -04:00
Joey Hess	5fcaaf77db	Make git-annex-standalone.deb include the shell completions again Was lost when the install-completions target was added.	2019-08-16 13:47:48 -04:00
Joey Hess	fa62c32233	Fix intermittent failure of the test suite Its repeated opening and writing to the sqlite database somehow caused inode cache information to occasionally be lost. This loses code coverage, since running git-annex as a child process prevents tracking what parts of the code are exercised. I have not looked at the code coverage in a long time. It would probably be possible to collect code coverage for the child procesess and merge it together.	2019-08-16 11:11:55 -04:00
Joey Hess	708fc6567f	S3: Fix encoding when generating public urls of S3 objects. This code feels worryingly stringily typed, but using URI does not help because the uriPath still has to be constructed with the right uri-encoding.	2019-08-15 12:56:46 -04:00
Joey Hess	dc672863c3	init: Install working hook scripts when run on a crippled filesystem and on Windows	2019-08-13 15:14:17 -04:00
Joey Hess	b87ea12b6b	git-annex merge branch * merge: When run with a branch parameter, merges from that branch. This is especially useful when using an adjusted branch, because it applies the same adjustment to the branch before merging it.	2019-08-09 13:21:15 -04:00
Joey Hess	b90ee6dc52	test: Add pass using adjusted unlocked branch On second thought, the extra time running the test suite is worth it. It will be gained back once we finally get rid of direct mode. There are two failing tests, same two that have been failing on windows (though the failure does not look identical). So this should also spare me the Windows VM while fixing.	2019-08-09 11:34:10 -04:00
Joey Hess	298812a353	use separate main repo dir for each test suite pass This way a failure to clean up the main repo dir from a previous pass can't result in reusing that repo, which won't be configured right for the current pass.	2019-08-08 14:29:28 -04:00
Joey Hess	70b71bf660	have init --version fail when repo is already initialized with other version init: When the repo is already initialized, and --version requests a different version, error out rather than silently not changing the version.	2019-08-08 14:13:02 -04:00
Joey Hess	3adc251f9d	Build with silently-1.2.5.1 on Windows; the old one used "NUL" which is not supported with recent versions of ghc.	2019-08-07 17:42:16 -04:00
Joey Hess	30ca02928c	Windows installer: Always install to 64 bit program files directory, since it needs 64 bit git now I saw the installer not defaulting to any installation directory, and I had to manually enter C:\Program Files\Git Maybe it was choosing gitInstallDir32, and that was empty? Or the conditional somehow failed. Simplifying so it will hopefully work again.	2019-08-07 14:05:03 -04:00
Joey Hess	bf5dd723d3	Fix querying git for object type when operating on a file containing newlines This typo would make "git cat-file cat-file" fail, and the way it's used, I think it broke querying all info from filenames containing newlines, because the other queries are only run when it succeeds.	2019-08-07 13:35:42 -04:00
Joey Hess	fb7d92457f	support using gcrypt with git-lfs special remote	2019-08-05 13:43:45 -04:00
Joey Hess	8401b09e32	Allow setting up a gcrypt special remote with encryption=shared It was documented to work, but seems it has been broken for a while/forever.	2019-08-05 12:41:05 -04:00
Joey Hess	d1a0c7b16f	make --in=here fast Use the same optimisation for --in=here as has always been used for --in=. rather than the slow code path that unncessarily queries the git-annex branch. It looks like when "here" got added as an alias for "." back in 2012, I forgot about this place. Also sped up some very unlikely ways of referring to the current repository. Note that, this could in some rare corner case cause a behavior change, if the git-annex branch and inAnnex disagree about whether content is present in the local repository. But --in=. already behaved that way, and the truth on the ground should win also.	2019-08-01 00:29:47 -04:00
Joey Hess	018b5b8173	Support building with socks-0.6 and persistant-template-2.7 persistent-template now needs UndecidableInstances. socks changed defaultSocksConf to take a SockAddr.	2019-07-30 12:50:48 -04:00
Joey Hess	9fd37e65d0	prep release	2019-07-30 12:47:33 -04:00
Joey Hess	426053cb6c	Corrected some license statements In `40ecf58d4b` I changed the license of code I wrote from GPL to AGPL. But, two files containing code I wrote combined with code by others were updated to say their license is AGPL, while in fact part of it was (the code I wrote) but part remained under the original license (the code written by others). Remote/Ddar.hs is now changed entirely back to GPL 3. Annex/DirHashes.hs stays AGPL, but I broke out Utility/MD5.hs with the code not written by me, and corrected its license statement to GPL-2, which is the actual version of the GPL included with the code in its original distribution at http://www.cs.ox.ac.uk/people/ian.lynagh/md5/	2019-07-28 14:27:33 -04:00
Joey Hess	875c7b5cc9	windows long filenames should be fixed now by new ghc	2019-07-22 09:44:09 -04:00
Joey Hess	ff85adba76	remove bundled rsync from windows build rsync is only needed for rsync special remotes and git-annex-shell from Debian oldstable. Since the library situation on windows for rsync required a particular 32 bit build of git for it to work, and may also somehow need git-annex to be 32 bit build, it's better to not include it. This commit was sponsored by Jake Vosloo on Patreon.	2019-07-22 09:37:42 -04:00
Joey Hess	21ff5e1e5a	CoW probing Improved probing when CoW copies can be made between files on the same drive. Now supports CoW between BTRFS subvolumes. And, falls back to rsync instead of using cp when CoW won't work, eg copies between repos on the same EXT4 filesystem. Rather than trying cp --reflink=always for each file copied to a remote, it's tried once and if it fails it falls back to using rsync thereafter for the lifetime of the Remote object. That avoids overhead of calling cp which while small, will add up over a large number of files. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2019-07-17 14:19:08 -04:00
Joey Hess	7be690f326	check headRef not Branch.current Support running v7 upgrade in a repo where there is no branch checked out, but HEAD is set directly to some other ref. This commit was sponsored by Jack Hill on Patreon.	2019-07-16 12:36:29 -04:00
Joey Hess	25f7a79217	stack.yaml: Build with http-client-0.5.14 to get a bug fix to http header parsing The cabal file does not yet demand this version because it's not in Debian yet and only affects use of certian broken http servers, but let's use it when it's easily available.	2019-07-09 10:10:05 -04:00
Joey Hess	5a8e26a817	fixup after branch merge	2019-07-08 09:01:50 -04:00
Joey Hess	5238610a05	Merge branch 'post-debian-stable-release'	2019-07-08 08:59:43 -04:00
Joey Hess	843b091093	releasing package git-annex version 7.20190708	2019-07-08 08:58:44 -04:00
Joey Hess	0c6b7e288d	Add BLAKE2BP512 and BLAKE2BP512E backends using a blake2 variant optimised for 4-way CPUs This had been deferred because the Debian package of cryptonite, and possibly other builds, was broken for blake2bp, but I've confirmed #892855 is fixed. This commit was sponsored by Brett Eisenberg on Patreon.	2019-07-05 15:30:03 -04:00
Joey Hess	9a5ddda511	remove many old version ifdefs Drop support for building with ghc older than 8.4.4, and with older versions of serveral haskell libraries than will be included in Debian 10. The only remaining version ifdefs in the entire code base are now a couple for aws! This commit should only be merged after the Debian 10 release. And perhaps it will need to wait longer than that; it would make backporting new versions of git-annex to Debian 9 (stretch) which has been actively happening as recently as this year. This commit was sponsored by Ilya Shlyakhter.	2019-07-05 15:09:37 -04:00
Joey Hess	b8ef1bf3be	Fix find --json to output json once more. Reversion from commit `436f10771`, CustomOutput was forcing quiet output which overrode the json setting. find happened to be the only command that uses CustomOutput and also outputs json. (metadata --get does also use CustomOutput and --json does not enable json output for that, which may be an oversight, but was already the behavior before this regression.)	2019-07-05 09:58:37 -04:00
Joey Hess	23f09790b6	releasing package git-annex version 7.20190626	2019-06-26 12:30:03 -04:00
Joey Hess	9273f80301	OSX dmg: Put git-annex's version in the Info.plist file.	2019-06-26 12:10:35 -04:00
Joey Hess	0cc8f2426c	arm ghc bug fixed	2019-06-26 00:55:05 -04:00
Joey Hess	42c386fc47	add: Display progress meter when hashing files. * add: Display progress meter when hashing files. * add: Support --json-progress option.	2019-06-25 13:12:47 -04:00
Joey Hess	84e729fda5	fix init default description reversion init: Fix a reversion in the last release that prevented automatically generating and setting a description for the repository. Seemed best to factor out uuidDescMapRaw that does not have the default mempty descrition behavior. I don't much like that behavior, but I know things depend on it. One thing in particular is `git annex info` which lists the uuids and descriptions; if the current repo has been initialized in some way that means it does not have a description, it would not show up w/o that. (Not only repos created due to this bug might lack that. For example a repo that was marked dead and had --drop-dead delete its git-annex branch info, and then came back from the dead would similarly not be in the uuid.log. Also there have been other versions of git-annex that didn't set a default description; for years there was no default description.)	2019-06-20 20:30:24 -04:00
Joey Hess	7264203eb1	importfeed: When there's a problem parsing the feed, --debug will output the feed content that was downloaded. And let the user know about it in the failure messages.	2019-06-20 12:37:07 -04:00
Joey Hess	759fd9ea68	avoid url resume from 0 When downloading an url and the destination file exists but is empty, avoid using http range to resume, since a range "bytes=0-" is an unusual edge case that it's best to avoid relying on working. This is known to fix a case where importfeed downloaded a partial feed from such a server. Since importfeed uses withTmpFile, the destination always exists empty, so it would particularly tickle such problem servers. Resuming from 0 is otherwise possible, but unlikely.	2019-06-20 12:26:17 -04:00
Joey Hess	04cc470201	run download checksum verification in separate job pool get, move, copy, sync: When -J or annex.jobs has enabled concurrency, checksum verification uses a separate job pool than is used for downloads, to keep bandwidth saturated. Not yet done for upload checksum verification, but that only affects remotes on local disks.	2019-06-17 14:58:02 -04:00
Joey Hess	502ce3f243	Merge branch 'starting'	2019-06-15 12:42:10 -04:00
Joey Hess	0bd9e8c0e2	releasing package git-annex version 7.20190615	2019-06-15 12:39:16 -04:00
Joey Hess	44de3fff0b	avoid rsync/gcrypt ssh startup delay with -J Avoid a delay at startup when concurrency is enabled and there are rsync or gcrypt special remotes, which was caused by git-annex opening a ssh connection to the remote too early. sshOptions makes a connection to the ssh server if one is not already open, when concurrency is enabled. Avoid doing that at startup, when the remote list is being built, but the remote may not be used at all. Instead, rsync/gcrypt now runs sshOptions once per ssh connection to the server. This should not be significant overhead since Remote.Git already has the same overhead (as do Bup and Ddar).	2019-06-13 11:16:38 -04:00
Joey Hess	e07003ab73	Revert "separate queue for cleanup actions" This reverts commit `659640e224` and `4932972487` Too early to include these in a release; they'll be de-reverted after the release.	2019-06-12 14:47:40 -04:00
Joey Hess	e1c48509d7	remove incorrect changelog entry I didn't speed up -J seek yet	2019-06-12 14:13:45 -04:00
Joey Hess	8e5ea28c26	finish CommandStart transition The hoped for optimisation of CommandStart with -J did not materialize. In fact, not runnign CommandStart in parallel is slower than -J3. So, CommandStart are still run in parallel. (The actual bad performance I've been seeing with -J in my big repo has to do with building the remoteList.) But, this is still progress toward making -J faster, because it gets rid of the onlyActionOn roadblock in the way of making CommandCleanup jobs run separate from CommandPerform jobs. Added OnlyActionOn constructor for ActionItem which fixes the onlyActionOn breakage in the last commit. Made CustomOutput include an ActionItem, so even things using it can specify OnlyActionOn. In Command.Move and Command.Sync, there were CommandStarts that used includeCommandAction, so output messages, which is no longer allowed. Fixed by using startingCustomOutput, but that's still not quite right, since it prevents message display for the includeCommandAction run inside it too.	2019-06-12 13:24:01 -04:00
Joey Hess	659640e224	separate queue for cleanup actions When running multiple concurrent actions, the cleanup phase is run in a separate queue than the main action queue. This can make some commands faster, because less time is spent on bookkeeping in between each file transfer. But as far as I can see, nothing will be sped up much by this yet, because all the existing cleanup actions are very light-weight. This is just groundwork for deferring checksum verification to cleanup time. This change does mean that if the user expects -J2 will mean that they see no more than 2 jobs running at a time, they may be surprised to see 4 in some cases (if the cleanup actions are slow enough to notice). It might also make sense to enable background cleanup without the -J, for at least one cleanup action. Indeed, that's the behavior that -J1 has now. At some point in the future, it make make sense to make the behavior with no -J the same as -J1. The only reason it's not currently is that git-annex can build w/o concurrent-output, and also any bugs in concurrent-output (such as perhaps misbehaving on non-VT100 compatible terminals) are avoided by default by only using it when -J is used.	2019-06-05 17:54:35 -04:00
Joey Hess	082e1f1738	Don't try to import .git directories from special remotes Because git does not support storing git repositories inside a git repository.	2019-06-04 15:14:20 -04:00
Joey Hess	67c06f5121	add back support for ftp urls Add back support for ftp urls, which was disabled as part of the fix for security hole CVE-2018-10857 (except for configurations which enabled curl and bypassed public IP address restrictions). Now it will work if allowed by annex.security.allowed-ip-addresses.	2019-05-30 14:51:34 -04:00
Joey Hess	1871295765	rename annex.security.allowed-http-addresses Renamed annex.security.allowed-http-addresses to annex.security.allowed-ip-addresses because it is not really specific to the http protocol, also limiting eg, git-annex's use of ftp and via youtube-dl, several other protocols. The old name for the config will still work. If both old and new name are set, the new name will win.	2019-05-30 12:43:40 -04:00
Joey Hess	8960f259b8	make readonly export remotes really be readonly When a remote is configured to be readonly, don't allow changing what's exported to it. This was missed in the original export remote implementation, but it makes sense for a readonly export remote to not be allowed to change.	2019-05-28 11:04:28 -04:00
Joey Hess	f2a54e3401	Android: Improve installation process when the user's login shell is not bash. ~/.profile works for bash, but not all other login shells. This setting PATH is a minor convenience for users, particuarly since typing on android is so much harder. The usual linux standalone bundle just expects the user to know how to add it to PATH. I don't want this code to grow special cases for every possible login shell. So displaying a message to the presumably minority who don't use bash seems like the best choice. Longer term, I'd hope termux gets some way to set an environment variable for all login shells. Systems using PAM can, via ~/.pam_environment. Or alternatively, add a git-annex package to termux, even if just an installer package. I'd rather spend time on either of those than on making this minor thing support more login shells. This commit was sponsored by mo on Patreon.	2019-05-23 13:06:31 -04:00
Joey Hess	a14f6ce758	fix repo description setting bugs * init: When the repository already has a description, don't change it. * describe: When run with no description parameter it used to set the description to "", now it will error out.	2019-05-23 12:51:01 -04:00
Joey Hess	e06feb7316	honor preferred content when importing Importing from a special remote honors its preferred content too; unwanted files are not imported. But, some preferred content expressions can't be checked before files are imported, and trying to import with such an expression will fail. Tested this with scenarios including changing the preferred content expression and making sure merging the import didn't delete files that were no longer wanted. There was one minor inefficiency mentioned in the todo that I punted on.	2019-05-21 14:38:06 -04:00
Joey Hess	3b9a19171a	Merge branch 'master' into preferred	2019-05-21 11:34:45 -04:00
Joey Hess	5e1221ad53	Improve shape of commit tree when importing from unversioned special remotes Make the import have the previous import as a parent, so eg `git log --stat` displays a useful diff. Also a minor optimisation, only calculate the depth of the imported history once.	2019-05-21 11:32:54 -04:00
Joey Hess	7d177b78e4	docs for export preferred content This includes a note about how include= and exclude= match when exporting a subtree. I don't know if the note is prominent enough, but the behavior seems unsurprising enough.	2019-05-20 12:06:02 -04:00
Joey Hess	82186ca58f	annex.jobs=cpus etc Added the ability to run one job per CPU (core), by setting annex.jobs=cpus, or using option --jobs=cpus or -Jcpus. Built with future expansion in mind, including not defaulting matching on Concurrency so more constructors can later be added, and using "cpu" instead of "0".	2019-05-10 13:27:08 -04:00
Joey Hess	e35f96aea9	Makefile: Added install-completions to install target.	2019-05-08 10:48:38 -04:00
Joey Hess	aaeb85361c	Merge branch 'wip'	2019-05-07 13:07:45 -04:00
Joey Hess	6eaa0af42f	releasing package git-annex version 7.20190507	2019-05-07 13:05:52 -04:00
Joey Hess	2d33122215	avoid ingest lockdown file escaping the withOtherTmp call Fixes bug that caused git-annex to fail to add a file when another git-annex process cleaned up the temp directory it was using. Solution is just to push withOtherTmp out to a higher level, so that the whole ingest process can be completed inside it. But in the assistant, that was not practical to do, since withOtherTmp runs in the Annex monad and the assistant does not. Worked around by introducing a separate temp directory that only the assistant uses for lockdown. Since only one assistant can run at a time, it's easy to clean up that directory of old cruft at startup.	2019-05-07 13:04:57 -04:00
Joey Hess	b03e65d260	Improved locking when multiple git-annex processes are writing to the .git/index file	2019-05-06 15:15:12 -04:00
Joey Hess	bf7ecd6892	fix export subtree reversion Fix reversion in last release that caused wrong tree to be written to remote tracking branch after an export of a subtree. The invariant "commitsha should have the treesha as its tree" was not met due to a bug. Guarantee it's met by catting the commitsha to find its actual tree. A little bit slower, but this is not run often.	2019-05-06 13:57:13 -04:00
Joey Hess	4da50456a3	releasing package git-annex version 7.20190503	2019-05-03 12:48:28 -04:00
Joey Hess	70d16d07fe	fix typos	2019-05-01 14:43:35 -04:00
Joey Hess	700a3f2787	Merge branch 'master' into import-from-s3	2019-05-01 14:30:52 -04:00
Joey Hess	9dd764e6f7	Added mimeencoding= term to annex.largefiles expressions. * Added mimeencoding= term to annex.largefiles expressions. This is probably mostly useful to match non-text files with eg "mimeencoding=binary" * git-annex matchexpression: Added --mimeencoding option.	2019-04-30 12:17:22 -04:00
Joey Hess	15bd7d57ca	info: Show when a remote is configured with importtree	2019-04-23 14:27:43 -04:00
Joey Hess	2f79cb4b45	versioned import from S3 is working Still some bugs and two stubbed methods to implement though.	2019-04-19 15:13:49 -04:00
Joey Hess	9dc7a10448	Drop support for building with aws older than 0.14. debian stable has 0.14 so lose the complexity for old versions	2019-04-19 14:27:59 -04:00
Joey Hess	c0c38e986d	added renameremote command	2019-04-15 13:49:03 -04:00
Joey Hess	f95f340c73	sync: When listing contents on an import remote fails, proceed with other syncing instead of aborting Switch listContents to being a proper CommandStart, so if it throws an exception, it will be treated like any other command action that fails. downloadImport apparently does not ever throw an exception, and itself uses commandAction, so it can't be a CommandStart.	2019-04-10 17:02:56 -04:00
Joey Hess	3d6f1b7dba	Made git-annex sync --content much faster when all the remotes it's syncing with are export/import remotes It was unnecessarily going over all files and checking preferred content against no remotes.	2019-04-10 12:42:10 -04:00
Joey Hess	6babb2c73f	remove wrong uniqueness constraint from ContentIdentifier db Fix bug that caused importing from a special remote to repeatedly download unchanged files when multiple files in the remote have the same content. Unfortunately, there's really no good way to remove a uniqueness constraint from a sqlite database. The best that can be done is to make a new table and copy the data over. But that would require using persistent's migrations or raw sql, and I don't want to do either. Instead, a sledgehammer approach: Renamed .git/annex/cid to .git/annex/cids. When the new database doesn't exist, it will be populated from the git-annex branch. Noting deletes the old database. Don't want to delete it out from under some long-running git-annex process that might be using it. It could eventually be deleted. But this is such a new feature, probably few repos have the database in any case.	2019-04-09 19:58:24 -04:00
Joey Hess	7b6d0da9b8	adb import As well as adding the necessary methods, a few other changes to the adb remote: * Use ".annextmp" extension for temp files, to avoid conflict with other temp files. * Stop using "echo $?" to get exit status of command inside adb. There were two problems; first the "echo" just before it meant it was always 0! And secondly, it seems kind of random on my phone whether it's 1 or 0, not dependant on whether the command seems to have succeeded.	2019-04-09 17:52:41 -04:00
Joey Hess	ece57002c6	releasing package git-annex version 7.20190322	2019-03-22 13:57:17 -04:00
Joey Hess	7d37011a11	S3: Added protocol= initremote setting, to allow https to be used on a non-standard port protocol=https implies port=443 and port=443 implies protocol=https -- this was necessary because the existing configs set port=443, but with a protocol setting, users will naturally want to use it, and then there's no need for them to supply the default https port. So we keep back-compat, add a nicer way to enable https, and also add support for non-standard https ports.	2019-03-22 12:17:05 -04:00
Joey Hess	97ae0f2c22	Android: Fix typo of name of armv7l in installation script. Thanks, 4omecha.	2019-03-22 09:39:18 -04:00
Joey Hess	5ab97333e4	import: Let --force overwrite symlinks, not only regular files The docs already implied this should work.	2019-03-18 16:40:15 -04:00
Joey Hess	258e8f8f29	Removed bundled gpg from the Linux standalone build and OSX dmg Because gpg now always wants to use gpg-agent, and shipping such a daemon in those is not a good idea.	2019-03-18 16:31:07 -04:00
Joey Hess	d5ee5fef65	fsck: Detect situations where annex.thin has caused data loss to the content of locked files. In particular, when two files had the same content, and one was unlocked and modified, with annex.thin that can corrupt the content of the annex object, and so fsck on the other file should detect that. getKeyStatus was relying on Database.Keys.getAssociatedFiles to tell when a file is unlocked, but that can false positive because the database can list old associated files. Instead, separate out the case of unlocked object which has multiple hardlinks when annex.thin is in use.	2019-03-18 15:59:43 -04:00
Joey Hess	60ca3ce043	Add -- before %f in the smudge/clean filter configuration To support filenames starting with dashes. To update the config of existing repositories, you can re-run git-annex init. Perhaps it should check every time for the old config and update it, but that has several problems: - read-only repos - unexpected commands like `git annex find` changing git configs might be surprising behavior Since filenames starting with dashes are not super common and the user can re-init easily enough if their repo needs fixed, I went for the simplest fix.	2019-03-18 14:12:13 -04:00
Joey Hess	8758f9c561	addurl --file: Fix a bug that made youtube-dl be used unneccessarily when adding an html url that does not contain any media.	2019-03-18 13:34:29 -04:00
Joey Hess	6491b62614	Makefile: Added install-home target which installs git-annex into the HOME directory	2019-03-18 12:36:03 -04:00
Joey Hess	353e4f6d24	update changelog	2019-03-11 14:17:49 -04:00
Joey Hess	633021e135	--no-push and remote.name.annex-push prevent exporting trees to special remotes Users may want sync to only export, or only import and this is broadly analagous to push and pull, so it makes sense to use the same configuration for it.	2019-03-09 13:21:49 -04:00
Joey Hess	5f17a9cc50	docs for importtree config	2019-03-04 15:39:19 -04:00
Joey Hess	18d7a1dbbb	make export and sync update special remote tracking branch The branch is only updated once the export is 100% complete. This way, if an export is started but interrupted and so the remote does not yet contain some of the files, an import will make a commit on the old branch, and so won't delete the missing files.	2019-03-01 16:35:48 -04:00
Joey Hess	760f26ebc6	Merge branch 'master' into importtree	2019-02-26 11:36:36 -04:00
Joey Hess	19f833b0b1	aws-0.21.1 * S3: Support enabling bucket versioning when built with aws-0.21.1. * stack.yaml: Build with aws-0.21.1	2019-02-24 12:45:09 -04:00
Joey Hess	4747fa923d	export: Deprecated the --tracking option. Instead, users can configure remote.<name>.annex-tracking-branch themselves.	2019-02-23 15:54:33 -04:00
Joey Hess	d65a78ff5b	Fix cleanup of git-annex:export.log after git-annex forget --drop-dead This log, unlike all other current top-level logs, is a new format log. I have not checked what throwing it at the old log parser did, but it seems likely it ignored unparsable lines, and so perhaps deleted all lines from the log.	2019-02-22 21:34:31 -04:00
Joey Hess	7af55de83c	optimisation: use graftTree to remember the export branch Sped up git-annex export in repositories with lots of keys. Old method read whole git-annex branch tree into memory.	2019-02-22 11:16:22 -04:00
Joey Hess	d839c2110a	fix encoding of metadata containing newlines This fixes a reversion in the ByteString conversion. The old code used isSpace to decide when the metadata value needs to be base64 encoded, and that incorrectly changed to only checking if it contained ' '. Note that only '\n' and '\r' were added and not other sorts of whitespace that isSpace matches, like '\t' and '\v'. Only the former would cause problems.	2019-02-20 14:26:18 -04:00
Joey Hess	f47ee98337	releasing package git-annex version 7.20190219	2019-02-19 12:19:53 -04:00
Joey Hess	1647b9c7a4	improve wording	2019-02-18 17:52:18 -04:00
Joey Hess	9f6b7d6258	On Windows, avoid using rsync for file-to-file copies, since rsync is not always available there. Installing git-annex with stack rsync won't be available. Also, using the git-annex installer with 64 bit git installs a non-working rsync binary because it's linked with libraries provided by 32 bit git.	2019-02-18 17:27:34 -04:00
Joey Hess	1a367cad83	Fix path separator bug on Windows that completely broke git-annex since version 7.20190122.	2019-02-18 17:16:39 -04:00
Joey Hess	c7893bf9b7	init: Fix bug when direct mode needs to be enabled on a crippled filesystem, that left the repository in indirect mode.	2019-02-15 12:34:03 -04:00
Joey Hess	3fa6be1fef	Added NetworkBSD build flag to deal with Network.BSD moving to a new package. Like with the network-uri split, cabal will automatically turn off the flag when building with an old network. I have not tested building with the new network-3.0.0.0 yet; several other dependencies including aws are still pinned on network-2.*	2019-02-08 13:36:39 -04:00
Joey Hess	60c1b5c994	deal with attempt to export filename with # or ? to webdav xporting files with '#' or '?' in their name won't work because urls get truncated on those. Fail in a better way in this case, and avoid failing when removing such files from the export, so after the user has renamed the problem files the export will succeed.	2019-02-07 13:47:57 -04:00
Joey Hess	c3f47ba389	make .noannex file prevent repo fixups Avoid performing repository fixups for submodules and git-worktrees when there's a .noannex file that will prevent git-annex from being used in the repository. This change is ok as long as the .noannex file is really going to prevent git-annex from being used. But, init --force could override the file. Which would result in the repo being initialized without the fixups having run. To avoid that situation decided to change init, to not let --force be used to override a .noannex file. Instead the user can just delete the file.	2019-02-05 14:43:23 -04:00
Joey Hess	b080699a95	fromkey --json * fromkey: Added --json. * fromkey --batch output changed to support using it with --json. The old output was not parseable for any useful information, so this is not expected to break anything.	2019-02-05 14:03:29 -04:00
Joey Hess	7b46b43c48	fromkey: Made idempotent If the worktree file already exists, and is annexed and uses the same key, avoid failing, nothing needs to be done. Had to add lookupFileNotHidden to handle the case where an adjust --hide-missing is in use, and the worktree file was hidden due to the object content being missing. lookupFile would return the key of the hidden file, but it makes sense that after fromkey succeeds, the worktree must contain the file it was supposed to set up.	2019-02-05 13:13:13 -04:00
Joey Hess	a64fca92f6	Fix race in cleanup of othertmp directory that could result in a failure attempting to access it. Need to create the directory after the lock is held, not before. The other racing process would need to shut down at just the wrong time, running cleanupOtherTmp. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2019-02-02 13:56:31 -04:00
Joey Hess	7b9701675e	Display progress bar when getting files from export remotes And moved the progress bar display into storeExport as well. This commit was sponsored by John Pellman on Patreon.	2019-01-31 13:34:12 -04:00

... 2 3 4 5 6 ...

964 commits