git-annex

Author	SHA1	Message	Date
Joey Hess	d074532aff	post-recive hook to make updateInstead work in direct mode and adjusted branches * Added post-recieve hook, which makes updateInstead work with direct mode and adjusted branches. * init: Set up the post-receive hook. This commit was sponsored by Fernando Jimenez on Patreon.	2017-02-17 14:04:43 -04:00
Joey Hess	f07af03018	Run ssh with -n whenever input is not being piped into it ... to avoid it consuming stdin that it shouldn't. This fixes git-annex-checkpresentkey --batch remote, which didn't output results for all keys passed into it. Other git-annex commands that communicate with a remote over ssh may also have been consuming stdin that they shouldn't have, which could have impacted using them in eg, shell scripts. For example, a shell script reading files from stdin and passing them to git annex drop would be impacted by this bug, whenever git annex drop ran git-annex-shell checkpresent, it would consume part/all of the stdin that the shell script was supposed to consume. Fixed by adding a ConsumeStdin parameter to Annex.Ssh.sshOptions, which is used throughout git-annex to run ssh (in order for ssh connection caching to work). Every call site was checked to see if it used CreatePipe for stdin, and if not was marked NoConsumeStdin.	2017-02-15 15:08:46 -04:00
Edward Betts	0750913136	correct spelling mistakes	2017-02-12 17:30:23 -04:00
Joey Hess	f617988a29	Make import --deduplicate and --skip-duplicates only hash once, not twice import: --deduplicate and --skip-duplicates were implemented inneficiently; they unncessarily hashed each file twice. They have been improved to only hash once. The new approach is to lock down (minimally) and hash files, and then reuse that information when importing them. This was rather tricky, especially in detecting changes to files while they are being imported. The output of import changed slightly. While before it silently skipped over files with eg --skip-duplicates, now it shows each file as it starts to act on it. Since every file is hashed first thing, it would otherwise not be clear what file import is chewing on. (Actually, it wasn't clear before when any of the duplicates switches were used.) This commit was sponsored by Alexander Thompson on Patreon.	2017-02-09 15:32:22 -04:00
Joey Hess	5c804cf42e	add SetupStage parameter to RemoteType.setup Most remotes have an idempotent setup that can be reused for enableremote, but in a few cases, it needs to tell which, and whether a UUID was provided to setup was used. This is groundwork for making initremote be able to provide a UUID. It should not change any behavior. Note that it would be nice to make the UUID always be provided to setup, and make setup not need to generate and return a UUID. What prevented this simplification is Remote.Git.gitSetup, which needs to reuse the UUID of the git remote when setting it up, and so has to return that UUID. This commit was sponsored by Thom May on Patreon.	2017-02-07 14:55:58 -04:00
Joey Hess	9eb10caa27	Some optimisations to string splitting code. Turns out that Data.List.Utils.split is slow and makes a lot of allocations. Here's a much simpler single character splitter that behaves the same (even in wacky corner cases) while running in half the time and 75% the allocations. As well as being an optimisation, this helps move toward eliminating use of missingh. (Data.List.Split.splitOn is nearly as slow as Data.List.Utils.split and allocates even more.) I have not benchmarked the effect on git-annex, but would not be surprised to see some parsing of eg, large streams from git commands run twice as fast, and possibly in less memory. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2017-01-31 19:06:22 -04:00
Joey Hess	339464e847	config: New command for storing configuration in the git-annex branch. Any config names can be set using this; git-annex commands will only look at specific ones that make sense and are worth the overhead of querying the branch. This might also be useful for storing whatever other config-type stuff the user might want to shove into the git-annex branch. This commit was sponsored by Jochen Bartl on Patreon.	2017-01-30 16:46:38 -04:00
Joey Hess	8484c0c197	Always use filesystem encoding for all file and handle reads and writes. This is a big scary change. I have convinced myself it should be safe. I hope!	2016-12-24 14:46:31 -04:00
Joey Hess	48d9624a2d	Revert ServerAliveInterval Revert ServerAliveInterval change in 6.20161111, which caused problems with too many old versions of ssh and unusual ssh configurations. It should have not been needed anyway since ssh is supposted to have TCPKeepAlive enabled by default.	2016-12-13 12:12:38 -04:00
Joey Hess	9dd510bf29	make tor hidden service work when directory watching is not available Avoid crashing when built w/o inotify..	2016-12-09 16:40:47 -04:00
Joey Hess	e152c322f8	refactor ref change watching Added to change notification to P2P protocol. Switched to a TBChan so that a single long-running thread can be started, and serve perhaps intermittent requests for change notifications, without buffering all changes in memory. The P2P runner currently starts up a new thread each times it waits for a change, but that should allow later reusing a thread. Although each connection from a peer will still need a new watcher thread to run. The dependency on stm-chans is more or less free; some stuff in yesod uses it, so it was already indirectly pulled in when building with the webapp. This commit was sponsored by Francois Marier on Patreon.	2016-12-09 15:01:09 -04:00
Joey Hess	38516b2fca	update progress logs in remotedaemon send/receive	2016-12-08 19:56:02 -04:00
Joey Hess	a8c868c2e1	plumb assicated files through P2P protocol for updating transfer logs ReadContent can't update the log, since it reads lazily. This part of the P2P monad will need to be rethought. Associated files are heavily sanitized when received from a peer; they could be an exploit vector. This commit was sponsored by Jochen Bartl on Patreon.	2016-12-02 16:42:54 -04:00
Joey Hess	bfc8305814	implement p2p command	2016-11-30 14:35:24 -04:00
Joey Hess	0a4479b8ec	Avoid backtraces on expected failures when built with ghc 8; only use backtraces for unexpected errors. ghc 8 added backtraces on uncaught errors. This is great, but git-annex was using error in many places for a error message targeted at the user, in some known problem case. A backtrace only confuses such a message, so omit it. Notably, commands like git annex drop that failed due to eg, numcopies, used to use error, so had a backtrace. This commit was sponsored by Ethan Aubin.	2016-11-15 21:29:54 -04:00
Joey Hess	7ed96a2405	Make .git/annex/ssh.config file work with versions of ssh older than 7.3, which don't support Include. When used with an older version of ssh, any ServerAliveInterval in ~/.ssh/config will be overridden by .git/annex/ssh.config. This commit was sponsored by Josh Taylor on Patreon.	2016-11-07 10:32:57 -04:00
Joey Hess	0ae08947ac	Run ssh with ServerAliveInterval 60 So that stalled transfers will be noticed within about 3 minutes, even if TCPKeepAlive is disabled or doesn't work. Rather than setting with -o, use -F with another config file, so that any settings in ~/.ssh/config or /etc/ssh/ssh_config overrides this.	2016-10-26 16:41:34 -04:00
Joey Hess	1a8ba7eab4	Improve ssh socket cleanup code to skip over the cruft that NFS sometimes puts in a directory when a file is being deleted.	2016-10-26 13:16:41 -04:00
Joey Hess	8e22114735	upgrade: Handle upgrade to v6 when the repository already contains v6 unlocked files whose content is already present. Closes https://github.com/datalad/datalad/issues/1020 The use of runWriter in scanUnlockedFiles broke due to this change; it failed with blocked indefinitely in mvar, because the database write handle was taken while linkFromAnnex needed to also write to it (to update the inode cache). So, switched to using a separate runWriter for each call to addAssociatedFileFast. A little less efficient, but not greatly; the writes should all still be cached.	2016-10-17 15:19:47 -04:00
Joey Hess	148bd0dbfd	refactor	2016-10-17 14:58:33 -04:00
Joey Hess	ee309d6941	lock: Fix edge cases where data loss could occur in v6 mode. In the case where the pointer file is in place, and not the content of the object, lock's performNew was called with filemodified=True, which caused it to try to repopulate the object from an unmodified associated file, of which there were none. So, the content of the object got thrown away incorrectly. This was the cause (although not the root cause) of data loss in https://github.com/datalad/datalad/issues/1020 The same problem could also occur when the work tree file is modified, but the object is not, and lock is called with --force. Added a test case for this, since it's excercising the same code path and is easier to set up than the problem above. Note that this only occurred when the keys database did not have an inode cache recorded for the annex object. Normally, the annex object would be in there, but there are of course circumstances where the inode cache is out of sync with reality, since it's only a cache. Fixed by checking if the object is unmodified; if so we don't need to try to repopulate it. This does add an additional checksum to the unlock path, but it's already checksumming the worktree file in another case, so it doesn't slow it down overall. Further investigation found a similar problem occurred when smudge --clean is called on a file and the inode cache is not populated. cleanOldKeys deleted the unmodified old object file in this case. This was also fixed by checking if the object is unmodified. In general, use of getInodeCaches and sameInodeCache is potentially dangerous if the inode cache has not gotten populated for some reason. Better to use isUnmodified. I breifly auited other places that check the inode cache, and did not see any immediate problems, but it would be easy to miss this kind of problem.	2016-10-17 13:58:43 -04:00
Joey Hess	933bc5c917	Support using v3 repositories without upgrading them to v5. An easy change now that supportedVersions is a list. Since v3 and v5 are identical other than version number, just add v3 to the list. This commit was sponsored by andrea rota.	2016-10-05 16:53:09 -04:00
Joey Hess	f867fc157f	When auto-upgrading a v3 remote, avoid upgrading to version 6, instead keep it at version 5. Fixes a bug introduced with v6 mode that I didn't notice until now. Probably not many v3 repos left out there, and upgrading them to v6 mode is not disastrous, only a little premature. This commit was sponsored by Riku Voipio	2016-10-05 16:23:09 -04:00
Joey Hess	34530e59d9	Avoid using a lot of memory when large objects are present in the git repository .. and have to be checked to see if they are a pointed to an annexed file. Cases where such memory use could occur included, but were not limited to: - git commit -a of a large unlocked file (in v5 mode) - git-annex adjust when a large file was checked into git directly Generally, any use of catKey was a potential problem. Fix by using git cat-file --batch-check to check size before catting. This adds another git batch process, which is included in the CatFileHandle for simplicity. There could be performance impact, anywhere catKey is used. Particularly likely to affect adjusted branch generation speed, and operations on unlocked files in v6 mode. Hopefully since the --batch-check and --batch read the same data, disk buffering will avoid most overhead. Leaving only the overhead of talking to the process over the pipe and whatever computation --batch-check needs to do. This commit was sponsored by Bruno BEAUFILS on Patreon.	2016-10-05 15:24:13 -04:00
Joey Hess	1cd02762bf	Optimisations to git-annex branch query and setting, avoiding repeated copies of the environment. Speeds up commands like "git-annex find --in remote" by over 50%. Profiling showed that adjustGitEnv was 21% of the time and 37% of the allocations of that command. It copied the environment each time with getEnvironment. The only repeated use of adjustGitEnv is in withIndexFile, which tends to be run at least once per file. So, it was optimised by keeping a cache of the environment, which can be reused. There could be other better ways to optimise this. Maybe get the while environment once at startup. But, then it would have to be serialized back out each time running a child process, so I doubt that would be a net win. It might be better to cache a version of the environment that is pre-modified to use .git-annex/index. But, profiling doesn't show that modifying the enviroment is taking any significant time.	2016-09-29 13:36:48 -04:00
Joey Hess	35446d3c3a	followup	2016-09-29 11:33:42 -04:00
Joey Hess	8794dcf27b	Optimisations to time it takes git-annex to walk working tree and find files to work on. Sped up by around 18%. key2file and file2key were top cost centers according to profiling. The repeated use of replace was not efficient. This new approach is quite a lot more efficient. This commit was sponsored by Denis Dzyubenko on Patreon.	2016-09-26 16:48:57 -04:00
Joey Hess	a569f195b7	fix bugs in handing of deep branches with sync and adjusted branches * sync: Previously, when run in a branch with a slash in its name, such as "foo/bar", the sync branch was "synced/bar". That conflicted with the sync branch used for branch "bar", so has been changed to "synced/foo/bar". * adjust: Previously, when adjusting a branch with a slash in its name, such as "foo/bar", the adjusted branch was "adjusted/bar(unlocked)". That conflicted with the adjusted branch used for branch "bar", so has been changed to "adjusted/foo/bar(unlocked)" * Also, running sync in an adjusted branch did not correctly sync changes back to the parent branch when it had a slash in its name. This bug has been fixed. Eliminate use of Git.Ref.under and Git.Ref.basename; using Git.Ref.underBase and Git.Ref.base make everything handle deep branches correctly. Probably noone was adjusting deep branches, and v6 is still experimental anyway, so I'm not going to worry about the mess that was left by that bug. In the case of git-annex sync, using a fixed git-annex with an old unfixed one will mean they use different sync branches for a deep branch, and so they may stop syncing until the old one is upgraded. However, that's only a problem when syncing between repositories without going via a central bare repository. Added a warning about this to the CHANGELOG, but it's probably not going to affect many people at all. This commit was sponsored by Riku Voipio.	2016-09-21 15:23:47 -04:00
Joey Hess	d4fbc3b460	make --json-progress work for url downloads	2016-09-09 16:15:39 -04:00
Joey Hess	8ef494a833	disentangle concurrency and message type This makes -Jn work with --json and --quiet, where before setting -Jn disabled those options. Concurrent json output is currently a mess though since threads output chunks over top of one-another.	2016-09-09 12:57:42 -04:00
Joey Hess	31289da691	get -J: Download different files from different remotes when the remotes have the same costs. Only done in -J mode because only if there's concurrency can downloading from two remotes be faster. Without concurrency, it's likely the case that sequential downloads from the same remote are faster than switching back and forth between two remotes. There is some hairy MVar code here, but basically it just keeps the activeremotes MVar full except when deciding which remote to assign to a thread. Also affects gets by sync --content -J This commit was sponsored by Jochen Bartl.	2016-09-06 12:45:21 -04:00
Joey Hess	10ddf2c3bd	remove TransferObserver unused after last commit	2016-08-03 13:46:20 -04:00
Joey Hess	f461bcae4b	Re-enable accumulating transfer failure log files for command-line actions This was disabled in commit `61ccf95004`, because only the assistant used them, and they were clutter. But, now --failed also uses them. Remove the failure log files after successful transfers. Should avoid most of the clutter problems. Commit `61ccf95004` mentions a subtle behavior change, which has now been reverted: There is one behavior change from this. If glacier is being used, and a manual git annex get --from glacier fails because the file isn't available yet, the assistant will no longer later see that failed transfer file and retry the get.	2016-08-03 13:41:07 -04:00
Joey Hess	1a0e2c9901	get, move, copy, mirror: Added --failed switch which retries failed copies/moves Note that get --from foo --failed will get things that a previous get --from bar tried and failed to get, etc. I considered making --failed only retry transfers from the same remote, but it was easier, and seems more useful, to not have the same remote requirement. Noisy due to some refactoring into Types/	2016-08-03 12:37:12 -04:00
Joey Hess	bf3327ff25	Added metadata --batch option, which allows getting, setting, deleting, and modifying metadata for multiple files/keys.	2016-07-27 10:46:25 -04:00
Joey Hess	e5225f08fc	When built with ut uid-1.3.12, generate more random UUIDs than before Use nextRandom to generate the random UUID, rather than using randomIO. This gets fixes for the following two bugs in the uuid library. However, this did not impact git-annex much, so a hard depedency has not been added on uuid-1.3.12. https://github.com/aslatter/uuid/issues/15 "v4 UUIDs are not that random" This doesn't greatly affect git-annex, because even with only 2^64 possible UUIDs, the chance that two git-annex repositories that are clones of the same git repo get the same UUID is miniscule. And, git-annex generates only one UUID per run, so preducting subsequent UUIDs is not a problem. https://github.com/aslatter/uuid/issues/16 "Remove Random instance for UUID, or mark it as deprecated" git-annex was using that instance; let's stop before it gets deprecated or removed.	2016-07-27 07:46:08 -04:00
Joey Hess	d13194b230	--branch, stage 2 Show branch:file that is being operated on. I had to make ActionItem a type and not a type class because withKeyOptions' passed two different types of values when using the type class, and I could not get the type checker to accept that.	2016-07-20 15:23:43 -04:00
Joey Hess	2619019630	Avoid any access to keys database in v5 mode repositories, which are not supposed to use that database.	2016-07-19 12:12:19 -04:00
Joey Hess	154c939830	Speed up startup time by caching the refs that have been merged into the git-annex branch. This can speed up git-annex commands by as much as a second, depending on the number of remotes.	2016-07-17 12:24:34 -04:00
Joey Hess	cbe3813005	handle SomeAsyncException same as AsyncException This new class was added to base a while ago; I don't know what uses it, but it's intended to be an async exception, so make sure we don't catch it.	2016-06-20 10:31:47 -04:00
Joey Hess	142710d1b4	fix build on windows	2016-06-13 14:54:34 -04:00
Joey Hess	bfd00a0f8c	v6: Fix bad merge in an adjusted branch that resulted in an empty tree.	2016-06-13 14:18:22 -04:00
Joey Hess	b6b5a11601	Make git clean filter preserve the backend that was used for a file.	2016-06-09 15:17:08 -04:00
Joey Hess	0249f3aff5	Fix bug in initialization of clone from a repo with an adjusted branch that had not been synced back to master. This bug caused broken tree objects to get built by a later git annex sync. This is a somewhat unlikely but not impossible situation, and the test suite's union_merge_regression test tickled it when it was run on FAT.	2016-06-09 14:11:00 -04:00
Joey Hess	8e4cbefbc6	also avoid crashing in most circumstances if unable to determine the username Mostly the username is only used for the git committer or other display purposes, and we can just fall back to a dummy value in these cases. The only remaining place where an error is thrown is when starting local pairing, which needs the username to be known.	2016-06-08 15:04:15 -04:00
Joey Hess	9569d6be63	Fix bad automatic merge conflict resolution between an annexed file and a directory with the same name when in an adjusted branch. When running in an overlay work tree, all unchanged files show as deleted, so this code that stages deletions should not run.	2016-06-07 12:53:35 -04:00
Joey Hess	8148ee3d4b	withAltRepo needs a separate queue of changes The queue could potentially contain changes from before withAltRepo, and get flushed inside the call, which would apply the changes to the modified repo. Or, changes could be queued in withAltRepo that were intended to affect the modified repo, but don't get flushed until later. I don't know of any cases where either happens, but better safe than sorry. Note that this affect withIndexFile, which is used in git-annex branch updates. So, it potentially makes things slower. Should not be by much; the overhead consists only of querying the current queue a couple of times, and potentially flushing changes queued within withAltRepo earlier, that could have maybe been bundled with other later changes. Notice in particular that the existing queue is not flushed when calling withAltRepo. So eg when git annex add needs to stage files in the index, it will still bundle them together efficiently.	2016-06-03 13:57:00 -04:00
Joey Hess	907fc62f2c	Fix initialization of a bare clone of a repo that has an adjusted branch checked out.	2016-06-02 17:02:38 -04:00
Joey Hess	26887745a0	refactor isBareRepo	2016-06-02 16:59:47 -04:00
Joey Hess	3b97c09cde	better avoid switching to direct mode in clone of adjusted branch repo	2016-06-02 16:10:30 -04:00
Joey Hess	69bf128f76	avoid switching to direct mode in clone of adjusted branch repo	2016-06-02 15:36:52 -04:00
Joey Hess	72f0d3d384	Automatically enable v6 mode when initializing in a clone from a repo that has an adjusted branch checked out. The clone also has the adjusted branch checked out, so it needs to be initialized to a version that supports that.	2016-06-02 15:34:30 -04:00
Joey Hess	fbf5045d4f	sync --content: Fix bug that caused transfers of files to be made to a git remote that does not have a UUID. This particularly impacted clones from gcrypt repositories. Added guard in Annex.Transfer to prevent this problem at a deeper level. I'm unhappy ith NoUUID, but having Maybe UUID instead wouldn't help either if nothing checked that there was a UUID. Since there legitimately need to be Remotes that do not have a UUID, I can't see a way to fix it at the type level, short making there be two separate types of Remotes.	2016-06-02 13:50:43 -04:00
Yaroslav Halchenko	64e844e1fe	minor typo fixes throughout problematic flexibility	2016-06-02 11:22:18 -04:00
Joey Hess	714750e593	include 3 in upgradableVersions Does not change behavior, only git annex version output	2016-05-24 17:13:19 -04:00
Joey Hess	91df4c6b53	Pass the various gnupg-options configs to gpg in several cases where they were not before. Removed the instance LensGpgEncParams RemoteConfig because it encouraged code that does not take the RemoteGitConfig into account. RemoteType's setup was changed to take a RemoteGitConfig, although the only place that is able to provide a non-empty one is enableremote, when it's changing an existing remote. This led to several folow-on changes, and got RemoteGitConfig plumbed through.	2016-05-23 17:03:20 -04:00
Joey Hess	80b86ff78d	fix recent test suite reversion git annex adjust --force will overwrite any current adjusted branch. I didn't document this because for the user, deleting the branch is just as good.	2016-05-23 11:23:30 -04:00
Joey Hess	097605e2e9	git's handing of relative GIT_INDEX_FILE is more insane than I thought; always make absolute This is actually worse than I thought; when git is being run with a detached work tree, GIT_INDEX_FILE is treated as a path relative to CWD, instead of the normal behavior of relative the top of the work tree. This seems to make it basically impossible for any program that wants to use GIT_INDEX_FILE to use anything other than an absolute path to it; there are too many configurations to keep straight that can change how git interprets what should be a simple relative path to a file. (I have complained to the git developers.)	2016-05-22 15:02:55 -04:00
Joey Hess	823c28d2dc	nub transitionList to avoid ugly message after repeated transitions, and avoid redundant work for repeated ForgetDeadRemotes transitions	2016-05-18 12:26:38 -04:00
Joey Hess	766728c8cf	unify handling of unusual GIT_INDEX_FILE relative path This is probably a git bug that stuck in its interface.	2016-05-17 14:42:06 -04:00
Joey Hess	b4ab1fb093	Fix crash when entering/changing view in a subdirectory of a repo that has a dotfile in its root.	2016-05-17 13:49:10 -04:00
Joey Hess	e91037a38b	use indexEnv	2016-05-17 13:38:04 -04:00
Joey Hess	93c03b5dd5	Work around git bug in handling of relative path to GIT_INDEX_FILE when in a subdirectory of the repository. This affected git annex view. It turns out that some other places that use GIT_INDEX_FILE were already working around the bug. I removed the workaround from Annex.Branch since the new workaround will do.	2016-05-17 13:29:51 -04:00
Joey Hess	d56175164b	avoid checking locations in regular repo In commit `2d00523609` I accidentially made gitAnnexLocation do more work, checking content locations, when used in a regular repo.	2016-05-16 17:19:07 -04:00
Joey Hess	eda5d9cc74	adjust: Add --fix adjustment, which is useful when the git directory is in a nonstandard place.	2016-05-16 17:18:33 -04:00
Joey Hess	4efc26ca6c	move keys db closure to AutoMerge This makes git-annex sync also do it, which makes sure that the keys db info is fresh when doing a sync --content.	2016-05-16 15:11:14 -04:00
Joey Hess	9f05be393e	adjust: If the adjusted branch already exists, avoid overwriting it, since it might contain changes that have not yet been propigated to the original branch. Could not think of a foolproof way to detect if the old adjusted branch was just behind the current branch. It's possible that the user amended the adjusting commit at the head of the adjusted branch, for example. I decided to bail in this situation, instead of just entering the old branch, so that if git annex adjust succeeds the user is always in a current adjusted branch, not some old and out of date one. What could perhaps be done is enter the old branch and then update it. But that seems too magical; the user may have rebased master or something or may not want to propigate the changes from the old branch. Best to error out.	2016-05-13 14:04:22 -04:00
Joey Hess	2d00523609	In the unusual configuration where annex.crippledfilesystem=true but core.symlinks=true, store object contents in mixed case hash directories so that symlinks will point to them. Contents are searched for in both locations, same as before, so this does not add any overhead.	2016-05-10 15:00:22 -04:00
Joey Hess	8a81ddb448	improve comment	2016-05-10 14:42:57 -04:00
Joey Hess	c456833179	Windows: Fix an over-long temp directory name.	2016-05-06 12:49:41 -04:00
Joey Hess	6cf9dbb564	fix build warning on windows	2016-05-05 15:48:58 -04:00
Joey Hess	a9e8cf42d6	more windows path fixes normalize filepaths in the map because it may be constructed with windows-style paths and then queried for git-style	2016-05-04 13:00:02 -04:00
Joey Hess	b22409db38	avoid warnings about not exported System.Directory.isSymbolicLink	2016-04-28 15:18:11 -04:00
Joey Hess	5fe450514b	Fix build with directory-1.2.6.2. It started exporting a isSymbolicLink which supports windows. But, git-annex does no use symlinks on windows yet and this conflicts with the function by the same name from unix-compat, so hide it.	2016-04-28 13:18:44 -04:00
Joey Hess	46e3319995	assistant: Deal with upcoming git's refusal to merge unrelated histories by default git 2.8.1 (or perhaps 2.9.0) is going to prevent git merge from merging in unrelated branches. Since the webapp's pairing etc features often combine together repositories with unrelated histories, work around this behavior change by setting GIT_MERGE_ALLOW_UNRELATED_HISTORIES when the assistant merges. Note though that this is not done for git annex sync's merges, so it will follow git's default or configured behavior.	2016-04-22 14:26:44 -04:00
Joey Hess	0273cd5005	adjusted branches need git 2.2.0 or newer When git-annex is used with a git version older than 2.2.0, disable support for adjusted branches, since GIT_COMMON_DIR is needed to update them and was first added in that version of git.	2016-04-22 12:29:32 -04:00
Joey Hess	b56218f0c2	Fix bug that prevented annex.sshcaching=false configuration from taking effect when on a crippled filesystem. Thanks, divergentdave.	2016-04-20 14:43:43 -04:00
Joey Hess	9d952fe9d1	reinject: When src file's content cannot be verified, leave it alone, instead of deleting it.	2016-04-20 13:21:56 -04:00
Joey Hess	bd516af734	fsck: Warn when core.sharedRepository is set and an annex object file's write bit is not set and cannot be set due to the file being owned by a different user. Made all Annex.Perms file mode changing functions ignore errors when core.sharedRepository is set, because the file might be owned by someone else. I don't fancy getting bug reports about crashes due to set modes in this configuration, which is a very foot-shooty configuration in the first place. The fsck warning is necessary because old repos kept files mode 444, which doesn't allow locking them, and so if the mode remains 444 due to the file being owned by someone else, the user should be told about it.	2016-04-14 15:36:53 -04:00
Joey Hess	b7c8bf5274	Preserve execute bits of unlocked files in v6 mode. When annex.thin is set, adding an object will add the execute bits to the work tree file, and this does mean that the annex object file ends up executable. This doesn't add any complexity that wasn't already present, because git annex add of an executable file has always ingested it so that the annex object ends up executable. But, since an annex object file can be executable or not, when populating an unlocked file from one, the executable bit is always added or removed to match the mode of the pointer file.	2016-04-14 14:47:08 -04:00
Joey Hess	5e190913a4	add AdjBranch newtype; some simplications	2016-04-09 15:10:26 -04:00
Joey Hess	b5be04027c	change name of basis branch Making the name look too much like the adjusted branch was ambiguous.	2016-04-09 14:17:20 -04:00
Joey Hess	7d28110c68	fix master push overwrite race when updating adjusted branch, by maintaining basis ref	2016-04-09 14:12:25 -04:00
Joey Hess	cf06dac2b8	hard links on windows * annex.thin and annex.hardlink are now supported on Windows. * unannex --fast now makes hard links on Windows.	2016-04-08 15:25:32 -04:00
Joey Hess	251405eca2	avoid withWorkTreeRelated affecting annex symlink calculation	2016-04-08 14:24:00 -04:00
Joey Hess	6549049142	fix commit tree after merge into adjusted branch	2016-04-06 19:22:15 -04:00
Joey Hess	887ef93a7f	run out of tree merge with --no-ff This is how direct mode does it too, and somehow, for reasons that currently escape me, this makes git merge not care if it's run with an empty work tree.	2016-04-06 18:40:28 -04:00
Joey Hess	60bdffe43e	fix auto merge conflict resolution when doing out of tree merge for adjusted branch	2016-04-06 17:32:04 -04:00
Joey Hess	b9e4e2ba84	new method for merging changes into adjusted branch that avoids unncessary merge conflicts Still needs work when there are actual merge conflicts.	2016-04-06 15:36:18 -04:00
Joey Hess	2046502407	v6: Close pointer file handles more quickly, to avoid problems on Windows. Was using L.readFile, so the Handle would remain open until the garbage collector got around to it. Changed to explicit open and close, so we know it's always closed when the function returns.	2016-04-04 15:42:33 -04:00
Joey Hess	f78cbd9f0d	todo	2016-04-04 14:38:29 -04:00
Joey Hess	7c7f3a0f76	deal with cloning a repo that has an ajdusted branch checked out	2016-04-04 13:51:42 -04:00
Joey Hess	7ba836eec3	make way for git checkout output	2016-04-04 13:25:30 -04:00
Joey Hess	c3e0859846	Upgrading a direct mode repository to v6 has changed to enter an adjusted unlocked branch. This makes the direct mode to v6 upgrade able to be performed in one clone of a repository without affecting other clones, which can continue using v5 and direct mode.	2016-04-04 13:17:24 -04:00
Joey Hess	12ddb6e8b2	fixed merging of changes from adjusted branch + a remote	2016-03-31 18:54:35 -04:00
Joey Hess	860602a1e6	made some progress on syncing adjusted branches, but still buggy	2016-03-31 14:56:10 -04:00
Joey Hess	a585731935	add reflog messages	2016-03-31 12:27:48 -04:00
Joey Hess	02ce75c87d	clean up handling of commit lock Closing the lock manually caused a later exception when the bracket tried to close it again.	2016-03-31 12:04:05 -04:00
Joey Hess	8a69298bf2	init: Automatically enter the adjusted unlocked branch when in a v6 repo on a filesystem not supporting symlinks.	2016-03-29 13:54:42 -04:00
Joey Hess	42b7ccc89f	git annex add in adjusted unlocked branch Cached the current branch lookup just because it seems unnecessary overhead to run an extra git command per add to query the current branch.	2016-03-29 13:26:06 -04:00
Joey Hess	5e1d7bbc00	limit git annex adjust to v6 mode doesn't work in v5	2016-03-29 12:05:02 -04:00
Joey Hess	1df62b43d1	remove hashPointerFile' no longer needed now that hashPointerFile uses a long-running git hash-object handle	2016-03-29 11:15:21 -04:00
Joey Hess	70e8d6860e	Merge branch 'master' into adjustedbranch	2016-03-29 11:07:40 -04:00
Joey Hess	a2b668a8f6	reuse annex's HashObjectHandle	2016-03-14 16:29:59 -04:00
Joey Hess	2d234de781	Sped up git-annex merge by using git hash-object --batch. This does mean that it has to write out temp files containing updated objects for the merge. So may use more disk space, and disk IO, but that should generally win out over needing to launch N separate git hash-object processes.	2016-03-14 16:23:22 -04:00
Joey Hess	00d9da3534	use hash-object --batch Handle was plumbed through, but not used.	2016-03-14 16:12:55 -04:00
Joey Hess	88a4a6f396	Sped up git-annex add in direct mode and v6 by using git hash-object --batch. Speeds up hashSymlink and hashPointerFile.	2016-03-14 15:58:46 -04:00
Joey Hess	f2772f469a	followup	2016-03-14 15:54:46 -04:00
Joey Hess	1df49506c4	Correct git-annex info to include unlocked files in v6 repository. An unlocked present file does not have a pointer file in the worktree, so info skipped counting it. It may be that unused was also affected by the problem, but it seemed not to be in my tests. I think because of the use of the associatedFilesFilter. This fix slows down both info and unused a little bit, since they have to query the contents of files from git, but only when handling unlocked files.	2016-03-14 13:14:01 -04:00
Joey Hess	41b7c5f6aa	implement another adjustment -- easy to do now!	2016-03-11 19:54:10 -04:00
Joey Hess	a85196bd4e	simplify adjustment reversal	2016-03-11 19:41:11 -04:00
Joey Hess	ba1ef156a2	fix deletion of files in adjustTree	2016-03-11 16:30:06 -04:00
Joey Hess	b9184f69a7	improve propigation of commits from adjusted branches Only reverse adjust the changes in the commit, which means that adjustments do not need to be generally cleanly reversable. For example, an adjustment can unlock all locked files, but does not need to worry about files that were originally unlocked when reversing, because it will only ever be run on files that have been changed. So, it's ok if it locks all files when reversed, or even leaves all files as-is when reversed.	2016-03-11 16:05:06 -04:00
Joey Hess	97e97dccda	Merge branch 'master' into adjustedbranch	2016-03-11 12:21:26 -04:00
Joey Hess	4b3355cf3c	refactor	2016-03-09 13:43:22 -04:00
Joey Hess	9039bdb4ea	Always try to thaw content, even when annex.crippledfilesystem is set.	2016-03-09 13:33:13 -04:00
Joey Hess	be80c29dbc	Merge branch 'no-cbits'	2016-03-05 11:22:32 -04:00
Joey Hess	5e3f707c34	rebase on top of updated original branch	2016-03-03 17:00:48 -04:00
Joey Hess	ac08f6580e	fix abs filepath generation	2016-03-03 16:47:51 -04:00
Joey Hess	40509e20e5	change name of adjusted branches to eg adjusted/master(unlocked) Using adjusted/unlocked/master made lots of git stuff dealing with "master" complain that it was ambiguous. This new appoach is more like view branch names, and shows the adjustment right there in the branch display even if only the basename of the branch is shown.	2016-03-03 16:38:56 -04:00
Joey Hess	cf24e9b892	working toward adjusted commit propigation	2016-03-03 16:19:09 -04:00
Joey Hess	7811556a5b	Merge branch 'master' into adjustedbranch	2016-03-03 15:40:25 -04:00
Joey Hess	91f37673df	can't checkout adjusted branch while index is still locked There's a race here, but entering an adjusted branch for the first time is not something to do when a commit is being made at the same time. Although, may want to prevent the assistant from committing while entering the adjusted branch.	2016-03-03 14:20:39 -04:00
Joey Hess	6024108ab2	push original branch, not adjusted branch	2016-03-03 14:13:54 -04:00
Joey Hess	ef1abda78b	lock index while making index-less commits Avoids race with another git commit at the same time adjusted branch is being updated.	2016-03-03 12:55:00 -04:00
Joey Hess	84de8bd2d0	clarify	2016-03-01 16:22:47 -04:00
Joey Hess	3e91cd13ba	Fix data loss that can occur when annex.pidlock is set in a repository.	2016-03-01 12:12:57 -04:00
Joey Hess	a97a9aaaee	remove debug	2016-02-29 17:36:20 -04:00
Joey Hess	70e78cc53e	update keys database when adjusting branches	2016-02-29 17:27:19 -04:00
Joey Hess	d7bd4d971d	implement updateAdjustedBranch	2016-02-29 17:16:56 -04:00
Joey Hess	048d513233	make assistant aware of adjusted branches when merging	2016-02-29 15:57:47 -04:00
Joey Hess	7c20bf6e7a	make sync aware of adjusted branches So, it will pull and push the original branch, not the adjusted one. And, for merging, it will use updateAdjustedBranch (not implemented yet). Note that remaining uses of Git.Branch.current need to be checked too; for things that should act on the original branch, and not the adjusted branch.	2016-02-29 15:23:08 -04:00
Joey Hess	9e1ebc2336	include adjustment in the adjusted branch name Allows it to be recovered easily.	2016-02-29 15:04:58 -04:00
Joey Hess	3b4557c754	Merge branch 'master' into adjustedbranch	2016-02-29 14:05:10 -04:00
Joey Hess	e520366c4d	metadata: Added -r to remove all current values of a field.	2016-02-29 13:00:46 -04:00
Joey Hess	b946ca44c3	Support --metadata field<number, --metadata field>number etc to match ranges of numeric values. Similarly (well, for free), support preferred content expressions like metadata=field<number and metadata=field>number	2016-02-27 10:55:02 -04:00
Joey Hess	471a211d21	Include magic database in the linux and OSX standalone builds.	2016-02-26 11:54:15 -04:00
Joey Hess	0a1b02ce04	adjusted branches, proof of concept "git annex adjust" may be a temporary interface, but works for a proof of concept. It is pretty fast at creating the adjusted branch. The main overhead is injecting pointer files. It might be worth optimising that by reusing the symlink target as the pointer file content. When I tried to do that, the problem was that the clean filter doesn't use that same format, and so git thought files had changed. Could be dealt with, perhaps make the clean filter use symlink format for pointer files when on an adjusted branch? But the real overhead is in checking out the branch, when git runs the smudge filter once per file. That is perhaps too slow to be usable, although it may only affect initial checkout of the branch, and not updates. TBD.	2016-02-25 16:23:24 -04:00
Joey Hess	4712882776	add hashPointerFile'	2016-02-25 16:10:54 -04:00
Joey Hess	04d4830ac3	add catCommit	2016-02-25 15:34:46 -04:00
Joey Hess	be2e9427ad	refactor	2016-02-25 13:46:31 -04:00
Joey Hess	a5bf674bec	Avoid crashing when built with MagicMime support, but when the magic database cannot be loaded.	2016-02-23 14:39:56 -04:00
Joey Hess	b0081598c7	Fix memory leak in last release, which affected commands like git-annex status when a large non-annexed file is present in the work tree. The whole file was strictly read, and so buffered in memory, and remained buffered for some time when running git-annex status.	2016-02-19 14:45:26 -04:00
Joey Hess	3fba4f83ed	fix windows build	2016-02-16 16:15:32 -04:00
Joey Hess	aa569500d5	fix numerous problem with test suite on crippled filesystems etc	2016-02-16 15:30:59 -04:00
Joey Hess	15148ee9eb	annex.addunlocked * add, addurl, import, importfeed: When in a v6 repository on a crippled filesystem, add files unlocked. * annex.addunlocked: New configuration setting, makes files always be added unlocked. (v6 only)	2016-02-16 14:43:43 -04:00
Joey Hess	adc27f081a	escape slashes in annex pointer files The problem with having the slashes unescaped is, it broke parsing, since the parser takes the filename to get the part containing the key. That particularly affected URL keys. This makes the format be the same as symlinks point to, which keeps things simple. Existing pointer files will continue to work ok.	2016-02-16 14:10:08 -04:00
Joey Hess	7899f7248a	force strict file read Avoid possibly having the file open still when it gets deleted. Needed on Windows, particularly.	2016-02-15 16:47:34 -04:00
Joey Hess	4d89a1ffd1	allow \r in pointer files git-annex doesn't write \r, but it can be present due to line ending conversions or perhaps user edits.	2016-02-15 16:37:40 -04:00
Joey Hess	f9d79d194b	Windows: Fix v6 unlocked files to actually work. Pointer files were not being treated as annex content, so "git annex get" didn't replace them with the object.	2016-02-15 16:12:18 -04:00
Joey Hess	2e3b5e645f	When initializing a v6 repo on a crippled filesystem, don't force it into direct mode.	2016-02-15 15:41:49 -04:00
Joey Hess	540a0343ba	more windows build fix	2016-02-15 15:03:44 -04:00
Joey Hess	f55c576923	fix windows build	2016-02-15 14:58:45 -04:00
Joey Hess	40207b26ea	move old ghc compat code into separate module; eliminate WITH_CLIBS This avoids hsc2hs being run except when building for the old version of ghc. Should speed up builds.	2016-02-15 11:47:33 -04:00
Joey Hess	0983f136b8	create directory for transfer lock file, and catch perm error Before, the call to mkProgressUpdater created the directory as a side-effect, but since that ignored failure to create it, this led to a "does not exist" exception when the transfer lock file was created, rather than a permissions error. So, make sure the directory exists before trying to lock the file in it. When a PermissionDenied exception is caught, skip making the transfer lock. This lets downloads from readonly remotes happen. If an upload is being tried, and the lock file can't be written due to permissions, then probably the actual transfer will fail for the same reason, so I think it's ok that it continues w/o taking the lock in that case.	2016-02-12 14:11:25 -04:00
Joey Hess	17c97434f2	init: Fix bugs in submodule .git symlink fixup, that occurred when initializing in a subdirectory of a submodule and a submodule of a submodule.	2016-02-08 15:41:27 -04:00
Joey Hess	23cc315c38	matchexpression: Added --largefiles option to parse an annex.largefiles expression.	2016-02-03 16:58:36 -04:00
Joey Hess	5127cb59cc	annex.largefiles: Add support for mimetype=text/* etc, when git-annex is linked with libmagic.	2016-02-03 16:29:34 -04:00
Joey Hess	403b56fb91	Limit annex.largefiles parsing to the subset of preferred content expressions that make sense in its context. So, not "standard" or "lackingcopies", etc.	2016-02-03 15:04:42 -04:00
Joey Hess	cdf5977053	simplify	2016-02-03 13:23:34 -04:00
Joey Hess	5d9c7a1164	refactor	2016-02-03 13:08:15 -04:00
Joey Hess	aded00c5f0	avoid unnecessary building of a one-off Map A case lookup should be more efficient.	2016-02-03 12:59:28 -04:00
Joey Hess	d37fe6a547	annex.largefiles can be configured in .gitattributes too This is particulary useful for v6 repositories, since the .gitattributes configuration will apply in all clones of the repository.	2016-02-02 15:18:17 -04:00
Joey Hess	e8fc2ff27c	add "nothing" to preferred content DSL Same as "not anything"; will be particularly useful in annex.largefiles gitattributes.	2016-02-02 14:42:13 -04:00
Gabor Greif	daf8aa76fe	Unneded constraint	2016-01-28 12:34:07 -04:00
Gabor Greif	50e4ec36c7	Another redundant constraint	2016-01-28 12:34:07 -04:00
Joey Hess	710d44a16e	add the known associated file to the list of others	2016-01-26 14:48:19 -04:00
Joey Hess	039e83ed5d	Fix nasty reversion in the last release that broke sync --content's handling of many preferred content expressions. The type checker should have noticed this, but the changes to mapM that make it accept any Traversable hid the fact that it was not being passed a list at all. Thus, what should have returned an empty list most of the time instead returned [""] which was treated as the name of the associated file, with disasterout consequences. When I have time, I should add a test case checking what sync --content drops. I should also consider replacing mapM with one re-specialized to lists.	2016-01-26 14:28:43 -04:00
Joey Hess	23ff58cd4f	optimise getUUID This avoids a Map lookup each time it's called, instead the GitConfig field lazily looks it up once and then caches.	2016-01-20 16:55:06 -04:00
Joey Hess	737e45156e	remove 163 lines of code without changing anything except imports	2016-01-20 16:36:33 -04:00
Joey Hess	b52cf5697b	immediate queue flushing when annex.queuesize=1 Previously, it only flushed when the queue got larger than 1. Also, make the queue auto-flush when items are added, rather than needing to be flushed as a separate step. This simplifies the code and make it more efficient too, as it avoids needing to read the queue out of the state to check if it should be flushed.	2016-01-13 14:55:01 -04:00
Joey Hess	bafcbe95c3	fix one more test failure with v6 unlocked file merge conflict resolution	2016-01-08 15:23:15 -04:00
Joey Hess	51bc32e21e	better fix for slash in view metadata The homomorphs are back, just encoded such that it doesn't crash in LANG=C However, I noticed a bug in the old escaping; [pseudoSlash] was escaped the same as ['/','/']. Fixed by using '%' to escape pseudoSlash. Which requires doubling '%' to escape it, but that's already done in the escaping of worktree filenames in a view, so is probably ok.	2016-01-08 13:55:35 -04:00
Joey Hess	42619e2231	view: Avoid using cute unicode homomorphs for '/' and '\' and instead use ugly escaping, as the unicode method doesn't work on non-unicode supporting systems.	2016-01-08 12:45:32 -04:00
Joey Hess	4b819bee2b	avoid confusing git with a modified ctime in clean filter Linking the file to the tmp dir was not necessary in the clean filter, and it caused the ctime to change, which caused git to think the file was changed. This caused git status to get slow as it kept re-cleaning unchanged files.	2016-01-07 17:48:04 -04:00
Joey Hess	3b960d1422	migrate and rekey v6 unlocked file support	2016-01-07 15:14:15 -04:00
Joey Hess	0b59fb423e	migrate: Copy over metadata to new key.	2016-01-07 14:21:12 -04:00
Joey Hess	b3d60ca285	use TopFilePath for associated files Fixes several bugs with updates of pointer files. When eg, running git annex drop --from localremote it was updating the pointer file in the local repository, not the remote. Also, fixes drop ../foo when run in a subdir, and probably lots of other problems. Test suite drops from ~30 to 11 failures now. TopFilePath is used to force thinking about what the filepath is relative to. The data stored in the sqlite db is still just a plain string, and TopFilePath is a newtype, so there's no overhead involved in using it in DataBase.Keys.	2016-01-05 17:22:19 -04:00
Joey Hess	f36f24197a	scan for unlocked files on init/upgrade of v6 repo	2016-01-01 15:09:42 -04:00
Joey Hess	a2c056df65	convert isPointerFile from Annex to IO	2016-01-01 13:22:38 -04:00
Joey Hess	829ae91009	fix failing git-annex unused test case in v6 WorkTree.lookupFile was finding a key for a file that's deleted from the work tree, which is different than the v5 behavior (though perhaps the same as the direct mode behavior). Fix by checking that the work tree file exists before catting its key. Hopefully this won't slow down much, probably the catKey is much more expensive. I can't see any way to optimise this, except perhaps to make Command.Unused check if work tree files exist before/after calling lookupFile. But, it seems better to make lookupFile really only find keys for worktree files; that's what it's intended to do.	2015-12-30 14:23:31 -04:00
Joey Hess	5057fffccd	flush queue before cleaning cruft Else, queued file stages won't have reached the index, and it won't find everthing. This evidently fixes a reversion in my work today, although I don't see how I broke it. It didn't use to flush the queue first, before, and worked somehow. Test suite for v5 is back to 100% green now.	2015-12-29 17:35:57 -04:00
Joey Hess	f3be28eedc	test suite noticed a direct mode reversion	2015-12-29 17:12:57 -04:00
Joey Hess	10ecc43790	rename	2015-12-29 17:02:14 -04:00
Joey Hess	996ae9b172	don't disable smudge filter while merging The smudge filter does need to be run, because if the key is in the local annex already (due to renaming, or a copy of a file added, or a new file added and its content has already arrived), git merge smudges the file and this should provide its content. This does probably mean that in merge conflict resolution, git smudges the existing file, re-copying all its content to it, and then the file is deleted. So, not efficient.	2015-12-29 16:36:21 -04:00
Joey Hess	24bbaa2346	avoid renaming file when auto-resolving conflict in annex pointer This is a behavior change for merge conflicts between locked files that both pointed to the same key, in different ways. Before, the conflict was resolved, but the file was renamed to .variant. This was unnecessary, because there was only one variant. Of course, this also handles conflicts between unlocked and locked, or even two unlocked files with different pointer contents.	2015-12-29 16:35:34 -04:00
Joey Hess	2e9341a47d	fix inode cache consistency bug when a merge unlocks a present file Since the file was present and locked, its annex object was not in the inode cache. So, despite not needing to update the annex object when the clean filter is run on the content by git merge, it does need to record the inode cache of the annex object. Otherwise, the annex object will be assumed to be bad, since its inode is not cached.	2015-12-29 16:26:27 -04:00
Joey Hess	b6b34f4916	automatic conflict resolution for v6 unlocked files Several tricky parts: * When the conflict is just between the same key being locked and unlocked, the unlocked version wins, and the file is not renamed in this case. * Need to update associated file map when conflict resolution renames an unlocked file. * git merge runs the smudge filter on the conflicting file, and actually overwrites the file with the same content it had before, and so invalidates its inode cache. This makes it difficult to know when it's safe to remove such files as conflict cruft, without going so far as to compare their entire contents. Dealt with this by preventing the smudge filter from populating the file when a merge is run. However, that also prevents the smudge filter being run for non-conflicting files, so eg moving a file won't put its new content into place. * Ideally, if a merge or a merge conflict resolution renames an unlocked file, the file in the work tree can just be moved, rather than copying the content to a new worktree file. This is attempted to be done in merge conflict resolution, but due to git merge's behavior of running smudge filters, what actually seems to happen is the old worktree file with the content is deleted and rewritten as a pointer file, so doesn't get reused. So, this is probably not as efficient as it optimally could be. If that becomes a problem, could look into running the merge in a separate worktree and updating the real worktree more efficiently, similarly to the direct mode merge. However, the direct mode merge had a lot of bugs, and I'd rather not use that more error-prone method unless really needed.	2015-12-29 15:41:09 -04:00
Joey Hess	645833774d	fix windows build	2015-12-28 12:44:04 -04:00
Joey Hess	121f5d5b0c	annex.thin Decided it's too scary to make v6 unlocked files have 1 copy by default, but that should be available to those who need it. This is consistent with git-annex not dropping unused content without --force, etc. * Added annex.thin setting, which makes unlocked files in v6 repositories be hard linked to their content, instead of a copy. This saves disk space but means any modification of an unlocked file will lose the local (and possibly only) copy of the old version. * Enable annex.thin by default on upgrade from direct mode to v6, since direct mode made the same tradeoff. * fix: Adjusts unlocked files as configured by annex.thin.	2015-12-27 15:59:59 -04:00
Joey Hess	54f87ef95f	get associated files from Keys database	2015-12-26 15:09:53 -04:00
Joey Hess	7593917147	cleanup	2015-12-26 15:09:47 -04:00
Joey Hess	289a3592c3	support v6 unlocked files This optimisation was not necessary, and didn't work for v6 unlocked files. Typically only a small number of files will be changed by a commit, so just catKey them all.	2015-12-26 15:04:26 -04:00
Joey Hess	60c36ef6ba	make views work with v6 unlocked files Have to only use the view index in one place; lookupFile was failing for unlocked files because it was run using the view index, which was empty.	2015-12-26 14:52:58 -04:00
Joey Hess	49fca49991	remove dead code	2015-12-26 14:45:07 -04:00
Joey Hess	f324ad24c1	improve comment	2015-12-26 13:47:36 -04:00
Joey Hess	0c03629173	clean up cruft in assistant fast rename code path	2015-12-22 18:03:47 -04:00
Joey Hess	d8a8c77a8f	move cleanOldKey into ingest	2015-12-22 16:55:49 -04:00
Joey Hess	cfaac52b88	populate unlocked files with newly available content when ingesting This can happen when ingesting a new file in either locked or unlocked mode, when some unlocked files in the repo use the same key, and the content was not locally available before.	2015-12-22 16:22:28 -04:00
Joey Hess	4f60234690	finish v6 support for assistant Seems to basically work now!	2015-12-22 15:23:27 -04:00
Joey Hess	4392140946	make linkAnnex detect when the file changes as it's being copied/linked in This fixes a race where the modified file ended up in annex/objects, and the InodeCache stored in the database was for the modified version, so git-annex didn't know it had gotten modified. The race could occur when the smudge filter was running; now it gets the InodeCache before generating the Key, which avoids the race.	2015-12-22 15:20:03 -04:00
Joey Hess	8e9608d7f0	refactoring no behavior changes	2015-12-22 13:42:58 -04:00
Joey Hess	ca2c977704	wip v6 support for assistant Files are not yet added to v6 repos in unlocked mode.	2015-12-21 18:41:15 -04:00
Joey Hess	35f6a78b66	fix reversion in v5 git-annex add of unlocked file In v5, lookupFile is supposed to only look at symlinks on disk (except when in direct mode). Note that v6 also has a bug when a locked file's symlink is deleted and is replaced with a new file. It sees that a link is staged and gets that key.	2015-12-16 14:27:12 -04:00
Joey Hess	38a23928e9	temporarily remove cached keys database connection The problem is that shutdown is not always called, particularly in the test suite. So, a database connection would be opened, possibly some changes queued, and then not shut down. One way this can happen is when using Annex.eval or Annex.run with a new state. A better fix might be to make both of them call Keys.shutdown (and be sure to do it even if the annex action threw an error). Complication: Sometimes they're run reusing an existing state, so shutting down a database connection could cause problems for other users of that same state. I think this would need a MVar holding the database handle, so it could be emptied once shut down, and another user of the database connection could then start up a new one if it got shut down. But, what if 2 threads were concurrently using the same database handle and one shut it down while the other was writing to it? Urgh. Might have to go that route eventually to get the database access to run fast enough. For now, a quick fix to get the test suite happier, at the expense of speed.	2015-12-16 14:05:26 -04:00
Joey Hess	7d0e79b9e1	Use git-annex init --version=6 to get v6 for now Not ready to make it default because of the direct mode upgrade needing to all happen at once.	2015-12-15 17:17:13 -04:00
Joey Hess	f9d077186a	implemented upgrade of direct mode repo to v6	2015-12-15 16:00:26 -04:00
Joey Hess	cdd27b8920	reorg	2015-12-15 15:34:28 -04:00
Joey Hess	2bc920e266	update inode cache to cover file even when nothing needs to be done to linkAnnex This covers the case where multiple files have the same content and are added with git add. Previously only the one that was linked to the annex got its inode cached; now both are.	2015-12-15 13:02:33 -04:00
Joey Hess	1dad3af3fc	checked getKeysPresent; it's ok for v6 unlocked files When a v6 unlocked files is removed from the work tree, unused doesn't show it. When it gets removed from the index, unused does show it. This is the same as a locked file.	2015-12-11 16:12:42 -04:00
Joey Hess	7790e059b2	finish v6 git-annex lock This was a doozy!	2015-12-11 15:28:34 -04:00
Joey Hess	50e83b606c	only make 1 hardlink max between pointer file and annex object If multiple files point to the same annex object, the user may want to modify them independently, so don't use a hard link. Also, check diskreserve when copying.	2015-12-11 14:00:21 -04:00
Joey Hess	c608a752a5	Merge branch 'master' into smudge	2015-12-11 13:50:31 -04:00
Joey Hess	abd66c7089	fsck: Failed to honor annex.diskreserve when checking a remote.	2015-12-11 13:50:27 -04:00
Joey Hess	c910b4e255	wip	2015-12-11 10:42:18 -04:00
Joey Hess	9dffd3d255	add generalized linkAnnex'	2015-12-10 16:08:19 -04:00
Joey Hess	06a8256bf6	always format pointer file with a trailing newline Before the smudge filter added a trailing newline, but other things that wrote formatPointer to a file did not. also some new pointer staging code to use later	2015-12-10 16:06:58 -04:00
Joey Hess	f80a3d8cd0	check InodeCache in inAnnex et al This avoids querying the database when the content file doen't exist (or otherwise fails the provided check). However, it does add overhead of querying the database, and will certianly impact performance.	2015-12-10 14:51:04 -04:00
Joey Hess	2b8f6b8b2f	check inode cache in prepSendAnnex This does mean one query of the database every time an object is sent. May impact performance.	2015-12-10 14:50:52 -04:00
Joey Hess	3b2a7f216d	move	2015-12-10 14:20:38 -04:00
Joey Hess	3719d1b390	make clear when code is using deprecated direct mode files	2015-12-09 19:43:15 -04:00
Joey Hess	aa88851ec1	reorder	2015-12-09 19:38:37 -04:00
Joey Hess	ce73a96e4e	use InodeCache when dropping a key to see if a pointer file can be safely reset The Keys database can hold multiple inode caches for a given key. One for the annex object, and one for each pointer file, which may not be hard linked to it. Inode caches for a key are recorded when its content is added to the annex, but only if it has known pointer files. This is to avoid the overhead of maintaining the database when not needed. When the smudge filter outputs a file's content, the inode cache is not updated, because git's smudge interface doesn't let us write the file. So, dropping will fall back to doing an expensive verification then. Ideally, git's interface would be improved, and then the inode cache could be updated then too.	2015-12-09 17:54:54 -04:00
Joey Hess	5e8c628d2e	add inode cache to the db Renamed the db to keys, since it is various info about a Keys. Dropping a key will update its pointer files, as long as their content can be verified to be unmodified. This falls back to checksum verification, but I want it to use an InodeCache of the key, for speed. But, I have not made anything populate that cache yet.	2015-12-09 17:00:37 -04:00
Joey Hess	3311c48631	move InodeSentinal from direct mode code to its own module Will be used outside of direct mode for v6 unlocked files, and is already used outside of direct mode when adding files to annex.	2015-12-09 15:52:11 -04:00
Joey Hess	8a818088a3	link/copy pointer files to object content when it's added	2015-12-09 15:27:29 -04:00
Joey Hess	751120c171	avoid pre-commit hook messing up new-style unlocked files in v6 repo	2015-12-09 15:18:54 -04:00
Joey Hess	78a6b8ce05	refactor and improve pointer file handling code	2015-12-09 14:27:43 -04:00
Joey Hess	712c9fc590	require "annex/objects/" before key in pointer files This removes ambiguity, because while someone might have "WORM--foo" in a file that's not intended to be a git-annex pointer file, "annex/objects/WORM--foo" is less likely. Also, `664cc987e8` had a caveat about symlink targets being parsed as pointer files, and now the same parser is used for both. I did not include any hash directories before the key in the pointer file, as they're not needed. However, if they were included, the parser would still work ok.	2015-12-07 15:45:08 -04:00
Joey Hess	664cc987e8	support pointer files Backend.lookupFile is changed to always fall back to catKey when operating on a file that's not a symlink. catKey is changed to understand pointer files, as well as annex symlinks. Before, catKey needed a file mode witness, to be sure it was looking at a symlink. That was complicated stuff. Now, it doesn't actually care if a file in git is a symlink or not; in either case asking git for the content of the file will get the pointer to the key. This does mean that git-annex will treat a link foo -> WORM--bar as a git-annex file, and also treats a regular file containing annex/objects/WORM--bar as a git-annex file. Calling catKey could make git-annex commands need to do more work than before. This would especially be the case if a repo contained many regular files, and only a few annexed files, as now git-annex will need to ask git about the contents of the regular files.	2015-12-07 15:35:36 -04:00
Joey Hess	62a2fba1cd	Merge branch 'master' into smudge	2015-12-07 12:29:34 -04:00
Joey Hess	2936153fc4	fix temp filename Was not putting it inside the temp dir, but next to it! This was just wrong, and it led to a longer filename that desired being used, leading to some bug reports.	2015-12-06 16:54:01 -04:00
Joey Hess	6e71094e7d	avoid too long temp dir template The filename might be at or close to the filename length limit, so using it as the template for the temp dir would then fail.	2015-12-06 16:42:40 -04:00
Joey Hess	e7f75b079d	don't let git-annex direct be run in a v6 repo	2015-12-04 16:33:09 -04:00
Joey Hess	ccc49861ca	add v6; keep v5 working for now and manual upgrade Since all places where a repo is used in direct mode need to have git-annex upgraded before the repo can safely be converted to v6, the upgrade needs to be manual for now. I suppose that at some point I'll want to drop all the direct mode support code. At that point, will stop supporting v5, and will need to auto-upgrade any remaining v5 repos. If possible, I'd like to carry the direct mode support for say, a year or so, to give people plenty of time to upgrade and avoid disruption.	2015-12-04 16:14:48 -04:00
Joey Hess	34ead644d9	auto-configure filter.annex.smudge and clean on init	2015-12-04 16:14:11 -04:00
Joey Hess	983c1894eb	avoid unnecessary reading of git-annex branch data when matching on annex.largefiles This makes git annex clean not look at the git-annex branch at all, and so speeds it up by 50% or more.	2015-12-04 15:06:41 -04:00
Joey Hess	99b2a524a0	clean filter should update location log when adding new content to annex	2015-12-04 14:20:32 -04:00
Joey Hess	2c6454a2e2	basic clean filter working	2015-12-04 13:39:14 -04:00
Joey Hess	0d432dd1a4	annex object file mode for core.sharedRepository When core.sharedRepository is set, annex object files are not made mode 444, since that prevents a user other than the file owner from locking them. Instead, a mode such as 664 is used in this case.	2015-11-18 15:45:32 -04:00
Joey Hess	3449c0e8ec	avoid spawning file size polling thread when not in -J mode	2015-11-16 21:21:58 -04:00
Joey Hess	e97fce35a6	Display progress meter in -J mode when downloading from the web. Including in addurl, and get --from web, but also in S3 and External special remotes when a web url is known for content in those remotes.	2015-11-16 21:00:54 -04:00
Joey Hess	262c37c16e	add missing checkSaneLock wrapper for pidlocks	2015-11-16 15:35:41 -04:00
Joey Hess	bb86eebfbd	init: Automatically enable annex.pidlock when necessary.	2015-11-13 13:35:29 -04:00
Joey Hess	aaf1ef268d	convert from Utility.LockPool to Annex.LockPool everywhere	2015-11-12 18:13:37 -04:00
Joey Hess	aa4192aea6	pid locking configuration and abstraction layer for git-annex (not actually used anywhere yet)	2015-11-12 17:50:34 -04:00
Joey Hess	7c741302cc	assistant: Pass ssh-options through 3 more git pull/push calls that were missed before. It was used for regular pull, but not for regular push, tagged push, or the fallback fetching.	2015-11-10 16:52:30 -04:00
Joey Hess	7938b87864	add: Fix error recovery rollback to not move the injested file content out of the annex back to the file, because other files may point to that same content. Instead, copy the injected file content out to recover. That was not a data loss, but it came close!	2015-11-06 15:28:20 -04:00
Joey Hess	51e60259e1	fix replaceFile makeAnnexLink race replaceFile created a temp file, which was guaranteed to not overlap with another temp file. However, makeAnnexLink then deleted that file, in preparation for making the symlink in its place. This caused a race, since some other replaceFile could create a temp file, using the same name! I was able to reproduce the race easily running git-annex add -J10 in a directory with 100 files (all with different contents). Some files would get ingested into the annex, but their annex links would fail to be added. There could be other situations where this same problem could occur. Perhaps when the assistant is adding a file, if the user manually also ran git-annex add. Perhaps in cases not involving adding a file. The new replaceFile makes a temprary directory, which is guaranteed to be unique, and doesn't make a temp file in there. makeAnnexLink can thus create the symlink without problem and the race is avoided. Audited all calls to replaceFile to make sure that the old behavior of providing an empty temp file was not relied on. The general problem of asking for a temp file and deleting it as part of the process of using it could reach beyond replaceFile. Did some quick audits and didn't find other cases of it. Probably only symlink creation stuff would tend to make that mistake, mostly.	2015-11-06 15:08:19 -04:00
Joey Hess	31472161e4	merge git command queue when joining with concurrent thread	2015-11-05 18:21:48 -04:00

... 3 4 5 6 7 ...

1135 commits