git-annex

Author	SHA1	Message	Date
Joey Hess	15107d2c5a	propigate ssh-options everywhere ssh caching is used * sync: Use the ssh-options git config when doing git pull and push. * remotedaemon: Use the ssh-options git config. Note that the rename env var means that if a new git-annex calls an old one for git-annex ssh, or a new calls an old, nothing much will go wrong; just ssh caching won't happen.	2015-02-12 16:14:53 -04:00
Joey Hess	5be7ba7ee5	The ssh-options git config is now used by gcrypt, rsync, and ddar special remotes that use ssh as a transport.	2015-02-12 15:44:10 -04:00
Joey Hess	7fce85adac	Improve race recovery code when committing to git-annex branch.	2015-02-09 18:34:48 -04:00
Joey Hess	b94eb9b22c	relFile does not have to be relative; rename to currFile	2015-02-06 16:03:02 -04:00
Joey Hess	c8163ce29a	use a Set	2015-01-28 18:17:10 -04:00
Joey Hess	b0575c621f	implement annex.tune.branchhash1 I hope this doesn't impact speed much -- it does have to pull out a value from Annex state every time it accesses the branch now. The test case I dropped has never caught any problems that I can remember, and would have been rather difficult to convert.	2015-01-28 17:17:26 -04:00
Joey Hess	009bd050c1	implement annex.tune.objecthashlower Split out Annex.DirHashes which never really belonged in Locations.	2015-01-28 16:52:08 -04:00
Joey Hess	e8c376e0ad	import Data.Default in Common	2015-01-28 16:11:28 -04:00
Joey Hess	ba3825441c	rework Differences data type Eliminated complexity and future proofed. The most important change is that all functions over Difference are now total; any Difference that can be expressed should be handled. Avoids needs for sanity checking of inputs, and version skew with the future. Also, the difference.log now serializes a [Difference], not a Differences. This saves space and keeps it simpler. Note that [Difference] might contain conflicting differences (eg, [Version5, Version6]. In this case, one of them needs to consistently win over the others, probably based on Ord.	2015-01-28 13:50:02 -04:00
Joey Hess	70736d2b41	Repository tuning parameters can now be passed when initializing a repository for the first time. * init: Repository tuning parameters can now be passed when initializing a repository for the first time. For details, see http://git-annex.branchable.com/tuning/ * merge: Refuse to merge changes from a git-annex branch of a repo that has been tuned in incompatable ways.	2015-01-27 17:38:06 -04:00
Joey Hess	f50b6779f9	Fix default repository description created by git annex init, which got broken by the relative path changes in the last release.	2015-01-22 14:59:57 -04:00
Joey Hess	afc5153157	update my email address and homepage url	2015-01-21 12:50:09 -04:00
Joey Hess	068aaf943b	on second thought, InodeCache should use getFileSize This is necessary for interop between inode caches created on unix and windows. Which is more important than supporting inodecaches for large keys with the wrong size, which are broken anyway. There should be no slowdown from this change, except on Windows.	2015-01-20 19:35:50 -04:00
Joey Hess	4f657aa14e	add getFileSize, which can get the real size of a large file on Windows Avoid using fileSize which maxes out at just 2 gb on Windows. Instead, use hFileSize, which doesn't have a bounded size. Fixes support for files > 2 gb on Windows. Note that the InodeCache code only needs to compare a file size, so it doesn't matter it the file size wraps. So it has been left as-is. This was necessary both to avoid invalidating existing inode caches, and because the code passed FileStatus around and would have become more expensive if it called getFileSize. This commit was sponsored by Christian Dietrich.	2015-01-20 17:09:24 -04:00
Joey Hess	6035f94666	Windows: Fix running of the pre-commit-annex hook.	2015-01-20 14:48:16 -04:00
Joey Hess	f4de021a54	convert parentDir to be based on takeDirectory, but fixed for trailing /	2015-01-09 14:26:52 -04:00
Joey Hess	3bab5dfb1d	revert parentDir change Reverts `965e106f24` Unfortunately, this caused breakage on Windows, and possibly elsewhere, because parentDir and takeDirectory do not behave the same when there is a trailing directory separator.	2015-01-09 13:11:56 -04:00
Joey Hess	184ad45b42	Merge branch 'master' into relativepaths	2015-01-06 21:10:01 -04:00
Joey Hess	d7f1449b2b	fix view generation code to work when run in a subdirectory; no longer needs to setCurrentDirectory to top of repo	2015-01-06 21:01:05 -04:00
Joey Hess	858d776352	Merge branch 'master' into relativepaths Conflicts: Locations.hs debian/changelog	2015-01-06 19:00:01 -04:00
Joey Hess	965e106f24	made parentDir return a Maybe FilePath; removed most uses of it parentDir is less safe than takeDirectory, especially when working with relative FilePaths. It's really only useful in loops that want to terminate at / This commit was sponsored by Audric SCHILTKNECHT.	2015-01-06 18:55:56 -04:00
Joey Hess	8a1c5956eb	absolute path to index file; test suite passes There are still known problems; for example git annex view a=b fails when run in a subdir of the repo.	2015-01-06 17:34:02 -04:00
Joey Hess	d8a2f658dd	direct mode merge relative path trickiness This fixes 9 test suite failures. There are some tricky things going on with the paths to the index file, and git's working directory, which are hard to get right with relative paths. So, I switched back to absolute here, at least for now. Only 2 test suite failures remain on this branch, but there are other potential problems the test suite doesn't catch. Including some calls to setCurrentDirectory -- I was wrong and git-annex does do that in a few places, like when generating a view.	2015-01-06 17:18:12 -04:00
Joey Hess	cd865c3b8f	Switch to using relative paths to the git repository. This allows the git repository to be moved while git-annex is running in it, with fewer problems. On Windows, this avoids some of the problems with the absurdly small MAX_PATH of 260 bytes. In particular, git-annex repositories should work in deeper/longer directory structures than before. See http://git-annex.branchable.com/bugs/__34__git-annex:_direct:_1_failed__34___on_Windows/ There are several possible ways this change could break git-annex: 1. If it changes its working directory while it's running, that would be Bad News. Good news everyone! git-annex never does so. It would also break thread safety, so all such things were stomped out long ago. 2. parentDir "." -> "" which is not a valid path. I had to fix one instace of this, and I should probably wipe all calls to parentDir out of the git-annex code base; it was never a good idea. 3. Things like relPathDirToFile require absolute input paths, and code assumes that the git repo path is absolute and passes it to it as-is. In the case of relPathDirToFile, I converted it to not make this assumption. Currently, the test suite has 16 failures.	2015-01-06 16:19:41 -04:00
Joey Hess	a4cf80f460	Windows: Fix handling of views of filenames containing '%'	2014-12-30 17:48:04 -04:00
Joey Hess	402bfff665	fix test case on windows "a:" is an absolute path, so viewedfile test cannot be run on it.	2014-12-30 16:04:06 -04:00
Joey Hess	33f1062bc3	Revert "temporary debugging code for windows autobuilder test suite failure" This reverts commit `0d9fbd18c1`.	2014-12-30 15:18:38 -04:00
Joey Hess	0d9fbd18c1	temporary debugging code for windows autobuilder test suite failure	2014-12-30 15:17:51 -04:00
Joey Hess	c9a3e80d32	fixed all remaining build warnings on Windows	2014-12-29 17:30:20 -04:00
Joey Hess	7e422269a6	move dummy uuids to Annex.UUID	2014-12-17 13:57:52 -04:00
Joey Hess	7ae16bb6f7	Revert "let url claims optionally include a suggested filename" This reverts commit `85df9c30e9`. Putting filename in the claim was a bad idea.	2014-12-11 14:09:57 -04:00
Joey Hess	85df9c30e9	let url claims optionally include a suggested filename	2014-12-11 12:47:57 -04:00
Joey Hess	6ecd3ff421	diffdriver: New git-annex command, to make git external diff drivers work with annexed files. Closes https://github.com/datalad/datalad/issues/18	2014-11-24 16:14:06 -04:00
Joey Hess	864086a956	proxy: for all your direct mode repository munging needs This allows bypassing the direct mode guard in a safe way to do all sorts of things including git revert, git mv, git checkout ... This commit was sponsored by the WikiMedia Foundation.	2014-11-12 15:51:46 -04:00
Joey Hess	5ccc2a2d7c	no longer used imports	2014-11-06 14:18:38 -04:00
Joey Hess	334f366979	Remove fixup code for bad bare repositories created by versions 5.20131118 through 5.20131127. That fixup code would accidentially fire when --git-dir was incorrectly pointed at the working tree of a git-annex repository, resulting in data loss. Closes: #768093	2014-11-04 18:04:19 -04:00
Joey Hess	0f6aaf8012	Windows: Fix crash when user.name is not set in git config.	2014-10-31 16:14:12 -04:00
Joey Hess	4edfda59c0	fix windows build	2014-10-16 15:48:30 -04:00
Joey Hess	1e59df083d	Use haskell setenv library to clean up several ugly workarounds for inability to manipulate the environment on windows. Didn't know that this library existed! This includes making git-annex not re-exec itself on start on windows, and making the test suite on Windows run tests without forking.	2014-10-15 20:33:52 -04:00
Joey Hess	db9121ecee	vicfg: Deleting configurations now resets to the default, where before it has no effect. Added a Default instance for TrustLevel, and was able to use that to clear up several other parts of the code too. This commit was sponsored by Stephan Schulz	2014-10-14 14:15:07 -04:00
Joey Hess	9fd95d9025	indent with tabs not spaces Found these with: git grep "^ " $(find -type f -name \*.hs) \|grep -v ': where' Unfortunately there is some inline hamlet that cannot use tabs for indentation. Also, Assistant/WebApp/Bootstrap3.hs is a copy of a module and so I'm leaving it as-is.	2014-10-09 15:09:26 -04:00
Joey Hess	7b50b3c057	fix some mixed space+tab indentation This fixes all instances of " \t" in the code base. Most common case seems to be after a "where" line; probably vim copied the two space layout of that line. Done as a background task while listening to episode 2 of the Type Theory podcast.	2014-10-09 15:09:11 -04:00
Joey Hess	0598412e5c	Fix transfer lock file FD leak that could occur when two separate git-annex processes were both working to perform the same set of transfers.	2014-09-11 13:53:26 -04:00
Joey Hess	b874f84086	New annex.hardlink setting. Closes: #758593 * New annex.hardlink setting. Closes: #758593 * init: Automatically detect when a repository was cloned with --shared, and set annex.hardlink=true, as well as marking the repository as untrusted. Had to reorganize Logs.Trust a bit to avoid a cycle between it and Annex.Init.	2014-09-05 13:44:09 -04:00
Joey Hess	6eb5c3f479	Do not preserve permissions and acls when copying files from one local git repository to another. Timestamps are still preserved as long as cp --preserve=timestamps is supported. This avoids cp -a overriding the default mode acls that the user might have set in a git repository. With GNU cp, this behavior change should not be a breaking change, because git-anex also uses rsync sometimes in the same situation, and has only ever preserved timestamps when using rsync. Systems without GNU cp will no longer use cp -a, but instead just cp. So, timestamps will no longer be preserved. Preserving timestamps when copying between repos is not guaranteed anyway. Closes: #729757	2014-08-26 17:10:25 -07:00
Joey Hess	2b234634f6	fix imports for windows	2014-08-23 16:27:24 -07:00
Joey Hess	aebcc395ff	use types to enforce that removeAnnex can only be called inside lockContent This fixed one bug where it needed to be and wasn't (in Assistant.Unused). And also found one place where lockContent was used unnecessarily (by drop --from remote). A few other places like uninit probably don't really need to lockContent, but it doesn't hurt to do call it anyway. This commit was sponsored by David Wagner.	2014-08-20 20:13:47 -04:00
Joey Hess	1994771215	more lock file refactoring Also fixes a test suite failures introduced in recent commits, where inAnnexSafe failed in indirect mode, since it tried to open the lock file ReadWrite. This is why the new checkLocked opens it ReadOnly. This commit was sponsored by Chad Horohoe.	2014-08-20 18:58:14 -04:00
Joey Hess	e386e26ef2	avoid trying to create a content file in order to lock it The nice refactoring in `ec7dd0446a` highlighted a bug in lockContent -- when the content is not present, this incorrectly created an empty lock file, using the same filename as the content file. This seems like it could result in empty objects, which fsck would detect and complain about. Both drop and move --to call lockContent, as does Remote.Git.dropKey -- I think we got lucky and this bug didn't show up because both all of those only operate on files that are present. So this bug could only manifest if there was a race, and a file's content was dropped at just the wrong time, just as another process was about to drop it. (And then only if the other process's dropping failed, otherwise it'd delete the empty object file.) Hmm, move --from also called lockContent. Unnecessarily, since the content is not being removed from the local annex. In this case, the combination of the 2 bugs could result in an empty lock file being written, and then if the download of the content failed, left in the object directory as the content. This commit also optimises lockContent, avoiding an unncessary doesFileExist test and instead just catching the exception that's thrown when the file doesn't exist. This commit was sponsored by Justine Lam.	2014-08-20 17:25:30 -04:00
Joey Hess	ec7dd0446a	more lock file refactoring	2014-08-20 17:03:04 -04:00
Joey Hess	d279180266	reorganize and refactor lock code Added a convenience Utility.LockFile that is not a windows/posix portability shim, but still manages to cut down on the boilerplate around locking. This commit was sponsored by Johan Herland.	2014-08-20 16:45:58 -04:00
Joey Hess	0a4d301051	fix lockFileShared to actually create lock file This was a bug, but it was only used for ssh locks and by the hook special remote locking. At least in the case of ssh locks, the lock files happened to already exist before this tried to use them, so the bug didn't cause anything to break.	2014-08-20 15:49:49 -04:00
Joey Hess	bf3133ebb0	whoops, I the debug prints	2014-08-20 12:14:56 -04:00
Joey Hess	96dc423e39	When accessing a local remote, shut down git-cat-file processes afterwards, to ensure that remotes on removable media can be unmounted. Closes: #758630 This does mean that eg, copying multiple files to a local remote will become slightly slower, since it now restarts git-cat-file after each copy. Should not be significant slowdown. The reason git-cat-file is run on the remote at all is to update its location log. In order to add an item to it, it needs to get the current content of the log. Finding a way to avoid needing to do that would be a good path to avoiding this slowdown if it does become a problem somehow. This commit was sponsored by Evan Deaubl.	2014-08-20 12:07:57 -04:00
Joey Hess	83dc82c232	forgot some lifts	2014-08-20 11:51:47 -04:00
Joey Hess	092041fab0	Ensure that all lock fds are close-on-exec, fixing various problems with them being inherited by child processes such as git commands. (With the exception of daemon pid locking.) This fixes at part of #758630. I reproduced the assistant locking eg, a removable drive's annex journal lock file and forking a long-running git-cat-file process that inherited that lock. This did not affect Windows. Considered doing a portable Utility.LockFile layer, but git-annex uses posix locks in several special ways that have no direct Windows equivilant, and it seems like it would mostly be a complication. This commit was sponsored by Protonet.	2014-08-20 11:37:02 -04:00
Joey Hess	e0227dfedf	memoize construction of the Request -> Request function to apply the UrlOptions	2014-08-15 17:47:21 -04:00
Joey Hess	852185c242	git-annex-shell sendkey: Don't fail if a remote asks for a key to be sent that already has a transfer lock file indicating it's being sent to that remote. The remote may have moved between networks, or reconnected.	2014-08-15 14:17:05 -04:00
Joey Hess	bb6cec3461	direct: Avoid leaving file content in misctemp if interrupted.	2014-08-15 13:38:05 -04:00
Joey Hess	d8be828734	direct: Fix ugly warning messages. replaceFileOr was broken and ran the rollback action always. Luckily, for replaceFile, the rollback action was safe to run, since it just nuked a temp file that had already been moved into place. However, when `git annex direct` used replaeFileOr, its rollback printed a scary message: /home/joey/tmp/rrrr/.git/annex/misctmp/tmp32268: rename: does not exist (No such file or directory) There was actually no bad result though.	2014-08-12 13:00:08 -04:00
Joey Hess	c784ef4586	unify exception handling into Utility.Exception Removed old extensible-exceptions, only needed for very old ghc. Made webdav use Utility.Exception, to work after some changes in DAV's exception handling. Removed Annex.Exception. Mostly this was trivial, but note that tryAnnex is replaced with tryNonAsync and catchAnnex replaced with catchNonAsync. In theory that could be a behavior change, since the former caught all exceptions, and the latter don't catch async exceptions. However, in practice, nothing in the Annex monad uses async exceptions. Grepping for throwTo and killThread only find stuff in the assistant, which does not seem related. Command.Add.undo is changed to accept a SomeException, and things that use it for rollback now catch non-async exceptions, rather than only IOExceptions.	2014-08-07 22:03:29 -04:00
Joey Hess	5aa2286e7b	Merge branch 'newchunks' I am happy enough with this to make it live!	2014-08-01 18:00:47 -04:00
Joey Hess	76d894f2e5	Display exception message when a transfer fails due to an exception. For example, I had a copy to a remote that was failing for an unknown reason. This let me see the exception was createDirectory: permission denied; the underlying problem being a permissions issue.	2014-07-30 15:57:19 -04:00
Joey Hess	f3e457a195	add missing Ord constraint (fixes android build) Probably the new ghc used on android is the root cause of needing this constraint.	2014-07-30 11:57:40 -04:00
Joey Hess	bc9e4697b9	better type for Retriever Putting a callback in the Retriever type allows for the callback to remove the retrieved file when it's done with it. I did not really want to make Retriever be fixed to Annex Bool, but when I tried to use Annex a, I got into some type of type mess.	2014-07-29 18:41:41 -04:00
Joey Hess	47e522979c	allow Retriever action to update the progress meter Needed for eg, Remote.External. Generally, any Retriever that stores content in a file is responsible for updating the meter, while ones that procude a lazy bytestring cannot update the meter, so are not asked to.	2014-07-29 17:18:49 -04:00
Joey Hess	7496355031	add some more exception handling primitives	2014-07-26 23:24:27 -04:00
Joey Hess	e2c44bf656	implement chunk logs Slightly tricky as they are not normal UUIDBased logs, but are instead maps from (uuid, chunksize) to chunkcount. This commit was sponsored by Frank Thomas.	2014-07-24 16:23:36 -04:00
Joey Hess	3e4cb0e7f9	fix windows build	2014-07-14 15:55:48 -04:00
Joey Hess	822f4619ae	resolvemerge: finish up by committing	2014-07-11 16:59:49 -04:00
Joey Hess	61a35de433	Deal with change in git 2.0 that made indirect mode merge conflict resolution leave behind old files. I think this is a git behavior change, but have not checked to be sure. Conflict cruft used to look like $foo~HEAD, but now just $foo is left behind as conflict cruft. With test case.	2014-07-11 16:56:19 -04:00
Joey Hess	cb66ca3a76	resolvemerge: New plumbing command that runs the automatic merge conflict resolver.	2014-07-11 16:45:18 -04:00
Joey Hess	1efa51f344	direct: Fix handling of case where a work tree subdirectory cannot be written to due to permissions. Running `git annex direct` would cause loss of data, because the object was moved to a temp file, which it then tried to replace the work tree file with, and on failure, the temp file got deleted. Now it's instead moved back into the annex object location.	2014-07-10 14:15:46 -04:00
Joey Hess	26ee27915a	refactor locking	2014-07-10 00:32:23 -04:00
Joey Hess	e5b88713a1	refactor	2014-07-10 00:16:53 -04:00
Joey Hess	d9d76cf98b	Fix minor FD leak in journal code. Minor because normally only 1 FD is leaked per git-annex run. However, the test suite leaks a few hundred FDs, and this broke it on the Debian autobuilders, which seem to have a tigher than usual ulimit. The leak was introduced by the lazy getDirectoryContents' that was introduced in `e6330988dd` in order to scale to millions of journal files -- if the lazy list was never fully consumed, the directory handle did not get closed. Instead, pull in openDirectory/readDirectory/closeDirectory code that I already developed and submitted in a patch to the haskell directory library earlier. Using this in journalDirty avoids the place that the lazy list caused a problem. And using it in stageJournal eliminates the need for getDirectoryContents'. The getJournalFiles* functions are switched back to using the regular strict getDirectoryContents. I'm not sure if those always consume the whole list, so this avoids any leak. And the things that call those are things like git annex unused, which also look at every file committed to the git-annex branch, so would need more work to scale to insane numbers of files anyway.	2014-07-09 23:36:53 -04:00
Joey Hess	c75193e88b	fix build warning	2014-07-09 15:39:19 -04:00
Joey Hess	84186ee626	fix windows build	2014-07-09 15:37:25 -04:00
Joey Hess	58acaf8026	prospective fix for bad_merge_commit_deleting_all_files Assuming my analysis of a race is correct. In any case, this certianly closes a race..	2014-07-09 15:08:19 -04:00
Joey Hess	ba42b67c70	Fix bug in automatic merge conflict resolution When one side is an annexed symlink, and the other side is a non-annexed symlink. In this case, git-merge does not replace the annexed symlink in the work tree with the non-annexed symlink, which is different from it's handling of conflicts between annexed symlinks and regular files or directories. So, while git-annex generated the correct merge commit, the work tree didn't get updated to reflect it. See comments on bug for additional analysis. Did not add this to the test suite yet; just unloaded a truckload of firewood and am feeling lazy. This commit was sponsored by Adam Spiers.	2014-07-08 13:55:11 -04:00
Joey Hess	4a66cd3f91	assistant: Fix bug, introduced in last release, that caused the assistant to make many unncessary empty merge commits.	2014-07-05 17:12:05 -04:00
Joey Hess	c90e4e8778	work around getDirectoryContents not streaming lazily	2014-07-04 17:59:26 -04:00
Joey Hess	e6330988dd	Fix memory leak when committing millions of changes to the git-annex branch Eg after git-annex add has run on 2 million files in one go. Slightly unhappy with the neeed to use a temp file here, but I cannot see any other alternative (see comments on the bug report). This commit was sponsored by Hamish Coleman.	2014-07-04 15:28:07 -04:00
Joey Hess	d41849bc23	support commit.gpgsign Support users who have set commit.gpgsign, by disabling gpg signatures for git-annex branch commits and commits made by the assistant. The thinking here is that a user sets commit.gpgsign intending the commits that they manually initiate to be gpg signed. But not commits made in the background, whether by a deamon or implicitly to the git-annex branch. gpg signing those would be at best a waste of CPU and at worst would fail, or flood the user with gpg passphrase prompts, or put their signature on changes they did not directly do. See Debian bug #753720. Also makes all commits done by git-annex go through a few central control points, to make such changes easier in future. Also disables commit.gpgsign in the test suite. This commit was sponsored by Antoine Boegli.	2014-07-04 11:53:51 -04:00
Joey Hess	fc80956092	really add non-date metadata too	2014-07-03 14:35:20 -04:00
Joey Hess	d0c1a22e7c	import metadata from feeds When annex.genmetadata is set, metadata from the feed is added to files that are imported from it. Reused the same feedtitle and itemtitle, feedauthor, itemauthor, etc names that are used in --template. Also added title and author, which are the item title/author if available, falling back to the feed title/author. These are more likely to be common metadata fields. (There is a small bit of dupication here, but once git gets around to packing the object, it will compress it away.) The itempubdate field is not included in the metadata as a string; instead it is used to generate year and month fields, same as is done when adding files with annex.genmetadata set. This commit was sponsored by Amitai Schlair, who cooincidentially is responsible for ikiwiki generating nice feed metadata!	2014-07-03 14:15:00 -04:00
Joey Hess	7a8f8b5ac9	refactor	2014-06-16 18:59:23 -04:00
Joey Hess	b30de0dfd2	work around a bug in git http://marc.info/?l=git&m=140262402204212&w=2 This git bug manifested on FAT and Windows as the test suite failing in 3 places. All involved merge conflict resolution. It turned out that the associated file mappings were getting messed up, and that happened because this git bug lost track of what files were supposed to be symlinks. This commit was sponsored by Eric Kidd.	2014-06-12 22:00:02 -04:00
Joey Hess	4fe2e53f5b	finish fixing windows timezone madness Rather than calculating the TSDelta once, and caching it, this now reads the inode sential file's InodeCache file once, and then each time a new InodeCache is generated, looks at the sentinal file to get the current delta. This way, if the time zone changes while git-annex is running, it will adapt. This adds some inneffiency, but only on Windows, and only 1 stat per new file added. The worst innefficiency is that `git annex status` and `git annex sync` will now (on Windows) stat the inode sentinal file once per file in the repo. It would be more efficient to use getCurrentTimeZone, rather than needing to stat the sentinal file. This should be easy to do, once the time package gets my bugfix patch. This commit was sponsored by Jürgen Lüters.	2014-06-12 13:54:08 -04:00
Joey Hess	e4d7e2ebde	fix for Windows file timestamp timezone madness On Windows, changing the time zone causes the apparent mtime of files to change. This confuses git-annex, which natually thinks this means the files have actually been modified (since THAT'S WHAT A MTIME IS FOR, BILL <sheesh>). Work around this stupidity, by using the inode sentinal file to detect if the timezone has changed, and calculate a TSDelta, which will be applied when generating InodeCaches. This should add no overhead at all on unix. Indeed, I sped up a few things slightly in the refactoring. Seems to basically work! But it has a big known problem: If the timezone changes while the assistant (or a long-running command) runs, it won't notice, since it only checks the inode cache once, and so will use the old delta for all new inode caches it generates for new files it's added. Which will result in them seeming changed the next time it runs. This commit was sponsored by Vincent Demeester.	2014-06-12 13:42:21 -04:00
Joey Hess	a44fd2c019	export CreateProcess fields from Utility.Process update code to avoid cwd and env redefinition warnings	2014-06-10 19:20:14 -04:00
Joey Hess	f08fcb5030	simplify	2014-06-09 20:32:11 -04:00
Joey Hess	ab72456bb3	avoid fast-forwarding when a merge conflict was auto-resolved	2014-06-09 20:10:12 -04:00
Joey Hess	d6711800ad	avoid bad commits after interrupted direct mode sync (or merge) It was possible for a interrupted sync or merge in direct mode to leave the work tree out of sync with the last recorded commit. This would result in the next commit seeing files missing from the work tree, and committing their removal. Now, a direct mode merge happens not only in a throwaway work tree, but using a temporary index file, and without any commits or index changes being made until the real work tree has been updated. If the merge is interrupted, the work tree may have some updated files, but worst case a commit will redundantly commit changes that come from the merge. This commit was sponsored by Tony Cantor.	2014-06-09 19:40:28 -04:00
Joey Hess	dcddacfd5c	fixed getting files from bare repos on windows	2014-06-05 15:54:06 -04:00
Joey Hess	eb86f1338f	wip	2014-06-05 15:31:23 -04:00
Joey Hess	1ab3d7c810	Windows: Fix bug introduced in last release that caused files in the git-annex branch to have lines teminated with \r.	2014-06-05 14:57:01 -04:00
Joey Hess	2dd274e4ca	webapp: When adding a new local repository, fix bug that caused its group and preferred content to be set in the current repository, even when not combining. There was a tricky bit here, when it does combine, the edit form is shown, and so the info needs to be committed to the new repository, but then pulled into the current one. And caches need to be invalidated for it to be visible in the edit form.	2014-05-29 20:17:05 -04:00
Ben Gamari	99b89b22fd	Use exceptions in place of deprecated MonadCatchIO-transformers	2014-05-28 17:03:40 -04:00
Joey Hess	95ca3bb022	Fix encoding of data written to git-annex branch. Avoid truncating unicode characters to 8 bits. Allow any encoding to be used, as with filenames (but utf8 is the sane choice). Affects metadata and repository descriptions, and preferred content expressions. The question of what's the right encoding for the git-annex branch is a vexing one. utf-8 would be a nice choice, but this leaves the possibility of bad data getting into a git-annex branch somehow, and this resulting in git-annex crashing with encoding errors, which is a failure mode I want to avoid. (Also, preferred content expressions can refer to filenames, and filenames can have any encoding, so limiting to utf-8 would not be ideal.) The union merge code already took care to not assume any encoding for a file. Except it assumes that any \n is a literal newline, and not part of some encoding of a character that happens to contain a newline. (At least utf-8 avoids using newline for anything except liternal newlines.) Adapted the git-annex branch code to use this same approach. Note that there is a potential interop problem with Windows, since FileSystemEncoding doesn't work there, and instead things are always decoded as utf-8. If someone uses non-utf8 encoding for data on the git-annex branch, this can lead to an encoding error on windows. However, this commit doesn't actually make that any worse, because the union merge code would similarly fail with an encoding error on windows in that situation. This commit was sponsored by Kyle Meyer.	2014-05-27 14:16:33 -04:00
Joey Hess	79cf404e75	support being run by ssh as ssh-askpass replacement To use, set GIT_ANNEX_SSHASKPASS to point to a fifo or regular file (FIFO is better, avoids touching disk or multiple readers) that contains the password. Then set SSH_ASKPASS=git-annex, and when ssh runs it, it will tell ssh the password. This is not yet used..	2014-04-29 18:08:10 -04:00
Joey Hess	2807e14904	use a subdir of GIT_ANNEX_TMP for ssh connection caching sockets To prevent any possible collisions with other, non-socket files, like the xmppgit directory.	2014-04-20 16:56:01 -04:00
Joey Hess	a8bd7a607d	When init detects that git is not configured to commit, and sets user.email to work around the problem, also make it set user.name. I was able to reproduce git failing to commit despite user.email being set, in a test account on my laptop. The account had no GECOS information.	2014-04-20 14:17:57 -04:00
Joey Hess	e880d0d22c	replace (Key, Backend) with Key Only fsck and reinject and the test suite used the Backend, and they can look it up as needed from the Key. This simplifies the code and also speeds it up. There is a small behavior change here. Before, all commands would warn when acting on an annexed file with an unknown backend. Now, only fsck and reinject show that warning.	2014-04-17 18:03:39 -04:00
Joey Hess	915d038bec	reinit: New command that can initialize a new reposotory using the configuration of a previously known repository. Useful if a repository got deleted and you want to clone it back the way it was.	2014-04-15 20:13:35 -04:00
Joey Hess	138d25518d	Merge branch 'master' into remotecontrol Conflicts: doc/devblog/day_152__more_ssh_connection_caching.mdwn	2014-04-14 13:38:35 -04:00
Joey Hess	2ff9ba9f74	add missing Network.URI Ord instance for Debian stable	2014-04-14 13:25:49 -04:00
Joey Hess	a0ef99b3f9	don't try to use ssh connection caching for non-ssh urls	2014-04-13 21:39:04 -04:00
Joey Hess	a33b30d0c4	remotedaemon: When network connection is lost, close all cached ssh connections. This commit was sponsored by Cedric Staub.	2014-04-12 16:32:59 -04:00
Joey Hess	15917ec1a8	sync, assistant, remotedaemon: Use ssh connection caching for git pushes and pulls. For sync, saves 1 ssh connection per remote. For remotedaemon, the same ssh connection that is already open to run git-annex-shell notifychanges is reused to pull from the remote. Only potential problem is that this also enables connection caching when the assistant syncs with a ssh remote. Including the sync it does when a network connection has just come up. In that case, cached ssh connections are likely to be stale, and so using them would hang. Until I'm sure such problems have been dealt with, this commit needs to stay on the remotecontrol branch, and not be merged to master. This commit was sponsored by Alexandre Dupas.	2014-04-12 15:59:34 -04:00
Johan Kiviniemi	4025515616	Notification: Add action/status-dependent icon and urgency	2014-04-05 20:45:11 +03:00
Johan Kiviniemi	7760dfcc7f	Notification: summary is not optional Use the summary field instead of body.	2014-04-05 20:44:06 +03:00
Joey Hess	a772b23c62	fix warning on !dbus	2014-04-02 18:10:03 -04:00
Joey Hess	fe19e15040	reorg matcher types; no non-type code changes	2014-03-29 14:43:34 -04:00
Joey Hess	0df4848a9e	forget --drop-dead: Avoid removing the dead remote from the trust.log, so that if git remotes for it still exist anywhere, git annex info will still know it's dead and not show it.	2014-03-26 13:28:26 -04:00
Joey Hess	16025a4f12	fix build w/o DesktopNotification	2014-03-23 18:17:35 -04:00
Joey Hess	fb8a32cc7f	notifications on drop	2014-03-22 15:01:48 -04:00
Joey Hess	a5dcd0e4bd	fix failure notification	2014-03-22 14:26:36 -04:00
Joey Hess	e426fac273	add desktop notifications Motivation: Hook scripts for nautilus or other file managers need to provide the user with feedback that a file is being downloaded. This commit was sponsored by THM Schoemaker.	2014-03-22 14:12:19 -04:00
Joey Hess	f64c2d6138	toplevel lastchanged field	2014-03-19 19:10:55 -04:00
Joey Hess	1052eeface	Windows: Fix some filename encoding bugs. http://git-annex.branchable.com/bugs/Unicode_file_names_ignored_on_Windows/ Not a complete fix yet.	2014-03-19 15:57:56 -04:00
Joey Hess	caa97d1271	Each for each metadata field, there's now an automatically maintained "$field-lastchanged" that gives the timestamp of the last change to that field. Note that this is a nearly entirely free feature. The data was already stored in the metadata log in an easily accessible way, and already was parsed to a time when parsing the log. The generation of the metadata fields may even be done lazily, although probably not entirely (the map has to be evaulated to when queried).	2014-03-18 18:55:43 -04:00
Joey Hess	6a4dd42328	finish wiring up groupwanted	2014-03-15 17:08:55 -04:00
Joey Hess	3551d40b05	"standard" can now be used as a first-class keyword in preferred content expressions. For example "standard or (include=otherdir/*)" or even "not standard" Note that the implementation avoids any potential for loops (if a standard preferred content expression itself mentioned standard). This commit was sponsored by Jochen Bartl.	2014-03-14 15:04:33 -04:00
Joey Hess	ba02cd8a80	Fix ssh connection caching stop method to work with openssh 6.5p1, which broke the old method. Old ssh did not check the hostname passed to -O stop, so I had used "any". But now ssh does check it! I think this happened as part of the client-side hostname canonicalization changes in 6.5p1, but have not verified that introduced the problem. The symptom was that it would try to dns lookup "any", which often caused a bit of a delay at shutdown. And the old ssh connection kept running, so it would do it over and over again. Fixed by using localhost, which hopefully reliably resolves to some address that ssh will accept.. Also nukeFile the socket after ssh has been asked to shutdown, just in case.	2014-03-13 19:35:06 -04:00
Joey Hess	8e2997aa69	only run sshCleanup when the command actually used ssh connection caching Optimises query commands that do not. More importantly, avoids any ssh connection cleanup delay causing problems at the end of such commands.	2014-03-13 19:30:13 -04:00
Joey Hess	1f99a6778f	Fix direct mode getKeysPresent false positive & also sped up direct mode unused and unannex unused: In direct mode, files that are deleted from the work tree are no longer incorrectly detected as unused. Direct mode `git annex info` slows down a bit due to more stringent checking, but not by a lot.	2014-03-07 12:43:56 -04:00
Joey Hess	8ee3b47d2b	style	2014-03-04 22:55:40 -04:00
Joey Hess	14d1e878ab	sync: Automatically resolve merge conflict between and annexed file and a regular git file. This is a new feature, it was not handled before, since it's a bit of an edge case. However, it can be handled exactly the same as a file/dir conflict, just leave the non-annexed item alone. While implementing this, the core resolveMerge' function got a lot simpler and clearer. Note especially that where before there was an asymetric call to stagefromdirectmergedir, now graftin is called symmetrically in both cases. And, in order to add that `graftin us`, the current branch needed to be known (if there is no current branch, there cannot be a merge conflict). This led to some cleanups of how autoMergeFrom behaved when there is no current branch. This commit was sponsored by Philippe Gauthier.	2014-03-04 19:35:55 -04:00
Joey Hess	99295f2c1d	factor out Annex.AutoMerge from Command.Sync	2014-03-04 16:26:15 -04:00
Joey Hess	4a847cdc08	finish fixing direct mode merge bug involving unstaged local files Added test cases for both ways this can happen, with a conflict involving a file, or a directory. Cleaned up resolveMerge to not touch the work tree in direct mode, which turned out to be the only way to handle things.. And makes it much nicer. Still need to run test suite on windows.	2014-03-04 02:03:15 -04:00
Joey Hess	fb88e0f02c	fix `1192d98721` to handle annexed files in conflicted merge In the case of a conflicted merge where the remote adds a directory, and we have a file (which is checked in), resolveMerge' will create the link, and so the fix for `1192d98721` looked at that, thought it was an unannexed file (it's not in the oldref), and preserved it. This is a hacky fix. It would be better for resolveMerge' to not update the work tree, at least in direct mode, and only stage the changes, which mergeDirectCleanUp could then move into tree. I want to make that change, but this is not the time to do it.	2014-03-03 17:09:53 -04:00
Joey Hess	1192d98721	sync: Fix bug in direct mode that caused a file not checked into git to be deleted when merging with a remote that added a file by the same name. (Thanks, jkt)	2014-03-03 14:57:16 -04:00
Joey Hess	04b77328ef	fix handling of nonexistant hook	2014-03-03 13:59:36 -04:00
Joey Hess	d0fce426c4	pre-commit-annex hook script to automatically extract metadata from lots of types of files Using the extract(1) program to do the heavy lifting. Decided to make git-annex run pre-commit-annex when committing. Since git-annex pre-commit also runs it, it'll be run when git commit is run too, via the pre-commit hook. This basically gives back the pre-commit hook that git-annex took away. The implementation avoids repeatedly looking for the hook script when the assistant is running and committing repeatedly; only checks if the hook is available once. To make the script simpler, made git-annex metadata -s field?=value only set a field when it's not already got a value. This commit was sponsored by bak.	2014-03-02 20:11:58 -04:00
Joey Hess	308d4b67f3	fix combining of FIlterValues	2014-03-02 15:44:14 -04:00
Joey Hess	7d9486a709	vadd: Allow listing multiple desired values for a field.	2014-03-02 15:36:45 -04:00
Joey Hess	c2e8c21ca6	view, vfilter: Add support for filtering tags and values out of a view, using !tag and field!=value. Note that negated globs are not supported. Would have complicated the code to add them, without changing the data type serialization in a non-backwards-compatable way. This commit was sponsored by Denver Gingerich.	2014-03-02 14:53:19 -04:00
Joey Hess	7ac37a7854	Probe for quvi version at run time. Overhead: git annex addurl runs quvi --version once. And more bloat to Annex state..	2014-02-28 14:54:02 -04:00
Joey Hess	a1432bce2f	Put non-object tmp files in .git/annex/misctmp, leaving .git/annex/tmp for only partially transferred objects. This allows eg, putting .git/annex/tmp on a ram disk, if the disk IO of temp object files is too annoying (and if you don't want to keep partially transferred objects across reboots). .git/annex/misctmp must be on the same filesystem as the git work tree, since files are moved to there in a way that will not work cross-device, as well as symlinked into there. I first wanted to put the tmp objects in .git/annex/objects/tmp, but that would pose transition problems on upgrade when partially transferred objects existed. git annex info does not currently show the size of .git/annex/misctemp, since it should stay small. It would also be ok to make something clean it out, periodically.	2014-02-26 16:52:56 -04:00
Joey Hess	06e9080f01	metadata: FIeld names are now case insensative.	2014-02-25 18:45:09 -04:00
Joey Hess	b9147b4012	fix test to work on Windows	2014-02-25 18:09:45 -04:00
Joey Hess	003fc2b7e1	add UrlOptions sum type	2014-02-24 22:00:25 -04:00
Joey Hess	c69d6eb035	Make annex.web-options be used in several places that call curl.	2014-02-24 21:29:37 -04:00
Joey Hess	8d5158fa31	Preserve metadata when staging a new version of an annexed file. Performance impact: When adding a large tree of new files, this needs to do some git cat-file queries to check if any of the files already existed and might need a metadata copy. I tried a benchmark in a copy of my sound repository (so there was already a significant git tree to check against. Adding 10000 small files, with a cold cache: before: 1m48.539s after: 1m52.791s So, impact is 0.0004 seconds per file added. Which seems acceptable, so did not add some kind of configuration to enable/disable this. This commit was sponsored by Lisa Feilen.	2014-02-24 14:41:33 -04:00
Joey Hess	7498c5dd96	annex.genmetadata can be set to make git-annex automatically set metadata (year and month) when adding files	2014-02-23 00:08:29 -04:00
Joey Hess	fa6f553083	exclude derived metadata when extracting metadata from a viewed file	2014-02-22 18:16:28 -04:00
Joey Hess	079b35a1a8	views: add automatically constructed file location metadata When constructing views, metadata is available about the location of the file in the view's reference branch. Allows incorporating parts of the directory hierarchy in a view. For example `git annex view tag=* podcasts/=` makes a view in the form tag/showname. Performance impact: I benchmarked git annex view tag= in the conference proceedings repo to take 6.459s before this change, and 6.544s after. FWIW, I considered making the syntax for this be podcasts/, which might be easier for the user to learn. However, I think it's not as good: The user has to then juggle two different syntaxes, and podcasts/* will be expanded by the shell so they also need to quote it, while podcasts/=* is unlikely to be expanded by the shell. * It would allow for things like podcasts// and *.mp3 which do not map well into views. This commit was sponsored by Aurélien Pinceaux.	2014-02-22 16:27:53 -04:00
Joey Hess	73a5245502	prune imports	2014-02-22 14:58:05 -04:00
Joey Hess	cc0a576ab0	change directory encoding in ViewedFile such that the original directory can be extracted from it	2014-02-22 14:54:53 -04:00
Joey Hess	1435c4f149	factor out new module	2014-02-22 13:35:50 -04:00
Joey Hess	24f8136504	--metadata field=value can now use globs to match, and matches case insensatively, the same as git annex view field=value does. Also refactored glob code into its own module.	2014-02-21 18:34:34 -04:00
Joey Hess	db48b8a4a3	unused: Fix to actually detect unused keys when in direct mode.	2014-02-20 13:53:49 -04:00
Joey Hess	dd7b99c860	add tip about metadata driven views (and more flexible view filtering) While writing this documentation, I realized that there needed to be a way to stay in a view like tag=* while adding a filter like tag=work that applies to the same field. So, there are really two ways a view can be refined. It can have a new "field=explicitvalue" filter added to it, which does not change the "shape" of the view, but narrows the files it shows. Or, it can have a new view added, which adds another level of subdirectories. So, added a vfilter command, which takes explicit values to add to the filter, and rejects changes that would change the shape of the view. And, made vadd only accept changes that change the shape of the view. And, changed the View data type slightly; now components that can match multiple metadata values can be visible, or not visible. This commit was sponsored by Stelian Iancu.	2014-02-19 16:29:56 -04:00
Joey Hess	39ebfa1a2e	pre-commit: Update metadata when committing changes to annexed files within a view. So the user can now switch to a view and then move files around within it to manage metadata. For example, moving a file into a new directory when in the tags=* view adds a tag to it. Implementation is fairly efficient. One diff-index, which is no more expensive than the first stage of a git commit, followed by possibly some cat-file --batch traffic to find the key (when deleting a file). Very similar to what's done in direct mode when committing. And like direct mode when updating the WC after a merge, it has to buffer the diff-tree values in order to make 2 passes over them. When not in a view, pre-commit now does one extra git symbolic-ref, which is tiny overhead. This commit was sponsored by Andrew Eskridge.	2014-02-19 14:17:58 -04:00
Joey Hess	1fe6cd3c0d	decruft	2014-02-19 02:32:22 -04:00
Joey Hess	fb266e2da6	make view globs case-insensative, memoized, and bring back TFDA I was careful to write the code so its clear how laziness memoizes it, although it's likely that much less explicit currying would have had the same effect. Verified that the memoization works using a Debug.Trace.	2014-02-19 02:30:14 -04:00
Joey Hess	0b7ede2088	reject views with too many nested subdirs	2014-02-19 01:28:48 -04:00
Joey Hess	4e0be2792b	remove Read instance for Ref Removed instance, got it all to build using fromRef. (With a few things that really need to show something using a ref for debugging stubbed out.) Then added back Read instance, and made Logs.View use it for serialization. This changes the view log format.	2014-02-19 01:19:57 -04:00
Joey Hess	72c118152f	fix view changing when in subdir Failed reading some files with relative paths. This is a quick and dirty fix.	2014-02-18 20:57:14 -04:00
Joey Hess	9b51d43318	view: preserve toplevel dotfiles	2014-02-18 20:32:00 -04:00
Joey Hess	5790acc19f	improve view filenames	2014-02-18 20:01:44 -04:00
Joey Hess	67fd06af76	add git annex view command (And a vpop command, which is still a bit buggy.) Still need to do vadd and vrm, though this also adds their documentation. Currently not very happy with the view log data serialization. I had to lose the TDFA regexps temporarily, so I can have Read/Show instances of View. I expect the view log format will change in some incompatable way later, probably adding last known refs for the parent branch to View or something like that. Anyway, it basically works, although it's a bit slow looking up the metadata. The actual git branch construction is about as fast as it can be using the current git plumbing. This commit was sponsored by Peter Hogg.	2014-02-18 18:22:20 -04:00
Joey Hess	103dab702b	better data types	2014-02-17 00:38:33 -04:00
Joey Hess	e806152f77	split out types	2014-02-17 00:18:57 -04:00
Joey Hess	d7a95098fb	tricky view refining code that keeps track of whether the view is widenening or narrowing	2014-02-16 22:44:28 -04:00
Joey Hess	410f603383	support globs when built w/o TDFA, just slower	2014-02-16 21:26:57 -04:00
Joey Hess	613f8f02e3	add another quickcheck property, and several edge cases handled	2014-02-16 21:00:12 -04:00
Joey Hess	81628d24c8	simplify type	2014-02-16 17:46:52 -04:00
Joey Hess	9633c67842	filter branches (incomplete) Promosing work toward metadata driven filter branches. A few methods to construct them are stubbed out; all the data types and pure code seems good. This commit was sponsored by Walter Somerville.	2014-02-16 17:39:54 -04:00
Joey Hess	2075cdeb59	limiting files based on metadata Note that there is currently no caching, so --metadata foo=bar --metadata tag=blah will currently read the log 2x per file.	2014-02-13 02:24:30 -04:00
Joey Hess	9f7e76130e	add metadata command to get/set metadata Adds metadata log, and command. Note that unsetting field values seems to currently be broken. And in general this has had all of 2 minutes worth of testing. This commit was sponsored by Julien Lefrique.	2014-02-12 21:30:33 -04:00
Joey Hess	2d480602aa	random hlint (to give the autobuilder something new to build)	2014-02-11 00:41:19 -04:00
Joey Hess	43d17632f6	remove workaround for old bug `ef24751922` described a bug moving between remotes in direct mode; I can no longer reproduce it with this strange workaround removed. Also test suite still passes. Hope the broken code just got fixed in the meantime.	2014-02-06 17:36:14 -04:00
Joey Hess	897d877472	work around absNormPath not working on Windows When making git-annex links, we want unix-style paths in the link targets.	2014-02-06 17:17:35 -04:00
Joey Hess	a44e01c29c	--in can now refer to files that were located in a repository at some past date. For example, --in="here@{yesterday}"	2014-02-06 12:43:56 -04:00
Joey Hess	08afe3a1f6	fix failing test case on Windows ensure file being modified is all read before it's opened for write	2014-02-03 10:20:18 -04:00
Joey Hess	1572c460e8	avoid using openFile when withFile can be used Potentially fixes some FD leak if an action on an opened file handle fails for some reason. There have been some hard to reproduce reports of git-annex leaking FDs, and this may solve them.	2014-02-03 10:19:06 -04:00
Joey Hess	fd1382f96f	factor out utility function	2014-02-03 10:08:28 -04:00
Joey Hess	a542781f6a	remove some monkey faces	2014-02-01 17:14:38 -04:00
Joey Hess	1669e80e85	Windows: Avoid using unix-compat's rename, which refuses to rename directories. Opened a bug about this: https://github.com/jystic/unix-compat/issues/10	2014-01-29 15:19:03 -04:00
Joey Hess	721cc0cd22	rework annexed object locking in direct mode & support Windows Seems that locking of annexed objects when they're being dropped was broken in direct mode: * When taking the lock before dropping, it created the .git/annex/objects file, as an empty file. It seems that the dropping code deleted that, but that is not right, and for all I know could in some situation cause a corrupted object to leak out. * When the lock was checked, it actually tried to open each direct mode file, and checked if it was locked. Not the same lock used above, and could also fail if some consumer of the file locked it. Fixed this, and added windows support by switching direct mode to lock a .lck file.	2014-01-28 16:43:11 -04:00
Joey Hess	891c85cd88	use locking on Windows This is all the easy cases, where there was already a separate lock file.	2014-01-28 14:42:03 -04:00
Joey Hess	2ecd42b43b	remove debug print just saw it legitimately occur when 2 git-annex were running	2014-01-26 17:04:12 -04:00
Joey Hess	74b101d1dd	reorg	2014-01-26 16:36:31 -04:00
Joey Hess	b93e485ef1	added annex.secure-erase-command config option.	2014-01-24 12:58:52 -04:00
Joey Hess	3518c586cf	fix transfers of key with no associated file Several places assumed this would not happen, and when the AssociatedFile was Nothing, did nothing. As part of this, preferred content checks pass the Key around. Note that checkMatcher is sometimes now called with Just Key and Just File. It currently constructs a FileMatcher, ignoring the Key. However, if it constructed a FileKeyMatcher, which contained both, then it might be possible to speed up parts of Limit, which currently call the somewhat expensive lookupFileKey to get the Key. I have not made this optimisation yet, because I am not sure if the key is always the same. Will need some significant checking to satisfy myself that's the case..	2014-01-23 16:44:02 -04:00
Joey Hess	4b55afe9e9	add "unused" preferred content expression With a really nice optimisation that keeps it from having any overhead in normal operation! This commit was sponsored by Ulises Vitulli.	2014-01-22 16:35:32 -04:00
Joey Hess	f2713a3bb9	benchmarked numcopies .gitattributes in preferred content Checking .gitattributes adds a full minute to a git annex find looking for files that don't have enough copies. 2:25 increasts to 3:27. I feel this is too much of a slowdown to justify making it the default. So, exposed two versions of the preferred content expression, a slow one and a fast but approximate one. I'm using the approximate one in the default preferred content expressions to avoid slowing down the assistant.	2014-01-21 18:49:25 -04:00
Joey Hess	f7cdc40f7b	reorg	2014-01-21 18:08:56 -04:00
Joey Hess	0ef282a116	numcopies cleanup, part 2 This includes several bug fixes.	2014-01-21 17:25:39 -04:00
Joey Hess	b40df4f0d0	reorganize numcopies code (no behavior changes) Move stuff into Logs.NumCopies. Add a NumCopies newtype. Better names for various serialization classes that are specific to one thing or another.	2014-01-21 16:08:59 -04:00
Joey Hess	3159da2693	Add and use numcopiesneeded preferred content expression. * Add numcopiesneeded preferred content expression. * Client, transfer, incremental backup, and archive repositories now want to get content that does not yet have enough copies. This means the asssistant will make copies of files that don't yet meet the configured numcopies, even to places that would not normally want the file. For example, if numcopies is 4, and there are 2 client repos and 2 transfer repos, and 2 removable backup drives, the file will be sent to both transfer repos in order to make 4 copies. Once a removable drive get a copy of the file, it will be dropped from one transfer repo or the other (but not both). Another example, numcopies is 3 and there is a client that has a backup removable drive and two small archive repos. Normally once one of the small archives has a file, it will not be put into the other one. But, to satisfy numcopies, the assistant will duplicate it into the other small archive too, if the backup repo is not available to receive the file. I notice that these examples are fairly unlikely setups .. the old behavior was not too bad, but it's nice to finally have it really correct. .. Almost. I have skipped checking the annex.numcopies .gitattributes out of fear it will be too slow. This commit was sponsored by Florian Schlegel.	2014-01-20 17:35:29 -04:00
Joey Hess	d66535f065	global numcopies setting * numcopies: New command, sets global numcopies value that is seen by all clones of a repository. * The annex.numcopies git config setting is deprecated. Once the numcopies command is used to set the global number of copies, any annex.numcopies git configs will be ignored. * assistant: Make the prefs page set the global numcopies. This global numcopies setting is needed to let preferred content expressions operate on numcopies. It's also convenient, because typically if you want git-annex to preserve N copies of files in a repo, you want it to do that no matter which repo it's running in. Making it global avoids needing to warn the user about gotchas involving inconsistent annex.numcopies settings. (See changes to doc/numcopies.mdwn.) Added a new variety of git-annex branch log file, that holds only 1 value. Will probably be useful for other stuff later. This commit was sponsored by Nicolas Pouillard.	2014-01-20 16:47:56 -04:00
Joey Hess	73c420ffcf	much better command action handling for sync --content	2014-01-20 13:31:03 -04:00
Joey Hess	34c8af74ba	fix inversion of control in CommandSeek (no behavior changes) I've been disliking how the command seek actions were written for some time, with their inversion of control and ugly workarounds. The last straw to fix it was sync --content, which didn't fit the Annex [CommandStart] interface well at all. I have not yet made it take advantage of the changed interface though. The crucial change, and probably why I didn't do it this way from the beginning, is to make each CommandStart action be run with exceptions caught, and if it fails, increment a failure counter in annex state. So I finally remove the very first code I wrote for git-annex, which was before I had exception handling in the Annex monad, and so ran outside that monad, passing state explicitly as it ran each CommandStart action. This was a real slog from 1 to 5 am. Test suite passes. Memory usage is lower than before, sometimes by a couple of megabytes, and remains constant, even when running in a large repo, and even when repeatedly failing and incrementing the error counter. So no accidental laziness space leaks. Wall clock speed is identical, even in large repos. This commit was sponsored by an anonymous bitcoiner.	2014-01-20 04:57:36 -04:00
Joey Hess	b6ba0bd556	sync --content: New option that makes the content of annexed files be transferred. Similar to the assistant, this honors any configured preferred content expressions. I am not entirely happpy with the implementation. It would be nicer if the seek function returned a list of actions which included the individual file gets and copies and drops, rather than the current list of calls to syncContent. This would allow getting rid of the somewhat reundant display of "sync file [ok\|failed]" after the get/put display. But, do that, withFilesInGit would need to somehow be able to construct such a mixed action list. And it would be less efficient than the current implementation, which is able to reuse several values between eg get and drop. Note that currently this does not try to satisfy numcopies when getting/putting files (numcopies are of course checked when dropping files!) This makes it like the assistant, and unlike get --auto and copy --auto, which do duplicate files when numcopies is not yet satisfied. I don't know if this is the right decision; it only seemed to make sense to have this parallel the assistant as far as possible to start with, since I know the assistant works. This commit was sponsored by Øyvind Andersen Holm.	2014-01-19 17:49:54 -04:00
Joey Hess	8ce515ffe4	improve matcher data type to allow matching Keys, instead of just files (no behavior changes)	2014-01-18 14:51:55 -04:00
Joey Hess	207ac67aaa	avoid needing a build-dep on hxt for Data.AssocList	2014-01-14 16:42:10 -04:00
Joey Hess	d07f2d7865	Fix a long-standing bug that could cause the wrong index file to be used when committing to the git-annex branch, if GIT_INDEX_FILE is set in the environment. This typically resulted in git-annex branch log files being committed to the master branch and later showing up in the work tree. (These log files can be safely removed.)	2014-01-14 15:36:33 -04:00
Joey Hess	78c7c54fdb	also check diskreserve for quvi downloads	2014-01-04 15:38:59 -04:00
Joey Hess	f9e7b6cf61	addurl, importfeed: Honor annex.diskreserve as long as the size of the url can be checked. This adds a http HEAD before the download is done. That was already the case when the assistant was running, and it seems worth it to avoid filling up the whole disk, like happened to my server today.	2014-01-04 15:08:06 -04:00
Joey Hess	3e68c1c2fd	add remote state logs This allows a remote to store a piece of arbitrary state associated with a key. This is needed to support Tahoe, where the file-cap is calculated from the data stored in it, and used to retrieve a key later. Glacier also would be much improved by using this. GETSTATE and SETSTATE are added to the external special remote protocol. Note that the state is left as-is even when a key is removed from a remote. It's up to the remote to decide when it wants to clear the state. The remote state log, $KEY.log.rmt, is a UUID-based log. However, rather than using the old UUID-based log format, I created a new variant of that format. The new varient is more space efficient (since it lacks the "timestamp=" hack, and easier to parse (and the parser doesn't mess with whitespace in the value), and avoids compatability cruft in the old one. This seemed worth cleaning up for these new files, since there could be a lot of them, while before UUID-based logs were only used for a few log files at the top of the git-annex branch. The transition code has also been updated to handle these new UUID-based logs. This commit was sponsored by Daniel Hofer.	2014-01-03 16:35:57 -04:00
Joey Hess	b1d7474c1d	Auto-upgrade v3 indirect repos to v5 with no changes. This also fixes a problem when a direct mode repo was somehow set to v3 rather than v4, and so the automatic direct mode upgrade to v5 was not done.	2013-12-29 13:06:23 -04:00
Joey Hess	7d5b25515c	Add plumbing-level lookupkey command.	2013-12-15 14:02:23 -04:00
Joey Hess	bef567c31f	Fix direct mode's handling when modifications to non-annexed files are pulled from a remote. A bug prevented the files from being updated in the work tree, and this caused the modification to be reverted.	2013-12-12 15:57:09 -04:00
Joey Hess	c160bf9d88	format comment	2013-12-12 15:16:44 -04:00
Joey Hess	03932212ec	Avoid using git commit in direct mode, since in some situations it will read the full contents of files in the tree. The assistant's commit code also always avoids git commit, for simplicity. Indirect mode sync still does a git commit -a to catch unstaged changes. Note that this means that direct mode sync no longer runs the pre-commit hook or any other hooks git commit might call. The git annex pre-commit hook action for direct mode is however explicitly run. (The assistant already ran git commit with hooks disabled, so no change there.)	2013-12-01 13:59:45 -04:00
Joey Hess	b25abdb3e6	fix reversion in relative paths to local remotes of direct mode repos `0980f3dae6` broke support for local remotes from direct mode repos, because the relative path was taken to be from the gitdir, rather than from the work tree.	2013-11-26 19:33:26 -04:00
Joey Hess	f913deab78	move programPath out of Config.Files to Annex.Path This works around horribleness in the Mavericks cpp, which falls over on the #if when configure is running. Moving it avoids the file being built at that point. But it's also a location that makes sense..	2013-11-24 16:03:03 -04:00
Joey Hess	e563c7e6f4	fsck distribution key	2013-11-23 21:58:39 -04:00
Joey Hess	b8e74bf489	fix standalone build of this module	2013-11-22 12:21:37 -04:00
Joey Hess	b876df6fdb	Ensure that core.sharedrepository is honored when creating the .git/annex directory.	2013-11-18 18:20:20 -04:00
Joey Hess	310c549b5a	Ensure execute bit is set on directories when core.sharedrepsitory is set.	2013-11-18 18:13:09 -04:00
Joey Hess	5561b46416	fix windows build	2013-11-18 11:05:16 -04:00
Joey Hess	d48b00ebed	Direct mode .git/annex/objects directories are no longer left writable Because that allowed writing to symlinks of files that are not present, which followed the link and put bad content in an object location. fsck: Fix up .git/annex/object directory permissions. This commit was sponsored by an anonymous bitcoin donor.	2013-11-15 14:52:03 -04:00
Joey Hess	b0f85b3e22	Fix direct mode merge bug when a direct mode file was deleted and replaced with a directory. An ordering problem caused the directory to not get created in this case. Thanks to Tim for the test cases.	2013-11-15 13:40:12 -04:00
Joey Hess	59ecc804cd	add new status command This works for both direct and indirect mode. It may need some performance tuning. Note that unlike git status, it only shows the status of the work tree, not the status of the index. So only one status letter, not two .. and since files that have been added and not yet committed do not differ between the work tree and the index, they are not shown. Might want to add display of the index vs the last commit eventually. This commit was sponsored by an unknown bitcoin contributor, whose contribution as been going up lately! ;)	2013-11-07 14:07:25 -04:00
Joey Hess	00c91816fb	Merge branch 'master' into directguard	2013-11-06 13:02:35 -04:00
Joey Hess	81117e8a9d	typo	2013-11-06 12:39:14 -04:00
Joey Hess	ee23be55fd	Fix exception handling bug that could cause .git/annex/index to be used for git commits outside the git-annex branch. Known to affect git-annex when used with the git shipped with Ubuntu 13.10.	2013-11-06 12:21:50 -04:00
Joey Hess	3802f2f270	work around lack of receive.denyCurrentBranch in direct mode Now that direct mode sets core.bare=true, git's normal prohibition about pushing into the currently checked out branch doesn't work. A simple fix for this would be an update hook which blocks the pushes.. but git hooks must be executable, and git-annex needs to be usable on eg, FAT, which lacks x bits. Instead, enabling direct mode switches the branch (eg master) to a special purpose branch (eg annex/direct/master). This branch is not pushed when syncing; instead any changes that git annex sync commits get written to master, and it's pushed (along with synced/master) to the remote. Note that initialization has been changed to always call setDirect, even if it's just setDirect False for indirect mode. This is needed because if the user has just cloned a direct mode repo, that nothing has synced with before, it may have no master branch, and only a annex/direct/master. Resulting in that branch being checked out locally too. Calling setDirect False for indirect mode moves back out of this branch, to a new master branch, and ensures that a manual "git push" doesn't push changes directly to the annex/direct/master of the remote. (It's possible that the user makes a commit w/o using git-annex and pushes it, but nothing I can do about that really.) This commit was sponsored by Jonathan Harrington.	2013-11-05 21:08:31 -04:00
Joey Hess	4510819215	v5 for direct mode, with automatic upgrade This includes storing the current state of the HEAD ref, which git annex sync is going to need, but does not make sync use it.	2013-11-05 17:05:03 -04:00
Joey Hess	0edd9ec03a	refactored hook setup	2013-11-05 15:29:56 -04:00
Joey Hess	230bfa9688	add --want-get and --want-drop options New --want-get and --want-drop options which can be used to test preferred content settings. For example, "git annex find --in . --want-drop"	2013-10-28 14:50:17 -04:00
Joey Hess	049e80e865	refactor	2013-10-28 14:05:55 -04:00
Joey Hess	435ea52f3c	repair command: add handling of git-annex branch and index	2013-10-23 13:00:45 -04:00
Joey Hess	4f871f89ba	git-recover-repository 1/2 done	2013-10-20 17:50:51 -04:00
Joey Hess	19816bca41	update for DiffTree type change (which fixes assistant in subdir confusion bug)	2013-10-17 15:11:21 -04:00
Joey Hess	78acbfeb6a	ensure merge directory is empty before starting merge Don't want some past failed merge to lead to bad results, potentially.	2013-10-16 14:57:58 -04:00
Joey Hess	18f4d1b400	queue downloads of keys that fsck finds with bad content	2013-10-10 17:27:00 -04:00
Joey Hess	267c124f67	run ssh in the directory with its socket when stopping This guarantees that stopping an existing socket never fails. This might be the route out of the mess of needing to worry about socket lengths in general. However, it would need quite a lot of refactoring to make every place in git-annex that runs ssh run it with a cwd that was determined by the location of its connection caching socket. If this wasn't already such a mess, I'd consider even the thought of that API a bad idea..	2013-10-06 21:11:39 -04:00
Joey Hess	6f38426cb8	work around ssh brain-damange The control socket path passed to ssh needs to be 17 characters shorter than the maximum unix domain socket length, because ssh appends stuff to it to make a temporary filename. Closes: #725512 Also, take the shorter of the relative and the absolute paths to the socket. Typically the relative path will be a lot shorter (unless deep inside a subdirectory of the repository), and so using it will avoid flirting with the maximum safe socket lenghts in more situations, and so lead to less breakage if all my attempts at fixing this are still buggy.	2013-10-06 20:59:36 -04:00
Joey Hess	f8880c4fe4	Automatically and safely detect and recover from dangling .git/annex/index.lock files, which would prevent git from committing to the git-annex branch, eg after a crash.	2013-10-03 15:43:08 -04:00
Joey Hess	83b4b8d589	rename confusing function The index.lck file is not a lock file. Kept the historical name for now as changing it would be work.	2013-10-03 15:06:58 -04:00
Joey Hess	f2ee4ef86d	ensure that commitBranch is only called when the journal is locked This is not strictly a requirement, since it does not actually update the journal. But it's a nice invariant to enforce.	2013-10-03 14:48:46 -04:00
Joey Hess	56c3f68a53	use types to partially prove correctness of journal locking code My implementation does not guard against double locking of the journal. But it does ensure that the journal is always locked when operated on, by using a type that is only produced by lockJournal, and which is required as a parameter of all functions that operate on the journal. Note that I had to add the fooStale functions for cases where it does not make sense to lock the journal when querying it. I was more concerned about ensuring that anything that modifies the journal is locked. setJournalFile's implementation ensures that any query of the journal will get one value or the other atomically, even if the journal is being changed at the time.	2013-10-03 14:41:57 -04:00
Joey Hess	7a9a16b337	lockJournal when running performTransitions This may not strictly be needed -- the transition code bypasses the journal. However, this ensures that the git-annex branch is only committed with the journal locked. This will allow for further improvements.	2013-10-03 14:37:46 -04:00
Joey Hess	12f6b9693a	Send a git-annex user-agent when downloading urls. Overridable with --user-agent option. Not yet done for S3 or WebDAV due to limitations of libraries used -- nether allows a user-agent header to be specified. This commit sponsored by Michael Zehrer.	2013-09-28 14:35:21 -04:00
Joey Hess	c45f5fbdb3	indirect: Better behavior when a file in direct mode is not owned by the user running the conversion.	2013-09-25 15:29:56 -04:00
Joey Hess	b405295aee	hlint test suite still passes	2013-09-25 03:09:06 -04:00
Joey Hess	3588729f0d	completely solve catKey memory leak Since `006cf7976f` was incomplete, not being able to get the right mode of the file when the index differs from HEAD, this is a final workaround. Only buffering the start of the file in this case avoids leaking memory. This does not prevent git-cat-file being asked to output the whole file, which needs to be consumed, and can be slow. But this only happens in a rare edge case.	2013-09-19 20:09:03 -04:00
Joey Hess	006cf7976f	more completely solve catKey memory leak Done using a mode witness, which ensures it's fixed everywhere. Fixing catFileKey was a bear, because git cat-file does not provide a nice way to query for the mode of a file and there is no other efficient way to do it. Oh, for libgit2.. Note that I am looking at tree objects from HEAD, rather than the index. Because I cat-file cannot show a tree object for the index. So this fix is technically incomplete. The only cases where it matters are: 1. A new large file has been directly staged in git, but not committed. 2. A file that was committed to HEAD as a symlink has been staged directly in the index. This could be fixed a lot better using libgit2.	2013-09-19 16:41:21 -04:00
Joey Hess	eb42bde19a	sync, pre-commit, indirect: Avoid unnecessarily catting non-symlink files from git, which can be so large it runs out of memory.	2013-09-19 14:48:42 -04:00
Joey Hess	ab9dd6d8a0	sync: Fix bug that caused direct mode mappings to not be updated when merging files into the tree on Windows.	2013-09-13 13:49:28 -04:00
Joey Hess	a48a4e2f8a	automatically derive an annex-uuid from a gcrypt-uuids	2013-09-05 16:02:39 -04:00
Joey Hess	4079f9cfe8	avoid double commit during transition The second commit had some bad refs which resulted in the race detection code running. But that commit was unnecessary anyway, it only was there to merge in the other refs.	2013-09-03 16:33:15 -04:00
Joey Hess	db83cc82d6	Merge branch 'forget' Conflicts: debian/changelog	2013-09-03 14:36:00 -04:00
Joey Hess	67fda9e669	Honor core.sharedrepository when receiving and adding files in direct mode.	2013-09-03 13:35:49 -04:00
Joey Hess	0831e18372	forget --drop-dead: Completely removes mentions of repositories that have been marked as dead from the git-annex branch. Wrote nice pure transition calculator, and ugly code to stage its results into the git-annex branch. Also had to split up several Log modules that Annex.Branch needed to use, but that themselves used Annex.Branch. The transition calculator is limited to looking at and changing one file at a time. While this made the implementation relatively easy, it precludes transitions that do stuff like deleting old url log files for keys that are being removed because they are no longer present anywhere.	2013-08-31 17:51:13 -04:00

... 3 4 5 6 7 ...

781 commits