git-annex

Author	SHA1	Message	Date
Joey Hess	207ac67aaa	avoid needing a build-dep on hxt for Data.AssocList	2014-01-14 16:42:10 -04:00
Joey Hess	d07f2d7865	Fix a long-standing bug that could cause the wrong index file to be used when committing to the git-annex branch, if GIT_INDEX_FILE is set in the environment. This typically resulted in git-annex branch log files being committed to the master branch and later showing up in the work tree. (These log files can be safely removed.)	2014-01-14 15:36:33 -04:00
Joey Hess	78c7c54fdb	also check diskreserve for quvi downloads	2014-01-04 15:38:59 -04:00
Joey Hess	f9e7b6cf61	addurl, importfeed: Honor annex.diskreserve as long as the size of the url can be checked. This adds a http HEAD before the download is done. That was already the case when the assistant was running, and it seems worth it to avoid filling up the whole disk, like happened to my server today.	2014-01-04 15:08:06 -04:00
Joey Hess	3e68c1c2fd	add remote state logs This allows a remote to store a piece of arbitrary state associated with a key. This is needed to support Tahoe, where the file-cap is calculated from the data stored in it, and used to retrieve a key later. Glacier also would be much improved by using this. GETSTATE and SETSTATE are added to the external special remote protocol. Note that the state is left as-is even when a key is removed from a remote. It's up to the remote to decide when it wants to clear the state. The remote state log, $KEY.log.rmt, is a UUID-based log. However, rather than using the old UUID-based log format, I created a new variant of that format. The new varient is more space efficient (since it lacks the "timestamp=" hack, and easier to parse (and the parser doesn't mess with whitespace in the value), and avoids compatability cruft in the old one. This seemed worth cleaning up for these new files, since there could be a lot of them, while before UUID-based logs were only used for a few log files at the top of the git-annex branch. The transition code has also been updated to handle these new UUID-based logs. This commit was sponsored by Daniel Hofer.	2014-01-03 16:35:57 -04:00
Joey Hess	b1d7474c1d	Auto-upgrade v3 indirect repos to v5 with no changes. This also fixes a problem when a direct mode repo was somehow set to v3 rather than v4, and so the automatic direct mode upgrade to v5 was not done.	2013-12-29 13:06:23 -04:00
Joey Hess	7d5b25515c	Add plumbing-level lookupkey command.	2013-12-15 14:02:23 -04:00
Joey Hess	bef567c31f	Fix direct mode's handling when modifications to non-annexed files are pulled from a remote. A bug prevented the files from being updated in the work tree, and this caused the modification to be reverted.	2013-12-12 15:57:09 -04:00
Joey Hess	c160bf9d88	format comment	2013-12-12 15:16:44 -04:00
Joey Hess	03932212ec	Avoid using git commit in direct mode, since in some situations it will read the full contents of files in the tree. The assistant's commit code also always avoids git commit, for simplicity. Indirect mode sync still does a git commit -a to catch unstaged changes. Note that this means that direct mode sync no longer runs the pre-commit hook or any other hooks git commit might call. The git annex pre-commit hook action for direct mode is however explicitly run. (The assistant already ran git commit with hooks disabled, so no change there.)	2013-12-01 13:59:45 -04:00
Joey Hess	b25abdb3e6	fix reversion in relative paths to local remotes of direct mode repos `0980f3dae6` broke support for local remotes from direct mode repos, because the relative path was taken to be from the gitdir, rather than from the work tree.	2013-11-26 19:33:26 -04:00
Joey Hess	f913deab78	move programPath out of Config.Files to Annex.Path This works around horribleness in the Mavericks cpp, which falls over on the #if when configure is running. Moving it avoids the file being built at that point. But it's also a location that makes sense..	2013-11-24 16:03:03 -04:00
Joey Hess	e563c7e6f4	fsck distribution key	2013-11-23 21:58:39 -04:00
Joey Hess	b8e74bf489	fix standalone build of this module	2013-11-22 12:21:37 -04:00
Joey Hess	b876df6fdb	Ensure that core.sharedrepository is honored when creating the .git/annex directory.	2013-11-18 18:20:20 -04:00
Joey Hess	310c549b5a	Ensure execute bit is set on directories when core.sharedrepsitory is set.	2013-11-18 18:13:09 -04:00
Joey Hess	5561b46416	fix windows build	2013-11-18 11:05:16 -04:00
Joey Hess	d48b00ebed	Direct mode .git/annex/objects directories are no longer left writable Because that allowed writing to symlinks of files that are not present, which followed the link and put bad content in an object location. fsck: Fix up .git/annex/object directory permissions. This commit was sponsored by an anonymous bitcoin donor.	2013-11-15 14:52:03 -04:00
Joey Hess	b0f85b3e22	Fix direct mode merge bug when a direct mode file was deleted and replaced with a directory. An ordering problem caused the directory to not get created in this case. Thanks to Tim for the test cases.	2013-11-15 13:40:12 -04:00
Joey Hess	59ecc804cd	add new status command This works for both direct and indirect mode. It may need some performance tuning. Note that unlike git status, it only shows the status of the work tree, not the status of the index. So only one status letter, not two .. and since files that have been added and not yet committed do not differ between the work tree and the index, they are not shown. Might want to add display of the index vs the last commit eventually. This commit was sponsored by an unknown bitcoin contributor, whose contribution as been going up lately! ;)	2013-11-07 14:07:25 -04:00
Joey Hess	00c91816fb	Merge branch 'master' into directguard	2013-11-06 13:02:35 -04:00
Joey Hess	81117e8a9d	typo	2013-11-06 12:39:14 -04:00
Joey Hess	ee23be55fd	Fix exception handling bug that could cause .git/annex/index to be used for git commits outside the git-annex branch. Known to affect git-annex when used with the git shipped with Ubuntu 13.10.	2013-11-06 12:21:50 -04:00
Joey Hess	3802f2f270	work around lack of receive.denyCurrentBranch in direct mode Now that direct mode sets core.bare=true, git's normal prohibition about pushing into the currently checked out branch doesn't work. A simple fix for this would be an update hook which blocks the pushes.. but git hooks must be executable, and git-annex needs to be usable on eg, FAT, which lacks x bits. Instead, enabling direct mode switches the branch (eg master) to a special purpose branch (eg annex/direct/master). This branch is not pushed when syncing; instead any changes that git annex sync commits get written to master, and it's pushed (along with synced/master) to the remote. Note that initialization has been changed to always call setDirect, even if it's just setDirect False for indirect mode. This is needed because if the user has just cloned a direct mode repo, that nothing has synced with before, it may have no master branch, and only a annex/direct/master. Resulting in that branch being checked out locally too. Calling setDirect False for indirect mode moves back out of this branch, to a new master branch, and ensures that a manual "git push" doesn't push changes directly to the annex/direct/master of the remote. (It's possible that the user makes a commit w/o using git-annex and pushes it, but nothing I can do about that really.) This commit was sponsored by Jonathan Harrington.	2013-11-05 21:08:31 -04:00
Joey Hess	4510819215	v5 for direct mode, with automatic upgrade This includes storing the current state of the HEAD ref, which git annex sync is going to need, but does not make sync use it.	2013-11-05 17:05:03 -04:00
Joey Hess	0edd9ec03a	refactored hook setup	2013-11-05 15:29:56 -04:00
Joey Hess	230bfa9688	add --want-get and --want-drop options New --want-get and --want-drop options which can be used to test preferred content settings. For example, "git annex find --in . --want-drop"	2013-10-28 14:50:17 -04:00
Joey Hess	049e80e865	refactor	2013-10-28 14:05:55 -04:00
Joey Hess	435ea52f3c	repair command: add handling of git-annex branch and index	2013-10-23 13:00:45 -04:00
Joey Hess	4f871f89ba	git-recover-repository 1/2 done	2013-10-20 17:50:51 -04:00
Joey Hess	19816bca41	update for DiffTree type change (which fixes assistant in subdir confusion bug)	2013-10-17 15:11:21 -04:00
Joey Hess	78acbfeb6a	ensure merge directory is empty before starting merge Don't want some past failed merge to lead to bad results, potentially.	2013-10-16 14:57:58 -04:00
Joey Hess	18f4d1b400	queue downloads of keys that fsck finds with bad content	2013-10-10 17:27:00 -04:00
Joey Hess	267c124f67	run ssh in the directory with its socket when stopping This guarantees that stopping an existing socket never fails. This might be the route out of the mess of needing to worry about socket lengths in general. However, it would need quite a lot of refactoring to make every place in git-annex that runs ssh run it with a cwd that was determined by the location of its connection caching socket. If this wasn't already such a mess, I'd consider even the thought of that API a bad idea..	2013-10-06 21:11:39 -04:00
Joey Hess	6f38426cb8	work around ssh brain-damange The control socket path passed to ssh needs to be 17 characters shorter than the maximum unix domain socket length, because ssh appends stuff to it to make a temporary filename. Closes: #725512 Also, take the shorter of the relative and the absolute paths to the socket. Typically the relative path will be a lot shorter (unless deep inside a subdirectory of the repository), and so using it will avoid flirting with the maximum safe socket lenghts in more situations, and so lead to less breakage if all my attempts at fixing this are still buggy.	2013-10-06 20:59:36 -04:00
Joey Hess	f8880c4fe4	Automatically and safely detect and recover from dangling .git/annex/index.lock files, which would prevent git from committing to the git-annex branch, eg after a crash.	2013-10-03 15:43:08 -04:00
Joey Hess	83b4b8d589	rename confusing function The index.lck file is not a lock file. Kept the historical name for now as changing it would be work.	2013-10-03 15:06:58 -04:00
Joey Hess	f2ee4ef86d	ensure that commitBranch is only called when the journal is locked This is not strictly a requirement, since it does not actually update the journal. But it's a nice invariant to enforce.	2013-10-03 14:48:46 -04:00
Joey Hess	56c3f68a53	use types to partially prove correctness of journal locking code My implementation does not guard against double locking of the journal. But it does ensure that the journal is always locked when operated on, by using a type that is only produced by lockJournal, and which is required as a parameter of all functions that operate on the journal. Note that I had to add the fooStale functions for cases where it does not make sense to lock the journal when querying it. I was more concerned about ensuring that anything that modifies the journal is locked. setJournalFile's implementation ensures that any query of the journal will get one value or the other atomically, even if the journal is being changed at the time.	2013-10-03 14:41:57 -04:00
Joey Hess	7a9a16b337	lockJournal when running performTransitions This may not strictly be needed -- the transition code bypasses the journal. However, this ensures that the git-annex branch is only committed with the journal locked. This will allow for further improvements.	2013-10-03 14:37:46 -04:00
Joey Hess	12f6b9693a	Send a git-annex user-agent when downloading urls. Overridable with --user-agent option. Not yet done for S3 or WebDAV due to limitations of libraries used -- nether allows a user-agent header to be specified. This commit sponsored by Michael Zehrer.	2013-09-28 14:35:21 -04:00
Joey Hess	c45f5fbdb3	indirect: Better behavior when a file in direct mode is not owned by the user running the conversion.	2013-09-25 15:29:56 -04:00
Joey Hess	b405295aee	hlint test suite still passes	2013-09-25 03:09:06 -04:00
Joey Hess	3588729f0d	completely solve catKey memory leak Since `006cf7976f` was incomplete, not being able to get the right mode of the file when the index differs from HEAD, this is a final workaround. Only buffering the start of the file in this case avoids leaking memory. This does not prevent git-cat-file being asked to output the whole file, which needs to be consumed, and can be slow. But this only happens in a rare edge case.	2013-09-19 20:09:03 -04:00
Joey Hess	006cf7976f	more completely solve catKey memory leak Done using a mode witness, which ensures it's fixed everywhere. Fixing catFileKey was a bear, because git cat-file does not provide a nice way to query for the mode of a file and there is no other efficient way to do it. Oh, for libgit2.. Note that I am looking at tree objects from HEAD, rather than the index. Because I cat-file cannot show a tree object for the index. So this fix is technically incomplete. The only cases where it matters are: 1. A new large file has been directly staged in git, but not committed. 2. A file that was committed to HEAD as a symlink has been staged directly in the index. This could be fixed a lot better using libgit2.	2013-09-19 16:41:21 -04:00
Joey Hess	eb42bde19a	sync, pre-commit, indirect: Avoid unnecessarily catting non-symlink files from git, which can be so large it runs out of memory.	2013-09-19 14:48:42 -04:00
Joey Hess	ab9dd6d8a0	sync: Fix bug that caused direct mode mappings to not be updated when merging files into the tree on Windows.	2013-09-13 13:49:28 -04:00
Joey Hess	a48a4e2f8a	automatically derive an annex-uuid from a gcrypt-uuids	2013-09-05 16:02:39 -04:00
Joey Hess	4079f9cfe8	avoid double commit during transition The second commit had some bad refs which resulted in the race detection code running. But that commit was unnecessary anyway, it only was there to merge in the other refs.	2013-09-03 16:33:15 -04:00
Joey Hess	db83cc82d6	Merge branch 'forget' Conflicts: debian/changelog	2013-09-03 14:36:00 -04:00

1 2 3 4 5 ...

383 commits