git-annex

Author	SHA1	Message	Date
Joey Hess	138d25518d	Merge branch 'master' into remotecontrol Conflicts: doc/devblog/day_152__more_ssh_connection_caching.mdwn	2014-04-14 13:38:35 -04:00
Joey Hess	e53a85743e	adjust to not use cpp in modules used by configure	2014-04-14 13:37:12 -04:00
Joey Hess	f67d5abc41	support gcrypt remotes (assuming them to be over ssh transport)	2014-04-08 16:16:46 -04:00
Joey Hess	43909723b3	added git-annex remotedaemon So far, handling connecting to git-annex-shell notifychanges, and pulling immediately when a change is pushed to a remote. A little bit buggy (crashes after the first pull), but it already works! This commit was sponsored by Mark Sheppard.	2014-04-06 19:10:23 -04:00
Joey Hess	1052eeface	Windows: Fix some filename encoding bugs. http://git-annex.branchable.com/bugs/Unicode_file_names_ignored_on_Windows/ Not a complete fix yet.	2014-03-19 15:57:56 -04:00
Joey Hess	67f09bca6d	fully fix fsck memory use by iterative fscking Not very well tested, but I'm sure it doesn't eg, loop forever.	2014-03-12 15:18:43 -04:00
Joey Hess	475bf70af6	read stdout and stderr concurrently Avoids any buffering-related blocking.	2014-03-12 13:54:29 -04:00
Joey Hess	85d13b4302	better streaming when cleaning up corrupt objects A repo with a lot of objects will now stream them through, rather than buffering a list of them all in memory.	2014-03-10 16:36:18 -04:00
Joey Hess	0e0d396b27	Improve memory usage when git fsck finds a great many broken objects. From 1.7 gb to 900 mb on 300 thousand unique reported shas. When shas are not unique, this streams much better than before, so won't buffer the full list before putting them into the Set and throwing away dups. And when fsck output includes ignorable lines, especially dangling object lines, they won't be buffered in memory at all.	2014-03-10 15:14:09 -04:00
Joey Hess	8496d8aa63	improved direct mode dir/file conflicted merge resultion, using tree grafting	2014-03-04 15:00:19 -04:00
Joey Hess	1192d98721	sync: Fix bug in direct mode that caused a file not checked into git to be deleted when merging with a remote that added a file by the same name. (Thanks, jkt)	2014-03-03 14:57:16 -04:00
Joey Hess	d0fce426c4	pre-commit-annex hook script to automatically extract metadata from lots of types of files Using the extract(1) program to do the heavy lifting. Decided to make git-annex run pre-commit-annex when committing. Since git-annex pre-commit also runs it, it'll be run when git commit is run too, via the pre-commit hook. This basically gives back the pre-commit hook that git-annex took away. The implementation avoids repeatedly looking for the hook script when the assistant is running and committing repeatedly; only checks if the hook is available once. To make the script simpler, made git-annex metadata -s field?=value only set a field when it's not already got a value. This commit was sponsored by bak.	2014-03-02 20:11:58 -04:00
Joey Hess	f8cfcd4e44	couple more warning fixes	2014-02-25 14:53:43 -04:00
Joey Hess	3f6e4b8c7c	fix all remaining -Wall warnings on Windows	2014-02-25 14:48:50 -04:00
Joey Hess	46cc39f1a4	repair: Optimise unpacking of pack files, and avoid repeated error messages about corrupt pack files.	2014-02-24 19:36:58 -04:00
Joey Hess	4e0be2792b	remove Read instance for Ref Removed instance, got it all to build using fromRef. (With a few things that really need to show something using a ref for debugging stubbed out.) Then added back Read instance, and made Logs.View use it for serialization. This changes the view log format.	2014-02-19 01:19:57 -04:00
Joey Hess	67fd06af76	add git annex view command (And a vpop command, which is still a bit buggy.) Still need to do vadd and vrm, though this also adds their documentation. Currently not very happy with the view log data serialization. I had to lose the TDFA regexps temporarily, so I can have Read/Show instances of View. I expect the view log format will change in some incompatable way later, probably adding last known refs for the parent branch to View or something like that. Anyway, it basically works, although it's a bit slow looking up the metadata. The actual git branch construction is about as fast as it can be using the current git plumbing. This commit was sponsored by Peter Hogg.	2014-02-18 18:22:20 -04:00
Joey Hess	9633c67842	filter branches (incomplete) Promosing work toward metadata driven filter branches. A few methods to construct them are stubbed out; all the data types and pure code seems good. This commit was sponsored by Walter Somerville.	2014-02-16 17:39:54 -04:00
Joey Hess	61ecf76644	unbreak the build	2014-02-12 14:34:01 -04:00
Joey Hess	029a1c431a	remove windows --git-dir unix style path hack This is no longer necessary, at least with msysgit 1.8.5.2.msysgit.0. Its root cause may have been fixed by other recent git path fixes. It was causing the webapp to fail to make repos on other drives.	2014-02-11 16:12:22 -04:00
Joey Hess	c95d0cf7a8	Windows: Fix handling of absolute unix-style git repository paths. Note that on Windows a remote with a path like /home/foo/bar is interpreted by git as being some screwy relative path (relative to what exactly seems ill-defined -- it seemed relative to C:\Program Files\Git\ in my tests!) So no attempt has been made to handle such a path sanely, just not to crash when encountering it. Note that "C:\\foo" </> "/home/foo/bar" yields /home/foo/bar even though that is not absolute! I don't know what to make of all this, except that I will be very happy when this crock of **** vanishes from the face of the earth.	2014-02-08 15:39:04 -04:00
Joey Hess	92edee0b04	remove workaround This was needed when absNormPath was not being used on Windows, since path normalization includes removing ./	2014-02-08 14:47:57 -04:00
Joey Hess	a44e01c29c	--in can now refer to files that were located in a repository at some past date. For example, --in="here@{yesterday}"	2014-02-06 12:43:56 -04:00
Joey Hess	ed7c61914c	assistant: Run the periodic git gc in batch mode.	2014-01-22 17:11:41 -04:00
Joey Hess	78ead70ea4	repair: Check git version at run time.	2014-01-21 13:22:48 -04:00
Joey Hess	4e19e87921	repair: Fix bug in packed refs file exploding code that caused a .gitrefs directory to be created instead of .git/refs	2014-01-15 16:34:18 -04:00
Joey Hess	5e6e89f423	repair: Support old git versions from before git fsck --no-dangling was implemented.	2014-01-13 18:10:45 -04:00
Joey Hess	858eb26303	Avoid looping if long-running git cat-file or git hash-object crashes and keeps crashing when restarted.	2014-01-01 21:42:25 -04:00
Joey Hess	49aad120b9	Windows: Fix bug in direct mode merge code that could cause files in subdirectories to go missing.	2013-12-31 16:39:11 -04:00
Richard Hartmann	974fe009bf	Another round of s/amoung/among/	2013-12-19 12:30:53 -04:00
Joey Hess	c99d6a8151	assistant: Fix OSX-specific bug that caused the startup scan to try to follow symlinks to other directories, and add their contents to the annex.	2013-12-18 15:05:29 -04:00
Joey Hess	625076f9a5	status: Ignore new files that are gitignored.	2013-12-12 14:01:24 -04:00
Joey Hess	e6c4f550d8	repair: Remove damaged git-annex sync branches.	2013-12-10 16:17:49 -04:00
Joey Hess	b37323d857	update	2013-12-10 15:48:24 -04:00
Joey Hess	c0ce3269e9	accidentially committed wrong version of file	2013-12-10 15:45:22 -04:00
Joey Hess	ce045a51af	Improve repair of git-annex index file. Fixes a test case I received where a corrupted repo was repaired, but the git-annex branch was not. The root of the problem was that the MissingObject returned by the repair code was not necessarily a complete set of all objects that might have been deleted during the repair. So, stop trying to return that at all, and instead make the index file checking code explicitly verify that each object the index uses is present.	2013-12-10 15:40:01 -04:00
Joey Hess	c717905d15	work around msysgit very strange behavior on ./ or .\ at start of path Seems that verify_path() rejects such a path on Windows, but I cannot see why. Git bug?	2013-12-04 23:49:18 -04:00
Joey Hess	4882a611e5	assistant: Batch jobs are now run with ionice and nocache, when those commands are available.	2013-12-01 14:53:15 -04:00
Joey Hess	03932212ec	Avoid using git commit in direct mode, since in some situations it will read the full contents of files in the tree. The assistant's commit code also always avoids git commit, for simplicity. Indirect mode sync still does a git commit -a to catch unstaged changes. Note that this means that direct mode sync no longer runs the pre-commit hook or any other hooks git commit might call. The git annex pre-commit hook action for direct mode is however explicitly run. (The assistant already ran git commit with hooks disabled, so no change there.)	2013-12-01 13:59:45 -04:00
Joey Hess	6edac746f0	merge improved fsck types from git-repair and some associated changes	2013-11-30 14:29:11 -04:00
Joey Hess	0980f3dae6	Fix bug that broke switching between local repositories in the webapp when they use the new guarded direct mode. git treats eg ~/annex as a bare git repository located in ~/.annex/.git if ~/annex/.git/config has core.bare=true.	2013-11-22 23:27:15 -04:00
Joey Hess	d490bbb891	make runRepairOf run preRepair This may be a little late, since a fsck has already been done, but it can't hurt.	2013-11-21 20:13:55 -04:00
Joey Hess	7d682dd844	merge from git-repair	2013-11-21 20:07:44 -04:00
Joey Hess	ff2b0a9df6	merge from git-repair	2013-11-21 00:43:30 -04:00
Joey Hess	8217e97d88	merge from git-repair	2013-11-20 19:34:30 -04:00
Joey Hess	e80d935b53	merge from git-repair	2013-11-20 19:16:42 -04:00
Joey Hess	8a466247ed	merge from git-repair	2013-11-20 18:45:22 -04:00
Joey Hess	7dbb702edd	merge from git-repair	2013-11-20 18:31:00 -04:00
Joey Hess	ef34316c45	fix repair failure that occurred when index was corrupted, and other objects too In this case, the index problem prevented fsck from finding the other problems.	2013-11-19 17:16:33 -04:00
Joey Hess	b1ed98636b	merge with git-repair	2013-11-19 17:08:57 -04:00
Joey Hess	b245aa40df	moving git-repair to its own package	2013-11-18 13:24:55 -04:00
Joey Hess	eab4470440	better handling of missing index file	2013-11-13 14:39:26 -04:00
Joey Hess	13108b7196	assistant: Notice on startup when the index file is corrupt, and auto-repair.	2013-11-13 14:27:17 -04:00
Joey Hess	5e7e0c7dc0	repair: Handle case where index file is corrupt, but all objects are ok.	2013-11-13 13:41:02 -04:00
Joey Hess	958312885f	webapp: Improve UI around remote that have no annex.uuid set, either because setup of them is incomplete, or because the remote git repository is not a git-annex repository. Complicated by such repositories potentially being repos that should have an annex.uuid, but it failed to be gotten, perhaps due to the past ssh repo setup bugs. This is handled now by an Upgrade Repository button.	2013-11-07 18:02:00 -04:00
Joey Hess	59ecc804cd	add new status command This works for both direct and indirect mode. It may need some performance tuning. Note that unlike git status, it only shows the status of the work tree, not the status of the index. So only one status letter, not two .. and since files that have been added and not yet committed do not differ between the work tree and the index, they are not shown. Might want to add display of the index vs the last commit eventually. This commit was sponsored by an unknown bitcoin contributor, whose contribution as been going up lately! ;)	2013-11-07 14:07:25 -04:00
Joey Hess	3802f2f270	work around lack of receive.denyCurrentBranch in direct mode Now that direct mode sets core.bare=true, git's normal prohibition about pushing into the currently checked out branch doesn't work. A simple fix for this would be an update hook which blocks the pushes.. but git hooks must be executable, and git-annex needs to be usable on eg, FAT, which lacks x bits. Instead, enabling direct mode switches the branch (eg master) to a special purpose branch (eg annex/direct/master). This branch is not pushed when syncing; instead any changes that git annex sync commits get written to master, and it's pushed (along with synced/master) to the remote. Note that initialization has been changed to always call setDirect, even if it's just setDirect False for indirect mode. This is needed because if the user has just cloned a direct mode repo, that nothing has synced with before, it may have no master branch, and only a annex/direct/master. Resulting in that branch being checked out locally too. Calling setDirect False for indirect mode moves back out of this branch, to a new master branch, and ensures that a manual "git push" doesn't push changes directly to the annex/direct/master of the remote. (It's possible that the user makes a commit w/o using git-annex and pushes it, but nothing I can do about that really.) This commit was sponsored by Jonathan Harrington.	2013-11-05 21:08:31 -04:00
Joey Hess	cf34e59c8c	factor out update	2013-11-05 18:20:52 -04:00
Joey Hess	4510819215	v5 for direct mode, with automatic upgrade This includes storing the current state of the HEAD ref, which git annex sync is going to need, but does not make sync use it.	2013-11-05 17:05:03 -04:00
Joey Hess	04768e44b2	automatically set and unset core.bare when switching to/from direct mode	2013-11-05 15:41:24 -04:00
Joey Hess	0edd9ec03a	refactored hook setup	2013-11-05 15:29:56 -04:00
Joey Hess	c2862d9585	pass -c option on to all git commands run The -c option now not only modifies the git configuration seen by git-annex, but it is passed along to every git command git-annex runs. This was easy to plumb through because gitCommandLine is already used to construct every git command line, to add --git-dir and --work-tree	2013-11-05 13:38:37 -04:00
Joey Hess	58db042033	map: Work when there are gcrypt remotes.	2013-11-04 14:14:44 -04:00
Joey Hess	7ed8e87a34	assistant: Support repairing git remotes that are locally accessible (eg, on removable drives) gcrypt remotes are not yet handled. This commit was sponsored by Sören Brunk.	2013-10-27 15:38:59 -04:00
Joey Hess	0036139b33	wire git repair into webapp	2013-10-23 14:43:58 -04:00
Joey Hess	1ab2ad86c7	minor	2013-10-23 13:19:37 -04:00
Joey Hess	435ea52f3c	repair command: add handling of git-annex branch and index	2013-10-23 13:00:45 -04:00
Joey Hess	d5eb85acf4	add repair command	2013-10-23 12:21:59 -04:00
Joey Hess	d345e5b52f	add git fsck to cronner, and UI for repository repair (not yet wired up)	2013-10-22 16:02:52 -04:00
Joey Hess	44bb9a808f	clean warnings	2013-10-22 14:52:17 -04:00
Joey Hess	ff3f654cbe	make git fsck batch-capable	2013-10-22 14:49:41 -04:00
Joey Hess	3e61749d08	index file recovery	2013-10-22 12:58:04 -04:00
Joey Hess	2fb08acda5	add reflog	2013-10-21 16:41:46 -04:00
Joey Hess	18487c779f	corrupt branch resetting (but not yet reflog walking)	2013-10-21 16:20:54 -04:00
Joey Hess	fcd91be6f0	implemented removal of corrupt tracking branches Oh, git, you made this so hard. Not determining if a branch pointed to some corrupt object, that was easy, but dealing with corrupt branches using git plumbing is a PITA.	2013-10-21 15:28:06 -04:00
Joey Hess	6d8250c255	avoid redundant fsck when no changes are made	2013-10-20 19:42:17 -04:00
Joey Hess	4f871f89ba	git-recover-repository 1/2 done	2013-10-20 17:50:51 -04:00
Joey Hess	f482de1b76	remove workaround for bug in git 1.8.4r0	2013-10-20 15:23:06 -04:00
Joey Hess	edbf177628	fix lsTreeFiles to use --full-tree This makes it show the full tree, not just the current directory, and enables --full-name, which yields TopFilePaths.	2013-10-18 15:50:26 -04:00
Joey Hess	c979e0ea62	fix	2013-10-17 19:51:16 -04:00
Joey Hess	c116383b5d	fix	2013-10-17 19:49:44 -04:00
Joey Hess	81c4259a0d	fix	2013-10-17 19:41:00 -04:00
Joey Hess	16243b9972	missing import	2013-10-17 19:39:22 -04:00
Joey Hess	e93206e294	Windows: Deal with strange msysgit 1.8.4 behavior of not understanding DOS formatted paths for --git-dir and --work-tree.	2013-10-17 19:35:57 -04:00
Joey Hess	aff125ddab	try working around windows xargs problem	2013-10-17 15:56:56 -04:00
Joey Hess	d785432f78	use TopFilePath for DiffTree and LsTree	2013-10-17 14:51:19 -04:00
Joey Hess	82ff37520f	fix off-by-one	2013-10-16 12:14:14 -04:00
Joey Hess	bac078742d	Deal with git check-attr -z output format change in git 1.8.5. I have not actually tested with 1.8.5, which is not yet relesaed, but git.git commit f7cd8c50b9ab83e084e8f52653ecc8d90665eef2 changes -z to also apply to output, without regards to back-compat. (But with pretty good reasons.) New code should work with both versions, by fingerprinting for NULs and newlines.	2013-10-15 16:05:27 -04:00
Joey Hess	f1295b5141	fix windows build	2013-10-02 20:26:00 -04:00
Joey Hess	1536ebfe47	Disable receive.denyNonFastForwards when setting up a gcrypt special remote gcrypt needs to be able to fast-forward the master branch. If a git repository is set up with git init --shared --bare, it gets that set, and pushing to it will then fail, even when it's up-to-date.	2013-10-01 15:23:48 -04:00
Joey Hess	57d49a6d04	remove >=> and >=> ; use <$$> instead I forgot I had <$$> hidden away in Utility.Applicative. It allows doing the same kind of currying as does >=> and I found using it made the code more readable for me. (>=> was not used)	2013-09-27 19:58:48 -04:00
Joey Hess	e864c8d033	blind enabling gcrypt repos on rsync.net This pulls off quite a nice trick: When given a path on rsync.net, it determines if it is an encrypted git repository that the user has the key to decrypt, and merges with it. This is works even when the local repository had no idea that the gcrypt remote exists! (As previously done with local drives.) This commit sponsored by Pedro Côrte-Real	2013-09-27 16:21:56 -04:00
Joey Hess	1550759220	enabling rsync.net gcrypt repos Still need to detect when the user is trying to create a repo that already exists, and jump to the enabling code.	2013-09-26 23:47:30 -04:00
Joey Hess	735ed3b822	prep for enabling remotre gcrypt repos in webapp	2013-09-26 17:26:13 -04:00
Joey Hess	3192b059b5	add back lost check that git-annex-shell supports gcrypt	2013-09-24 17:51:12 -04:00
Joey Hess	7390f08ef9	Use cryptohash rather than SHA for hashing. This is a massive win on OSX, which doesn't have a sha256sum normally. Only use external hash commands when the file is > 1 mb, since cryptohash is quite close to them in speed. SHA is still used to calculate HMACs. I don't quite understand cryptohash's API for those. Used the following benchmark to arrive at the 1 mb number. 1 mb file: benchmarking sha256/internal mean: 13.86696 ms, lb 13.83010 ms, ub 13.93453 ms, ci 0.950 std dev: 249.3235 us, lb 162.0448 us, ub 458.1744 us, ci 0.950 found 5 outliers among 100 samples (5.0%) 4 (4.0%) high mild 1 (1.0%) high severe variance introduced by outliers: 10.415% variance is moderately inflated by outliers benchmarking sha256/external mean: 14.20670 ms, lb 14.17237 ms, ub 14.27004 ms, ci 0.950 std dev: 230.5448 us, lb 150.7310 us, ub 427.6068 us, ci 0.950 found 3 outliers among 100 samples (3.0%) 2 (2.0%) high mild 1 (1.0%) high severe 2 mb file: benchmarking sha256/internal mean: 26.44270 ms, lb 26.23701 ms, ub 26.63414 ms, ci 0.950 std dev: 1.012303 ms, lb 925.8921 us, ub 1.122267 ms, ci 0.950 variance introduced by outliers: 35.540% variance is moderately inflated by outliers benchmarking sha256/external mean: 26.84521 ms, lb 26.77644 ms, ub 26.91433 ms, ci 0.950 std dev: 347.7867 us, lb 210.6283 us, ub 571.3351 us, ci 0.950 found 6 outliers among 100 samples (6.0%) import Crypto.Hash import Data.ByteString.Lazy as L import Criterion.Main import Common testfile :: FilePath testfile = "/run/shm/data" -- on ram disk main = defaultMain [ bgroup "sha256" [ bench "internal" $ whnfIO internal , bench "external" $ whnfIO external ] ] sha256 :: L.ByteString -> Digest SHA256 sha256 = hashlazy internal :: IO String internal = show . sha256 <$> L.readFile testfile external :: IO String external = do s <- readProcess "sha256sum" [testfile] return $ fst $ separate (== ' ') s	2013-09-22 20:06:02 -04:00
Joey Hess	006cf7976f	more completely solve catKey memory leak Done using a mode witness, which ensures it's fixed everywhere. Fixing catFileKey was a bear, because git cat-file does not provide a nice way to query for the mode of a file and there is no other efficient way to do it. Oh, for libgit2.. Note that I am looking at tree objects from HEAD, rather than the index. Because I cat-file cannot show a tree object for the index. So this fix is technically incomplete. The only cases where it matters are: 1. A new large file has been directly staged in git, but not committed. 2. A file that was committed to HEAD as a symlink has been staged directly in the index. This could be fixed a lot better using libgit2.	2013-09-19 16:41:21 -04:00
Joey Hess	f26c996dc6	interface to parse git tree objects	2013-09-19 15:58:35 -04:00
Joey Hess	eb42bde19a	sync, pre-commit, indirect: Avoid unnecessarily catting non-symlink files from git, which can be so large it runs out of memory.	2013-09-19 14:48:42 -04:00
Joey Hess	e8e209f4e5	better probing for gcrypt repositories using new --check option Now can tell if a repo uses gcrypt or not, and whether it's decryptable with the current gpg keys. This closes the hole that undecryptable gcrypt repos could have before been combined into the repo in encrypted mode.	2013-09-19 12:53:24 -04:00
Joey Hess	8062f6337f	webapp: support adding existing gcrypt special remotes from removable drives When adding a removable drive, it's now detected if the drive contains a gcrypt special remote, and that's all handled nicely. This includes fetching the git-annex branch from the gcrypt repo in order to find out how to set up the special remote. Note that gcrypt repos that are not git-annex special remotes are not supported. It will attempt to detect such a gcrypt repo and refuse to use it. (But this is hard to do any may fail; see https://github.com/blake2-ppc/git-remote-gcrypt/issues/6) The problem with supporting regular gcrypt repos is that we don't know what the gcrypt.participants setting is intended to be for the repo. So even if we can decrypt it, if we push changes to it they might not be visible to other participants. Anyway, encrypted sneakernet (or mailnet) is now fully possible with the git-annex assistant! Assuming that the gpg key distribution is handled somehow, which the assistant doesn't yet help with. This commit was sponsored by Navishkar Rao.	2013-09-18 15:55:31 -04:00
Joey Hess	6c35038643	gcrypt: Ensure that signing key is set to one of the participants keys. Otherwise gcrypt will fail to pull, since it requires this to be the case. This needs a patched gcrypt, which is in my forked version.	2013-09-17 16:06:29 -04:00
Joey Hess	ab9dd6d8a0	sync: Fix bug that caused direct mode mappings to not be updated when merging files into the tree on Windows.	2013-09-13 13:49:28 -04:00
Joey Hess	7c1a9cdeb9	partially complete gcrypt remote (local send done; rest not) This is a git-remote-gcrypt encrypted special remote. Only sending files in to the remote works, and only for local repositories. Most of the work so far has involved making initremote work. A particular problem is that remote setup in this case needs to generate its own uuid, derivied from the gcrypt-id. That required some larger changes in the code to support. For ssh remotes, this will probably just reuse Remote.Rsync's code, so should be easy enough. And for downloading from a web remote, I will need to factor out the part of Remote.Git that does that. One particular thing that will need work is supporting hot-swapping a local gcrypt remote. I think it needs to store the gcrypt-id in the git config of the local remote, so that it can check it every time, and compare with the cached annex-uuid for the remote. If there is a mismatch, it can change both the cached annex-uuid and the gcrypt-id. That should work, and I laid some groundwork for it by already reading the remote's config when it's local. (Also needed for other reasons.) This commit was sponsored by Daniel Callahan.	2013-09-07 18:38:00 -04:00
Joey Hess	dad34e0ea8	add getParticipantList Note that it needs to look at global git config, since git-remote-gcrypt will see any setting there as a fallback.	2013-09-05 16:34:13 -04:00
Joey Hess	a48a4e2f8a	automatically derive an annex-uuid from a gcrypt-uuids	2013-09-05 16:02:39 -04:00
Joey Hess	6cdac3a003	sync, assistant: Force push of the git-annex branch. Necessary to ensure it gets pushed to remotes after being rewritten by forget. See inline rationalles for why I think this is safe!	2013-08-29 14:27:53 -04:00
guilhem	f754779c02	Unused: bugfix Detect staged files that are not in the working tree.	2013-08-26 13:50:09 -04:00
guilhem	f15fda60ed	Speed up the 'unused' command. Instead of populating the second-level Bloom filter with every key referenced in every Git reference, consider only those which differ from what's referenced in the index. Incidentaly, unlike with its old behavior, staged modifications/deletion/... will now be detected by 'unused'. Credits to joeyh for the algorithm. :-)	2013-08-25 21:02:13 -04:00
guilhem	b4a32c7506	Unescape characters in 'file://...' URIs. That allows, in Git remotes, such URIs to contain spaces or UTF-8 characters. Closes http://git-annex.branchable.com/bugs/Unable_to_use_remotes_with_space_in_the_path/ .	2013-08-22 11:33:16 -04:00
Joey Hess	6fd2935a5a	unused: Pay attention to symlinks that are not yet staged in the index.	2013-08-22 10:20:03 -04:00
Joey Hess	a3224ce35b	avoid more build warnings on Windows	2013-08-04 14:05:36 -04:00
Joey Hess	b191d5c595	gitignore support for the assistant and watcher Requires git 1.8.4 or newer. When it's installed, a background git check-ignore process is run, and used to efficiently check ignores whenever a new file is added. Thanks to Adam Spiers, for getting the necessary support into git for this. A complication is what to do about files that are gitignored but have been checked into git anyway. git commands assume the ignore has been overridden in this case, and not need any more overriding to commit a changed version. However, for the assistant to do the same, it would have to run git ls-files to check if the ignored file is in git. This is somewhat expensive. Or it could use the running git-cat-file process to query the file that way, but that requires transferring the whole file content over a pipe, so it can be quite expensive too, for files that are not git-annex symlinks. Now imagine if the user knows that a file or directory tree will be getting frequent changes, and doesn't want the assistant to sync it, so gitignores it. The assistant could overload the system with repeated ls-files checks! So, I've decided that the assistant will not automatically commit changes to files that are gitignored. This is a tradeoff. Hopefully it won't be a problem to adjust .gitignore settings to not ignore files you want the assistant to autocommit, or to manually git annex add files that are listed in .gitignore. (This could be revisited if git-annex gets access to an interface to check the content of the index w/o forking a git command. This could be libgit2, or perhaps a separate git cat-file --batch-check process, so it wouldn't need to ship over the whole file content.) This commit was sponsored by Francois Marier. Thanks!	2013-08-02 20:37:03 -04:00
Joey Hess	672cfc3923	better git version checking	2013-08-02 18:32:26 -04:00
Joey Hess	93f2371e09	get rid of __WINDOWS__, use mingw32_HOST_OS The latter is harder for me to remember, but avoids build failures in code used by the configure program.	2013-08-02 12:27:32 -04:00
Joey Hess	d16114d024	Slow and ugly work around for bug #718517 in git, which broke git-cat-file --batch for filenames containing spaces. This runs git-cat-file in non-batch mode for all files with spaces. If a directory tree has a lot of them, and is in direct mode, even "git annex add" when there are few new files will need a lot of forks! The only reason buffering the whole file content to get the sha is not a memory leak is that git-annex only ever uses this on symlinks. This needs to be reverted as soon as a fix is available in git!	2013-08-01 17:30:47 -04:00
Joey Hess	ebd778c519	Escape ':' in file/directory names to avoid it being treated as a pathspec by some git commands A git pathspec is a filename, except when it starts with ':', it's taken to refer to a branch, etc. Rather than special case ':', any filename starting with anything unusual is prefixed with "./" This could have been a real mess to deal with, but luckily SafeCommand is already extensively used and so we know at the type level the difference between parameters that are files, and parameters that are command options. Testing did show that Git.Queue was not using SafeCommand on filenames fed to xargs. (Filenames starting with '-' worked before only because -- was used to separate filenames from options when calling eg git add.) The test suite now passes with filenames starting with ':'. However, I did not keep that change to it, because such filenames are probably not legal on windows, and I have enough ugly windows ifdefs in there as it is. This commit was sponsored by Otavio Salvador. Thanks!	2013-08-01 15:15:49 -04:00
Joey Hess	7e66d260ea	importfeed: git-annex becomes a podcatcher in 150 LOC	2013-07-28 16:55:42 -04:00
Joey Hess	4e2fab90d5	avoid newline translation when writing to git hash-object They're like mushrooms, just keep popping up.	2013-06-18 15:08:51 -04:00
Joey Hess	02c51266ec	missed another hash-object call, disable filtering there too	2013-06-18 14:48:15 -04:00
Joey Hess	a1f8771d2b	avoid filtering object being hashed This avoids newline conversion being done on it in Windows.	2013-06-18 13:42:16 -04:00
Joey Hess	077ca355d0	Revert "flush stream after each write to update-index, to possibly avoid buffering issues on Windows" Didn't help.	2013-06-14 14:34:24 -04:00
Joey Hess	b97a9ea786	flush stream after each write to update-index, to possibly avoid buffering issues on Windows	2013-06-14 14:25:17 -04:00
Joey Hess	91c4dcfc69	Can now restart certain long-running git processes if they crash, and continue working. Fuzz tests have shown that git cat-file --batch sometimes stops running. It's not yet known why (no error message; repo seems ok). But this is something we can deal with in the CoProcess framework, since all 3 types of long-running git processes should be restartable if they fail. Note that, as implemented, only IO errors are caught. So an error thrown by the reveiver, when it sees something that is not valid output from git cat-file (etc) will not cause a restart. I don't want it to retry if git commands change their output or are just outputting garbage. This does mean that if the command did a partial output and crashed in the middle, it would still not be restarted. There is currently no guard against restarting a command repeatedly, if, for example, it crashes repeatedly on startup.	2013-05-31 12:42:13 -04:00
Joey Hess	a600471a23	include HEAD in CanPush shas	2013-05-21 20:04:38 -04:00
Joey Hess	08c03b2af3	XMPP: Avoid redundant and unncessary pushes. Note that this breaks compatibility with previous versions of git-annex, which will refuse to accept any XMPP pushes from this version.	2013-05-21 18:24:29 -04:00
Joey Hess	25dba9da24	fix windows build	2013-05-21 13:07:43 -04:00
Joey Hess	369fb69fe7	fix warning	2013-05-20 18:01:27 -04:00
Joey Hess	25cb9a48da	fix the day's Windows permissions damage	2013-05-14 20:15:14 -04:00
Joey Hess	959536ef03	fill in a few windows stubs	2013-05-14 16:32:03 -05:00
Joey Hess	306a36260f	typo	2013-05-14 15:44:49 -04:00
Joey Hess	7b92ffc3a1	more leaning toothpick fixes	2013-05-14 15:43:23 -04:00
Joey Hess	dc66b1f27d	Merge branch 'master' into windows Conflicts: Annex/Environment.hs Build/Configure.hs Git/Construct.hs Utility/FileMode.hs	2013-05-14 15:37:24 -04:00
Joey Hess	81cded2b9d	detect local urls on DOS	2013-05-14 15:27:39 -04:00
Joey Hess	03e8594369	fix the day's windows permissions damage	2013-05-12 19:09:48 -04:00
Joey Hess	73d2f8b280	deal with git using / internally, even on DOS	2013-05-12 17:29:49 -05:00
Joey Hess	06551ad86b	set raw mode for git check-attr	2013-05-12 16:37:06 -05:00
Joey Hess	abe8d549df	fix permission damage (thanks, Windows)	2013-05-11 23:54:25 -04:00
Joey Hess	5e1458152f	refactoring	2013-05-11 23:11:56 -04:00
Joey Hess	1e2ddcb68a	use setCurrentDirectory On POSIX, this just calls changeWorkingDirectory.	2013-05-11 19:14:30 -04:00
Joey Hess	18bdff3fae	clean up from windows porting	2013-05-11 18:23:41 -04:00
Joey Hess	dc22549ab3	git annex init works on Windows! git hash-object and cat-file both only use \n at ends of line, even on Windows.	2013-05-11 16:02:35 -05:00
Joey Hess	c45a723876	catFile expects no \r, even on Windows	2013-05-11 15:32:34 -05:00
Joey Hess	3c7e30a295	git-annex now builds on Windows (doesn't work)	2013-05-11 15:03:00 -05:00
Joey Hess	763cbda14f	fixup #if 0 stubs to use #ifndef mingw32_HOST_OS That's needed in files used to build the configure program. For the other files, I'm keeping my __WINDOWS__ define, as I find that much easier to type. I may search and replace it to use the mingw32_HOST_OS thing later.	2013-05-10 16:57:21 -05:00
Joey Hess	6c74a42cc6	stub out POSIX stuff	2013-05-10 16:29:59 -05:00
Joey Hess	8a2d1988d3	expose Control.Monad.join I think I've been looking for that function for some time. Ie, I remember wanting to collapse Just Nothing to Nothing.	2013-04-22 20:24:53 -04:00
Joey Hess	a5dded0401	assistant: The ConfigMonitor left one zombie behind each time it checked for changes, now fixed.	2013-03-18 22:09:51 -04:00
Joey Hess	2c05c85437	webapp: DTRT when told to create a git repo that already exists.	2013-03-12 08:09:31 -04:00
Joey Hess	ea672b7c77	Bugfix: git annex add, when ran without any file or directory specified, should add files in the current directory, but not act on unlocked files elsewhere in the tree.	2013-03-07 19:03:06 -04:00
Joey Hess	82f639c70f	fix type introduced in `0c13d306` Doubled command name broke show-ref, which broke git annex sync. Re-read all of `0c13d306` to check for other problems.	2013-03-07 11:09:30 -04:00
Joey Hess	0c13d3065e	git subcommand cleanup Pass subcommand as a regular param, which allows passing git parameters like -c before it. This was already done in the pipeing set of functions, but not the command running set.	2013-03-03 13:39:07 -04:00
Joey Hess	4d33423067	assistant: Avoid noise in logs from git commit about typechanged files in direct mode repositories.	2013-03-01 16:21:29 -04:00
Joey Hess	8d9c2afd89	Additional GIT_DIR support bugfixes. May actually work now. Two fixes. First, and most importantly, relax the isLinkToAnnex check to only look for /annex/objects/, not [^\|/].git/annex/objects. If GIT_DIR is used with a detached work tree, the git directory is not necessarily named .git. There are important caveats with doing that at all, since git-annex will make symlinks that point at GIT_DIR, which means that the relative path between GIT_DIR and GIT_WORK_TREE needs to remain stable across all clones of the repository. ---- The other fix is just fixing crazy and wrong code that, when GIT_DIR is set, expects to still find a git repository in the path below the work tree, and uses some of its configuration, and some of GIT_DIR. What was I thinking, and why can't I seem to get this code right?	2013-02-23 12:41:22 -04:00
Joey Hess	52902c0945	make adding modified files work on crippled filesystems	2013-02-20 14:12:55 -04:00
Joey Hess	547d7745fb	pre-commit: Update direct mode mappings. Making the pre-commit hook look at git diff-index to find changed direct mode files and update the mappings works pretty well. One case where it does not work is when a file is git annex added, and then git rmed, and then this is committed. That's a no-op commit, so the hook probably doesn't even run, and it certianly never notices that the file was deleted, so the mapping will still have the original filename in it. For this and other reasons, it's important that the mappings still be treated as possibly inconsistent. Also, the assistant now allows the pre-commit hook to run when in direct mode, so the mappings also get updated there.	2013-02-06 12:44:19 -04:00
Joey Hess	5cd152b8a9	annex.autocommit New setting, can be used to disable autocommit of changed files by the assistant, while it still does data syncing and other tasks. Also wired into webapp UI	2013-01-27 22:43:05 +11:00
Joey Hess	0214e0fb17	union merge bugfix Union merges involving two or more repositories could sometimes result in data from one repository getting lost. This could result in the location log data becoming wrong, and fsck being needed to fix it. NB: I audited for any other occurrences of this problem. There are other places than union merge where multiple changes are fed into update-index in a stream, but they all involve working copy files being staged, or their deletion being staged, and in this case it's fine for the later changes to override the earlier ones.	2013-01-16 21:31:06 -04:00
Joey Hess	95db595e91	make startup scan for deleted files work in direct mode git add --update cannot be used, because it'll stage typechanged direct mode files. Intead, use ls-files to find deleted files, and stage them ourselves. It seems that no commit was made before when the scan staged deleted files. (Probably masked since if files were added, a commit happened then..) Now that I'm doing the staging, I was also able to fix that bug.	2012-12-24 14:24:13 -04:00
Joey Hess	92bd889e61	unused	2012-12-18 17:15:11 -04:00
Joey Hess	53dbcce645	direct mode merging works! Automatic merge resoltion code needs to be fixed to preserve objects from direct mode files.	2012-12-18 15:04:44 -04:00
Joey Hess	ffdd08fd2e	Merge branch 'master' into desymlink	2012-12-13 00:46:10 -04:00
Joey Hess	0d50a6105b	whitespace fixes	2012-12-13 00:45:27 -04:00
Joey Hess	b080a58b76	Merge branch 'master' into desymlink Conflicts: Annex/CatFile.hs Annex/Content.hs Git/LsFiles.hs Git/LsTree.hs	2012-12-13 00:29:06 -04:00
Joey Hess	f87a781aa6	finished where indentation changes	2012-12-13 00:24:19 -04:00
Joey Hess	e7b8cb0063	direct mode committing	2012-12-12 19:20:38 -04:00
Joey Hess	b0c5cbfde2	add notStaged	2012-12-12 13:25:26 -04:00
Joey Hess	e8a74e9493	where indentation	2012-12-12 13:20:58 -04:00
Joey Hess	0714b0bd03	remove unused function	2012-12-12 13:17:41 -04:00
Joey Hess	715c67a3e5	git diff-tree interface	2012-12-10 14:36:57 -04:00
Joey Hess	444e984727	don't treat foo::bar as a ssh url It's a git-remote-helper location, and will be stored as just an url.	2012-11-09 13:50:23 -04:00
Joey Hess	39e82b1af8	webapp: Generate better git remote names. Wrote a better git remote name sanitizer. Git blows up on lots of weird stuff, especially if it starts the remote name, but I managed to get some common punctuation working.	2012-10-31 15:26:19 -04:00
Joey Hess	7ee0ffaeb9	Use USER and HOME environment when set, and only fall back to getpwent, which doesn't work with LDAP or NIS.	2012-10-25 18:17:54 -04:00
Joey Hess	c7c2015435	add ConfigMonitor thread Monitors git-annex branch for changes, which are noticed by the Merger thread whenever the branch ref is changed (either due to an incoming push, or a local change), and refreshes cached config values for modified config files. Rate limited to run no more often than once per minute. This is important because frequent git-annex branch changes happen when files are being added, or transferred, etc. A primary use case is that, when preferred content changes are made, and get pushed to remotes, the remotes start honoring those settings. Other use cases include propigating repository description and trust changes to remotes, and learning when a remote has added a new special remote, so the webapp can present the GUI to enable that special remote locally. Also added a uuid.log cache. All other config files already had caches.	2012-10-20 16:43:35 -04:00
Joey Hess	b281584422	remove some more !!	2012-10-20 16:21:43 -04:00
Joey Hess	e6b1f36e1d	Fix handling of GIT_DIR when it refers to a git submodule. The old code was just wrong in taking fromPath of GIT_DIR -- that made an localUnknown location with the GIT_DIR in it, which only worked by accident, and failed in submodules.	2012-10-17 14:28:05 -04:00
Joey Hess	919fec85cd	better fix for zombie problem, which turns out to be a zombie ssh started by rsync When rsyncProgress pipes rsync's stdout, this turns out to cause a ssh process started by rsync to be left behind as a zombie. I don't know why, but my recent zombie reaping cleanup was correct, it's just that this other zombie, that's not directly started by git-annex, was no longer reaped due to changes in the cleanup. Make rsyncProgress reap the zombie started by rsync, as a workaround. FWIW, the process tree looks like this. It seems like the rsync child is for some reason starting but not waiting on this extra ssh process. Ssh connection caching may be involved -- disabling it seemed to change the shape of the tree, but did not eliminate the zombie. 9378 pts/14 S+ 0:00 \| \_ rsync -p --progress --inplace -4 -e 'ssh' '-S' ... 9379 pts/14 S+ 0:00 \| \| \_ ssh ... 9380 pts/14 S+ 0:00 \| \| \_ rsync -p --progress --inplace -4 -e 'ssh' '-S' ... 9381 pts/14 Z+ 0:00 \| \_ [ssh] <defunct>	2012-10-17 00:47:52 -04:00
Joey Hess	4f95cc8ef1	ensure that gitdir is absolute calcGitLink turns out to need it to be absolute, and it normally is, but not if it's read from a .git file in a submodule, or perhaps from GIT_DIR. I should look into dropping this invariant.	2012-10-16 16:25:45 -04:00
Joey Hess	8fec62d299	A relative core.worktree is relative to the gitdir. Now that this is handled correctly, git-annex can be used in git submodules. Also, fixed infelicity where Git.CurrentRepo and Git.Config.updateLocation were both dealing with core.worktree. Now updateLocation handles it for Local as well as for LocalUnknown repos.	2012-10-16 00:08:39 -04:00
Joey Hess	148d9f0088	simplify	2012-10-15 23:12:50 -04:00
Joey Hess	429b77844e	drop old config when rereading repo config Before, the new config was merged into the old, so if eg, a remote was renamed, it would have both the new and the old remote name.	2012-10-14 17:23:40 -04:00
Joey Hess	06831e7754	fix slightly incorrect comment	2012-10-12 12:20:45 -04:00
Joey Hess	e05c21cb73	Fix a crash when merging files in the git-annex branch that contain invalid utf8. The crash actually occurred when writing out the file, which was done to a handle that had not had fileSystemEncoding applied to it.	2012-10-12 12:19:30 -04:00
Joey Hess	47314c0fad	fix last zombies in the assistant Made Git.LsFiles return cleanup actions, and everything waits on processes now, except of course for Seek.	2012-10-04 19:56:32 -04:00
Joey Hess	f7f1d25df8	bugfix	2012-10-04 19:41:58 -04:00
Joey Hess	de3ea4adb6	remove now-unnecessary manual reaps	2012-10-04 18:58:57 -04:00
Joey Hess	5594bf0643	more zombie fighting I'm down to 9 places in the code that can produce unwaited for zombies. Most of these are pretty innocuous, at least for now, are only used in short-running commands, or commands that run a set of actions and explicitly reap zombies after each one. The one from Annex.Branch.files could be trouble later, since both Command.Fsck and Command.Unused can trigger it, and the assistant will be doing those eventally. Ditto the one in Git.LsTree.lsTree, which Command.Unused uses. The only ones currently affecting the assistant though, are in Git.LsFiles. Several threads use several of those. (And yeah, using pipes or ResourceT would be a less ad-hoc approach, but I don't really feel like ripping my entire code base apart right now to change a foundation monad. Maybe one of these days..)	2012-10-04 18:47:31 -04:00
Joey Hess	f67b54e5e3	make a pipeReadStrict, that properly waits on the process Nearly everything that's reading from git is operating on a small amount of output and has been switched to use that. Only pipeNullSplit stuff continues using the lazy version that yields zombies.	2012-10-04 18:04:09 -04:00
Joey Hess	582316f66f	avoid webapp crash on startup when there's no ~/.gitconfig git config --list --global exits nonzero when there's no global config	2012-09-23 12:43:14 -04:00
Joey Hess	e8188ea611	flip catchDefaultIO	2012-09-17 00:18:07 -04:00
Joey Hess	ba744c84a4	better name for fallback sync refs Don't expose these as branches in refs/heads/. Instead hide them away in refs/synced/ where only show-ref will find them. Make unused only look at branches and tags, not these other things, so it won't care if some stale sync ref used to use a file. This means they don't need to be deleted, which could have led to an incoming sync being missed.	2012-09-16 23:09:08 -04:00
Joey Hess	6cddda4143	make the merger merge any equivilant sync branch into the current branch Not just synced/master, but synced/UUID/master, for example	2012-09-16 19:41:26 -04:00
Joey Hess	da63b7e96c	Support repositories created with --separate-git-dir. Closes: #684405	2012-09-15 22:40:04 -04:00
Joey Hess	ca45cea113	Revert "add catFileIndex" This interface is not a good idea, because a running git cat-file --batch does not notice when existing files in the index are changed.	2012-09-15 18:30:53 -04:00
Joey Hess	0b63ee6cd5	run git coprocesses with gitEnv	2012-09-15 17:43:37 -04:00
Joey Hess	e1baf48d88	add catFileIndex	2012-09-15 17:06:10 -04:00
Joey Hess	c9b3b8829d	thread safe git-annex index file use	2012-08-24 20:50:39 -04:00
Joey Hess	fb4b19deed	make the webapp honor the web.browser git config	2012-08-08 13:15:35 -04:00
Joey Hess	5ae1f75a39	handle case of adding populated drive to just created repo The just created repo has no master branch commits yet. This is now handled, merging in the master branch from the populated drive.	2012-08-05 16:35:30 -04:00
Joey Hess	34fc0d358e	fix crashes when run in a git repo that has been initted but has no master branch yet	2012-08-05 15:53:47 -04:00

... 2 3 4 5 6 ...

479 commits