git-annex

Author	SHA1	Message	Date
Joey Hess	d7c93b8913	fully support core.symlinks=false in all relevant symlink handling code Refactored annex link code into nice clean new library. Audited and dealt with calls to createSymbolicLink. Remaining calls are all safe, because: Annex/Link.hs: ( liftIO $ createSymbolicLink linktarget file only when core.symlinks=true Assistant/WebApp/Configurators/Local.hs: createSymbolicLink link link test if symlinks can be made Command/Fix.hs: liftIO $ createSymbolicLink link file command only works in indirect mode Command/FromKey.hs: liftIO $ createSymbolicLink link file command only works in indirect mode Command/Indirect.hs: liftIO $ createSymbolicLink l f refuses to run if core.symlinks=false Init.hs: createSymbolicLink f f2 test if symlinks can be made Remote/Directory.hs: go [file] = catchBoolIO $ createSymbolicLink file f >> return True fast key linking; catches failure to make symlink and falls back to copy Remote/Git.hs: liftIO $ catchBoolIO $ createSymbolicLink loc file >> return True ditto Upgrade/V1.hs: liftIO $ createSymbolicLink link f v1 repos could not be on a filesystem w/o symlinks Audited and dealt with calls to readSymbolicLink. Remaining calls are all safe, because: Annex/Link.hs: ( liftIO $ catchMaybeIO $ readSymbolicLink file only when core.symlinks=true Assistant/Threads/Watcher.hs: ifM ((==) (Just link) <$> liftIO (catchMaybeIO $ readSymbolicLink file)) code that fixes real symlinks when inotify sees them It's ok to not fix psdueo-symlinks. Assistant/Threads/Watcher.hs: mlink <- liftIO (catchMaybeIO $ readSymbolicLink file) ditto Command/Fix.hs: stopUnless ((/=) (Just link) <$> liftIO (catchMaybeIO $ readSymbolicLink file)) $ do command only works in indirect mode Upgrade/V1.hs: getsymlink = takeFileName <$> readSymbolicLink file v1 repos could not be on a filesystem w/o symlinks Audited and dealt with calls to isSymbolicLink. (Typically used with getSymbolicLinkStatus, but that is just used because getFileStatus is not as robust; it also works on pseudolinks.) Remaining calls are all safe, because: Assistant/Threads/SanityChecker.hs: \| isSymbolicLink s -> addsymlink file ms only handles staging of symlinks that were somehow not staged (might need to be updated to support pseudolinks, but this is only a belt-and-suspenders check anyway, and I've never seen the code run) Command/Add.hs: if isSymbolicLink s \|\| not (isRegularFile s) avoids adding symlinks to the annex, so not relevant Command/Indirect.hs: \| isSymbolicLink s -> void $ flip whenAnnexed f $ only allowed on systems that support symlinks Command/Indirect.hs: whenM (liftIO $ not . isSymbolicLink <$> getSymbolicLinkStatus f) $ do ditto Seek.hs:notSymlink f = liftIO $ not . isSymbolicLink <$> getSymbolicLinkStatus f used to find unlocked files, only relevant in indirect mode Utility/FSEvents.hs: \| Files.isSymbolicLink s = runhook addSymlinkHook $ Just s Utility/FSEvents.hs: \| Files.isSymbolicLink s -> Utility/INotify.hs: \| Files.isSymbolicLink s -> Utility/INotify.hs: checkfiletype Files.isSymbolicLink addSymlinkHook f Utility/Kqueue.hs: \| Files.isSymbolicLink s = callhook addSymlinkHook (Just s) change all above are lower-level, not relevant Audited and dealt with calls to isSymLink. Remaining calls are all safe, because: Annex/Direct.hs: \| isSymLink (getmode item) = This is looking at git diff-tree objects, not files on disk Command/Unused.hs: \| isSymLink (LsTree.mode l) = do This is looking at git ls-tree, not file on disk Utility/FileMode.hs:isSymLink :: FileMode -> Bool Utility/FileMode.hs:isSymLink = checkMode symbolicLinkMode low-level Done!!	2013-02-17 16:43:14 -04:00
Joey Hess	397082013a	proper fix for dropunused Now getKeysPresent checks that the key's content, not only its directory, exists. In direct mode, the inode cache file is used as a standin for the content. removeAnnex always removes the inode cache file, and drop and move --from always call removeAnnex, even if the object does not seem to be inAnnex, to ensure it's always deleted.	2013-02-15 17:58:49 -04:00
Joey Hess	5a8fb26d0a	Revert "Clean up direct mode cache and mapping info when dropping keys." This reverts commit `57780cb3a4`. This was buggy, it caused the direct mode cache to be lost when dropping keys, so when the file is gotten back, it's stored in indirect mode. Note to self: Do not attempt bug fixes at 6 am!	2013-02-15 16:37:57 -04:00
Joey Hess	2e49a7e729	don't allow setting indirect mode on a crippled filesystem	2013-02-15 14:17:31 -04:00
Joey Hess	5e6a60c17d	migrate, rekey: copy rather than hard linking in crippled filesystem mode	2013-02-15 13:51:50 -04:00
Joey Hess	7ce30b534f	add: Improved detection of files that are modified while being added. In indirect mode, now checks the inode cache to detect changes to a file. Note that a file can still be changed if a process has it open for write, after landing in the annex. In direct mode, some checking of the inode cache was done before, but from a much later point, so fewer modifications could be detected. Now it's as good as indirect mode. On crippled filesystems, no lock down is done before starting to add a file, so checking the inode cache is the only protection we have.	2013-02-14 16:54:36 -04:00
Joey Hess	a52f8f382b	split out Utility.InodeCache	2013-02-14 16:17:40 -04:00
Joey Hess	47477b2807	crippled filesystem support, probing and initial support git annex init probes for crippled filesystems, and sets direct mode, as well as `annex.crippledfilesystem`. Avoid manipulating permissions of files on crippled filesystems. That would likely cause an exception to be thrown. Very basic support in Command.Add for cripped filesystems; avoids the lock down entirely since doing it needs both permissions and hard links. Will make this better soon.	2013-02-14 14:15:26 -04:00
Joey Hess	43b4b7d43a	can now build Android targeted binary Various things that don't work on Android are just ifdefed out. * the webapp (needs template haskell for arm) * --include and --exclude globbing (needs libpcre, which is not ported; probably I'll make it use the pure haskell glob library instead) * annex.diskreserve checking (missing sys/statvfs.h) * timestamp preservation support (yawn) * S3 * WebDAV * XMPP The resulting 17mb binary has been tested on Android, and it is able to, at least, print its usage message.	2013-02-10 15:48:38 -04:00
Joey Hess	57780cb3a4	Clean up direct mode cache and mapping info when dropping keys. These files were left behind, and made getKeysPresent find keys that were not present. It would be expensive to make getKeysPresent check that the actual key files are present (it just lists the directories). But that's not needed if we just clean up the stale cache and mapping files. To handle systems that were in direct mode and got switched back with stale direct mode files, made cleanObjectLoc remove all files in the key's directory. git annex unused will still list keys that are gone but for which the stale direct mode files exists. To deal with that, made dropunused remove the key's directory even if the key does not seem to be present.	2013-02-07 08:28:40 -04:00
Joey Hess	b1de99c1d4	uninit, unannex --fast: If hard link creation fails, fall back to slow mode.	2013-02-06 14:02:18 -04:00
Joey Hess	547d7745fb	pre-commit: Update direct mode mappings. Making the pre-commit hook look at git diff-index to find changed direct mode files and update the mappings works pretty well. One case where it does not work is when a file is git annex added, and then git rmed, and then this is committed. That's a no-op commit, so the hook probably doesn't even run, and it certianly never notices that the file was deleted, so the mapping will still have the original filename in it. For this and other reasons, it's important that the mappings still be treated as possibly inconsistent. Also, the assistant now allows the pre-commit hook to run when in direct mode, so the mappings also get updated there.	2013-02-06 12:44:19 -04:00
Joey Hess	39d5f3f11c	avoid queueing rm of no files	2013-02-05 15:11:05 -04:00
Joey Hess	b19c2e6122	assistant: Fix location log when adding new file in direct mode.	2013-02-05 13:41:48 -04:00
Joey Hess	76ddf9b6d3	webapp: Now allows restarting any threads that crash.	2013-01-26 17:09:33 +11:00
Joey Hess	1713ed95f7	use async to track and manage threads	2013-01-26 14:14:32 +11:00
Joey Hess	672f8b5b83	fsck: Detect and fix consistency errors in direct mode mapping files.	2013-01-19 14:11:23 -04:00
Joey Hess	49f4ba297c	sync: Automatic merge conflict resolution now stages deleted files.	2013-01-17 21:19:00 -04:00
Joey Hess	7272179979	avoid running pre-commit hook in direct mode The code that handles committing unlocked files in indirect mode did something unexpected and data lossy.	2013-01-17 14:11:01 -04:00
Joey Hess	52e6eeaf06	drop: fix misleading message	2013-01-16 21:44:42 -04:00
Joey Hess	d7ca6fb856	webapp: Now always logs to .git/annex/daemon.log It used to not log to daemon.log when a repository was first created, and when starting the webapp. Now both do. Redirecting stdout and stderr to the log is tricky when starting the webapp, because the web browser may want to communicate with the user. (Either a console web browser, or web.browser = echo) This is handled by restoring the original fds when running the browser.	2013-01-15 13:34:59 -04:00
Joey Hess	f51ad2a00c	assistant: Avoid committer crashing if a file is deleted at the wrong instant.	2013-01-14 15:02:13 -04:00
Joey Hess	18a6935e42	safe recv-key in direct mode Checks the key's size and checksum. This is sorta expensive, but it avoids needing to add another round-trip to the protocol.	2013-01-11 16:03:45 -04:00
Joey Hess	2e11a6013b	drop: Suggest using git annex move when numcopies prevents dropping a file.	2013-01-09 18:53:59 -04:00
Joey Hess	1bc49b7158	Special remotes now all rollback storage of keys that get modified during the transfer, which can happen in direct mode.	2013-01-09 18:42:29 -04:00
Joey Hess	0da2507fd6	improve direct mode fsck An earlier commit (mislabeled) made direct mode fsck check file checksums. While it's expected for files to change at any time in direct mode, and so fsck cannot complain every time there's a checksum mismatch, it is possible for it to detect when a file does not seem to have changed, then check its checksum, and so detect disk corruption or other problems. This commit improves that, by checking a second time, if the checksum fails, that the file is still not modified, before taking action. This way, a direct mode file can be modified while being fscked.	2013-01-08 15:07:00 -04:00
Joey Hess	174867b846	blog for yesterday	2013-01-08 12:41:09 -04:00
Joey Hess	fc63e3b660	fix a stupid typo that made fsck loop when it found bad content Thank goodness for test suites!	2013-01-07 13:01:53 -04:00
Joey Hess	248090064d	addurl in direct mode	2013-01-06 17:34:44 -04:00
Joey Hess	858ad6783b	add works in direct mode Also, changed sync to no longer automatically add files in direct mode. That was only necessary before because add didn't work.	2013-01-06 17:24:22 -04:00
Joey Hess	f12202f771	optimize pre-commit in direct mode	2013-01-06 16:56:55 -04:00
Joey Hess	9d3e571f77	support fsck in direct mode	2013-01-06 15:42:49 -04:00
Joey Hess	b68eee625f	More commands work in direct mode repositories: find, whereis, move, copy, drop, log. These started working, for free, once lookupFile supported direct mode. yay!!	2013-01-05 17:17:04 -04:00
Joey Hess	aedfcde969	guard readSymbolicLink throws an exception if the file is not a symlink	2013-01-05 16:07:27 -04:00
Joey Hess	20fafc6a2d	avoid pre-commit in direct mode It was a no-op until my recent change that made lookupFile work in direct mode.	2013-01-05 16:06:20 -04:00
Joey Hess	15ecce2bfd	squelch warning	2013-01-05 15:09:43 -04:00
Joey Hess	bf1981f60e	committer: Fix a file handle leak.	2013-01-05 13:42:31 -04:00
Joey Hess	4008590c68	type based git config handling for remotes Still a couple of places that use git config ad-hoc, but this is most of it done.	2013-01-01 13:58:14 -04:00
Joey Hess	7f7c31df1c	type based git config handling Now there's a Config type, that's extracted from the git config at startup. Note that laziness means that individual config values are only looked up and parsed on demand, and so we get implicit memoization for all of them. So this is not only prettier and more type safe, it optimises several places that didn't have explicit memoization before. As well as getting rid of the ugly explicit memoization code. Not yet done for annex.<remote>.* configuration settings.	2012-12-29 23:10:18 -04:00
Joey Hess	92287f6905	ensure that direct mode file is not modified while generating its key	2012-12-29 15:32:29 -04:00
Joey Hess	e872c3f648	convert notBareRepo to a CommandCheck This avoids some small overhead by only running the check once per command; it also ensures that, even if the command doesn't find anything to run on, it still fails to run when in a bare repo.	2012-12-29 14:45:19 -04:00
Joey Hess	2ce736ac50	block all commands that don't work in direct mode I left status working in direct mode, although it doesn't show correct stats for known annex keys.	2012-12-29 14:28:19 -04:00
Joey Hess	8a8380f1b7	use sync command merge engine in assistant To handle direct mode merging.	2012-12-25 14:10:07 -04:00
Joey Hess	c3a35eb857	add a guard against using git annex add in direct mode repo Currently, it deletes files when run in one, so until I get a chance to fix it, block foot shooting.	2012-12-24 14:54:36 -04:00
Joey Hess	c6d2bbe402	assistant adding of files in direct mode	2012-12-24 13:37:29 -04:00
Joey Hess	e71f85645e	handle shasum's leading \ in checksum with certian unsual filenames Bugfix: Remove leading \ from checksums output by shasum commands, when the filename contains \ or a newline. Closes: #696384 fsck: Still accept checksums with a leading \ as valid, now that above bug is fixed. * migrate: Remove leading \ in checksums	2012-12-20 17:07:10 -04:00
Joey Hess	ddb0adb998	more quickcheck fun	2012-12-19 16:36:19 -04:00
Joey Hess	915cd7f676	comment	2012-12-19 12:50:24 -04:00
Joey Hess	05ec4587dd	partial and incomplete automatic merging in direct mode Handles our file right, but not theirs.	2012-12-18 17:15:16 -04:00
Joey Hess	53dbcce645	direct mode merging works! Automatic merge resoltion code needs to be fixed to preserve objects from direct mode files.	2012-12-18 15:04:44 -04:00
Joey Hess	d62a58b9c8	Merge branch 'master' into desymlink	2012-12-18 12:36:29 -04:00
Joey Hess	77931c1e92	vicfg: Quote filename. Closes: #696193	2012-12-18 12:19:24 -04:00
Joey Hess	44402159dd	add ok's	2012-12-13 16:02:10 -04:00
Joey Hess	2cfda59174	reorder for better display	2012-12-13 15:58:38 -04:00
Joey Hess	5df3c66a85	added direct and indirect commands	2012-12-13 15:44:56 -04:00
Joey Hess	cf129c2545	show direct/indirect mode	2012-12-13 13:48:07 -04:00
Joey Hess	ffdd08fd2e	Merge branch 'master' into desymlink	2012-12-13 00:46:10 -04:00
Joey Hess	0d50a6105b	whitespace fixes	2012-12-13 00:45:27 -04:00
Joey Hess	e7b8cb0063	direct mode committing	2012-12-12 19:20:38 -04:00
Joey Hess	676c78436d	also update direct mode associated files in local merge	2012-12-12 13:06:03 -04:00
Joey Hess	514957914d	direct mode mappings now updated by git annex sync Still lots to do to make sync handle direct mode, but this is a good first step.	2012-12-10 14:37:24 -04:00
Joey Hess	b4c6da9cbd	Got object sending working in direct mode. However, I don't yet have a reliable way to deal with files being modified while they're being transferred. I have code that detects it on the sending side, but the receiver is still free to move the wrong content into its annex, and record that it has the content. So that's not acceptable, and I'll need to work on it some more. However, at this point I can use a direct mode repository as a remote and transfer files from and to it.	2012-12-08 17:03:39 -04:00
Joey Hess	99a8a5297c	--auto fixes * get/copy --auto: Transfer data even if it would exceed numcopies, when preferred content settings want it. * drop --auto: Fix dropping content when there are no preferred content settings.	2012-12-06 13:22:16 -04:00
Joey Hess	2525fefbb9	The standalone builds now unset their special path and library path variables before running the system web browser. Should fix a crash reported on OSX.	2012-11-27 17:05:29 -04:00
Joey Hess	e80ca5f43d	formatting	2012-11-25 15:52:35 -04:00
Joey Hess	0ab05c32c8	avoid commits when running fix and find	2012-11-24 17:58:16 -04:00
Joey Hess	463cf58140	webapp and assistant glacier support	2012-11-24 16:30:15 -04:00
Joey Hess	5f977cc725	directory special remote: Made more efficient and robust. Files are now written to a tmp directory in the remote, and once all chunks are written, etc, it's moved into the final place atomically. For now, checkpresent still checks every single chunk of a file, because the old method could leave partially transferred files with some chunks present and others not.	2012-11-19 13:18:23 -04:00
Joey Hess	83993a2ba0	remove showOutput; git is run in quiet mode	2012-11-15 15:19:02 -04:00
Joey Hess	ebd576ebcb	where indentation	2012-11-12 01:05:04 -04:00
Joey Hess	887fe1714b	flush stdout It's block-buffered here.	2012-11-09 14:33:34 -04:00
Joey Hess	82ccb385e3	use xmpp::user@host for xmpp remotes Inject the required git-remote-xmpp into PATH when running xmpp git push. Rest of the time it will not be in PATH, and git won't be able to talk to xmpp remotes.	2012-11-09 13:35:23 -04:00
Joey Hess	cb7523b9e8	add xmppgit command; roughed out xmpp push protocol and design	2012-11-06 00:59:20 -04:00
Joey Hess	8f08aa3f45	better handling of lifting from XMPP -> Assistant	2012-11-05 19:39:08 -04:00
Joey Hess	68118b8986	split remaining assistant types	2012-10-30 14:34:48 -04:00
Joey Hess	f78ca9bc58	split out daemonstatus types	2012-10-30 14:11:14 -04:00
Joey Hess	4e765327ca	Assistant monad, stage 1 This adds the Assistant monad, and an AssistantData structure. So far, none of the assistant's threads run in the monad yet.	2012-10-29 00:15:43 -04:00
Joey Hess	4ac2fd0a22	ensure that git-annex branch is pushed after a successful transfer I now have this topology working: assistant ---> {bare repo, special remote} <--- assistant And, I think, also this one: +----------- bare repo --------+ v v assistant ---> special remote <--- assistant While before with assistant <---> assistant connections, both sides got location info updated after a transfer, in this topology, the bare repo might get its location info updated, but the other assistant has no way to know that it did. And a special remote doesn't record location info, so transfers to it won't propigate out location log changes at all. So, for these to work, after a transfer succeeds, the git-annex branch needs to be pushed. This is done by recording a synthetic commit has occurred, which lets the pusher handle pushing out the change (which will include actually committing any still journalled changes to the git-annex branch). Of course, this means rather a lot more syncing action than happened before. At least the pusher bundles together very close together pushes, somewhat. Currently it just waits 2 seconds between each push.	2012-10-28 16:05:34 -04:00
Joey Hess	c71836269b	(re)start XMPP when it's configured in the webapp	2012-10-27 00:50:14 -04:00
Joey Hess	9856641ef1	deal with mtl/monads-tf conflict I had been using -ignore-package monads-tf to deal with this, but the XMPP library uses monads-tf, so that also ignores it. Instead, use PackageImports to force use of mtl in my own code.	2012-10-24 14:43:32 -04:00
Joey Hess	b05981d973	uninit: Check and abort if there are symlinks to annexed content that are not checked into git.	2012-10-22 11:54:50 -04:00
Joey Hess	b281584422	remove some more !!	2012-10-20 16:21:43 -04:00
Joey Hess	f7f34d2072	drop unwanted content in the transfer scan This was complicated quite a bit by needing to check numcopies. I optimised that, so it only looks up numcopies once per file, no matter how many remotes it checks to drop from. Although it did just occur to me that it might be better to first check if it wants to drop content, and only then check numcopies..	2012-10-18 15:07:11 -04:00
Joey Hess	919fec85cd	better fix for zombie problem, which turns out to be a zombie ssh started by rsync When rsyncProgress pipes rsync's stdout, this turns out to cause a ssh process started by rsync to be left behind as a zombie. I don't know why, but my recent zombie reaping cleanup was correct, it's just that this other zombie, that's not directly started by git-annex, was no longer reaped due to changes in the cleanup. Make rsyncProgress reap the zombie started by rsync, as a workaround. FWIW, the process tree looks like this. It seems like the rsync child is for some reason starting but not waiting on this extra ssh process. Ssh connection caching may be involved -- disabling it seemed to change the shape of the tree, but did not eliminate the zombie. 9378 pts/14 S+ 0:00 \| \_ rsync -p --progress --inplace -4 -e 'ssh' '-S' ... 9379 pts/14 S+ 0:00 \| \| \_ ssh ... 9380 pts/14 S+ 0:00 \| \| \_ rsync -p --progress --inplace -4 -e 'ssh' '-S' ... 9381 pts/14 Z+ 0:00 \| \_ [ssh] <defunct>	2012-10-17 00:47:52 -04:00
Joey Hess	51ef707a59	nub the autostart file It's possible for the file to get duplicate lines in it, and if so, we want to ignore the dups.	2012-10-14 15:19:34 -04:00
Joey Hess	4571ad9590	add help command	2012-10-13 19:07:56 -04:00
Joey Hess	9c3e1ca3c9	full analysis of ways content could stop being preferred and need to be dropped	2012-10-13 13:21:43 -04:00
Joey Hess	e52fc5ba89	vicfg: New file format, avoids ambiguity with repos that have the same description, or no description. This is also nice in that uuids are all the same length, so the values of each line, line up. Also a great deal of boilerplate elimination.	2012-10-12 23:11:26 -04:00
Joey Hess	589d1711f2	git config remote.name.annex-sync can be used to control whether a remote gets synced.	2012-10-11 18:39:21 -04:00
Joey Hess	80b3952930	webapp: display message about starting web browser One reason to do this is that on OSX, it doesn't jump to the web browser when opening a new page. Linux seems ahead in usability here... :P	2012-10-11 15:19:48 -04:00
Joey Hess	bbf2c31aa7	better message	2012-10-11 12:14:23 -04:00
Joey Hess	c0aec874a2	webapp: avoid infinite loop on start If the autostart file lists a repository, for which a directory exists, but there's not actually a valid git repo in there, the web app used to try to use it, and see it wasn't valid, and then try to autostart again. The ensuing runaway loop also ate memory, although not as fast as I was led to belive was happening to someone on IRC yesterday. So that guy may have had a different problem. But this seems otherwise a reasonable fit for the circumstances described, if git-annex was started before something that occurred during desktop login that made the repository available.	2012-10-11 12:08:11 -04:00
Joey Hess	bf72760af2	dead: Remove dead repository from all groups. This is less expensive than having inallgroup weed out dead repositories.	2012-10-10 15:39:13 -04:00
Joey Hess	5ac15149cc	assistant: Now honors preferred content settings when deciding what to transfer. Both when queueing downloads, and uploads, consults the preferred content settings. I didn't make it check yet when requeing failed transfers or queuing deferred downloads; dealing with the preferred content settings (or indeed, other settings) changing while the assistant is running still needs work.	2012-10-09 12:18:41 -04:00
Joey Hess	fee40dd374	generalized Annex.Wanted this should make it easy to use from inside the assistant, where everything is an AssociatedFile.	2012-10-08 17:14:01 -04:00
Joey Hess	1eedf495c3	make copy --to check preferred content of the remote	2012-10-08 16:06:56 -04:00
Joey Hess	17543f6e80	drop --auto --from with preferred content With --from, it needs to examine the preferred content of the repository being dropped from, instead of the local repository.	2012-10-08 15:34:44 -04:00
Joey Hess	71fd18a97f	wired preferred content up to get, copy, and drop --auto	2012-10-08 13:16:53 -04:00
Joey Hess	47314c0fad	fix last zombies in the assistant Made Git.LsFiles return cleanup actions, and everything waits on processes now, except of course for Seek.	2012-10-04 19:56:32 -04:00
Joey Hess	de3ea4adb6	remove now-unnecessary manual reaps	2012-10-04 18:58:57 -04:00

1 2 3 4 5 ...

829 commits