git-annex

Author	SHA1	Message	Date
Joey Hess	40cec65ace	more hlint	2014-02-11 10:48:52 -04:00
Joey Hess	897d877472	work around absNormPath not working on Windows When making git-annex links, we want unix-style paths in the link targets.	2014-02-06 17:17:35 -04:00
Joey Hess	721cc0cd22	rework annexed object locking in direct mode & support Windows Seems that locking of annexed objects when they're being dropped was broken in direct mode: * When taking the lock before dropping, it created the .git/annex/objects file, as an empty file. It seems that the dropping code deleted that, but that is not right, and for all I know could in some situation cause a corrupted object to leak out. * When the lock was checked, it actually tried to open each direct mode file, and checked if it was locked. Not the same lock used above, and could also fail if some consumer of the file locked it. Fixed this, and added windows support by switching direct mode to lock a .lck file.	2014-01-28 16:43:11 -04:00
Joey Hess	d345e5b52f	add git fsck to cronner, and UI for repository repair (not yet wired up)	2013-10-22 16:02:52 -04:00
Joey Hess	396e47b07e	tighten file2key to not produce invalid keys with no keyName A file named "foo-" or "foo-bar" was taken as a key's file, with a backend of "foo", and an empty keyName. This led to various problems, especially because converting that key back to a file did not yeild the same filename.	2013-10-16 12:46:24 -04:00
Joey Hess	af5e1d0494	half way complete cronner thread to run scheduled activities	2013-10-08 11:48:28 -04:00
Joey Hess	635c9a1549	assistant: Detect stale git lock files at startup time, and remove them. Extends the index.lock handling to other git lock files. I surveyed all lock files used by git, and found more than I expected. All are handled the same in git; it leaves them open while doing the operation, possibly writing the new file content to the lock file, and then closes them when done. The gc.pid file is excluded because it won't affect the normal operation of the assistant, and waiting for a gc to finish on startup wouldn't be good. All threads except the webapp thread wait on the new startup sanity checker thread to complete, so they won't try to do things with git that fail due to stale lock files. The webapp thread mostly avoids doing that kind of thing itself. A few configurators might fail on lock files, but only if the user is explicitly trying to run them. The webapp needs to start immediately when the user has opened it, even if there are stale lock files. Arranging for the threads to wait on the startup sanity checker was a bit of a bear. Have to get all the NotificationHandles set up before the startup sanity checker runs, or they won't see its signal. Perhaps the NotificationBroadcaster is not the best interface to have used for this. Oh well, it works. This commit was sponsored by Michael Jakl	2013-10-05 17:04:21 -04:00
Joey Hess	1be4d281d6	Better sanitization of problem characters when generating URL and WORM keys. FAT has a lot of characters it does not allow in filenames, like ? and * It's probably the worst offender, but other filesystems also have limitiations. In 2011, I made keyFile escape : to handle FAT, but missed the other characters. It also turns out that when I did that, I was also living dangerously; any existing keys that contained a : had their object location change. Oops. So, adding new characters to escape to keyFile is out. Well, it would be possible to make keyFile behave differently on a per-filesystem basis, but this would be a real nightmare to get right. Consider that a rsync special remote uses keyFile to determine the filenames to use, and we don't know the underlying filesystem on the rsync server.. Instead, I have gone for a solution that is backwards compatable and simple. Its only downside is that already generated URL and WORM keys might not be able to be stored on FAT or some other filesystem that dislikes a character used in the key. (In this case, the user can just migrate the problem keys to a checksumming backend. If this became a big problem, fsck could be made to detect these and suggest a migration.) Going forward, new keys that are created will escape all characters that are likely to cause problems. And if some filesystem comes along that's even worse than FAT (seems unlikely, but here it is 2013, and people are still using FAT!), additional characters can be added to the set that are escaped without difficulty. (Also, made WORM limit the part of the filename that is embedded in the key, to deal with filesystem filename length limits. This could have already been a problem, but is more likely now, since the escaping of the filename can make it longer.) This commit was sponsored by Ian Downes	2013-10-05 15:01:49 -04:00
Joey Hess	3dac026598	move some code around	2013-10-05 13:49:45 -04:00
Joey Hess	83b4b8d589	rename confusing function The index.lck file is not a lock file. Kept the historical name for now as changing it would be work.	2013-10-03 15:06:58 -04:00
Joey Hess	7a9a16b337	lockJournal when running performTransitions This may not strictly be needed -- the transition code bypasses the journal. However, this ensures that the git-annex branch is only committed with the journal locked. This will allow for further improvements.	2013-10-03 14:37:46 -04:00
Joey Hess	4c954661a1	git-annex-shell: Added support for operating inside gcrypt repositories. * Note that the layout of gcrypt repositories has changed, and if you created one you must manually upgrade it. See http://git-annex.branchable.com/upgrades/gcrypt/	2013-09-24 17:25:47 -04:00
Joey Hess	fcd5c167ef	untested transition detection on merging, and transition running code	2013-08-28 15:57:42 -04:00
Joey Hess	24c8a6042b	importfeed: Ignores transient problems with feeds. Only exits nonzero when a feed has repeatedly had a problems for at least 1 day.	2013-08-03 01:40:21 -04:00
Joey Hess	a96e982bd3	fuzz tester	2013-05-23 19:00:46 -04:00
Joey Hess	03e8594369	fix the day's windows permissions damage	2013-05-12 19:09:48 -04:00
Joey Hess	a2f83b28f3	fix path separator	2013-05-12 17:18:34 -05:00
Joey Hess	f1b0a4b404	Use lower case hash directories for storing files on crippled filesystems, same as is already done for bare repositories. * since this is a crippled filesystem anyway, git-annex doesn't use symlinks on it * so there's no reason to use the mixed case hash directories that we're stuck using to avoid breaking everyone's symlinks to the content * so we can do what is already done for all bare repos, and make non-bare repos on crippled filesystems use the all-lower case hash directories * which are, happily, all 3 letters long, so they cannot conflict with mixed case hash directories * so I was able to 100% fix this and even resuming `git annex add` in the test case will recover and it will all just work.	2013-04-04 15:46:33 -04:00
Joey Hess	38d61f934d	Update working tree files fully atomically This avoids commit churn by the assistant when eg, replacing a file with a symlink. But, just as importantly, it prevents the working tree being left with a deleted file if git-annex, or perhaps the whole system, crashes at the wrong time. (It also probably avoids confusing displays in file managers.)	2013-04-02 15:02:00 -04:00
Joey Hess	8d9c2afd89	Additional GIT_DIR support bugfixes. May actually work now. Two fixes. First, and most importantly, relax the isLinkToAnnex check to only look for /annex/objects/, not [^\|/].git/annex/objects. If GIT_DIR is used with a detached work tree, the git directory is not necessarily named .git. There are important caveats with doing that at all, since git-annex will make symlinks that point at GIT_DIR, which means that the relative path between GIT_DIR and GIT_WORK_TREE needs to remain stable across all clones of the repository. ---- The other fix is just fixing crazy and wrong code that, when GIT_DIR is set, expects to still find a git repository in the path below the work tree, and uses some of its configuration, and some of GIT_DIR. What was I thinking, and why can't I seem to get this code right?	2013-02-23 12:41:22 -04:00
Joey Hess	624e34649f	Direct mode: Support filesystems like FAT which can change their inodes each time they are mounted.	2013-02-19 17:31:03 -04:00
Joey Hess	a52f8f382b	split out Utility.InodeCache	2013-02-14 16:17:40 -04:00
Joey Hess	909f67443f	Fix transferring files to special remotes in direct mode.	2013-01-06 14:29:01 -04:00
Joey Hess	53dbcce645	direct mode merging works! Automatic merge resoltion code needs to be fixed to preserve objects from direct mode files.	2012-12-18 15:04:44 -04:00
Joey Hess	ef24751922	support for checking presence of objects in direct mode Also for dropping objects in direct mode. Checking presence reliably needs a cache of mtime, size, and inode. This way, if a file is modified, keys that point to it are no longer present. Also, the code for restoring the symlink when removing objects is unnecessarily messy. calcGitLink was generating links starting with "../../remote/.git/", when running "git annex move --from remote". I put in a workaround, but calcGitLink should probably be fixed. There is not yet support for getting objects from repositories in direct mode; it still looks for content in .git/annex/objects, and there's no once place I can change to fix that. Also, getting objects from direct mode repositories is problematic since the can be changed while the object is being transferred. It probably needs to quarantine it first.	2012-12-07 17:29:55 -04:00
Joey Hess	3898d8c091	support for storing files in direct mode	2012-12-07 14:53:02 -04:00
Joey Hess	d3dfeeb3d9	remove annex/ from key locations used for webdav	2012-11-18 23:59:39 -04:00
Joey Hess	bb28c6114a	drop webdav compatability with the directory special remote etc The benefit of using a compatable directory structure does not outweigh the cost in complexity of handling the multiple locations content can be stored in directory special remotes. And this also allows doing away with the parent directories, which can't be made unwritable in DAV, so have no benefit there. This will save 2 http calls per file store. But, kept the directory hashing, just in case.	2012-11-16 00:42:33 -04:00
Joey Hess	6eca362c5d	indentation foo, and a new coding style page. no code changes	2012-10-28 21:27:15 -04:00
Joey Hess	7a7f63182c	vicfg: New command, allows editing (or simply viewing) most of the repository configuration settings stored in the git-annex branch. Incomplete; I need to finish parsing and saving. This will also be used for editing transfer control expresssions. Removed the group display from the status output, I didn't really like that format, and vicfg can be used to see as well as edit rempository group membership.	2012-10-03 17:04:52 -04:00
Joey Hess	e4bf74a965	store S3 creds in a 600 mode file inside the local git repo	2012-09-26 14:42:32 -04:00
Joey Hess	6885b2deda	add recordStartTime and getStartTime	2012-09-25 14:16:34 -04:00
Joey Hess	18bae020ed	make other repositories list list all autostarted repos And add a form to add another, unrelated repository	2012-09-18 17:50:07 -04:00
Joey Hess	750c4ac6c2	bugfix: avoid staging but not committing changes to git-annex branch Branch.get is not able to see changes that have been staged to the index but not committed. This is a limitation of git cat-file --batch; when reading from the index, as opposed to from a branch, it does not notice changes made after the first time it reads the index. So, had to revert the changes made in `1f73db3469` to make annex.alwayscommit=false stage changes. Also, ensure that Branch.change and Branch.get always see changes at all points during a commit, by not deleting journal files when staging to the index. Delete them only after committing the branch. Before, there was a race during commits where a different git-annex could see out-of-date info from the branch while a commit was in progress. That's also done when updating the branch to merge in remote branches. In the case where the local git-annex branch has had changes pushed into it that are not yet reflected in the index, and there are journalled changes as well, a merge commit has to be done.	2012-09-15 20:15:16 -04:00
Joey Hess	60c31afc38	add decodeW8	2012-09-13 19:14:29 -04:00
Joey Hess	54a492db5f	UI for adding a ssh or rsync remote	2012-08-31 18:59:57 -04:00
Joey Hess	487bdf0e24	add transfer scanned flag files	2012-08-23 13:42:26 -04:00
Joey Hess	94fcd0cf59	add routes to pause/start/cancel transfers This commit includes a paydown on technical debt incurred two years ago, when I didn't know that it was bad to make custom Read and Show instances for types. As the routes need Read and Show for Transfer, which includes a Key, and deriving my own Read instance of key was not practical, I had to finally clean that up. So the compact Key read and show functions are now file2key and key2file, and Read and Show are now derived instances. Changed all code that used the old instances, compiler checked. (There were a few places, particularly in Command.Unused, and the test suite where the Show instance continue to be used for legitimate comparisons; ie show key_x == show key_y (though really in a bloom filter))	2012-08-08 16:20:24 -04:00
Joey Hess	1ffef3ad75	git annex webapp now opens a browser to the webapp Also, starts the assistant if it wasn't already running.	2012-07-25 23:13:01 -04:00
Joey Hess	be0e38bcc3	add transfer information files	2012-07-01 17:15:11 -04:00
Joey Hess	ff2414427b	implement daemon status serialization to a file Also afterLastDaemonRun, with 10 minute slop to handle majority of clock skew issues.	2012-06-13 13:35:15 -04:00
Joey Hess	942d8f7298	hlint	2012-06-12 11:32:06 -04:00
Joey Hess	0b3e2bed78	add a pid file Writes pid to a file. Is supposed to take an exclusive lock, but that's not working, and it's too late for me to understand why.	2012-06-11 01:20:19 -04:00
Joey Hess	d5884388b0	daemonize git annex watch	2012-06-11 00:39:09 -04:00
Joey Hess	6fd83851c1	Fix display of warning message when encountering a file that uses an unsupported backend.	2012-05-31 21:03:24 -04:00
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	626697b459	cabal file now autodetects whether S3 support is available.	2012-04-14 14:22:33 -04:00
Joey Hess	f9d44cccd9	perhaps more clear type	2012-03-10 11:38:38 -04:00
Joey Hess	bca3fd65b9	fix key directory hash calculation code Fix Key directory hash calculation code to behave as it did before version 3.20120227 when a key contains non-ascii. The hash directories for a given Key are based on its md5sum. Prior to ghc 7.4, Keys contained raw, undecoded bytes, so the md5sum was taken of each byte in turn. With the ghc 7.4 filename encoding change, keys contains decoded unicode characters (possibly with surrigates for undecodable bytes). This changes the result of the md5sum, since the md5sum used is pure haskell and supports unicode. And that won't do, as git-annex will start looking in a different hash directory for the content of a key. The surrigates are particularly bad, since that's essentially a ghc implementation detail, so could change again at any time. Also, changing the locale changes how the bytes are decoded, which can also change the md5sum. Symptoms would include things like: * git annex fsck would complain that no copies existed of a file, despite its symlink pointing to the content that was locally present * git annex fix would change the symlink to use the wrong hash directory. Only WORM backend is likely to have been affected, since only it tends to include much filename data (SHA1E could in theory also be affected). I have not tried to support the hash directories used by git-annex versions 3.20120227 to 3.20120308, so things added with those versions with WORM will require manual fixups. Sorry for the inconvenience!	2012-03-09 20:03:51 -04:00
Joey Hess	52e88f3ebf	add remote start and stop hooks Locking is used, so that, if there are multiple git-annex processes using a remote concurrently, the stop hook is only run by the last process that uses it.	2012-03-04 19:12:58 -04:00

1 2 3

136 commits