git-annex

Author	SHA1	Message	Date
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	626697b459	cabal file now autodetects whether S3 support is available.	2012-04-14 14:22:33 -04:00
Joey Hess	f9d44cccd9	perhaps more clear type	2012-03-10 11:38:38 -04:00
Joey Hess	bca3fd65b9	fix key directory hash calculation code Fix Key directory hash calculation code to behave as it did before version 3.20120227 when a key contains non-ascii. The hash directories for a given Key are based on its md5sum. Prior to ghc 7.4, Keys contained raw, undecoded bytes, so the md5sum was taken of each byte in turn. With the ghc 7.4 filename encoding change, keys contains decoded unicode characters (possibly with surrigates for undecodable bytes). This changes the result of the md5sum, since the md5sum used is pure haskell and supports unicode. And that won't do, as git-annex will start looking in a different hash directory for the content of a key. The surrigates are particularly bad, since that's essentially a ghc implementation detail, so could change again at any time. Also, changing the locale changes how the bytes are decoded, which can also change the md5sum. Symptoms would include things like: * git annex fsck would complain that no copies existed of a file, despite its symlink pointing to the content that was locally present * git annex fix would change the symlink to use the wrong hash directory. Only WORM backend is likely to have been affected, since only it tends to include much filename data (SHA1E could in theory also be affected). I have not tried to support the hash directories used by git-annex versions 3.20120227 to 3.20120308, so things added with those versions with WORM will require manual fixups. Sorry for the inconvenience!	2012-03-09 20:03:51 -04:00
Joey Hess	52e88f3ebf	add remote start and stop hooks Locking is used, so that, if there are multiple git-annex processes using a remote concurrently, the stop hook is only run by the last process that uses it.	2012-03-04 19:12:58 -04:00
Joey Hess	1f73db3469	improve alwayscommit=false mode Now changes are staged into the branch's index, but not committed, which avoids growing a large journal. And sync and merge always explicitly commit, ensuring that even when they do nothing else, they commit the staged changes. Added a flag file to indicate that the branch's journal contains uncommitted changes. (Could use git ls-files, but don't want to run that every time.) In the future, this ability to have uncommitted changes staged in the journal might be used on remotes after a series of oneshot commands.	2012-02-25 16:18:55 -04:00
Joey Hess	47250a153a	ssh connection caching Ssh connection caching is now enabled automatically by git-annex. Only one ssh connection is made to each host per git-annex run, which can speed some things up a lot, as well as avoiding repeated password prompts. Concurrent git-annex processes also share ssh connections. Cached ssh connections are shut down when git-annex exits. Note: The rsync special remote does not yet participate in the ssh connection caching.	2012-01-20 17:14:56 -04:00
Joey Hess	0f9859ae51	avoid partial function	2011-12-15 16:58:58 -04:00
Joey Hess	cfbbda99f4	optimize index updating The last branch ref that the index was updated to is stored in .git/annex/index.lck, and the index only updated when the current branch ref differs. (The .lck file should later be used for locking too.) Some more optimization is still needed, since there is some redundancy in calls to git show-ref.	2011-12-11 16:14:59 -04:00
Joey Hess	0ba4b1de18	move a file location to Locations.hs	2011-12-11 14:14:28 -04:00
Joey Hess	583ba80992	better syntax	2011-12-10 20:53:42 -04:00
Joey Hess	10e8028a42	Fix bug in last version in getting contents from bare repositories.	2011-12-10 18:45:55 -04:00
Joey Hess	64672c6262	refactor	2011-12-03 09:10:23 -04:00
Joey Hess	fb68a7881f	convert rsync special backend to using both hash directory types	2011-12-02 15:50:27 -04:00
Joey Hess	db5b479f3f	use lowercase hash by default; non-bare repos are a special case Directory special remotes will now always store keys in the lowercase name, which avoids the complication of catching failures to create the mixed case name. Git remotes using http will now try the lowercase name first.	2011-12-02 14:56:48 -04:00
Joey Hess	0815cc2fc1	refactor	2011-12-02 14:47:59 -04:00
Joey Hess	bff6ca2634	refactor	2011-11-28 23:20:31 -04:00
Joey Hess	e6ef66cea3	optimize gitAnnexLocation For non-bare it's back to doing no work.	2011-11-28 23:08:11 -04:00
Joey Hess	f4bf444ae0	store content in hashDirLower directories in bare repositories When storing content in bare repositories, use the hashDirLower directories. Bare repositories can be on USB drives, which might use the FAT filesystem, and fall afoul of recent bugs in linux's handling of mixed case on FAT. Using hashDirLower avoids that.	2011-11-28 22:55:40 -04:00
Joey Hess	da9cd315be	add support for using hashDirLower in addition to hashDirMixed Supporting multiple directory hash types will allow converting to a different one, without a flag day. gitAnnexLocation now checks which of the possible locations have a file. This means more statting of files. Several places currently use gitAnnexLocation and immediately check if the returned file exists; those need to be optimised.	2011-11-28 22:43:51 -04:00
Mark Wright	041d324125	Remove haskell98 to build with ghc 7.2.2, also built with ghc 7.0.4 Signed-off-by: Joey Hess <joey@kitenet.net>	2011-11-26 12:05:08 -04:00
Joey Hess	bf460a0a98	reorder repo parameters last Many functions took the repo as their first parameter. Changing it consistently to be the last parameter allows doing some useful things with currying, that reduce boilerplate. In particular, g <- gitRepo is almost never needed now, instead use inRepo to run an IO action in the repo, and fromRepo to get a value from the repo. This also provides more opportunities to use monadic and applicative combinators.	2011-11-08 16:27:20 -04:00
Joey Hess	91366c896d	clean Annex stuff out of Utility/	2011-10-16 00:04:26 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	d357556141	Add locking to avoid races when changing the git-annex branch.	2011-10-03 16:32:36 -04:00
Joey Hess	6c396a256c	finished hlint pass	2011-07-15 12:47:14 -04:00
Joey Hess	e784757376	hlint tweaks Did all sources except Remotes/* and Command/*	2011-07-15 03:12:05 -04:00
Joey Hess	79016c197c	add hashing to web log files	2011-07-01 17:23:01 -04:00
Joey Hess	f6063a094e	renamed GitRepo to Git It was always imported qualified as Git anyway	2011-06-30 13:21:39 -04:00
Joey Hess	5f494154a3	add journaling to speed up changes to the git-annex branch git is slow when the index file is large and has to be rewritten each time a file is changed. To speed this up, added a journal where changes are recorded before being fed into the index file and committed to the git-annex branch. The entire journal can be fed into git with just 2 commands, and only one write of the index file.	2011-06-23 11:37:26 -04:00
Joey Hess	23e765b67c	update re git-annex branch direct modification	2011-06-23 09:56:04 -04:00
Joey Hess	9e37898e21	remove stateDir	2011-06-23 08:48:13 -04:00
Joey Hess	1a182d4d04	stub in v2 upgrade	2011-06-22 17:51:48 -04:00
Joey Hess	4c4ebf2d75	store trust.log and remote.log in the git-annex branch .. and I think that's everything that will use the branch	2011-06-22 17:08:51 -04:00
Joey Hess	235e2e63a1	move	2011-06-22 16:30:34 -04:00
Joey Hess	1870186632	fixed logFile	2011-06-22 16:17:16 -04:00
Joey Hess	a5e6802b5b	typos in comments	2011-06-21 14:44:56 -04:00
Joey Hess	703c437bd9	rename modules for data types into Types/ directory	2011-06-01 21:56:04 -04:00
Joey Hess	760cde28b6	more pointless monadic golfing	2011-05-16 14:49:28 -04:00
Joey Hess	43f0a666f0	unused: Now also lists files fsck places in .git/annex/bad/	2011-04-29 13:59:00 -04:00
Joey Hess	f005a84e56	add loggedKeys	2011-04-02 15:50:51 -04:00
Joey Hess	616e6f8a84	Use lowercase hash directories for locationlog files to avoid some issues with git on OSX with the mixed-case directories. No migration is needed; the old mixed case hash directories are still read; new information is written to the new directories.	2011-04-02 13:49:03 -04:00
Joey Hess	fdd455e913	use same directory structure as .git/annex/objects for directory remotes And same file perms.	2011-03-30 14:56:31 -04:00
Joey Hess	137257ded1	better letter choice for hashing	2011-03-16 02:50:13 -04:00
Joey Hess	dd5448eb07	added 2 level hashing This means there can be 1024 subdirs, each with up to 1024 sub-subdirs. So with hundreds of millions of annexed objects, each leaf directory will have only a few files on average.	2011-03-15 23:58:27 -04:00
Joey Hess	6c412fb9f5	escape colons in key files	2011-03-15 23:39:04 -04:00
Joey Hess	9d49fe2c17	first pass at using new keys It compiles. It sorta works. Several subcommands are FIXME marked and broken, because things that used to accept separate --backend and --key params need to be changed to accept just a --key that encodes all the key info, now that there is metadata in keys.	2011-03-15 21:34:13 -04:00
Joey Hess	0e0f85e09d	add hash directory stuff, not used yet	2011-03-15 17:47:00 -04:00
Joey Hess	d651d4985b	some initial support for local bare repos This relies on git-annex's behavior of reading the config of local repos. That allows repoIsLocalBare to examine the git config for core.bare. Hopefully, gitAnnexLocation, gitAnnexDir, and gitAnnexObjectDir are only used on local repos. But, I have not audited fully, since they're probably not (see for example copyToRemote). And so, the functions fall back to their old non-bare-aware behavior for non-local repos.	2011-03-03 14:51:57 -04:00
Joey Hess	e6da7eb177	Improved temp file handling * Improved temp file handling. Transfers of content can now be resumed from temp files later; the resume does not have to be the immediate next git-annex run. * unused: Include partially transferred content in the list.	2011-01-28 14:10:50 -04:00

1 2

91 commits