git-annex

Author	SHA1	Message	Date
Joey Hess	2de1e2c2ce	Optimized copy --from and get --from to avoid checking the location log for files that are already present. This can be a significant speedup when running in large trees that are only missing a few files; it makes copy --from just as fast as get.	2011-11-10 21:32:42 -04:00
Joey Hess	cf0174c922	content locking I've tested that this solves the cyclic drop problem. Have not looked at cyclic move, etc.	2011-11-09 21:54:42 -04:00
Joey Hess	faa4935047	Handle a case where an annexed file is moved into a gitignored directory, by having fix --force add its change.	2011-11-07 18:10:31 -04:00
Joey Hess	f8911cc69d	releasing version 3.20111107	2011-11-07 13:06:58 -04:00
Joey Hess	41eecb4601	Bugfix: In the past two releases, git-annex init has written the uuid.log in the wrong format, with the UUID and description flipped. This is my own damn fault for not making UUID a real type, and then relying on the type checker to ensure my refactoring was correct -- which it wasn't! I should probably add code to clean up bogus entries in the uuid.log, but right now I want to get the fix out there to prevent people experiencing this bug. I should also make UUID a real data type.	2011-11-07 12:47:41 -04:00
Joey Hess	aae0417d94	Don't try to read config from repos with annex-ignore set.	2011-11-07 11:50:30 -04:00
Joey Hess	c99fb58909	merge: Use fast-forward merges when possible. Thanks Valentin Haenel for a test case showing how non-fast-forward merges could result in an ongoing pull/merge/push cycle. While the git-annex branch is fast-forwarded, git-annex's index file is still updated using the union merge strategy as before. There's no other way to update the index that would be any faster. It is possible that a union merge and a fast-forward result in different file contents: Files should have the same lines, but a union merge may change their order. If this happens, the next commit made to the git-annex branch will have some unnecessary changes to line orders, but the consistency of data should be preserved. Note that when the journal contains changes, a fast-forward is never attempted, which is fine, because committing those changes would be vanishingly unlikely to leave the git-annex branch at a commit that already exists in one of the remotes. The real difficulty is handling the case where multiple remotes have all changed. git-annex does find the best (ie, newest) one and fast forwards to it. If the remotes are diverged, no fast-forward is done at all. It would be possible to pick one, fast forward to it, and make a merge commit to the rest, I see no benefit to adding that complexity. Determining the best of N changed remotes requires N*2+1 calls to git-log, but these are fast git-log calls, and N is typically small. Also, typically some or all of the remote refs will be the same, and git-log is not called to compare those. In the real world I expect this will almost always add only 1 git-log call to the merge process. (Which already makes N anyway.)	2011-11-06 15:22:40 -04:00
Joey Hess	0556dc812e	releasing version 3.20111105	2011-11-05 15:55:19 -04:00
Joey Hess	0bb798e351	Pass -t to rsync to preserve timestamps.	2011-11-04 19:41:11 -04:00
Joey Hess	ef3457196a	use SHA256 by default To get old behavior, add a .gitattributes containing: * annex.backend=WORM I feel that SHA256 is a better default for most people, as long as their systems are fast enough that checksumming their files isn't a problem. git-annex should default to preserving the integrity of data as well as git does. Checksum backends also work better with editing files via unlock/lock. I considered just using SHA1, but since that hash is believed to be somewhat near to being broken, and git-annex deals with large files which would be a perfect exploit medium, I decided to go to a SHA-2 hash. SHA512 is annoyingly long when displayed, and git-annex displays it in a few places (and notably it is shown in ls -l), so I picked the shorter hash. Considered SHA224 as it's even shorter, but feel it's a bit weird. I expect git-annex will use SHA-3 at some point in the future, but probably not soon! Note that systems without a sha256sum (or sha256) program will fall back to defaulting to SHA1.	2011-11-04 15:51:01 -04:00
Joey Hess	1089e85d48	add changelog for bugfix	2011-11-04 15:51:01 -04:00
Joey Hess	eec137f33a	Record uuid when auto-initializing a remote so it shows in status.	2011-11-02 14:18:21 -04:00
Joey Hess	00988bcf36	fixed my build environment	2011-10-31 15:40:57 -04:00
Joey Hess	3d3e1c4c25	better command name	2011-10-31 15:18:41 -04:00
Joey Hess	380839299e	The fromkey command now takes the key as its first parameter. The --key option is no longer used.	2011-10-31 12:56:07 -04:00
Joey Hess	cc1ea8f844	Removed the setkey command, and added a setcontent command with a more useful interface.	2011-10-31 12:33:41 -04:00
Joey Hess	22e9f445ab	unused, dropunused: Now work in bare repositories. Turned out I had already done all the work needed to support this when unused started checking all branches.	2011-10-29 19:16:45 -04:00
Joey Hess	2566eb85fe	fsck: Now works in bare repositories. Checks location log information, and file contents. Does not check that numcopies is satisfied, as .gitattributes information about numcopies is not available in a bare repository. In practice, that should not be a problem, since fsck is also run in a checkout and will check numcopies there.	2011-10-29 18:03:28 -04:00
Joey Hess	ab738a403a	status: Now always shows the current repository, even when it does not appear in uuid.log.	2011-10-28 19:49:01 -04:00
Joey Hess	6c31e3a8c3	drop --from is now supported to remove file content from a remote.	2011-10-28 17:26:38 -04:00
Joey Hess	b955238ec7	Fail if --from or --to is passed to commands that do not support them.	2011-10-27 18:56:54 -04:00
Joey Hess	66194684ac	uninit: Add guard against being run with the git-annex branch checked out.	2011-10-27 15:47:11 -04:00
Joey Hess	83d11c03c4	wording	2011-10-27 15:24:58 -04:00
Joey Hess	f84d66fa15	reap in onLocal Each onLocal call involves a new Annex state, so needs to clean up after it.	2011-10-27 14:55:07 -04:00
Joey Hess	373cad993d	Sped up some operations on remotes that are on the same host. Specifically, disabled trying to update the git-annex branch on the remote, since that data is never used by operations that act on such remotes. Also, when copying content to such a remote, skip committing the presence information changes to its git-annex branch. Leaving it in the journal there is ok: Any command run on the remote that needs the info will flush the journal. This may partially solve this bug: http://git-annex.branchable.com/bugs/fails_to_handle_lot_of_files/ Although I still see unreaped git processes piling up when doing a copy --to.	2011-10-27 14:55:06 -04:00
Joey Hess	270c1af087	releasing version 3.20111025	2011-10-25 13:46:01 -07:00
Joey Hess	e2853b3fec	update	2011-10-25 11:39:15 -07:00
Joey Hess	52c8244219	git-annex-shell: GIT_ANNEX_SHELL_READONLY and GIT_ANNEX_SHELL_LIMITED environment variables can be set to limit what commands can be run. This could be used by eg, gitolite.	2011-10-15 19:06:35 -04:00
Joey Hess	ec169f84b1	migrate: Copy url logs for keys when migrating.	2011-10-15 16:36:56 -04:00
Joey Hess	9fa9214106	A remote can have a annexUrl configured, that is used by git-annex instead of its usual url. (Similar to pushUrl.)	2011-10-14 18:18:28 -04:00
Joey Hess	205a5b2aaa	typo	2011-10-12 00:29:49 -04:00
Joey Hess	11b154e811	prep release	2011-10-11 23:03:19 -04:00
Joey Hess	402d9c7c5f	oops	2011-10-11 22:54:38 -04:00
Joey Hess	9c04d1e523	fix git 1.7.7 breakage * This version of git-annex only works with git 1.7.7 and newer. The breakage with old versions is subtle, and affects annex.numcopies .gitattributes settings, so be sure to upgrade git to 1.7.7. (Debian package now depends on that version.) * Don't pass absolute paths to git show-attr, as it started following symlinks when that's done in 1.7.7. Instead, use relative paths, which show-attr only handles 100% correctly in 1.7.7. Closes: #645046 Unfortunatly I can find no way to work with the old and new gits, as the old had bugs that require absolute paths, while the new doesn't like them at all. And the behavior of git show-attr in 1.7.7. is the same as eg, git add of an absolute path to a symlink, so seems entirely intentional and not likely to change.	2011-10-11 22:53:32 -04:00
Joey Hess	10edaf6dc9	reorder	2011-10-10 16:03:32 -04:00
Joey Hess	81ed7b203d	Now supports git's insteadOf configuration, to modify the url used to access a remote. Note that pushInsteadOf is not used; that and pushurl are reserved for actual git pushes. Closes: #644278	2011-10-09 14:58:32 -04:00
Joey Hess	5414bbce58	git-annex-shell uuid verification * git-annex now asks git-annex-shell to verify that it's operating in the expected repository. * Note that this git-annex will not interoperate with remotes using older versions of git-annex-shell. The reason for this check is to avoid git-annex getting confused about what remote repository actually contains a value. It's a prerequisite for supporting git insteadOf aliases.	2011-10-06 19:24:11 -04:00
Joey Hess	f011033869	add timestamps to remote.log	2011-10-06 16:07:58 -04:00
Joey Hess	f929d0229c	Add timestamps to trust.log.	2011-10-06 15:55:50 -04:00
Joey Hess	3e0d2a0803	add timestamp to uuid.log * New or changed repository descriptions in uuid.log now have a timestamp, which is used to ensure the newest description is used when the uuid.log has been merged. * Note that older versions of git-annex will display the timestamp as part of the repository description, which is ugly but otherwise harmless.	2011-10-06 15:31:25 -04:00
Joey Hess	d357556141	Add locking to avoid races when changing the git-annex branch.	2011-10-03 16:32:36 -04:00
Joey Hess	49f21dd9ba	Contain the zombie hordes.a Specifically, when using gpg, a zombie is forked for each file, so waiting until shutdown to reap won't do.	2011-10-02 11:16:34 -04:00
Joey Hess	29032cb70e	When displaying a list of repositories, show git remote names in addition to their descriptions.	2011-09-30 15:02:29 -04:00
Joey Hess	828f3f1b0c	status: List all known repositories.	2011-09-30 03:20:24 -04:00
Joey Hess	a7e7dda55a	Fix referring to remotes by uuid. I think that I broke this in some fairly recent refactoring.	2011-09-30 02:23:24 -04:00
Joey Hess	7ff89ccfee	convert all git read/write functions to use ByteStrings This yields a second or so speedup in unused, find, etc. Seems that even when the ByteString is immediately split and then converted to Strings, it's faster. I may try to push ByteStrings out into more of git-annex gradually, although I suspect most of the time-critical parts are already covered now, and many of the rest rely on libraries that only support Strings.	2011-09-29 23:48:57 -04:00
Joey Hess	a91c8a15d5	Sped up unused. Added Git.ByteString which replaces Git IO methods with ones using lazy ByteStrings. This can be more efficient when large quantities of data are being read from git. In Git.LsTree, parse git ls-tree output more efficiently, thanks to ByteString. This benchmarks 25% faster, in a benchmark that includes (probably predominately) the run time for git ls-tree itself. In real world numbers, this makes git annex unused 2 seconds faster for each branch it needs to check, in my usual large repo.	2011-09-29 19:04:24 -04:00
Joey Hess	7dddb803a0	releasing version 3.20110928	2011-09-28 19:17:12 -04:00
Joey Hess	d75da353b9	documentation/warning message update for future feature	2011-09-23 18:04:38 -04:00
Joey Hess	9f5c7a246b	status: Massively sped up; remove --fast mode. Using Sets is the right thing; they have constant size lookup like my SizeList, and logn insertation, which beats nub to death. Runs faster than --fast mode did before, and gives accurate counts. 13 seconds total runtime with a warm cache in a repository with 40 thousand keys.	2011-09-20 18:57:05 -04:00

1 2 3 4 5 ...

417 commits