git-annex

Author	SHA1	Message	Date
Joey Hess	29385dc393	Moved list of backends and remote types from status to version command.	2013-10-01 20:50:46 -04:00
Joey Hess	bddfbef8be	git-annex-shell gcryptsetup command This was the least-bad alternative to get dedicated key gcrypt repos working in the assistant.	2013-10-01 17:20:51 -04:00
Joey Hess	0ddf4d3148	Merge branch 'master' of ssh://git-annex.branchable.com into sshgcrypt	2013-10-01 14:40:20 -04:00
Joey Hess	995e1e3c5d	fix transferring to gcrypt repo from direct mode repo recvkey was told it was receiving a HMAC key from a direct mode repo, and that confused it into rejecting the transfer, since it has no way to verify a key using that backend, since there is no HMAC backend. I considered making recvkey skip verification in the case of an unknown backend. However, that could lead to bad results; a key can legitimately be in the annex with a backend that the remote git-annex-shell doesn't know about. Better to keep it rejecting if it cannot verify. Instead, made the gcrypt special remote not set the direct mode flag when sending (and receiving) files. Also, added some recvkey messages when its checks fail, since otherwise all that is shown is a confusing error message from rsync when the remote git-annex-shell exits nonzero.	2013-10-01 14:19:24 -04:00
Joey Hess	7f7dcd315b	fix direct mode switch permissions problem Similar to how a similar problem with indirect was earlier fixed.	2013-09-30 12:48:40 -04:00
Joey Hess	12f6b9693a	Send a git-annex user-agent when downloading urls. Overridable with --user-agent option. Not yet done for S3 or WebDAV due to limitations of libraries used -- nether allows a user-agent header to be specified. This commit sponsored by Michael Zehrer.	2013-09-28 14:35:21 -04:00
Joey Hess	57d49a6d04	remove >=> and >=> ; use <$$> instead I forgot I had <$$> hidden away in Utility.Applicative. It allows doing the same kind of currying as does >=> and I found using it made the code more readable for me. (>=> was not used)	2013-09-27 19:58:48 -04:00
Joey Hess	1550759220	enabling rsync.net gcrypt repos Still need to detect when the user is trying to create a repo that already exists, and jump to the enabling code.	2013-09-26 23:47:30 -04:00
Joey Hess	98fc7e8a19	add, import, assistant: Better preserve the mtime of symlinks, when when adding content that gets deduplicated. Note that this turned out to remove a syscall, not add any expense. Otherwise, I would not have done it.	2013-09-25 16:07:11 -04:00
Joey Hess	c45f5fbdb3	indirect: Better behavior when a file in direct mode is not owned by the user running the conversion.	2013-09-25 15:29:56 -04:00
Joey Hess	b405295aee	hlint test suite still passes	2013-09-25 03:09:06 -04:00
Joey Hess	4c954661a1	git-annex-shell: Added support for operating inside gcrypt repositories. * Note that the layout of gcrypt repositories has changed, and if you created one you must manually upgrade it. See http://git-annex.branchable.com/upgrades/gcrypt/	2013-09-24 17:25:47 -04:00
Joey Hess	f9e438c1bc	factor out more ssh stuff from git remote This has the dual benefits of making Remote.Git shorter, and letting Remote.GCrypt use these utilities.	2013-09-24 13:37:41 -04:00
Joey Hess	55636bf92f	list --allrepos	2013-09-19 21:42:03 -04:00
Joey Hess	006cf7976f	more completely solve catKey memory leak Done using a mode witness, which ensures it's fixed everywhere. Fixing catFileKey was a bear, because git cat-file does not provide a nice way to query for the mode of a file and there is no other efficient way to do it. Oh, for libgit2.. Note that I am looking at tree objects from HEAD, rather than the index. Because I cat-file cannot show a tree object for the index. So this fix is technically incomplete. The only cases where it matters are: 1. A new large file has been directly staged in git, but not committed. 2. A file that was committed to HEAD as a symlink has been staged directly in the index. This could be fixed a lot better using libgit2.	2013-09-19 16:41:21 -04:00
Joey Hess	eb42bde19a	sync, pre-commit, indirect: Avoid unnecessarily catting non-symlink files from git, which can be so large it runs out of memory.	2013-09-19 14:48:42 -04:00
Joey Hess	66b6a9cc4e	credit	2013-09-19 14:27:07 -04:00
Antoine Beaupré	f4e8b70bba	rename remotes to list	2013-09-19 14:16:55 -04:00
Joey Hess	9d366dc638	make --fast disable the numcopies stats Looking up the location log for every key is not the fastest operation..	2013-09-15 19:17:56 -04:00
Joey Hess	a3bbda5bed	status: In local mode, displays information about variance from configured numcopies levels.	2013-09-15 19:10:38 -04:00
Joey Hess	5c71dc1087	addurl: Fix quvi audodetection, broken in last release.	2013-09-15 16:37:03 -04:00
Joey Hess	2407170eaf	sync: Don't fail if the directory it is run in gets removed by the sync.	2013-09-13 14:55:55 -04:00
Joey Hess	65fe2314be	fsck: Fix detection and fixing of present direct mode files that are wrongly represented as standin symlinks on crippled filesystems.	2013-09-13 12:50:29 -04:00
Joey Hess	82759b6a5d	remotes: New command, displays a compact table of remotes that contain files. (Thanks, anarcat for display code and mastensg for inspiration.) Note that it would be possible to extend the display to show all repositories. But there can be a lot of repositories that are not set up as remotes, and it would significantly clutter the display to show them all. Since we're not showing all repositories, it's not worth trying to show numcopies count either. I decided to embrace these limitations and call the command remotes.	2013-09-12 12:21:21 -04:00
Joey Hess	b64f5baf2d	sync: support gcrypt	2013-09-09 10:02:15 -04:00
Joey Hess	ecbb326e9d	Allow building without quvi support.	2013-09-09 02:16:22 -04:00
Joey Hess	7c1a9cdeb9	partially complete gcrypt remote (local send done; rest not) This is a git-remote-gcrypt encrypted special remote. Only sending files in to the remote works, and only for local repositories. Most of the work so far has involved making initremote work. A particular problem is that remote setup in this case needs to generate its own uuid, derivied from the gcrypt-id. That required some larger changes in the code to support. For ssh remotes, this will probably just reuse Remote.Rsync's code, so should be easy enough. And for downloading from a web remote, I will need to factor out the part of Remote.Git that does that. One particular thing that will need work is supporting hot-swapping a local gcrypt remote. I think it needs to store the gcrypt-id in the git config of the local remote, so that it can check it every time, and compare with the cached annex-uuid for the remote. If there is a mismatch, it can change both the cached annex-uuid and the gcrypt-id. That should work, and I laid some groundwork for it by already reading the remote's config when it's local. (Also needed for other reasons.) This commit was sponsored by Daniel Callahan.	2013-09-07 18:38:00 -04:00
Joey Hess	4079f9cfe8	avoid double commit during transition The second commit had some bad refs which resulted in the race detection code running. But that commit was unnecessary anyway, it only was there to merge in the other refs.	2013-09-03 16:33:15 -04:00
Joey Hess	b51dffa46d	fix error propigating when unable to download feed item	2013-09-03 14:39:07 -04:00
Joey Hess	db83cc82d6	Merge branch 'forget' Conflicts: debian/changelog	2013-09-03 14:36:00 -04:00
Joey Hess	d1bacccff4	importfeed: Also ignore transient problems with downloading content from feeds.	2013-09-03 14:32:26 -04:00
Joey Hess	0831e18372	forget --drop-dead: Completely removes mentions of repositories that have been marked as dead from the git-annex branch. Wrote nice pure transition calculator, and ugly code to stage its results into the git-annex branch. Also had to split up several Log modules that Annex.Branch needed to use, but that themselves used Annex.Branch. The transition calculator is limited to looking at and changing one file at a time. While this made the implementation relatively easy, it precludes transitions that do stuff like deleting old url log files for keys that are being removed because they are no longer present anywhere.	2013-08-31 17:51:13 -04:00
Joey Hess	62beaa1a86	refactor git-annex branch log filename code into central location Having one module that knows about all the filenames used on the branch allows working back from an arbitrary filename to enough information about it to implement dropping dead remotes and doing other log file compacting as part of a forget transition.	2013-08-29 19:13:00 -04:00
Joey Hess	6cdac3a003	sync, assistant: Force push of the git-annex branch. Necessary to ensure it gets pushed to remotes after being rewritten by forget. See inline rationalles for why I think this is safe!	2013-08-29 14:27:53 -04:00
Joey Hess	4a915cd3cd	add forget command Works, more or less. --dead is not implemented, and so far a new branch is made, but keys no longer present anywhere are not scrubbed. git annex sync fails to push the synced/git-annex branch after a forget, because it's not a fast-forward of the existing synced branch. Could be fixed by making git-annex sync use assistant-style sync branches.	2013-08-28 16:41:13 -04:00
guilhem	f754779c02	Unused: bugfix Detect staged files that are not in the working tree.	2013-08-26 13:50:09 -04:00
Joey Hess	88e2618e38	fix reversion in unused The reversion was that, if a file was git rm'd, but still in branches, it would not be seen as used. Looking at both the added and the removed (or changed) files from the diff-index is a cheap way to fix that.	2013-08-26 00:19:19 -04:00
Joey Hess	36f5b10065	whitespace	2013-08-25 21:41:10 -04:00
Joey Hess	0963f92984	unnecessary do block	2013-08-25 21:38:01 -04:00
guilhem	f15fda60ed	Speed up the 'unused' command. Instead of populating the second-level Bloom filter with every key referenced in every Git reference, consider only those which differ from what's referenced in the index. Incidentaly, unlike with its old behavior, staged modifications/deletion/... will now be detected by 'unused'. Credits to joeyh for the algorithm. :-)	2013-08-25 21:02:13 -04:00
Joey Hess	824241b6fb	better cases	2013-08-22 23:44:13 -04:00
Joey Hess	46b6d75274	Youtube support! (And 53 other video hosts) When quvi is installed, git-annex addurl automatically uses it to detect when an page is a video, and downloads the video file. web special remote: Also support using quvi, for getting files, or checking if files exist in the web. This commit was sponsored by Mark Hepburn. Thanks!	2013-08-22 18:50:43 -04:00
Joey Hess	6fd2935a5a	unused: Pay attention to symlinks that are not yet staged in the index.	2013-08-22 10:20:03 -04:00
Joey Hess	0f921307e7	mirror: New command, makes two repositories contain the same set of files. This is a simple approach for setting up a mirroring repository. It will work with any type of remotes. Mirror --from is more expensive than mirror --to in general. OTOH, mirror --from will get the file from any remote that has it, not only the named mirror remote. And if the named mirror remote is not the fastest available remote with a file, that can speed things up. It would be possible to make the assistant or watch command do a more dynamic mirroring, that didn't need to scan every time.	2013-08-20 15:46:35 -04:00
Joey Hess	b46afa29ac	implement import --deduplicate and import --clean-duplicates Note that --deduplicate currently checksums each file twice, once to see if it's a known key, and once when importing it. Perhaps this could be revisited and the extra checksum gotten rid of, at the cost of not locking down the file when adding it.	2013-08-20 11:00:52 -04:00
Joey Hess	e240cb99f7	Merge branch 'duplicate' Conflicts: debian/changelog	2013-08-20 10:27:24 -04:00
Joey Hess	a6a047192e	sync, merge: Bug fix: Don't try to merge into master when in a bare repo.	2013-08-17 21:29:44 +02:00
Joey Hess	d69da2bf22	implement import --duplicate The other two options are harder, due to needing to get the key for a file before adding it.	2013-08-11 20:31:54 +02:00
Joey Hess	b28023cb52	importfeed: Fix handling of dots in extensions.	2013-08-03 02:36:38 -04:00
Joey Hess	24c8a6042b	importfeed: Ignores transient problems with feeds. Only exits nonzero when a feed has repeatedly had a problems for at least 1 day.	2013-08-03 01:40:21 -04:00
Joey Hess	dc3e0725f9	improve error message	2013-08-02 13:01:25 -04:00
Joey Hess	93f2371e09	get rid of __WINDOWS__, use mingw32_HOST_OS The latter is harder for me to remember, but avoids build failures in code used by the configure program.	2013-08-02 12:27:32 -04:00
Joey Hess	03c76b5a30	improve importfeed --force; try to match existing files to avoid unncessary duplication	2013-08-01 11:57:05 -04:00
Joey Hess	42ca8aaa61	importfeed --force: re-download urls that have been seen before	2013-07-31 12:19:00 -04:00
Joey Hess	9476355bc3	find: Avoid polluting stdout with progress messages. Closes: #718186	2013-07-30 20:24:27 -04:00
Joey Hess	ddd46db09a	Fix a few bugs involving filenames that are at or near the filesystem's maximum filename length limit. Started with a problem when running addurl on a really long url, because the whole url is munged into the filename. Ended up doing a fairly extensive review for places where filenames could get too large, although it's hard to say I'm not missed any.. Backend.Url had a 128 character limit, which is fine when the limit is 255, but not if it's a lot shorter on some systems. So check the pathconf() limit. Note that this could result in fromUrl creating different keys for the same url, if run on systems with different limits. I don't see this is likely to cause any problems. That can already happen when using addurl --fast, or if the content of an url changes. Both Command.AddUrl and Backend.Url assumed that urls don't contain a lot of multi-byte unicode, and would fail to truncate an url that did properly. A few places use a filename as the template to make a temp file. While that's nice in that the temp file name can be easily related back to the original filename, it could lead to `git annex add` failing to add a filename that was at or close to the maximum length. Note that in Command.Add.lockdown, the template is still derived from the filename, just with enough space left to turn it into a temp file. This is an important optimisation, because the assistant may lock down a bunch of files all at once, and using the same template for all of them would cause openTempFile to iterate through the same set of names, looking for an unused temp file. I'm not very happy with the relatedTemplate hack, but it avoids that slowdown. Backend.WORM does not limit the filename stored in the key. I have not tried to change that; so git annex add will fail on really long filenames when using the WORM backend. It seems better to preserve the invariant that a WORM key always contains the complete filename, since the filename is the only unique material in the key, other than mtime and size. Since nobody has complained about add failing (I think I saw it once?) on WORM, probably it's ok, or nobody but me uses it. There may be compatability problems if using git annex addurl --fast or the WORM backend on a system with the 255 limit and then trying to use that repo in a system with a smaller limit. I have not tried to deal with those. This commit was sponsored by Alexander Brem. Thanks!	2013-07-30 19:18:29 -04:00
Joey Hess	07a9910af7	improve comment	2013-07-28 20:15:20 -04:00
Joey Hess	ac08924ec3	fix bug in makeUnique Returned the possibly non-unique file	2013-07-28 20:14:13 -04:00
Joey Hess	8c55970413	better extension handling When there's no extension, don't use "none", but "". When there is an extension, it starts with a dot, so don't put a redundant dot in the default format.	2013-07-28 19:08:50 -04:00
Joey Hess	8c8488e01a	if a feed cannot be downloaded or has no enclosures, fail	2013-07-28 18:16:24 -04:00
Joey Hess	18541bf3fa	don't crash on encoding issues in feeds filesystem encoding to the rescue once more! IIRC this was the main bug in hpodder.	2013-07-28 17:24:30 -04:00
Joey Hess	66dfeaff44	show a side action when finding known urls	2013-07-28 17:19:21 -04:00
Joey Hess	7e66d260ea	importfeed: git-annex becomes a podcatcher in 150 LOC	2013-07-28 16:55:42 -04:00
Joey Hess	c6100aa5cc	unused: No longer shows as unused tmp files that are actively being transferred.	2013-07-25 19:51:08 -04:00
Joey Hess	822918089e	dropunused behavior change: Now refuses to drop the last copy of a file, unless you use the --force. This was the last place in git-annex that could remove data referred to by the git history, without being forced. Like drop, dropunused checks remotes, and honors the global annex.numcopies setting. (However, .gitattributes settings cannot apply to unused files.)	2013-07-25 19:50:44 -04:00
Joey Hess	5e3a404d4f	Support import in direct mode.	2013-07-22 20:18:00 -04:00
Joey Hess	f353f13c9d	Support unannex and uninit in direct mode. In direct mode, it's best to whenever possible not move direct mode files out of the way, and so I made unannex avoid touching the direct mode file at all. That actually turns out to be easy, because in direct mode, unlike indirect mode, the pre-commit hook won't get confused if the unannexed file later gets added back by git add. So there's no need to commit the unannex right away; it can be staged for the user to commit later. This also means that unannex in direct mode is a lot faster than in indirect mode! Another subtle bit is the bookkeeping that is done when unannexing a direct mode file. The inode cache needs to be removed so that when uninit runs getKeysPresent, it doesn't see the cache and think the key is still present and crash when it's not. This commit is sponsored by Douglas Butts. Thanks!	2013-07-22 17:28:53 -04:00
Joey Hess	3e422cb5fa	fix uninit to delete content from annex when it ended up hard linked back to the work tree	2013-07-18 13:30:12 -04:00
Joey Hess	1d7d3ac325	uninit: Preserve .git/annex/objects at the end, if it still has content, so that old versions of files and deleted files are not deleted. Print a message with some suggested actions.	2013-07-16 15:00:25 -04:00
Joey Hess	c936384164	fix: Preserve the original mtime of fixed symlinks.	2013-07-11 11:39:42 -04:00
Joey Hess	207c9f3c4a	dropunused, addunused: Complain when asked to operate on a number that does not correspond to any unused key.	2013-07-08 16:47:34 -04:00
Joey Hess	74ad3072e4	addurl --pathdepth: Fix failure when the pathdepth specified is deeper than the urls's path.	2013-07-05 12:46:38 -04:00
Joey Hess	7a7e426352	moved AssociatedFile definition	2013-07-04 02:36:02 -04:00
Joey Hess	980e9a15e0	merge: Now also merges synced/master or similar branches, which makes it useful to put in a post-receive hook to make a repository automatically update its working copy when git annex sync or the assistant sync with it.	2013-07-03 15:42:56 -04:00
Joey Hess	04d07f2c1f	--unused: New switch that makes git-annex operate on all data found by the last run of git annex unused. Supported by fsck, get, move, copy.	2013-07-03 15:26:59 -04:00
Joey Hess	b337a8b4c7	--all for get, move, and copy	2013-07-03 13:55:50 -04:00
Joey Hess	def7cb706f	Add --all option, and support it for fsck	2013-07-03 13:12:53 -04:00
Joey Hess	a35bdcb3f2	fsck: Ensures that direct mode is used for files when it's enabled. A common failure mode for direct mode has been for files to end up still stored in indirect mode. While I hope that doesn't happen anymore, fsck should deal with it.	2013-06-24 16:26:00 -04:00
Joey Hess	53d52d57c1	check in configure if ionice -c3 works On old systems, it may need to be run as root.	2013-06-21 13:43:04 -04:00
Joey Hess	d901ba1781	assistant --autostart: Automatically ionices the daemons it starts.	2013-06-21 13:23:20 -04:00
Joey Hess	bf72c2c7fe	make dead output consistent with other trust setting commands	2013-06-18 15:41:19 -04:00
Joey Hess	64f8819ae4	fix build	2013-06-17 21:30:52 -04:00
Joey Hess	9ef09587dc	fsck: Avoid getting confused by Windows path separators	2013-06-17 21:18:43 -04:00
Joey Hess	98be446d02	remove workaround for old bug that was only in one release It's causing some problem on windows, see http://git-annex.branchable.com/bugs/windows_port_-_repo_can__39__t_pull_newly_added_files_/#comment-45df9748bba687d95e3c96b3877ea925 And only affected WORM backend, and for one release well over a year ago, so could well be bitrotted.	2013-06-17 20:51:36 -04:00
Joey Hess	2844e7175e	status: No longer shows dead repositories. This is because people continually whine about it. Seemingly not aware that data generally cannot be deleted from git anyway.	2013-06-17 12:35:33 -04:00
Joey Hess	9666addfaa	sync: Better support for bare git remotes. Now pushes directly to the master branch on such a remote, instead of to synced/master. This makes it easier to clone from a bare git remote that has been populated with git annex sync or by the assistant.	2013-06-12 14:54:23 -04:00
Joey Hess	6dcf21db93	Direct mode: No longer temporarily remove write permission bit of files when adding them. This write permission frobbing is very appropriate in indirect mode, since annexed objects are stored as immutably as can be managed. But not in direct mode, where files should be able to be modified at any time. There are already sufficient guards that there's no need to prevent a file being written to while it's being ingested, in direct mode. The inode cache will detect (most) types of modifications, and the add will fail. Then a re-add should be done. The assistant should get another inotify change event, and automatically add the new version of the file.	2013-06-12 14:02:31 -04:00
Joey Hess	c46b263fde	Android: Make the "Open webapp" menu item open the just created repository when a new repo is made.	2013-06-10 23:55:53 -04:00
Joey Hess	a64106dcef	Supports indirect mode on encfs in paranoia mode, and other filesystems that do not support hard links, but do support symlinks and other POSIX filesystem features.	2013-06-10 13:11:33 -04:00
Joey Hess	92f036fcb4	avoid warnings when built with ghc 7.6	2013-06-02 15:01:58 -04:00
Joey Hess	91c9ae83f1	squash warning	2013-06-02 14:06:17 -04:00
Joey Hess	a48d340abd	Android: Work around Android devices where the `am` command doesn't work.	2013-05-31 21:30:21 -04:00
Joey Hess	cba2942cda	Revert "android dupped stderr workaround" This reverts commit `4cc803c733`. The stderr fd is also trashed after `am` fails to open the web browser.	2013-05-30 16:27:10 -04:00
Joey Hess	4cc803c733	android dupped stderr workaround Avoid using dupped stderr, since http://git-annex.branchable.com/bugs/warning_-_WebApp_crashed:___60__file_descriptor_15__62__:_hPutStr:_illegal_operation___40__handle_is_closed__41___on_Android/#comment-a24c73803fb10bd35afdc10d50e071c8 seems to involve that handle not being dupped originally, or perhaps getting closed when the web browser is started on Android. Using the dupped stdout is known to work before starting the web browser, so it should work after -- unless perhaps starting it closes both handles. In any case, there's no real need to write to stderr here.	2013-05-30 13:55:22 -04:00
Joey Hess	3e2d50a336	Android: Added an "Open WebApp" item to the terminal's menu. Should work for Android devices that cannot auto-open the webapp on start.	2013-05-28 18:25:27 -04:00
Joey Hess	f1cce62283	fix merge conflict resolution when both sides have the same key Still need to git rm the old file so git accepts the merge is resolved.	2013-05-26 18:32:11 -04:00
Joey Hess	2180068e30	correct recent fix fc37456d0fe1fb0fd3e33338223977b3e7a940bb's fix caused it to try to stage a symlink in .git/annex/tmp, oops	2013-05-26 18:10:07 -04:00
Joey Hess	919a7d7316	sync: Fix double merge conflict resolution handling. Ie, when there'a a conflicted merge we may get foo.variant-xxxx created in a merge. If a second merge conflict occurs on that new file, it was not falling back to putting in the whole key (which should stop the merge conflicts happening for good, but is ugly).	2013-05-26 17:42:15 -04:00
Joey Hess	469b3859fc	reduce the amount of subdirectories created by the fuzz tester to saner limit	2013-05-26 16:15:25 -04:00
Joey Hess	9978269b55	make fuzztest honor annex.diskreserve	2013-05-26 16:04:52 -04:00

1 2 3 4 5 ...

1015 commits