git-annex

Author	SHA1	Message	Date
Joey Hess	cae3704a44	export file renaming This is seriously super hairy. It has to handle interrupted exports, which may be resumed with the same or a different tree. It also has to recover from export conflicts, which could cause the wrong content to be renamed to a file. I think this works, or is close to working. See the update to the design for how it works. This is definitely not optimal, in that it does more renames than are necessary. It would probably be worth finding the keys that are really renamed and only renaming those. But let's get the "simple" approach to work first.. This commit was supported by the NSF-funded DataLad project.	2017-09-06 15:44:10 -04:00
Joey Hess	0fa948b402	record incomplete exports in export.log Not yet used, but essential for resuming cleanly. Note that, in normmal operation, only one commit is made to export.log during an export; the incomplete version only gets to the journal and is then overwritten. This commit was supported by the NSF-funded DataLad project.	2017-09-06 13:45:03 -04:00
Joey Hess	1ec3a9eb05	thoughts on handling renames efficiently This gets complicated, but I think this design will work! This commit was supported by the NSF-funded DataLad project.	2017-09-06 13:04:09 -04:00
Joey Hess	8918b7ab09	Merge branch 'master' of ssh://git-annex.branchable.com	2017-09-06 12:26:18 -04:00
Edward Betts	c1b9f718bc	move line break to fix broken link	2017-09-06 11:25:06 -04:00
Joey Hess	fd8392b669	update	2017-09-06 11:23:04 -04:00
karel-de-macil	9a2e687b0d		2017-09-06 09:20:26 +00:00
yarikoptic	3e7d0e0de7	Added datalad "super-dataset".	2017-09-05 17:00:38 +00:00
EskildHustvedt	8755f320f5	removed	2017-09-05 09:17:44 +00:00
EskildHustvedt	70ecf52888	Added a comment: Partial exports	2017-09-05 09:16:59 +00:00
EskildHustvedt	5e15956225	Added a comment: Partial exports	2017-09-05 09:16:26 +00:00
eacousineau	b8b7a9a902		2017-09-05 01:22:19 +00:00
Joey Hess	c7af16eb3a	Merge branch 'master' of ssh://git-annex.branchable.com	2017-09-04 17:03:20 -04:00
Joey Hess	fa4defc9d7	devblog	2017-09-04 17:02:30 -04:00
Joey Hess	a1cc9ec0fd	add export infication to git-annex info	2017-09-04 17:01:38 -04:00
Joey Hess	662f2a5ee7	git annex get from exports Straightforward enough, except for the needed belt-and-suspenders sanity checks to avoid foot shooting due to exports not being key/value stores. * Even when annex.verify=false, always verify from exports. * Only get files from exports that use a backend that supports checksum verification. * Never trust exports, even if the user says to, because then `git annex drop` would drop content if the export seemed to contain a copy. This commit was supported by the NSF-funded DataLad project.	2017-09-04 16:39:56 -04:00
Joey Hess	4da763439b	use export db to correctly handle duplicate files Removed uncorrect UniqueKey key in db schema; a key can appear multiple times with different files. The database has to be flushed after each removal. But when adding files to the export, lots of changes are able to be queued up w/o flushing. So it's still fairly efficient. If large removals of files from exports are too slow, an alternative would be to make two passes over the diff, one pass queueing deletions from the database, then a flush and the a second pass updating the location log. But that would use more memory, and need to look up exportKey twice per removed file, so I've avoided such optimisation yet. This commit was supported by the NSF-funded DataLad project.	2017-09-04 14:39:32 -04:00
Joey Hess	656797b4e8	update for export	2017-09-04 14:25:00 -04:00
Joey Hess	2c90ed1fea	flush queued changes to export db on exit	2017-09-04 14:00:54 -04:00
Joey Hess	42eaa340fe	remove some backtraces on user errors	2017-09-04 13:55:49 -04:00
Joey Hess	7eb9889bfd	track exported files in a sqlite database Went with a separate db per export remote, rather than a single export database. Mostly because there will probably not be a lot of separate export remotes, and it might be convenient to be able to delete a given remote's export database. This commit was supported by the NSF-funded DataLad project.	2017-09-04 13:53:08 -04:00
Joey Hess	28e2cad849	implement exporttree=yes configuration * Only export to remotes that were initialized to support it. * Prevent storing key/value on export remotes. * Prevent enabling exporttree=yes and encryption in the same remote. SetupStage Enable was changed to take the old RemoteConfig. This allowed only setting exporttree when initially setting up a remote, and not configuring it later after stuff might already be stored in the remote. Went with =yes rather than =true for consistency with other parts of git-annex. Changed docs accordingly. This commit was supported by the NSF-funded DataLad project.	2017-09-04 13:09:38 -04:00
vgp	28635f0190	Added a comment	2017-09-01 21:40:11 +00:00
Joey Hess	a4328b49d2	refactor ExportActions This will allow disabling exports for remotes that are not configured to allow them. Also, exportSupported will be useful for the external special remote to probe. This commit was supported by the NSF-funded DataLad project	2017-09-01 13:05:09 -04:00
Joey Hess	f19a45973a	devblog	2017-08-31 18:14:04 -04:00
Joey Hess	5483ea90ec	graft exported tree into git-annex branch So it will be available later and elsewhere, even after GC. I first though to use git update-index to do this, but feeding it a line with a tree object seems to always cause it to generate a git subtree merge. So, fell back to using the Git.Tree interface to maniupulate the trees, and not involving the git-annex branch index file at all. This commit was sponsored by Andreas Karlsson.	2017-08-31 18:06:49 -04:00
Joey Hess	978885247e	implement export.log and resolve export conflicts Incremental export updates work now too. This commit was sponsored by Anthony DeRobertis on Patreon.	2017-08-31 15:47:23 -04:00
Joey Hess	bb08b1abd2	make storeExport atomic This avoids needing to deal with the complexity of partially transferred files in the export. We'd not be able to resume uploading to such a file anyway, so just avoid them. The implementation in Remote.Directory is not completely ideal, because it could leave the temp file hanging around in the export directory. This only happens if it's killed with -9, or there's a power failure; normally viaTmp cleans up after itself, even when interrupted. I could not see a better way to do it though, since the export directory might be the root of a filesystem. Also some design thoughts on resuming, which depend on storeExport being atomic. This commit was sponsored by Fernando Jimenez on Partreon.	2017-08-31 14:24:32 -04:00
Joey Hess	7c7af82578	resuming exports Make a pass over the whole exported tree, and upload anything that has not yet reached the export. Update location log when exporting. Note that the synthesized keys for non-annexed files are stored in the location log too. Some cases involving files in the tree with the same content are not handled correctly yet. This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.	2017-08-31 13:33:50 -04:00
Joey Hess	e662aceeac	improve type	2017-08-31 12:47:08 -04:00
Joey Hess	4694e49158	fix error message when content to export is not locally available	2017-08-31 12:39:10 -04:00
Joey Hess	efe3910c04	remove empty parent dirs when removing from export	2017-08-31 12:32:02 -04:00
Joey Hess	943de657b8	Merge branch 'master' into export	2017-08-31 12:16:22 -04:00
Joey Hess	bdec46ac13	a few tweaks to the design	2017-08-30 13:14:05 -04:00
yarikoptic	b14c4776d6	initial bug report	2017-08-30 14:15:45 +00:00
vgp	71682954f8	Added a comment	2017-08-30 12:42:23 +00:00
Joey Hess	474dd3ae6d	Merge branch 'master' of ssh://git-annex.branchable.com	2017-08-29 17:27:37 -04:00
Joey Hess	74aa4c503b	devblog	2017-08-29 17:26:42 -04:00
Joey Hess	9f3630f4e0	initial export command Very basic operation works, but of course this is only the beginning. This commit was sponsored by Nick Daly on Patreon.	2017-08-29 15:10:01 -04:00
Joey Hess	cca2764f91	provide file with content to export Rather than providing the key to export, provide the file. When exporting a treeish that contains files that are not annexed, this will let the content of those files also be exported. There's still a Key in the interface; it will be used by the external special remote protocol. A SHA1 key can be used when exporting non-annexed files. This commit was sponsored by Brock Spratlen on Patreon.	2017-08-29 13:57:42 -04:00
Joey Hess	8f35c6584d	documentation for export This commit was sponsored by Ole-Morten Duesund on Patreon.	2017-08-29 13:25:48 -04:00
Joey Hess	e55e445a36	add API for exporting Implemented so far for the directory special remote. Several remotes don't make sense to export to. Regular Git remotes, obviously, do not. Bup remotes almost certianly do not, since bup would need to be used to extract the export; same store for Ddar. Web and Bittorrent are download-only. GCrypt is always encrypted so exporting to it would be pointless. There's probably no point complicating the Hook remotes with exporting at this point. External, S3, Glacier, WebDAV, Rsync, and possibly Tahoe should be modified to support export. Thought about trying to reuse the storeKey/retrieveKeyFile/removeKey interface, rather than adding a new interface. But, it seemed better to keep it separate, to avoid a complicated interface that sometimes encrypts/chunks key/value storage and sometimes users non-key/value storage. Any common parts can be factored out. Note that storeExport is not atomic. doc/design/exporting_trees_to_special_remotes.mdwn has some things in the "resuming exports" section that bear on this decision. Basically, I don't think, at this time, that an atomic storeExport would help with resuming, because exports are not key/value storage, and we can't be sure that a partially uploaded file is the same content we're currently trying to export. Also, note that ExportLocation will always use unix path separators. This is important, because users may export from a mix of windows and unix, and it avoids complicating the API with path conversions, and ensures that in such a mix, they always use the same locations for exports. This commit was sponsored by Bruno BEAUFILS on Patreon.	2017-08-29 13:00:41 -04:00
supernaught	15601f2b66	Added a comment	2017-08-28 22:01:23 +00:00
Joey Hess	6ae9d8fe49	simplify Key is needed to use in reply	2017-08-28 15:37:34 -04:00
Joey Hess	ed5d8ee9ea	update proposed external special remote protocol	2017-08-28 15:34:26 -04:00
Joey Hess	792e582a60	fix link	2017-08-28 15:07:23 -04:00
Joey Hess	92ec2d13b5	formatting	2017-08-28 15:07:19 -04:00
Joey Hess	8cad03d7ca	typo	2017-08-28 15:04:25 -04:00
Joey Hess	5c99131b7b	comment	2017-08-28 13:49:16 -04:00
Joey Hess	4e5b3062d9	response	2017-08-28 13:41:19 -04:00

... 4 5 6 7 8 ...

30811 commits