git-annex

Author	SHA1	Message	Date
Joey Hess	f62114e5ad	Merge branch 'remove-esqueleto'	2018-11-20 11:50:04 -04:00
Joey Hess	d65df7ab21	improve messages around export conflicts When an export conflict prevents accessing a special remote, be clearer about what the problem is and how to resolve it. This commit was sponsored by Trenton Cronholm on Patreon.	2018-11-13 15:50:06 -04:00
Joey Hess	635b0b1df6	Merge remote-tracking branch 'seanparsons/remove-esqueleto' into remove-esqueleto	2018-11-10 12:03:52 -04:00
Joey Hess	0db653f12c	remove unused BangPatterns	2018-11-09 13:09:26 -04:00
Joey Hess	f78f97780c	Fix build with persistent-sqlite older than 2.6.3. This commit was sponsored by Jack Hill on Patreon.	2018-11-09 13:09:02 -04:00
Sean Parsons	ef5a735d47	Fixed a not equal condition in addAssociatedFile.	2018-11-06 22:50:00 +00:00
Sean Parsons	42bdc9fa2f	Removed Esqueleto as a dependency.	2018-11-06 22:18:55 +00:00
Joey Hess	6d8b8a3275	remove unused import	2018-10-31 12:58:53 -04:00
Joey Hess	fcc9eea554	avoid closeDb opening the db if it's not already open	2018-10-30 22:19:05 -04:00
Joey Hess	1428568554	retry when sqlite throws ErrorIO I suspect this may be due to SQLITE_IOERR_SHORT_READ, but have not verified. I was able to reproduce it on Linux after running the test suite in a loop for 1-3 hours until it failed. The WAL mode entry change in `3963c5fcf5` may have hidden the problem I was seeing; I have not seen an ErrorIO since then.	2018-10-30 18:06:38 -04:00
Joey Hess	3963c5fcf5	better approach to enabling WAL mode The old approach opened the database an extra time to enable WAL mode, but more recent persistent-sqlite has a better API to enable it.	2018-10-30 13:47:38 -04:00
Joey Hess	a89db2c604	link to bug report blob delete from tree	2018-10-30 12:07:38 -04:00
Joey Hess	57107cf213	sqlite bug report that the developers never responded to Adding to git-annex's history so it doesn't get lost; the gmane archive of it is long gone.	2018-10-30 12:04:28 -04:00
Joey Hess	718915e9fc	improve comments	2018-10-30 11:52:05 -04:00
Joey Hess	def5d8b02c	Fix potential crash in exporttree database due to failure to honor uniqueness constraint I don't know the circumstances, but have a report of this: git-annex: failed to commit changes to sqlite database: Just SQLite3 returned ErrorConstraint while attempting to perform step. All 3 tables in the export db have uniqueness constraints on them, insertUnique is used for all the rest, but this use of insertMany means it doesn't check the constraint. I guess that's what caused the crash, but I have not been able to test it yet. Use putMany when available, as it should be faster than mapM of insertMany. This commit was sponsored by Brock Spratlen on Patreon.	2018-10-09 16:56:33 -04:00
Joey Hess	b8ed97f5d8	add missing space	2018-10-09 16:32:59 -04:00
Joey Hess	a7f0b99a33	fix v6 deadlock with git 2.1.4 I don't know why git diff --raw would run the clean filter, but it did with this version of git. Perhaps it is cleaning the file to generate the diff to search with -G? But then why would newer gits not run the clean filter? It caused git annex to deadlock because the keys database was locked and ran a git command that ran git-annex, which tried to read from the keys database. This commit was sponsored by Brett Eisenberg on Patreon.	2018-09-13 13:55:25 -04:00
Joey Hess	50fa17aee6	v6: recover from race between git mv and git-annex get/drop Update pointer file next time reconcileStaged is run to recover from the race. Note that restagePointerFile causes git to run the clean filter, and that will run reconcileStaged. So, normally by the time the git annex get/drop command finishes, the race has already been dealt with. It may be that, in some case, that won't happen and the race will be dealt with at a later point. git-annex could run reconcileStaged at shutdown if that becomes a problem. This does not handle the situation where the git mv is committed before git-annex gets a chance to run again. git commit does run the clean filter, and that happens to re-inject the content if it was supposed to be dropped but is still populated. But, the case where the file was supposed to be gotten but is not populated is not handled yet. This commit was supported by the NSF-funded DataLad project.	2018-08-22 15:56:43 -04:00
Joey Hess	18ecf41917	avoid running reconcileStaged when the index has not changed This commit was supported by the NSF-funded DataLad project.	2018-08-22 13:04:12 -04:00
Joey Hess	5e56d9b620	v6: Update associated files database when git has staged changes to pointer files This commit was supported by the NSF-funded DataLad project.	2018-08-21 17:02:20 -04:00
Joey Hess	710d6a35ed	fix build with old version of persistent	2017-09-25 09:57:41 -04:00
Joey Hess	129418615b	refactor	2017-09-20 16:22:32 -04:00
Joey Hess	5f9eff3f32	fix bug that prevented db being written to disk in SingleWriter mode The bug occurred when closeDb was not called, and garbage collection of the DbHandle didn't give the workerThread time to shut down. Fixed by exiting the runSqlite action when a commit is made. (MultiWriter mode already forked off a runSqlite action, so avoided the problem.) This commit was sponsored by Brock Spratlen on Patreon.	2017-09-18 19:42:20 -04:00
Joey Hess	f4be3c3f89	merge changes made on other repos into ExportTree Now when one repository has exported a tree, another repository can get files from the export, after syncing. There's a bug: While the database update works, somehow the database on disk does not get updated, and so the database update is run the next time, etc. Wasn't able to figure out why yet. This commit was sponsored by Ole-Morten Duesund on Patreon.	2017-09-18 19:21:41 -04:00
Joey Hess	0ad7e36dc1	update ExportTree table efficiently Use same diff and key lookup except when the whole tree has to be scanned. This commit was sponsored by Peter Hogg on Patreon.	2017-09-18 14:27:50 -04:00
Joey Hess	b03d77c211	add ExportTree table to export db New table needed to look up what filenames are used in the currently exported tree, for reasons explained in export.mdwn. Also, added smart constructors for ExportLocation and ExportDirectory to make sure they contain filepaths with the right direction slashes. And some code refactoring. This commit was sponsored by Francois Marier on Patreon.	2017-09-18 13:59:59 -04:00
Joey Hess	e1f5c90c92	split out Types.Export	2017-09-15 16:46:03 -04:00
Joey Hess	c633144d28	remove empty directories when removing from export The subtle part of this is what happens when the remote fails to remove an empty directory. The removal from the export needs to fail in that case, so the removal will be tried again later. However, removeExportLocation has already been run and changed the export db, so if the next run checks getExportLocation, it might decide nothing remains to be done, leaving the empty directory. Dealt with that by making removeEmptyDirectories, handle a failure by calling addExportLocation, reverting the database changes so the next run will be guaranteed to try deleting the empty directory again. This commit was sponsored by Thomas Hochstein on Patreon.	2017-09-15 15:22:53 -04:00
Joey Hess	e223cf568f	add table to keep track of what subdirectories are populated in the export So empty subdirectories can be identified and removed. This commit was sponsored by Jochen Bartl on Patreon.	2017-09-15 14:35:22 -04:00
Joey Hess	6ab14710fc	fix consistency bug reading from export database The export database has writes made to it and then expects to read back the same data immediately. But, the way that Database.Handle does writes, in order to support multiple writers, makes that not work, due to caching issues. This resulted in export re-uploading files it had already successfully renamed into place. Fixed by allowing databases to be opened in MultiWriter or SingleWriter mode. The export database only needs to support a single writer; it does not make sense for multiple exports to run at the same time to the same special remote. All other databases still use MultiWriter mode. And by inspection, nothing else in git-annex seems to be relying on being able to immediately query for changes that were just written to the database. This commit was supported by the NSF-funded DataLad project.	2017-09-06 17:19:07 -04:00
Joey Hess	4da763439b	use export db to correctly handle duplicate files Removed uncorrect UniqueKey key in db schema; a key can appear multiple times with different files. The database has to be flushed after each removal. But when adding files to the export, lots of changes are able to be queued up w/o flushing. So it's still fairly efficient. If large removals of files from exports are too slow, an alternative would be to make two passes over the diff, one pass queueing deletions from the database, then a flush and the a second pass updating the location log. But that would use more memory, and need to look up exportKey twice per removed file, so I've avoided such optimisation yet. This commit was supported by the NSF-funded DataLad project.	2017-09-04 14:39:32 -04:00
Joey Hess	2c90ed1fea	flush queued changes to export db on exit	2017-09-04 14:00:54 -04:00
Joey Hess	7eb9889bfd	track exported files in a sqlite database Went with a separate db per export remote, rather than a single export database. Mostly because there will probably not be a lot of separate export remotes, and it might be convenient to be able to delete a given remote's export database. This commit was supported by the NSF-funded DataLad project.	2017-09-04 13:53:08 -04:00
Joey Hess	ca0daa8bb8	factor non-type stuff out of Key	2017-02-24 13:42:30 -04:00
Joey Hess	3b22ad9f47	Work around sqlite's incorrect handling of umask when creating databases. Refactored some common code into initDb. This only deals with the problem when creating new databases. If a repo got bad permissions into it, it's up to the user to deal with it. This commit was sponsored by Ole-Morten Duesund on Patreon.	2017-02-13 17:39:16 -04:00
Joey Hess	23d71423e1	work around ghc segfault hSetEncoding of a closed handle segfaults. https://ghc.haskell.org/trac/ghc/ticket/7161 `8484c0c197` introduced the crash. In particular, stdin may get closed (by eg, getContents) and then trying to set its encoding will crash. We didn't need to adjust stdin's encoding anyway, but only stderr, to work around https://github.com/yesodweb/persistent/issues/474 Thanks to Mesar Hameed for assistance related to reproducing this bug.	2016-12-30 18:14:19 -04:00
Joey Hess	8484c0c197	Always use filesystem encoding for all file and handle reads and writes. This is a big scary change. I have convinced myself it should be safe. I hope!	2016-12-24 14:46:31 -04:00
Joey Hess	0a4479b8ec	Avoid backtraces on expected failures when built with ghc 8; only use backtraces for unexpected errors. ghc 8 added backtraces on uncaught errors. This is great, but git-annex was using error in many places for a error message targeted at the user, in some known problem case. A backtrace only confuses such a message, so omit it. Notably, commands like git annex drop that failed due to eg, numcopies, used to use error, so had a backtrace. This commit was sponsored by Ethan Aubin.	2016-11-15 21:29:54 -04:00
Joey Hess	206dab8b44	remove unused imports	2016-10-18 16:16:47 -04:00
Joey Hess	148bd0dbfd	refactor	2016-10-17 14:58:33 -04:00
Joey Hess	e34046de38	slightly more efficient checking of versionUsesKeysDatabase It's a mvar lookup either way, but I think this way will be slightly more efficient. And it reduces the number of places where it's checked to 1.	2016-07-19 14:02:49 -04:00
Joey Hess	2619019630	Avoid any access to keys database in v5 mode repositories, which are not supposed to use that database.	2016-07-19 12:12:19 -04:00
Joey Hess	5f0b551c0c	assistant: Fix race in v6 mode that caused downloaded file content to sometimes not replace pointer files. The keys database handle needs to be closed after merging, because the smudge filter, in another process, updates the database. Old cached info can be read for a while from the open database handle; closing it ensures that the info written by the smudge filter is available. This is pretty horribly ad-hoc, and it's especially nasty that the transferrer closes the database every time.	2016-05-16 14:49:12 -04:00
Joey Hess	d05a75e45a	fix bug in unlocked file scanner that skipped over executable unlocked files	2016-04-14 13:07:46 -04:00
Joey Hess	6702f5a9c6	fix typo	2016-02-14 19:31:40 -04:00
Joey Hess	49215d68ae	devblog	2016-02-14 18:01:35 -04:00
Joey Hess	cf260d9a15	Fix storing of filenames of v6 unlocked files when the filename is not representable in the current locale. This is a mostly backwards compatable change. I broke backwards compatability in the case where a filename starts with double-quote. That seems likely to be very rare, and v6 unlocked files are a new feature anyway, and fsck needs to fix missing associated file mappings anyway. So, I decided that is good enough. The encoding used is to just show the String when it contains a problem character. While that adds some overhead to addAssociatedFile and removeAssociatedFile, those are not called very often. This approach has minimal decode overhead, because most filenames won't be encoded that way, and it only has to look for the leading double-quote to skip the expensive read. So, getAssociatedFiles remains fast. I did consider using ByteString instead, but getting a FilePath converted with all chars intact, even surrigates, is difficult, and it looks like instance PersistField ByteString uses Text, which I don't trust for problem encoded data. It would probably be slower too, and it would make the database less easy to inspect manually.	2016-02-14 16:37:25 -04:00
Joey Hess	9df13e73ae	if keys database cannot be opened due to permissions, ignore This lets readonly repos be used. If a repo is readonly, we can ignore the keys database, because nothing that we can do will change the state of the repo anyway.	2016-02-12 14:16:35 -04:00
Joey Hess	737e45156e	remove 163 lines of code without changing anything except imports	2016-01-20 16:36:33 -04:00
Joey Hess	927e1a067e	fix import warnings	2016-01-14 10:30:54 -04:00

1 2 3

101 commits