git-annex

Author	SHA1	Message	Date
Joey Hess	78da00c7a6	Future proof activity log parsing When the log has an activity that is not known, eg added by a future version of git-annex, it used to be treated as no activity at all, which would make git-annex expire think it should expire the repository, despite it having some kind of recent activity. Hopefully there will be no reason to add a new activity until enough time has passed that this commit is in use everywhere. Sponsored-by: Jake Vosloo on Patreon	2021-06-14 14:18:19 -04:00
yarikoptic	6043a2c7a0	Added a comment	2021-06-14 17:36:16 +00:00
james@06209b7878fcf3b5c46b8028dacb3cec6609369c	9d34a9d013		2021-06-14 17:19:50 +00:00
Joey Hess	372ace599a	comment	2021-06-14 13:13:46 -04:00
Joey Hess	f0cbaa194c	improve docs based on forum feedback	2021-06-14 13:04:58 -04:00
Joey Hess	fbd2f96b2c	comment	2021-06-14 12:56:29 -04:00
Joey Hess	dcd2c95249	fix windows build	2021-06-14 12:43:26 -04:00
Joey Hess	3ac9363c03	comment	2021-06-14 12:42:11 -04:00
Joey Hess	014dc63a55	avoid sometimes expensive operations when annex.supportunlocked = false This will mostly just avoid a DB lookup, so things get marginally faster. But in cases where there are many files using the same key, it can be a more significant speedup. Added overhead is one MVar lookup per call, which should be small enough, since this happens after transferring or ingesting a file, which is always a lot more work than that. It would be nice, though, to move getGitConfig to AnnexRead, which there is an open todo about.	2021-06-14 12:40:41 -04:00
Joey Hess	a02b5c2904	response	2021-06-14 12:36:42 -04:00
yarikoptic	51fede57a2	Added a comment	2021-06-14 16:23:41 +00:00
Ilya_Shlyakhter	35afd58a76	Added a comment: git-annex-add slowdown	2021-06-14 16:00:44 +00:00
Joey Hess	c4f1465a81	check symlink before reading file This is faster because when multiple files are in a directory, it gets cached.	2021-06-14 11:53:51 -04:00
Joey Hess	4163344ed6	retitle	2021-06-14 11:44:55 -04:00
Joey Hess	0eff5a3f71	reproduced	2021-06-14 11:37:21 -04:00
Joey Hess	26a9ea12d1	handle edge case of symlink to something that is not really a pointer file That seems very unlikely to happen, but still, it's possible it could. And with the recent addition of locked files to the keys db, this could be called by places that did not call it before, so it seems even more important it's correct. Adds an extra stat of the file, and is potentially racy, but both problems are fixed by the unix-2.8.0 path. I have not tested that path builds because that package is not yet released and it would be difficult to install it since it's tightly tied to a ghc version.	2021-06-14 11:35:52 -04:00
Joey Hess	673b2feaf3	rename for clarity Associated files are recorded now also for locked files, but this is only needed to populate unlocked files.	2021-06-14 10:55:24 -04:00
yarikoptic	8f66f73fea	Added a comment	2021-06-09 22:28:06 +00:00
yarikoptic	e30f973323	Added a comment: more "mystery resolved" -- identical (empty) keys	2021-06-09 21:00:34 +00:00
Joey Hess	4b09b93a18	Merge branch 'master' of ssh://git-annex.branchable.com	2021-06-09 15:38:58 -04:00
Joey Hess	fad281767a	comment	2021-06-09 15:38:55 -04:00
yarikoptic	714d9f1315	Added a comment	2021-06-08 22:02:34 +00:00
yarikoptic	a8fb61329d	Added a comment	2021-06-08 21:58:20 +00:00
yarikoptic	3985ae3224	Added a comment: OSX mystery resolved. add --batch is effective mitigation	2021-06-08 21:56:53 +00:00
Joey Hess	6cb9113ff5	comments	2021-06-08 17:38:56 -04:00
yarikoptic	c3993a2655	Added a comment	2021-06-08 20:23:09 +00:00
yarikoptic	437d9366b7	Added a comment: getting closer...	2021-06-08 19:21:59 +00:00
Ilya_Shlyakhter	be4a029e1b	Added a comment	2021-06-08 19:08:01 +00:00
jenkin.schibel@286264d9ceb79998aecff0d5d1a4ffe34f8b8421	be173f213d		2021-06-08 18:40:09 +00:00
jenkin.schibel@286264d9ceb79998aecff0d5d1a4ffe34f8b8421	e4cf6cc306	removed	2021-06-08 18:26:30 +00:00
Joey Hess	530c957c3e	Merge branch 'master' of ssh://git-annex.branchable.com	2021-06-08 12:52:08 -04:00
yarikoptic	697921ecd8	Added a comment: all recent builds/logs are fetched to smaug	2021-06-08 16:50:12 +00:00
Joey Hess	7b6deb1109	display scanning message whenever reconcileStaged has enough files to chew on Clear visible progress bar first. Removed showSideActionAfter because it can't be used in reconcileStaged (import loop). Instead, it counts the number of files it processes and displays it after it's seen a sufficient to know it's taking a while. Sponsored-by: Dartmouth College's Datalad project	2021-06-08 12:48:30 -04:00
Joey Hess	ecbaa52571	clarification	2021-06-08 12:00:01 -04:00
Joey Hess	1a6fa5abc8	add debugging for reconcileStaged calls for benchmarking	2021-06-08 11:57:23 -04:00
Joey Hess	13b9a288d3	scanAnnexedFiles in smudge --update This makes git checkout and git merge hooks do the work to catch up with changes that they made to the tree. Rather than doing it at some later point when the user is not thinking about that past operation. Sponsored-by: Dartmouth College's Datalad project	2021-06-08 11:37:47 -04:00
Joey Hess	c380687aa3	Merge branch 'master' of ssh://git-annex.branchable.com	2021-06-08 11:13:09 -04:00
Joey Hess	7f742589f9	claw back annexed file scan speedup Following commit `c941ab6f5b`, this avoids the second, redundant scan when annex.thin is not set. The benchmark now runs in 35.5 seconds, down from 40 seconds. Note that the inode cache of the annex object has to be passed to addInodeCaches now, because it might not already be in the inode caches, unlike previously. Sponsored-by: Dartmouth College's Datalad project	2021-06-08 11:09:15 -04:00
Joey Hess	ec1f2f246b	improve comment remove obsolete part about a commit preventing it seeing changes	2021-06-08 10:43:48 -04:00
yarikoptic	62758ffb9f	Added a comment: slow down is OSX specific	2021-06-08 14:28:18 +00:00
Joey Hess	d12120739d	comment	2021-06-08 10:19:04 -04:00
Joey Hess	2125367f3f	Merge branch 'master' of ssh://git-annex.branchable.com	2021-06-08 09:42:57 -04:00
Joey Hess	c941ab6f5b	avoid double work in git-annex init, second try reconcileStaged populates the db, so scanAnnexedFiles does not need to do it again. It still makes a pass over the HEAD tree, but populating the db was most of the expensive part. Benchmarking with 100,000 files, git-annex init now takes 40 seconds, vs 37 seconds with the old, buggy version of this fix. It should be possible to win those 3 precious seconds per 100k files back, in the case when when annex.thin is not set, with improvements to reconcileStaged that avoid needing this second pass. Sponsored-by: Dartmouth College's Datalad project	2021-06-08 09:36:53 -04:00
Joey Hess	22185b4a4e	stop using addAssociatedFileFast Use addAssociatedFile instead, after recent optimisations it seems just as fast.	2021-06-08 09:23:28 -04:00
Joey Hess	2cb7b7b336	Revert "avoid double work in git-annex init" This reverts commit `0f10f208a7`. The implementation of this turns out to be unsafe; it can lead to a keys db deadlock. scanAnnexedFiles injects a call to inAnnex into reconcileStaged, but inAnnex sometimes needs to read from the keys db, which will try to re-open it when it's in the process of being opened. The exclusive lock of gitAnnexKeysDbLock will then deadlock. This needs to be done in some other way...	2021-06-08 09:11:24 -04:00
Joey Hess	c831a562f5	faster associated file replacement with upsert Rather than first deleting and then inserting, upsert lets the key associated with a file be updated in place. Benchmarked with 100,000 files, and an empty keys database, running reconcileStaged. It improved from 47 seconds to 34 seconds. So this got reconcileStaged to be as fast as scanAssociatedFiles, or faster -- scanAssociatedFiles benchmarks at 37 seconds. (Also checked for other users of deleteWhere that could be sped up by upsert. There are a couple, but they are not in performance critical code paths, eg recordExportTreeCurrent is only run once per tree export.) I would have liked to rename FileKeyIndex to FileKeyUnique since it is being used as a uniqueness constraint now, not just to get an index. But, that gets converted into part of the SQL schema, and the name is used by the upsert, so it can't be changed. Sponsored-by: Dartmouth College's Datalad project	2021-06-08 07:53:36 -04:00
yarikoptic	57b567ac87	Added a comment	2021-06-07 21:39:05 +00:00
yarikoptic	2ffb9cc01b	Added a comment: clarification	2021-06-07 21:20:35 +00:00
Joey Hess	e9a8b48a52	Merge branch 'master' of ssh://git-annex.branchable.com	2021-06-07 17:02:15 -04:00
Joey Hess	2467de4f9b	todo	2021-06-07 16:58:35 -04:00

1 2 3 4 5 ...

40132 commits