git-annex/Database
Joey Hess 6a9e923c74
fix handling of linked worktrees on filesystems w/o symlinks
Fix bug in handling of linked worktrees on filesystems not supporting
symlinks, that caused annexed file content to be stored in the wrong
location inside the git directory, and also caused pointer files to not get
populated.

This parameterizes functions in Annex.Locations with a GitLocationMaker.
The uses of standardGitLocationMaker are in cases where the path returned
by a function should not change when in a linked worktree. For example,
gitAnnexLink uses standardGitLocationMaker because symlink targets should
always be to ".git/annex/objects" paths, even when in a linked worktree.
Hopefully I have gotten all uses of standardGitLocationMaker right.

This also assumes that all path construction to the annex directory
is done via the functions in Annex.Locations, and there is no other,
ad-hoc construction elsewhere. Thankfully, Annex.Locations has been around
since the beginning, and has been used consistently. I think.

---

In fixupUnusualRepos, when symlinks are supported, the .git file is replaced
with a symlink to the linked worktree git directory. And in that directory,
an "annex" symlink points to the main annex directory. In that case,
it's not necessary to set mainWorkTreePath. It would be ok to set it,
but not setting it in that case allows an optimisation of avoiding reading
the "commondir" file.

The change to make fixupUnusualRepos set mainWorkTreePath when the
repository is not initialized yet is done in case the initialization itself
writes to the annex directory. If that were the case, without setting
mainWorkTreePath, the annex symlink would not be set up yet, and so
it might have created the annex directory in the wrong place. Currently
that didn't happen, but now that mainWorkTreePath is available, using it
here avoids any such later problem.

---

This commit does not deal with the mess of a worktree that has
experienced this bug before. In particular, if `git-annex get` were
run in such a worktree, it would have stored the object files in the
linked worktree's git directory, rather than in the main git directory.
Such misplaced object files need to be dealt with; the plan is to make
git-annex fsck notice and fix them.

A worktree that has experienced this bug before will contain unpopulated
pointer files. Those may eventually get fixed up in regular usage of
git-annex, but git-annex fsck will also fix them up.

---

Finally, this has me pondering if all of git-annex's state files should
really be stored in one common place across all linked worktrees. Should
perhaps state files that are specific to the worktree be stored per-worktree?
That has not been the case when using git-annex on filesystems supporting
symlinks, but it *has* been the case on filesystems not supporting
symlinks. Perhaps this leads to some other buggy behavior in some cases.
Or perhaps to extra work being done.

For example, the keys database has an associated files table. Which depends
on the worktree. But reconcileStaged updates that table, so when git-annex
is used first in one worktree and then in another one, reconcileStaged will
update the table to reflect the current worktree. Which is extra work each
time a different worktree is used. But also, what if two git-annex
processes are running at the same time, in separate worktrees? Probably
this needs more thought and investigation.

So there is a risk that this commit exposes such buggy behavior in a
situation where it didn't happen before, due to the filesystem not
supporting symlinks. But, given how much this bug crippled using linked
worktrees in such a situation, I doubt that many people have been doing
that.
2025-07-14 13:20:39 -04:00
..
Keys more OsPath conversion 2025-01-27 10:13:43 -04:00
RepoSize remove stale live changes from reposize database 2024-08-28 13:57:25 -04:00
Benchmark.hs more OsPath conversion 2025-01-29 16:24:51 -04:00
ContentIdentifier.hs test suite now passes after OsPath conversion 2025-02-11 12:44:09 -04:00
Export.hs test suite now passes after OsPath conversion 2025-02-11 12:44:09 -04:00
Fsck.hs test suite now passes after OsPath conversion 2025-02-11 12:44:09 -04:00
Handle.hs more OsPath conversion 2025-01-25 11:56:35 -04:00
ImportFeed.hs test suite now passes after OsPath conversion 2025-02-11 12:44:09 -04:00
Init.hs more OsPath conversion 2025-02-11 14:03:20 -04:00
Keys.hs fix handling of linked worktrees on filesystems w/o symlinks 2025-07-14 13:20:39 -04:00
Queue.hs more OsPath conversion 2025-01-25 11:56:35 -04:00
RawFilePath.hs OsPath build flag no longer depends on filepath-bytestring 2025-02-10 16:39:55 -04:00
RepoSize.hs test suite now passes after OsPath conversion 2025-02-11 12:44:09 -04:00
Types.hs sqlite datbase for importfeed 2023-10-23 16:46:22 -04:00
Utility.hs Sped up sqlite inserts 2x when built with persistent 2.14.5.0 2023-03-31 14:38:25 -04:00