From d7bb4040d88efe7a9870ef99ac2b0e395d37b9a4 Mon Sep 17 00:00:00 2001 From: AlbertZeyer Date: Sat, 2 Jan 2021 23:49:14 +0000 Subject: [PATCH 1/5] Added a comment --- ...ment_6_cf89b44d67752edabfbf577d1212e7ad._comment | 13 +++++++++++++ 1 file changed, 13 insertions(+) create mode 100644 doc/forum/Adding_files_to_git__58___Very_long___34__recording_state_in_git__34___phase/comment_6_cf89b44d67752edabfbf577d1212e7ad._comment diff --git a/doc/forum/Adding_files_to_git__58___Very_long___34__recording_state_in_git__34___phase/comment_6_cf89b44d67752edabfbf577d1212e7ad._comment b/doc/forum/Adding_files_to_git__58___Very_long___34__recording_state_in_git__34___phase/comment_6_cf89b44d67752edabfbf577d1212e7ad._comment new file mode 100644 index 0000000000..ca1f3cebe2 --- /dev/null +++ b/doc/forum/Adding_files_to_git__58___Very_long___34__recording_state_in_git__34___phase/comment_6_cf89b44d67752edabfbf577d1212e7ad._comment @@ -0,0 +1,13 @@ +[[!comment format=mdwn + username="AlbertZeyer" + avatar="http://cdn.libravatar.org/avatar/b37d71961a6a5abf9b7184ed77b5a941" + subject="comment 6" + date="2021-01-02T23:49:14Z" + content=""" +I'm having a very similar issue. Adding files is quite slow, and it hangs for several minutes in `(recording state in git...)` now (that started after adding quite a few files already), and the time seems to increase (I fear that it will soon be hours, not minutes, making it basically unusable...). + +I have not really configured anything (i.e. it should use all the defaults). I just did `git init` and `git annex init`, and then started to import files using `git annex import`. That's all. + +I don't really know about this smudge thing. Is that enabled by default? If that is causing problems, should I maybe disable it? + +"""]] From 29114d53d9cebe5be31bf49359dd701a65072a67 Mon Sep 17 00:00:00 2001 From: AlbertZeyer Date: Sun, 3 Jan 2021 21:48:51 +0000 Subject: [PATCH 2/5] Added a comment --- ...comment_6_eb62a9020575d89799815f6e4b98b28c._comment | 10 ++++++++++ 1 file changed, 10 insertions(+) create mode 100644 doc/git-annex-move/comment_6_eb62a9020575d89799815f6e4b98b28c._comment diff --git a/doc/git-annex-move/comment_6_eb62a9020575d89799815f6e4b98b28c._comment b/doc/git-annex-move/comment_6_eb62a9020575d89799815f6e4b98b28c._comment new file mode 100644 index 0000000000..6eb032729f --- /dev/null +++ b/doc/git-annex-move/comment_6_eb62a9020575d89799815f6e4b98b28c._comment @@ -0,0 +1,10 @@ +[[!comment format=mdwn + username="AlbertZeyer" + avatar="http://cdn.libravatar.org/avatar/b37d71961a6a5abf9b7184ed77b5a941" + subject="comment 6" + date="2021-01-03T21:48:50Z" + content=""" +Renaming files is still somewhat unclear then. So I should just use `git mv` to rename files? +What if the file extension changes? E.g. I have lots of pictures with uppercase `.JPG` and I might want to change them to `.jpg`. I might also have some files as `.jpeg` and to unify them, I might change them as well to `.jpg`. But as far as I understand, that would also change the default hash, as the default hash contains the extension? But doing `git mv` will take care about all of that via pre-commit hooks? + +"""]] From 3ceaf762b9db8ec3a6c514fd8901d41c299d7573 Mon Sep 17 00:00:00 2001 From: AlbertZeyer Date: Sun, 3 Jan 2021 22:02:02 +0000 Subject: [PATCH 3/5] --- doc/forum/Reverse_index_key_to_list_of_file_paths.mdwn | 8 ++++++++ 1 file changed, 8 insertions(+) create mode 100644 doc/forum/Reverse_index_key_to_list_of_file_paths.mdwn diff --git a/doc/forum/Reverse_index_key_to_list_of_file_paths.mdwn b/doc/forum/Reverse_index_key_to_list_of_file_paths.mdwn new file mode 100644 index 0000000000..d6171bab6f --- /dev/null +++ b/doc/forum/Reverse_index_key_to_list_of_file_paths.mdwn @@ -0,0 +1,8 @@ +I understand from [here](https://git-annex.branchable.com/forum/Import_existing_files/#comment-29ece0290fa1314ca48caf8f435570d2) that there is no reverse index from a key to list of file paths pointing to that key (i.e. pointing to the value). + +`find . -lname '*'` would be an extremely slow operation on a big repo as it would go through the whole repo. And this is probably a common operation I frequently want to do. + +What if I would want to build one? How would I make sure that potential moves/renames will update the index? + +I understand from [here](https://git-annex.branchable.com/git-annex-metadata/) that you can attach meta information to a key (via `git annex metadata`). This sounds as it would be useful to contain such reverse information, right? + From c5dd184d66ecfd3513bcd369ed5ea60611b95138 Mon Sep 17 00:00:00 2001 From: AlbertZeyer Date: Sun, 3 Jan 2021 22:07:02 +0000 Subject: [PATCH 4/5] Added a comment --- ...ent_10_278fca1c579d0acdcce819449aec8eee._comment | 13 +++++++++++++ 1 file changed, 13 insertions(+) create mode 100644 doc/git-annex-metadata/comment_10_278fca1c579d0acdcce819449aec8eee._comment diff --git a/doc/git-annex-metadata/comment_10_278fca1c579d0acdcce819449aec8eee._comment b/doc/git-annex-metadata/comment_10_278fca1c579d0acdcce819449aec8eee._comment new file mode 100644 index 0000000000..c644f06b36 --- /dev/null +++ b/doc/git-annex-metadata/comment_10_278fca1c579d0acdcce819449aec8eee._comment @@ -0,0 +1,13 @@ +[[!comment format=mdwn + username="AlbertZeyer" + avatar="http://cdn.libravatar.org/avatar/b37d71961a6a5abf9b7184ed77b5a941" + subject="comment 10" + date="2021-01-03T22:07:02Z" + content=""" +From this man page, it's not totally clear how/where the metadata is stored. Is it inside the Git repo (i.e. as regular file), or inside the Annex, or somewhere else? Is this information synced when you do `git push` (as part of Git), or via `git annex sync`? + +How does it resolve any conflicts? + +Is the metadata itself under version control? (If it is in Git itself, then clearly yes, but that's not clear to me.) + +"""]] From ebe6c12cc017cbff745f109f3debcadae9b9c5b5 Mon Sep 17 00:00:00 2001 From: AlbertZeyer Date: Mon, 4 Jan 2021 12:04:04 +0000 Subject: [PATCH 5/5] Added a comment --- ...ment_4_9c7677838ad28d540a2a514d718f9f1d._comment | 13 +++++++++++++ 1 file changed, 13 insertions(+) create mode 100644 doc/forum/Import_existing_files/comment_4_9c7677838ad28d540a2a514d718f9f1d._comment diff --git a/doc/forum/Import_existing_files/comment_4_9c7677838ad28d540a2a514d718f9f1d._comment b/doc/forum/Import_existing_files/comment_4_9c7677838ad28d540a2a514d718f9f1d._comment new file mode 100644 index 0000000000..b0836dd5d2 --- /dev/null +++ b/doc/forum/Import_existing_files/comment_4_9c7677838ad28d540a2a514d718f9f1d._comment @@ -0,0 +1,13 @@ +[[!comment format=mdwn + username="AlbertZeyer" + avatar="http://cdn.libravatar.org/avatar/b37d71961a6a5abf9b7184ed77b5a941" + subject="comment 4" + date="2021-01-04T12:04:04Z" + content=""" +That is the best solution with `find`? There is no reverse index? I made a separate forum entry for this question [here](https://git-annex.branchable.com/forum/Reverse_index_key_to_list_of_file_paths/), to discuss that a bit more separately. + +Why exactly does `git annex sync` (or other ops) get slower on bigger repos? In principle it could be implemented in a way that it should not get slower (basically always avoiding any need to iterate through all objects, which should always be possible to avoid by having some indices for any operations which needs that). + +Does it make sense to split up the repo, but share the Git Annex object files (shared `.git/annex/objects`)? + +"""]]