Merge branch 'master' of ssh://git-annex.branchable.com

This commit is contained in:
Joey Hess 2020-07-02 10:00:11 -04:00
commit fe1f4632a4
No known key found for this signature in database
GPG key ID: DB12DB0FF05F8F38
6 changed files with 134 additions and 0 deletions

View file

@ -0,0 +1,8 @@
[[!comment format=mdwn
username="B"
avatar="http://cdn.libravatar.org/avatar/f9b0137cb32be76e5a171bff7ce98da7"
subject="Workaround"
date="2020-07-02T12:51:41Z"
content="""
Is there any workaround in the meantime? (I'm seeing this, too).
"""]]

View file

@ -0,0 +1,55 @@
### Please describe the problem.
Got an error message trying to initialize a git-annex repo on a VirtualBox shared folder (Linux guest, Windows host). The shared folder is on an external USB drive.
### What steps will reproduce the problem?
See log
### What version of git-annex are you using? On what operating system?
[[!format sh """
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$ git annex version
git-annex version: 8.20200617-g02765b8
build flags: Assistant Webapp Pairing S3 WebDAV Inotify DBus DesktopNotify TorrentParser MagicMime Feeds Testsuite
dependency versions: aws-0.22 bloomfilter-2.0.1.0 cryptonite-0.25 DAV-1.3.4 feed-1.2.0.1 ghc-8.6.5 http-client-0.6.4 persistent-sqlite-2.9.3 torrent-10000.1.1 uuid-1.3.13 yesod-1.6.0.1
key/value backends: SHA256E SHA256 SHA512E SHA512 SHA224E SHA224 SHA384E SHA384 SHA3_256E SHA3_256 SHA3_512E SHA3_512 SHA3_224E SHA3_224 SHA3_384E SHA3_384 SKEIN256E SKEIN256 SKEIN512E SKEIN512 BLAKE2B256E BLAKE2B256 BLAKE2B512E BLAKE2B512 BLAKE2B160E BLAKE2B160 BLAKE2B224E BLAKE2B224 BLAKE2B384E BLAKE2B384 BLAKE2BP512E BLAKE2BP512 BLAKE2S256E BLAKE2S256 BLAKE2S160E BLAKE2S160 BLAKE2S224E BLAKE2S224 BLAKE2SP256E BLAKE2SP256 BLAKE2SP224E BLAKE2SP224 SHA1E SHA1 MD5E MD5 WORM URL
remote types: git gcrypt p2p S3 bup directory rsync web bittorrent webdav adb tahoe glacier ddar git-lfs hook external
operating system: linux x86_64
supported repository versions: 8
upgrade supported from repository versions: 0 1 2 3 4 5 6 7
local repository version: 8
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$ uname -a
Linux cg-router1.broadinstitute.org 3.10.0-1127.13.1.el7.x86_64 #1 SMP Tue Jun 23 15:46:38 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$
"""]]
### Please provide any additional information below.
[[!format sh """
# If you can, paste a complete transcript of the problem occurring here.
# If the problem is with the git-annex assistant, paste in .git/annex/daemon.log
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$ git init
Initialized empty Git repository in /mnt/shared/d/WindowsImageBackup/WPAE9-305/.git/
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$ git annex init 'blue wd passport ultra'
init blue wd passport ultra
Detected a filesystem without fifo support.
Disabling ssh connection caching.
Detected a crippled filesystem.
(scanning for unlocked files...)
Entering an adjusted branch where files are unlocked as this filesystem does not support locked files.
Switched to branch 'adjusted/master(unlocked)'
ok
(recording state in git...)
git-annex: .git/annex/othertmp/jlog12400-5: removeLink: resource busy (Text file busy)
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$
# End of transcript or log.
"""]]
### Have you had any luck using git-annex before? (Sometimes we get tired of reading bug reports all day and a lil' positive end note does wonders)
Certainly. Right now trying to organize a new set of backups from multiple places, which without git-annex would be an organizational nightmare.

View file

@ -0,0 +1,42 @@
[[!comment format=mdwn
username="Ilya_Shlyakhter"
avatar="http://cdn.libravatar.org/avatar/1647044369aa7747829c38b9dcc84df0"
subject="adding files also fails"
date="2020-07-01T19:51:34Z"
content="""
Trying to add a file also gives errors; posting here because cause is likely the same:
[[!format sh \"\"\"
# If you can, paste a complete transcript of the problem occurring here.
# If the problem is with the git-annex assistant, paste in .git/annex/daemon.log
(newer3-gdrive-remote-env) [ilya@cg-router1 Backup 2020-06-30 043950]$ git annex add Esp.vhdx
add Esp.vhdx
100% 46 MiB 94 MiB/s 0s
git-annex: failed to commit changes to sqlite database: Just user error (SQLite3 returned ErrorReadOnly while attempting to perform step: attempt to write a readonly database(after successful open))
CallStack (from HasCallStack):
error, called at ./Database/Handle.hs:115:26 in main:Database.Handle
failed
git-annex: thread blocked indefinitely in an MVar operation
(newer3-gdrive-remote-env) [ilya@cg-router1 Backup 2020-06-30 043950]$ echo $?
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$ git config -l
core.repositoryformatversion=0
core.filemode=false
core.bare=false
core.logallrefupdates=true
core.symlinks=false
core.ignorecase=true
annex.uuid=9f2b6628-5921-47d2-a712-ac902a662d04
annex.sshcaching=false
annex.crippledfilesystem=true
annex.version=8
annex.addunlocked=true
annex.thin=true
filter.annex.smudge=git-annex smudge -- %f
filter.annex.clean=git-annex smudge --clean -- %f
(newer3-gdrive-remote-env) [ilya@cg-router1 WPAE9-305]$
# End of transcript or log.
\"\"\"]]
"""]]

View file

@ -0,0 +1,9 @@
[[!comment format=mdwn
username="http://templeofcrom.duckdns.org/"
nickname="Karl"
avatar="http://cdn.libravatar.org/avatar/336975995d2c8652aa98284987d5987e90e1b4d137da415af18a8e04c29edbc3"
subject="comment 5"
date="2020-07-01T21:54:49Z"
content="""
The generic readonly http remote sounds like what I was expecting web special remote to be. It would be great if that gets implemented.
"""]]

View file

@ -0,0 +1,8 @@
[[!comment format=mdwn
username="Ilya_Shlyakhter"
avatar="http://cdn.libravatar.org/avatar/1647044369aa7747829c38b9dcc84df0"
subject="annex.thin for importing from directory special remote"
date="2020-07-01T22:23:58Z"
content="""
As a special case, when importing from a directory special remote, could there be an option to hardlink the files into the repo instead of copying them?
"""]]

View file

@ -0,0 +1,12 @@
[[!comment format=mdwn
username="Lukey"
avatar="http://cdn.libravatar.org/avatar/c7c08e2efd29c692cc017c4a4ca3406b"
subject="comment 6"
date="2020-07-01T20:37:13Z"
content="""
The memory consumption is indeed problematic.
Digging further, I found a even better solution:
\"time (git ls-tree -r git-annex | awk '/SHA256.*.log$/{print $3\" \"$4}' | git cat-file --batch='%(objectname) %(objecttype) %(objectsize) %(rest)' --buffer > /dev/null)\" takes just 7 seconds in my repo.
Note the '%(rest)' in the batch format, awk feeds the objectname and the key (separated by space) to cat-file, which outputs the key in place of '%(rest)'. So a git-annex thread reading from cat-file has all the information (location log and key) readily available. This can be extended to also feed metadata at the same time by making the location log directly followed by metadata.
"""]]