Merge branch 'master' of ssh://git-annex.branchable.com
This commit is contained in:
commit
4836979518
6 changed files with 102 additions and 0 deletions
24
doc/bugs/addurl_+_sync_vs_addurl_+_commit.mdwn
Normal file
24
doc/bugs/addurl_+_sync_vs_addurl_+_commit.mdwn
Normal file
|
@ -0,0 +1,24 @@
|
|||
### Please describe the problem.
|
||||
|
||||
I think this is what happened; I need to go back and check this again (maybe I was just misreading something) but I want to get it written down first.
|
||||
|
||||
I've got a git repository, and I just ran git annex init. Then I ran git annex addurl a bunch of times, followed by git annex sync. The result was apparently a repository where the files downloaded by addurl were added using the SHA256 backend rather than the URL backend. I deleted the branches and tried again, but this time after calling git annex addurl a bunch of times I did a normal git commit. This time everything looked fine; the files were all listed in as present in the web remote.
|
||||
|
||||
### What steps will reproduce the problem?
|
||||
|
||||
[[!format sh """
|
||||
git annex init
|
||||
git annex addurl "https://archive.org/download/emularity_engine_jsmess/messnapple2e.js.gz" --file "messnapple2e.js.gz"
|
||||
git annex sync
|
||||
"""]]
|
||||
|
||||
### What version of git-annex are you using? On what operating system?
|
||||
|
||||
git-annex version: 5.20150412-g2be4834
|
||||
build flags: Assistant Webapp Webapp-secure Pairing Testsuite S3 WebDAV Inotify DBus DesktopNotify XMPP DNS Feeds Quvi TDFA TorrentParser
|
||||
key/value backends: SHA256E SHA1E SHA512E SHA224E SHA384E SKEIN256E SKEIN512E MD5E SHA256 SHA1 SHA512 SHA224 SHA384 SKEIN256 SKEIN512 MD5 WORM URL
|
||||
remote types: git gcrypt S3 bup directory rsync web bittorrent webdav tahoe glacier ddar hook external
|
||||
local repository version: 5
|
||||
supported repository version: 5
|
||||
upgrade supported from repository versions: 0 1 2 4
|
||||
This is on Linux.
|
|
@ -0,0 +1,2 @@
|
|||
Files only present in remotes show up as broken symlinks. That's great for knowing what files exist, but sometimes I just want to browse the files that are actually present. In this case, the many broken symlinks are just clutter.
|
||||
Is there a straightforward way to switch to a view that shows only locally present files?
|
|
@ -0,0 +1,9 @@
|
|||
<video src=http://s.natalian.org/2015-04-15/git-annex-issues.mp4></video>
|
||||
|
||||
I removed some archived directories perhaps foolishly with `rm -rf`. How do I find the files that I've had deleted?
|
||||
|
||||
|
||||
|
||||
I also have an issue where by I want one command to sync between two hardrives and [github](https://github.com/kaihendry/uploadme). Or do I have to: `git-annex move --to {foo,bar}; git-annex drop; git-annex sync`? Basically I want copies everywhere except on my laptop (X1C3).
|
||||
|
||||
I also expected my git dir to be much smaller than 1.4GB after dropping everything. Thanks!
|
|
@ -0,0 +1,37 @@
|
|||
[[!comment format=mdwn
|
||||
username="https://www.google.com/accounts/o8/id?id=AItOawlXt6nnNs-3uw61EGYtxr_AVhJqXybwLR8"
|
||||
nickname="Bruno"
|
||||
subject="comment 2"
|
||||
date="2015-04-15T17:51:18Z"
|
||||
content="""
|
||||
@CandyAngel Thank you for your **git annex find** tips. But for git gc, it seem not working fine :)
|
||||
After i have executed the **git gc**, the **git annex info** return the result after **1h 45m**
|
||||
|
||||
% time git annex info
|
||||
repository mode: indirect
|
||||
trusted repositories: 0
|
||||
semitrusted repositories: 5
|
||||
00000000-0000-0000-0000-000000000001 -- web
|
||||
00000000-0000-0000-0000-000000000002 -- bittorrent
|
||||
181d4dae-2131-435e-9c00-b8c7f1bfc332 -- [sbackup]
|
||||
2db1f8e7-0b29-4d61-8875-a4a4a42a79dd -- [dellcomputer]
|
||||
703df355-73a6-4487-97fd-a3a5d6ae034e -- usbhomebackup [here]
|
||||
untrusted repositories: 0
|
||||
transfers in progress: none
|
||||
available local disk space: 135.24 gigabytes (+1 megabyte reserved)
|
||||
local annex keys: 275416
|
||||
local annex size: 780.55 gigabytes
|
||||
annexed files in working tree: 265888
|
||||
size of annexed files in working tree: 751.49 gigabytes
|
||||
bloom filter size: 16 mebibytes (55.1% full)
|
||||
backend usage:
|
||||
SHA256E: 541304
|
||||
|
||||
git annex info 83,95s user 50,68s system 2% **cpu 1:45:01,70 total**
|
||||
|
||||
|
||||
Can you explain exactly the git gc or git repack parameters that you use for optimizing git annex performance ?
|
||||
|
||||
Thanks
|
||||
|
||||
"""]]
|
|
@ -0,0 +1,16 @@
|
|||
[[!comment format=mdwn
|
||||
username="CandyAngel"
|
||||
subject="comment 3"
|
||||
date="2015-04-16T07:41:07Z"
|
||||
content="""
|
||||
*git annex info* has check every file (not sure if it traverses *.git/annex/objects* specifically or not) to get \"local annex\" information. You can improve its performance by improving directory traversal in general (different filesystem or [changing the hashing method so it isn't Xx/Yy/KEY/FILE](https://github.com/datalad/datalad/issues/32)).
|
||||
|
||||
The repack/gc speeds up operations for the git side of things, like syncing (pull/push), cloning and committing.
|
||||
|
||||
Here's what I used:
|
||||
|
||||
git repack -ad
|
||||
git gc
|
||||
|
||||
This took git actions down from 1 hour+ to ~10 minutes (for a repo with 5.6 million objects).
|
||||
"""]]
|
|
@ -0,0 +1,14 @@
|
|||
[[!comment format=mdwn
|
||||
username="https://www.google.com/accounts/o8/id?id=AItOawlXt6nnNs-3uw61EGYtxr_AVhJqXybwLR8"
|
||||
nickname="Bruno"
|
||||
subject="comment 4"
|
||||
date="2015-04-16T11:47:50Z"
|
||||
content="""
|
||||
Thanks @CandyAngle,
|
||||
|
||||
Effectively, your tips for reduce a time for some git-annex commands if works fine, i will see in the long term if that is work perfectly
|
||||
|
||||
ex:, now **git annex sync** it work in **45s** ! :)
|
||||
|
||||
Thanks
|
||||
"""]]
|
Loading…
Reference in a new issue