git-annex/doc/walkthrough.mdwn

180 lines
5.9 KiB
Text
Raw Normal View History

A walkthrough of the basic features of git-annex.
[[!toc]]
2010-10-20 00:07:50 +00:00
## creating a repository
This is very straightforward. Just tell it a description of the repository.
# mkdir ~/annex
# cd ~/annex
# git init
# git annex init "my laptop"
## adding a remote
This could be a USB drive, or a sshfs or NFS mount to a file server, for
example.
# sudo mount /media/usb
# cd /media/usb
# git clone ~/annex
# cd annex
# git annex init "portable USB drive"
# git remote add home ~/annex
# cd ~/annex
# git remote add usbdrive /media/usb
2010-10-20 00:10:45 +00:00
This is all standard ad-hoc distributed git repository setup. Or you
could have added a centralized bare repository on a server if you prefer
doing things that way.
Anyway, the only git-annex specific part is telling it the name
2010-10-20 00:07:50 +00:00
of the new repository created on the USB drive.
Notice that both repos are set up as remotes of the other one. This lets
2010-10-20 00:10:45 +00:00
either get annexed files from the other. You'll want to do that even
if you are using a centralized bare repository.
2010-10-20 00:07:50 +00:00
## adding files
# cd ~/annex
# cp /tmp/big_file .
# cp /tmp/debian.iso .
# git annex add .
add big_file ok
add debian.iso ok
# git commit -a -m added
Notice you commit at the end, this checks in git-annex's record of the
2010-10-21 20:33:04 +00:00
files but not their actual, large, content.
2010-10-20 00:07:50 +00:00
2010-10-20 00:19:52 +00:00
## renaming files
# cd ~/annex
# git mv big_file my_cool_big_file
# mkdir iso
# git mv debian.iso iso
# git annex fix .
fix iso/debian.iso ok
# git commit -m moved
You can use any normal git operations to move files around, or even
make copies or delete them. `git-annex fix` needs to be run if a file
is moved into a different directory, in order to fix up the symlink
pointing to the file's content.
2010-10-21 21:59:32 +00:00
## getting file content
2010-10-20 00:07:50 +00:00
2010-10-21 21:59:32 +00:00
A repository does not always have all annexed file contents available.
When you need the content of a file, you can use "git annex get" to
make it available.
We can use this to copy everything in the laptop's home annex to the
USB drive.
2010-10-20 00:07:50 +00:00
# cd /media/usb/annex
# git pull home master
# git annex get .
2010-10-20 00:22:37 +00:00
get my_cool_big_file (copying from home...) ok
get iso/debian.iso (copying from home...) ok
2010-10-20 00:07:50 +00:00
Notice that you had to git pull from home first, this lets git-annex know
2010-10-21 21:59:32 +00:00
what has changed in home, and so it knows about the files present there and
can get them. See below for an easier way.
2010-10-20 00:07:50 +00:00
## transferring files: When things go wrong
After a while, you'll have serveral annexes, with different file contents.
You don't have to try to keep all that straight; git-annex does
2010-10-20 16:16:49 +00:00
[[location_tracking]] for you. If you ask it to get a file and the drive
2010-10-20 00:07:50 +00:00
or file server is not accessible, it will let you know what it needs to get
it:
# git annex get video/hackity_hack_and_kaxxt.mov
get video/_why_hackity_hack_and_kaxxt.mov (not available)
I was unable to access these remotes: server
Try making some of these repositories available:
5863d8c0-d9a9-11df-adb2-af51e6559a49 -- my home file server
58d84e8a-d9ae-11df-a1aa-ab9aa8c00826 -- portable USB drive
ca20064c-dbb5-11df-b2fe-002170d25c55 -- backup SATA drive
failed
# sudo mount /media/usb
# git annex get video/hackity_hack_and_kaxxt.mov
get video/hackity_hack_and_kaxxt.mov (copying from usbdrive...) ok
# git commit -a -m "got a video I want to rewatch on the plane"
## removing files
You can always drop files safely. Git-annex checks that some other annex
has the file before removing it.
2010-10-20 00:22:37 +00:00
# git annex drop iso/debian.iso
2010-10-20 00:07:50 +00:00
drop iso/Debian_5.0.iso ok
# git commit -a -m "freed up space"
## removing files: When things go wrong
Before dropping a file, git-annex wants to be able to look at other
remotes, and verify that they still have a file. After all, it could
have been dropped from them too. If the remotes are not mounted/available,
you'll see something like this.
# git annex drop important_file other.iso
drop important_file (unsafe)
Could only verify the existence of 0 out of 1 necessary copies
I was unable to access these remotes: usbdrive
Try making some of these repositories available:
58d84e8a-d9ae-11df-a1aa-ab9aa8c00826 -- portable USB drive
ca20064c-dbb5-11df-b2fe-002170d25c55 -- backup SATA drive
(Use --force to override this check, or adjust annex.numcopies.)
failed
drop other.iso (unsafe)
Could only verify the existence of 0 out of 1 necessary copies
No other repository is known to contain the file.
(Use --force to override this check, or adjust annex.numcopies.)
failed
Here you might --force it to drop `important_file` if you trust your backup.
But `other.iso` looks to have never been copied to anywhere else, so if
it's something you want to hold onto, you'd need to transfer it to
some other repository before dropping it.
2010-10-21 21:59:32 +00:00
## moving file content to another repository
Often you will want to transfer some file contents from a repository to
some other one, and then drop it from the first repository. For example,
your laptop's disk is getting full; time to move some files to an external
disk. Doing that by hand is possible, but a bit of a pain. `git annex move`
makes it very easy.
# git annex move my_cool_big_file --to usbdrive
move my_cool_big_file (to usbdrive...) ok
2010-10-21 20:36:01 +00:00
## using the URL backend
git-annex has multiple key-value [[backends]]. So far this walkthrough has
demonstrated the default, WORM (Write Once, Read Many) backend.
Another handy backend is the URL backend, which can fetch file's content
from remote URLs. Here's how to set up some files in your repository
that use this backend:
# git annex fromkey --backend=URL --key=http://www.archive.org/somefile somefile
2010-10-21 20:35:04 +00:00
fromkey somefile ok
# git commit -m "added a file from the Internet Archive"
Now you if you ask git-annex to get that file, it will download it,
2010-10-21 20:36:01 +00:00
and cache it locally.
# git annex get somefile
get somefile (downloading)
#########################################################################100.0%
ok
You can always drop files downloaded by the URL backend. It is assumed
that the URL is stable; no local backup is kept.
# git annex drop somefile
drop somefile (ok)