added documentation for using the Internet Archive as a remote via S3
Renamed Amazon_S3 page to just S3.
This commit is contained in:
parent
d67998b3d3
commit
647f7cf47c
6 changed files with 52 additions and 3 deletions
|
@ -11,4 +11,5 @@ A suppliment to the [[walkthrough]].
|
|||
walkthrough/untrusted_repositories
|
||||
walkthrough/what_to_do_when_you_lose_a_repository
|
||||
walkthrough/recover_data_from_lost+found
|
||||
walkthrough/Internet_Archive_via_S3
|
||||
"""]]
|
||||
|
|
|
@ -43,7 +43,7 @@ files with git.
|
|||
|
||||
* [[git-annex man page|git-annex]]
|
||||
* [[key-value backends|backends]] for data storage
|
||||
* [[special_remotes]] (including [[special_remotes/Amazon_S3]] and [[special_remotes/bup]])
|
||||
* [[special_remotes]] (including [[special_remotes/S3]] and [[special_remotes/bup]])
|
||||
* [[encryption]]
|
||||
* [[bare_repositories]]
|
||||
* [[internals]]
|
||||
|
|
|
@ -6,7 +6,7 @@ But, git-annex also extends git's concept of remotes, with these special
|
|||
types of remotes. These can be used just like any normal remote by git-annex.
|
||||
They cannot be used by other git commands though.
|
||||
|
||||
* [[Amazon_S3]]
|
||||
* [[S3]] (Amazon S3, and other compatible services)
|
||||
* [[bup]]
|
||||
* [[directory]]
|
||||
* [[rsync]]
|
||||
|
|
48
doc/walkthrough/Internet_Archive_via_S3.mdwn
Normal file
48
doc/walkthrough/Internet_Archive_via_S3.mdwn
Normal file
|
@ -0,0 +1,48 @@
|
|||
[The Internet Archive](http://www.archive.org/) allows members to upload
|
||||
collections using an Amazon S3
|
||||
[compatible API](http://www.archive.org/help/abouts3.txt), and this can
|
||||
be used with git-annex's [[special_remotes/S3]] support.
|
||||
|
||||
So, if you're an archivist, you can locally archive things with git-annex,
|
||||
and define remotes that correspond to "items" at the Internet Archive,
|
||||
and use git-annex to upload your files to there.
|
||||
Of course, your use of the Internet Archive must comply with their
|
||||
[terms of service](http://www.archive.org/about/terms.php).
|
||||
|
||||
## step 0
|
||||
|
||||
Sign up for an account, and get your access keys here:
|
||||
<http://www.archive.org/account/s3.php>
|
||||
|
||||
# export AWS_ACCESS_KEY_ID=blahblah
|
||||
# export AWS_SECRET_ACCESS_KEY=xxxxxxx
|
||||
|
||||
Now go to <http://www.archive.org/create/> and create the item.
|
||||
This allows you to fill in metadata which git-annex cannot provide to the
|
||||
Internet Archive. (It also works around a bug with bucket creation.)
|
||||
|
||||
(Note that there seems to be a bug in either hS3 or the archive that
|
||||
breaks authentication when the item name contains spaces or upper-case
|
||||
letters.. use all lowercase and no spaces.)
|
||||
|
||||
Specify `host=s3.us.archive.org` when doing initremote to set up
|
||||
a remote at the Archive. It does not make sense to use encryption.
|
||||
For the bucket name, specify the item name created in step 1.
|
||||
|
||||
# git annex initremote panama type=S3 encryption=none host=s3.us.archive.org bucket=panama-canal-lock-blueprints
|
||||
initremote archive-panama (checking bucket) (creating bucket in US) ok
|
||||
# git annex describe archive-panama "Internet Archive item for my grandfather's Panama Canal lock design blueprints"
|
||||
describe archive-panama ok
|
||||
|
||||
Then you can annex files and copy them to the remote as usual:
|
||||
|
||||
# git annex add photo1.jpeg
|
||||
add photo1.jpeg ok
|
||||
# git annex copy photo1.jpeg --to archive-panama
|
||||
copy (checking archive-panama...) (to archive-panama...) ok
|
||||
|
||||
Note that it probably makes the most sense to use the WORM backend
|
||||
for files, since that exposes the original filename in the key stored
|
||||
in the Archive, which allows its special processing for sound files,
|
||||
movies, etc to be done. Also, the Internet Archive has restrictions
|
||||
on what is allowed in a filename; particularly no spaces are allowed.
|
|
@ -34,4 +34,4 @@ Now the remote can be used like any other remote.
|
|||
# git annex move video/hackity_hack_and_kaxxt.mov --to cloud
|
||||
move video/hackity_hack_and_kaxxt.mov (checking cloud...) (to cloud...) ok
|
||||
|
||||
See [[special_remotes/Amazon_S3]] for details.
|
||||
See [[special_remotes/S3]] for details.
|
||||
|
|
Loading…
Reference in a new issue