2015-03-23 19:36:10 +00:00
|
|
|
# NAME
|
|
|
|
|
2020-12-17 16:51:49 +00:00
|
|
|
git-annex import - import files from a special remote
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
# SYNOPSIS
|
|
|
|
|
2019-11-19 17:26:27 +00:00
|
|
|
git annex import --from remote branch[:subdir] | `[path ...]`
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
# DESCRIPTION
|
|
|
|
|
2019-11-19 17:26:27 +00:00
|
|
|
This command is a way to import a tree of files from elsewhere into your
|
|
|
|
git-annex repository. It can import files from a git-annex special remote,
|
|
|
|
or from a directory.
|
2019-02-23 19:46:03 +00:00
|
|
|
|
2019-05-14 18:29:40 +00:00
|
|
|
# IMPORTING FROM A SPECIAL REMOTE
|
2019-02-23 19:46:03 +00:00
|
|
|
|
2020-07-03 17:41:57 +00:00
|
|
|
Importing from a special remote first downloads or hashes all new content
|
|
|
|
from it, and then constructs a git commit that reflects files that have
|
|
|
|
changed on the special remote since the last time git-annex looked at it.
|
|
|
|
Merging that commit into your repository will update it to reflect changes
|
|
|
|
made on the special remote.
|
2019-02-23 19:46:03 +00:00
|
|
|
|
|
|
|
This way, something can be using the special remote for file storage,
|
|
|
|
adding files, modifying files, and deleting files, and you can track those
|
|
|
|
changes using git-annex.
|
|
|
|
|
2019-03-01 18:32:45 +00:00
|
|
|
You can combine using `git annex import` to fetch changes from a special
|
|
|
|
remote with `git annex export` to send your local changes to the special
|
|
|
|
remote.
|
|
|
|
|
|
|
|
You can only import from special remotes that were configured with
|
2019-02-23 19:46:03 +00:00
|
|
|
`importtree=yes` when set up with [[git-annex-initremote]](1). Only some
|
2019-11-19 17:26:27 +00:00
|
|
|
kinds of special remotes will let you configure them this way. A perhaps
|
2020-09-30 13:40:06 +00:00
|
|
|
non-exhaustive list is the directory, s3, and adb special remotes.
|
2019-02-23 19:46:03 +00:00
|
|
|
|
|
|
|
To import from a special remote, you must specify the name of a branch.
|
2020-07-03 15:54:21 +00:00
|
|
|
A corresponding remote tracking branch will be updated by `git annex import`.
|
|
|
|
After that point, it's the same as if you had run a `git fetch`
|
2019-08-09 17:21:15 +00:00
|
|
|
from a regular git remote; you can merge the changes into your
|
2019-02-23 19:46:03 +00:00
|
|
|
currently checked out branch.
|
|
|
|
|
|
|
|
For example:
|
|
|
|
|
|
|
|
git annex import master --from myremote
|
2021-07-19 16:08:24 +00:00
|
|
|
git annex merge --allow-unrelated-histories myremote/master
|
|
|
|
|
|
|
|
You could just as well use `git merge --allow-unrelated-histories myremote/master`
|
|
|
|
as the second step, but using `git-annex merge` avoids a couple of gotchas.
|
|
|
|
When using adjusted branches, it adjusts the branch before merging from it.
|
|
|
|
|
|
|
|
The --allow-unrelated-histories option is needed for at least the first
|
|
|
|
merge of an imported remote tracking branch, since the branch's history is
|
|
|
|
not connected. Think of this as the remote being a separate git repository
|
|
|
|
with its own files. If you first `git annex export` files to a remote, and
|
|
|
|
then `git annex import` from it, you won't need that option.
|
2019-03-01 18:32:45 +00:00
|
|
|
|
2019-05-14 18:43:51 +00:00
|
|
|
You can import into a subdirectory, using the "branch:subdir" syntax. For
|
|
|
|
example, if "camera" is a special remote that accesses a camera, and you
|
|
|
|
want to import those into the photos directory, rather than to the root of
|
|
|
|
your repository:
|
2019-02-23 19:46:03 +00:00
|
|
|
|
|
|
|
git annex import master:photos --from camera
|
|
|
|
git merge camera/master
|
|
|
|
|
|
|
|
The `git annex sync --content` command (and the git-annex assistant)
|
|
|
|
can also be used to import from a special remote.
|
|
|
|
To do this, you need to configure "remote.<name>.annex-tracking-branch"
|
|
|
|
to tell it what branch to track. For example:
|
|
|
|
|
|
|
|
git config remote.myremote.annex-tracking-branch master
|
|
|
|
git annex sync --content
|
|
|
|
|
2020-09-30 14:41:59 +00:00
|
|
|
Any files that are gitignored will not be included in the import,
|
|
|
|
but will be left on the remote.
|
|
|
|
|
2020-08-10 19:35:26 +00:00
|
|
|
When the special remote has a preferred content expression set by
|
importtree: support preferred content expressions needing keys
When importing from a special remote, support preferred content expressions
that use terms that match on keys (eg "present", "copies=1"). Such terms
are ignored when importing, since the key is not known yet.
When "standard" or "groupwanted" is used, the terms in those
expressions also get pruned accordingly.
This does allow setting preferred content to "not (copies=1)" to make a
special remote into a "source" type of repository. Importing from it will
import all files. Then exporting to it will drop all files from it.
In the case of setting preferred content to "present", it's pruned on
import, so everything gets imported from it. Then on export, it's applied,
and everything in it is left on it, and no new content is exported to it.
Since the old behavior on these preferred content expressions was for
importtree to error out, there's no backwards compatability to worry about.
Except that sync/pull/etc will now import where before it errored out.
2023-12-18 20:27:26 +00:00
|
|
|
[[git-annex-wanted]](1), that is used to pick which files to import from
|
|
|
|
it. Files that are not preferred content of the remote will not be
|
2020-08-10 19:35:26 +00:00
|
|
|
imported from it, but will be left on the remote.
|
2019-05-21 18:38:00 +00:00
|
|
|
|
importtree: support preferred content expressions needing keys
When importing from a special remote, support preferred content expressions
that use terms that match on keys (eg "present", "copies=1"). Such terms
are ignored when importing, since the key is not known yet.
When "standard" or "groupwanted" is used, the terms in those
expressions also get pruned accordingly.
This does allow setting preferred content to "not (copies=1)" to make a
special remote into a "source" type of repository. Importing from it will
import all files. Then exporting to it will drop all files from it.
In the case of setting preferred content to "present", it's pruned on
import, so everything gets imported from it. Then on export, it's applied,
and everything in it is left on it, and no new content is exported to it.
Since the old behavior on these preferred content expressions was for
importtree to error out, there's no backwards compatability to worry about.
Except that sync/pull/etc will now import where before it errored out.
2023-12-18 20:27:26 +00:00
|
|
|
So for example, a preferred content expression like
|
|
|
|
`"include=*.jpeg or largerthan=100mb"` will make only jpegs and
|
|
|
|
large files be imported.
|
|
|
|
|
|
|
|
Parts of a preferred content expression that relate to the key,
|
|
|
|
such as "copies=" are ignored when importing, because the key
|
|
|
|
is not known before importing.
|
2019-05-14 18:43:51 +00:00
|
|
|
|
2020-08-10 19:35:26 +00:00
|
|
|
Things in the expression like "include=" match relative to the top of
|
|
|
|
the tree of files on the remote, even when importing into a subdirectory.
|
|
|
|
|
2020-07-03 17:41:57 +00:00
|
|
|
# OPTIONS FOR IMPORTING FROM A SPECIAL REMOTE
|
|
|
|
|
|
|
|
* `--content`, `--no-content`
|
|
|
|
|
2020-09-28 17:22:16 +00:00
|
|
|
Controls whether annexed content is downloaded from the special remote.
|
|
|
|
|
2020-07-03 17:41:57 +00:00
|
|
|
The default is to download content into the git-annex repository.
|
|
|
|
|
|
|
|
With --no-content, git-annex keys are generated from information
|
|
|
|
provided by the special remote, without downloading it. Commands like
|
|
|
|
`git-annex get` can later be used to download files, as desired.
|
2020-09-28 19:03:15 +00:00
|
|
|
The --no-content option is not supported by all special remotes.
|
2020-07-03 17:41:57 +00:00
|
|
|
|
2024-01-18 16:41:44 +00:00
|
|
|
* `--message=msg` `-m msg`
|
|
|
|
|
|
|
|
Use this option to specify a commit message for the changes that have
|
|
|
|
been made to the special remote since the last import from it.
|
|
|
|
|
2024-03-27 19:58:27 +00:00
|
|
|
If multiple -m options are given, their values are concatenated
|
|
|
|
as separate paragraphs.
|
|
|
|
|
2019-05-14 18:29:40 +00:00
|
|
|
# IMPORTING FROM A DIRECTORY
|
2019-02-23 19:46:03 +00:00
|
|
|
|
2021-02-17 13:58:00 +00:00
|
|
|
When run with a path, `git annex import` **moves** files from somewhere outside
|
|
|
|
the git working copy, and adds them to the annex. In contrast to importing
|
2022-08-19 17:31:16 +00:00
|
|
|
from a special directory remote, imported files are **deleted from the given
|
|
|
|
path**.
|
2019-02-23 19:46:03 +00:00
|
|
|
|
2019-11-19 17:26:27 +00:00
|
|
|
This is a legacy interface. It is still supported, but please consider
|
|
|
|
switching to importing from a directory special remote instead, using the
|
|
|
|
interface documented above.
|
|
|
|
|
2019-02-23 19:46:03 +00:00
|
|
|
Individual files to import can be specified. If a directory is specified,
|
2021-02-17 13:58:00 +00:00
|
|
|
the entire directory is imported. Please note that the following instruction
|
|
|
|
will **delete all files from the source directory**.
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
git annex import /media/camera/DCIM/*
|
2015-03-26 15:44:20 +00:00
|
|
|
|
|
|
|
When importing files, there's a possibility of importing a duplicate
|
|
|
|
of a file that is already known to git-annex -- its content is either
|
2015-04-29 17:25:10 +00:00
|
|
|
present in the local repository already, or git-annex knows of another
|
|
|
|
repository that contains it, or it was present in the annex before but has
|
|
|
|
been removed now.
|
2015-03-26 15:44:20 +00:00
|
|
|
|
|
|
|
By default, importing a duplicate of a known file will result in
|
|
|
|
a new filename being added to the repository, so the duplicate file
|
|
|
|
is present in the repository twice. (With all checksumming backends,
|
|
|
|
including the default SHA256E, only one copy of the data will be stored.)
|
|
|
|
|
2019-02-23 19:46:03 +00:00
|
|
|
Several options can be used to adjust handling of duplicate files, see
|
|
|
|
`--duplicate`, `--deduplicate`, `--skip-duplicates`, `--clean-duplicates`,
|
|
|
|
and `--reinject-duplicates` documentation below.
|
2015-03-23 19:36:10 +00:00
|
|
|
|
2022-08-19 17:31:16 +00:00
|
|
|
symbolic links in the directory being imported are skipped to avoid
|
|
|
|
accidentially importing things outside the directory that import was ran
|
|
|
|
on. The directory that import is run on can, however inself be a symbolic
|
|
|
|
link, and that symbolic link will be followed.
|
|
|
|
|
2019-02-23 19:46:03 +00:00
|
|
|
# OPTIONS FOR IMPORTING FROM A DIRECTORY
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
* `--duplicate`
|
|
|
|
|
|
|
|
Do not delete files from the import location.
|
|
|
|
|
2017-02-09 19:40:44 +00:00
|
|
|
Running with this option repeatedly can import the same files into
|
|
|
|
different git repositories, or branches, or different locations in a git
|
|
|
|
repository.
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
* `--deduplicate`
|
|
|
|
|
2015-03-26 15:44:20 +00:00
|
|
|
Only import files that are not duplicates;
|
|
|
|
duplicate files will be deleted from the import location.
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
* `--skip-duplicates`
|
|
|
|
|
2017-02-07 20:39:29 +00:00
|
|
|
Only import files that are not duplicates. Avoids deleting any
|
|
|
|
files from the import location.
|
2015-03-23 19:36:10 +00:00
|
|
|
|
|
|
|
* `--clean-duplicates`
|
|
|
|
|
|
|
|
Does not import any files, but any files found in the import location
|
2015-03-26 15:44:20 +00:00
|
|
|
that are duplicates are deleted.
|
2017-02-09 19:40:44 +00:00
|
|
|
|
|
|
|
* `--reinject-duplicates`
|
|
|
|
|
|
|
|
Imports files that are not duplicates. Files that are duplicates have
|
|
|
|
their content reinjected into the annex (similar to
|
2018-07-06 16:51:18 +00:00
|
|
|
[[git-annex-reinject]](1)).
|
2015-03-23 19:36:10 +00:00
|
|
|
|
2015-04-29 17:32:50 +00:00
|
|
|
* `--force`
|
|
|
|
|
|
|
|
Allow existing files to be overwritten by newly imported files.
|
|
|
|
|
|
|
|
Also, causes .gitignore to not take effect when adding files.
|
|
|
|
|
2015-03-23 19:36:10 +00:00
|
|
|
* file matching options
|
|
|
|
|
|
|
|
Many of the [[git-annex-matching-options]](1)
|
|
|
|
can be used to specify files to import.
|
|
|
|
|
|
|
|
git annex import /dir --include='*.png'
|
|
|
|
|
2023-06-23 18:18:16 +00:00
|
|
|
# COMMON OPTIONS
|
2019-02-23 19:46:03 +00:00
|
|
|
|
2015-11-06 19:39:51 +00:00
|
|
|
* `--jobs=N` `-JN`
|
|
|
|
|
|
|
|
Imports multiple files in parallel. This may be faster.
|
|
|
|
For example: `-J4`
|
|
|
|
|
2019-05-10 17:24:31 +00:00
|
|
|
Setting this to "cpus" will run one job per CPU core.
|
|
|
|
|
2022-06-29 17:28:08 +00:00
|
|
|
* `--backend`
|
|
|
|
|
|
|
|
Specifies which key-value backend to use for the imported files.
|
|
|
|
|
2020-09-30 14:41:59 +00:00
|
|
|
* `--no-check-gitignore`
|
|
|
|
|
|
|
|
Add gitignored files.
|
|
|
|
|
2016-01-19 15:54:13 +00:00
|
|
|
* `--json`
|
|
|
|
|
|
|
|
Enable JSON output. This is intended to be parsed by programs that use
|
2023-04-25 21:37:34 +00:00
|
|
|
git-annex. Each line of output is a JSON object.
|
2016-01-19 15:54:13 +00:00
|
|
|
|
2020-05-26 15:27:47 +00:00
|
|
|
* `--json-progress`
|
|
|
|
|
|
|
|
Include progress objects in JSON output.
|
|
|
|
|
2018-02-19 18:28:17 +00:00
|
|
|
* `--json-error-messages`
|
|
|
|
|
2023-04-25 21:37:34 +00:00
|
|
|
Messages that would normally be output to standard error are included in
|
|
|
|
the JSON instead.
|
2018-02-19 18:28:17 +00:00
|
|
|
|
2021-05-10 19:00:13 +00:00
|
|
|
* Also the [[git-annex-common-options]](1) can be used.
|
|
|
|
|
2015-03-23 19:36:10 +00:00
|
|
|
# CAVEATS
|
|
|
|
|
|
|
|
Note that using `--deduplicate` or `--clean-duplicates` with the WORM
|
|
|
|
backend does not look at file content, but filename and mtime.
|
|
|
|
|
2016-01-25 17:41:21 +00:00
|
|
|
If annex.largefiles is configured, and does not match a file, `git annex
|
|
|
|
import` will add the non-large file directly to the git repository,
|
|
|
|
instead of to the annex.
|
|
|
|
|
2015-03-23 19:36:10 +00:00
|
|
|
# SEE ALSO
|
|
|
|
|
|
|
|
[[git-annex]](1)
|
|
|
|
|
2015-05-29 16:12:55 +00:00
|
|
|
[[git-annex-add]](1)
|
|
|
|
|
2017-08-29 17:25:48 +00:00
|
|
|
[[git-annex-export]](1)
|
|
|
|
|
2019-05-14 18:43:51 +00:00
|
|
|
[[git-annex-preferred-content]](1)
|
|
|
|
|
2015-03-23 19:36:10 +00:00
|
|
|
# AUTHOR
|
|
|
|
|
|
|
|
Joey Hess <id@joeyh.name>
|
|
|
|
|
|
|
|
Warning: Automatically converted into a man page by mdwn2man. Edit with care.
|