2024-05-01 15:04:20 +00:00
|
|
|
This is a summary todo covering several subprojects, which would extend
|
|
|
|
git-annex to be able to use proxies which sit in front of a cluster of
|
|
|
|
repositories.
|
|
|
|
|
2024-05-01 16:19:12 +00:00
|
|
|
1. [[design/passthrough_proxy]]
|
2024-05-01 19:26:51 +00:00
|
|
|
2. [[design/p2p_protocol_over_http]]
|
|
|
|
3. [[design/balanced_preferred_content]]
|
|
|
|
4. [[todo/track_free_space_in_repos_via_git-annex_branch]]
|
|
|
|
5. [[todo/proving_preferred_content_behavior]]
|
2024-05-01 15:04:20 +00:00
|
|
|
|
2024-06-04 10:53:59 +00:00
|
|
|
Joey has received funding to work on this.
|
|
|
|
Planned schedule of work:
|
|
|
|
|
|
|
|
* June: git-annex proxy
|
|
|
|
* July, part 1: git-annex proxy support for exporttree
|
|
|
|
* July, part 2: p2p protocol over http
|
|
|
|
* August: balanced preferred content
|
|
|
|
* September: streaming through proxy to special remotes (especially S3)
|
|
|
|
* October: proving behavior of balanced preferred content with proxies
|
|
|
|
|
2024-05-01 15:04:20 +00:00
|
|
|
[[!tag projects/openneuro]]
|
2024-06-04 11:51:33 +00:00
|
|
|
|
|
|
|
# work notes
|
|
|
|
|
2024-06-04 18:55:54 +00:00
|
|
|
In development on the `proxy` branch.
|
|
|
|
|
2024-06-04 11:51:33 +00:00
|
|
|
For June's work on [[design/passthrough_proxy]], implementation plan:
|
|
|
|
|
|
|
|
1. UUID discovery via git-annex branch. Add a log file listing UUIDs
|
|
|
|
accessible via proxy UUIDs. It also will contain the names
|
|
|
|
of the remotes that the proxy is a proxy for,
|
2024-06-04 18:55:54 +00:00
|
|
|
from the perspective of the proxy. (done)
|
|
|
|
|
|
|
|
1. Add `git-annex updateproxy` command and remote.name.annex-proxy
|
|
|
|
configuration. (done)
|
2024-06-04 11:51:33 +00:00
|
|
|
|
2024-06-07 16:35:04 +00:00
|
|
|
2. Remote instantiation for proxies. (done)
|
2024-06-04 18:55:54 +00:00
|
|
|
|
2024-06-11 20:56:52 +00:00
|
|
|
3. Implement git-annex-shell proxying for CONNECT and NOTIFYCHANGES.
|
|
|
|
(For completeness, they will only be used when using tor-annex
|
|
|
|
to access a proxy.)
|
2024-06-04 11:51:33 +00:00
|
|
|
|
2024-06-12 14:10:11 +00:00
|
|
|
3. Proxy should update location tracking information for proxied remotes,
|
|
|
|
so it is available to other users who sync with it.
|
2024-06-11 17:07:53 +00:00
|
|
|
|
|
|
|
4. Either implement proxying for tor-annex remotes, or prevent
|
2024-06-07 16:35:04 +00:00
|
|
|
listProxied from operating on them.
|
|
|
|
|
2024-06-04 11:51:33 +00:00
|
|
|
4. Let `storeKey` return a list of UUIDs where content was stored,
|
|
|
|
and make proxies accept uploads directed at them, rather than a specific
|
|
|
|
instantiated remote, and fan out the upload to whatever nodes behind
|
|
|
|
the proxy want it. This will need P2P protocol extensions.
|
|
|
|
|
|
|
|
5. Make `git-annex copy --from $proxy` pick a node that contains each
|
|
|
|
file, and use the instantiated remote for getting the file. Same for
|
|
|
|
similar commands.
|
|
|
|
|
|
|
|
6. Make `git-annex drop --from $proxy` drop, when possible, from every
|
|
|
|
remote accessible by the proxy. Communicate partial drops somehow.
|
|
|
|
|
|
|
|
7. Make commands like `git-annex push` not iterate over instantiate
|
|
|
|
remotes, and instead just send content to the proxy for fanout.
|
|
|
|
|
|
|
|
8. Optimise proxy speed. See design for idea.
|
|
|
|
|
2024-06-11 19:15:58 +00:00
|
|
|
8. Use `sendfile()` to avoid data copying overhead when
|
|
|
|
`receiveBytes` is being fed right into `sendBytes`.
|
|
|
|
|
2024-06-04 11:51:33 +00:00
|
|
|
9. Encryption and chunking. See design for issues.
|
|
|
|
|
|
|
|
10. Cycle prevention. See design.
|
|
|
|
|
|
|
|
11. indirect uploads (to be considered). See design.
|
|
|
|
|
2024-06-06 21:23:51 +00:00
|
|
|
12. Support using a proxy when its url is a P2P address.
|