blog for the day
This commit is contained in:
parent
cfdd4d6602
commit
61695f9f41
3 changed files with 78 additions and 7 deletions
66
doc/design/assistant/blog/day_39__twice_is_enemy_action.mdwn
Normal file
66
doc/design/assistant/blog/day_39__twice_is_enemy_action.mdwn
Normal file
|
@ -0,0 +1,66 @@
|
|||
Beating my head against the threaded runtime some more. I can reproduce
|
||||
one of the hangs consistently by running 1000 git annex add commands
|
||||
in a loop. It hangs around 1% of the time, reading from `git cat-file`.
|
||||
|
||||
Interestingly, `git cat-file` is not yet running at this point --
|
||||
git-annex has forked a child process, but the child has not yet exec'd it.
|
||||
Stracing the child git-annex, I see it stuck in a futex. Adding tracing,
|
||||
I see the child never manages to run any code at all.
|
||||
|
||||
This really looks like the problem is once again in MissingH, which uses
|
||||
`forkProcess`. Which happens to come with a big warning about being very
|
||||
unsafe, in very subtle ways. Looking at the C code that the newer `process`
|
||||
library uses when sparning a pipe to a process, it messes around with lots of
|
||||
things; blocking signals, stopping a timer, etc. Hundreds of lines of C
|
||||
code to safely start a child process, all doing things that MissingH omits.
|
||||
|
||||
That's the second time I've seemingly isolated a hang in the GHC threaded
|
||||
runtime to MissingH.
|
||||
|
||||
And so I've started converting git-annex to use the new `process` library,
|
||||
for running all its external commands. John Goerzen had mentioned `process`
|
||||
to me once before when I found a nasty bug in MissingH, as the cool new
|
||||
thing that would probably eliminate the `System.Cmd.Utils` part of MissingH,
|
||||
but I'd not otherwise heard much about it. (It also seems to have the
|
||||
benefit of supporting Windows.)
|
||||
|
||||
This is a big change and it's early days, but each time I see a hang, I'm
|
||||
converting the code to use `process`, and so far the hangs have just gone
|
||||
away when I do that.
|
||||
|
||||
---
|
||||
|
||||
Hours later... I've converted *all* of git-annex to use `process`.
|
||||
|
||||
In the er, process, the `--debug` switch stopped printing all the commands
|
||||
it runs. I may try to restore that later.
|
||||
|
||||
I've not tested everything, but the test suite passes, even when
|
||||
using the threaded runtime. **MILESTONE**
|
||||
|
||||
Looking forward to getting out of these weeds and back to useful work..
|
||||
|
||||
---
|
||||
|
||||
Hours later yet.... The `assistant` branch in git now uses the threaded
|
||||
runtime. It works beautifully, using proper threads to run file transfers
|
||||
in.
|
||||
|
||||
That should fix the problem I was seeing on OSX yesterday. Too tired to
|
||||
test it now.
|
||||
|
||||
--
|
||||
|
||||
Amazingly, all the assistant's own dozen or so threads and thread
|
||||
synch variables etc all work great under the threaded runtime. I had
|
||||
assumed I'd see yet more concurrency problems there when switching to it,
|
||||
but it all looks good. (Or whatever problems there are are subtle ones?)
|
||||
|
||||
I'm very relieved. The threaded logjam is broken! I had been getting
|
||||
increasingly worried that not having the threaded runtime available would
|
||||
make it very difficult to make the assistant perform really well, and cause
|
||||
problems with the webapp, perhaps preventing me from using Yesod.
|
||||
|
||||
Now it looks like smooth sailing ahead. Still some hard problems, but
|
||||
it feels like with inotify and kqueue and the threaded runtime all
|
||||
dealt with, the really hard infrastructure-level problems are behind me.
|
|
@ -10,13 +10,6 @@ all the other git clones, at both the git level and the key/value level.
|
|||
on remotes, and transfer. But first, need to ensure that when a remote
|
||||
receives content, and updates its location log, it syncs that update
|
||||
out.
|
||||
* Transfer watching has a race on kqueue systems, which makes finished
|
||||
fast transfers not be noticed by the TransferWatcher. Which in turn
|
||||
prevents the transfer slot being freed and any further transfers
|
||||
from happening. So, this approach is too fragile to rely on for
|
||||
maintaining the TransferSlots. Instead, need [[todo/assistant_threaded_runtime]],
|
||||
which would allow running something for sure when a transfer thread
|
||||
finishes.
|
||||
|
||||
## longer-term TODO
|
||||
|
||||
|
@ -106,3 +99,10 @@ anyway.
|
|||
Annex state monad. **done**
|
||||
* Write transfer control thread, which decides when to launch transfers.
|
||||
**done**
|
||||
* Transfer watching has a race on kqueue systems, which makes finished
|
||||
fast transfers not be noticed by the TransferWatcher. Which in turn
|
||||
prevents the transfer slot being freed and any further transfers
|
||||
from happening. So, this approach is too fragile to rely on for
|
||||
maintaining the TransferSlots. Instead, need [[todo/assistant_threaded_runtime]],
|
||||
which would allow running something for sure when a transfer thread
|
||||
finishes. **done**
|
||||
|
|
|
@ -13,6 +13,11 @@ When pulling, pushing, and merging, the assistant runs external git
|
|||
commands, and this does block all other threads. The threaded runtime would
|
||||
really help here.
|
||||
|
||||
[[done]]; the assistant now builds with the threaded runtime.
|
||||
Some work still remains to run certian long-running external git commands
|
||||
in their own threads to prevent them blocking things, but that is easy to
|
||||
do, now. --[[Joey]]
|
||||
|
||||
---
|
||||
|
||||
Currently, git-annex seems unstable when built with the threaded runtime.
|
||||
|
|
Loading…
Reference in a new issue