git-annex

Author	SHA1	Message	Date
Joey Hess	f768cddf3a	fix build on OSX	2012-07-19 20:44:58 -04:00
Joey Hess	107a7b9388	try to make Utility.Mounts portable This is an unholy mashup, but it just might work. It works on Linux, that's all I've tested. :)	2012-07-19 20:38:58 -04:00
Joey Hess	e2c86a4b58	extacted Mounts.hsc from hsshellscript Converted from using c2hs to using hsc2hs, just because other code in git-annex uses hsc2hs. Various cleanups. This code is LGPLed, so I had to include that licence.	2012-07-19 12:53:39 -04:00
Joey Hess	1e0b7dda8c	foo	2012-07-19 01:02:22 -04:00
Joey Hess	9fc94d780b	better readProcess	2012-07-19 00:57:40 -04:00
Joey Hess	1db7d27a45	add back debug logging Make Utility.Process wrap the parts of System.Process that I use, and add debug logging to them. Also wrote some higher-level code that allows running an action with handles to a processes stdin or stdout (or both), and checking its exit status, all in a single function call. As a bonus, the debug logging now indicates whether the process is being run to read from it, feed it data, chat with it (writing and reading), or just call it for its side effect.	2012-07-19 00:46:52 -04:00
Joey Hess	2edb5d145c	rewrote to not use forkProcess That can make the threaded runtime stall.. But it can use threads now!	2012-07-18 19:25:46 -04:00
Joey Hess	f520a2c103	add missing imports	2012-07-18 18:29:33 -04:00
Joey Hess	f2ed3d6c8e	Merge branch 'threaded' into assistant	2012-07-18 18:17:33 -04:00
Joey Hess	d1da9cf221	switch from System.Cmd.Utils to System.Process Test suite now passes with -threaded! I traced back all the hangs with -threaded to System.Cmd.Utils. It seems it's just crappy/unsafe/outdated, and should not be used. System.Process seems to be the cool new thing, so converted all the code to use it instead. In the process, --debug stopped printing commands it runs. I may try to bring that back later. Note that even SafeSystem was switched to use System.Process. Since that was a modified version of code from System.Cmd.Utils, it needed to be converted too. I also got rid of nearly all calls to forkProcess, and all calls to executeFile, which I'm also doubtful about working well with -threaded.	2012-07-18 18:00:24 -04:00
Joey Hess	05310538ef	more debugging	2012-07-18 13:31:00 -04:00
Joey Hess	fb85d8e563	cleanup	2012-07-17 18:36:51 -04:00
Joey Hess	62a35162a0	bugfix	2012-07-17 18:35:56 -04:00
Joey Hess	32ac773934	kqueue: properly call delHook for file deletion, not delDirHook	2012-07-17 18:32:55 -04:00
Joey Hess	30ac6d7be0	robustness fix Don't fall over symlinks, and avoid crashing if the file goes away.	2012-07-17 16:29:49 -04:00
Joey Hess	7d89cf0eb9	cleanup	2012-07-17 16:02:29 -04:00
Joey Hess	e816776a62	add inodes to kqueue's directory cache This is necessary to generate events when a file is deleted and immediately replaced. Otherwise, the cache will have the old file, and so no event would be generated. Use of getFileStatus is suboptimal, it would be faster to use the inode returned by readdir -- but getDirectoryContents does not expose it, so I'd have to copy and modify a lot of low-level code.	2012-07-17 15:57:49 -04:00
Joey Hess	182526ff68	add debugging	2012-07-17 14:40:05 -04:00
Joey Hess	3ea708e03b	Merge branch 'master' into assistant	2012-07-02 15:45:20 -04:00
Joey Hess	7daa269853	better pid file locking code	2012-07-02 13:47:32 -04:00
Joey Hess	74f0d67aa3	avoid untrappable exception if dirContentsRecursive is run on a directory that doesn't exist, or cannot be read The problem is its use of unsafeInterleaveIO, which causes its IO code to run when the thunk is forced, outside any exception trapping the caller may do.	2012-07-02 10:56:26 -04:00
Joey Hess	bea0ac0274	record transfers for git-annex-shell Not yet tested and places git-annex-shell is run need to be modified to pass the new field settings. Note that rsyncServerSend was changed to fork, rather than directly exec rsync, because it needs to keep the transfer lock held, and clean up the transfer log when done.	2012-07-02 01:31:10 -04:00
Joey Hess	d1f49b0ad0	add fields to git-annex-shell	2012-07-02 00:53:00 -04:00
Joey Hess	7625319c2c	Merge branch 'master' into assistant	2012-07-01 21:00:43 -04:00
Joey Hess	29335bf326	pointlessness	2012-06-29 10:00:05 -04:00
Joey Hess	638a321ca5	typo	2012-06-28 14:15:49 -04:00
Joey Hess	421f9ce0e2	fix kqueue build	2012-06-28 14:13:15 -04:00
Joey Hess	4888c5b042	improve thread termination handling The reason the DirWatcher had to wait for program termination was because it used withINotify, so when it finished, its watcher threads were killed. But since I have two DirWatcher threads now, that was not good, and could perhaps explain the MVar problem I saw yesterday. In any case, fixed this part of the code by making the DirWatcher return a handle that can be used to stop it, and now the main Assistant thread is the only one calling waitForTermination.	2012-06-28 13:37:03 -04:00
Joey Hess	67c8ef7de2	use a TMVar SampleMVar won't work; between getting the current value and changing it, another thread could made a change, which would get lost. TMVar works well; this update situation is handled by atomic transactions.	2012-06-26 19:21:44 -04:00
Joey Hess	5cfe91f06d	add a push retry thread	2012-06-25 16:38:12 -04:00
Joey Hess	a71e7161fc	golfing	2012-06-23 01:33:10 -04:00
Joey Hess	e699ce1841	added a merger thread Wow! I can create a file in repo a, and it instantly* shows up in repo b! * under 1 second anyway	2012-06-22 17:01:08 -04:00
Joey Hess	e9630e90de	the syncer now pushes out changes to remotes, in parallel Note that, since this always pushes branch synced/master to the remote, it assumes that master has already gotten all the commits that are on the remote merged in. Otherwise, fast-forward prevention may prevent the push. That's probably ok, because the next stage is to automatically detect incoming pushes and merge.	2012-06-22 15:49:48 -04:00
Joey Hess	28e28bc043	stub syncer thread and commit channel	2012-06-22 14:10:25 -04:00
Joey Hess	ad11de94e5	typo	2012-06-20 15:53:56 -04:00
Joey Hess	483b1b08c6	Merge branch 'master' into watch	2012-06-20 13:15:59 -04:00
Joey Hess	75b6ee81f9	avoid ByteString.Char8 where not needed Its truncation behavior is a red flag, so avoid using it in these places where only raw ByteStrings are used, without looking at the data inside.	2012-06-20 13:13:40 -04:00
Joey Hess	ee2acd474f	[Word8] to filesystem encoded String My, GHC makes this hard.	2012-06-20 12:51:25 -04:00
Joey Hess	5580af5789	add closingTracked flag	2012-06-19 10:22:36 -04:00
Joey Hess	e68b3c99f4	kqueue synthetic add events on startup	2012-06-19 10:08:06 -04:00
Joey Hess	2a61df23e7	kqueue recursive directory adding	2012-06-19 09:56:03 -04:00
Joey Hess	4ab9449cee	add eventsCoalesce	2012-06-19 02:23:45 -04:00
Joey Hess	fd3e945932	fix prototype	2012-06-19 01:57:19 -04:00
Joey Hess	03b9341356	fix scheduling Handle kevent interruptions in the haskell code, so it can yield to other threads	2012-06-19 04:52:55 +00:00
Joey Hess	02e9fdb0a5	kqueue build fix new event dispatch seems a bit broken though	2012-06-19 04:07:48 +00:00
Joey Hess	7a09d74319	lifted out the kqueue and inotify to a generic DirWatcher interface Kqueue code for dispatching events is not tested and probably doesn't build.	2012-06-18 23:49:07 -04:00
Joey Hess	e164553272	robustness fixes	2012-06-19 02:13:26 +00:00
Joey Hess	5e9fdac92f	update kqueue when new directories are added	2012-06-18 21:46:04 -04:00
Joey Hess	2bfcc0b09c	kqueue: add directory content tracking, and change determination This may now return Add or Delete Changes as appropriate. All I know for sure is that it compiles. I had hoped to avoid maintaining my own state about the content of the directory tree, and rely on git to check what was changed. But I can't; I need to know about new and deleted subdirectories to add them to the watch list, and git doesn't deal with (empty) directories. So, wrote all the code to scan directories, remember their past contents, compare with current contents, generate appropriate Change events, and update bookkeeping info appropriately.	2012-06-18 21:29:30 -04:00
Joey Hess	ae7d07ddcb	close fds	2012-06-18 19:14:58 -04:00
Joey Hess	b141d9bcc8	retry interrupted kevent calls Many thanks to geekosaur in #haskell for help with this.	2012-06-18 18:03:50 -04:00
Joey Hess	a11825a1f1	add test stub	2012-06-18 16:56:05 -04:00
Joey Hess	d680ff7ef0	kqueue code compiles on debian kfreebsd	2012-06-18 16:34:08 -04:00
Joey Hess	90d565149a	flesh out kqueue library Have not tried to build this yet. But barring minor mistakes, I think it's good.	2012-06-18 16:18:59 -04:00
Joey Hess	89fcee03d0	add some utility functions for later Will need to update the DirMap to add or remove subdirs.	2012-06-18 13:22:36 -04:00
Joey Hess	a39b73d118	recurse dirTree and open the directories for kqueue to watch	2012-06-18 13:01:58 -04:00
Joey Hess	dc3d9d1e98	added dirTree	2012-06-18 12:53:57 -04:00
Joey Hess	3c8a9043b6	skeleton C library for calling kqueue	2012-06-18 12:25:20 -04:00
Joey Hess	66344a3613	Enable diskfree on kfreebsd, using statvfs. Could not reproduce the build failure I had seen related to this, but the numbers were wrong with statfs64. Probably pulling from the wrong place in the structure. statvfs seems to work..	2012-06-17 18:10:57 -04:00
Joey Hess	e84b78f40c	reorg	2012-06-17 14:02:40 -04:00
Joey Hess	0052cec2b7	add lsof build deps Check for it in configure; and add a --force option for people without it who want to live dangerously.	2012-06-15 23:29:39 -04:00
Joey Hess	bb6074dfea	work around a wrinkle in how lsof handles hard links to files that are open elsewhere +d is probably more expensive, but I need it	2012-06-15 22:34:42 -04:00
Joey Hess	d91950ecba	cleanup	2012-06-15 22:16:13 -04:00
Joey Hess	96ac25094b	fix pid file writing need to truncate, or part of previous longer pid may be left after writing	2012-06-15 20:42:53 -04:00
Joey Hess	85db1720e9	add lsof interface Uses lsof -F0 to get machine-readable output	2012-06-15 15:13:31 -04:00
Joey Hess	4b2b5e820e	add	2012-06-15 14:11:09 -04:00
Joey Hess	36d73b0017	slightly higher-level thread scheduling code Including support for unbound thread sleeping. Haskell's max thread sleep is 37 minutes, due to maxBound Int!	2012-06-13 17:53:19 -04:00
Joey Hess	12dbb9d1d0	plumb file status through to event handlers The idea, not yet done, is to use this to detect when a file has an old change time, and avoid expensive restaging of the file. If git-annex watch keeps track of the last time it finished a full scan, then any symlink that is older than that time must have been scanned before, so need not be added. (Relying on moving, copying, etc of a file all updating its change time.) Anyway, this info is available for free since inotify already checks it, so it might as well make it available.	2012-06-13 01:20:37 -04:00
Joey Hess	b240418acc	better optimisation of add check Now really only done in the startup scan. It turns out to be quite hard for event handlers to know when the startup scan is complete. I tried to make addWatch pass that info, but found threading the state very difficult. For now, a quick hack, using the fast flag. Note that it's actually possible for inotify events to come in while the startup scan is still ongoing. Due to my hack, the expensive check will be done for files added in such inotify events.	2012-06-12 16:24:06 -04:00
Joey Hess	7d2c813396	fix bug that turned files already in git into symlinks This requires a relatively expensive test at file add time to see if it's in git already. But it can be optimised to only happen during the startup scan.	2012-06-12 15:57:24 -04:00
Joey Hess	535d9e4998	add a flag indicating if an event was synthesized during initial dir scan	2012-06-12 14:34:09 -04:00
Joey Hess	942d8f7298	hlint	2012-06-12 11:32:06 -04:00
Joey Hess	433ff41496	bugfix	2012-06-11 02:06:22 -04:00
Joey Hess	d0a0a6ae21	git annex watch --stop	2012-06-11 02:01:20 -04:00
Joey Hess	8539a7bde8	fix pid file locking Ok, that's odd.. opening it before fork breaks the locking. I don't understand why.	2012-06-11 01:37:25 -04:00
Joey Hess	0b3e2bed78	add a pid file Writes pid to a file. Is supposed to take an exclusive lock, but that's not working, and it's too late for me to understand why.	2012-06-11 01:20:19 -04:00
Joey Hess	d5884388b0	daemonize git annex watch	2012-06-11 00:39:09 -04:00
Joey Hess	5d4e09199c	update message based on user feedback	2012-06-07 00:47:09 -04:00
Joey Hess	b8ae9528ab	refactor	2012-06-06 23:20:09 -04:00
Joey Hess	b8f85f7a82	build watch on non-linux, just don't do anything	2012-06-06 22:49:32 -04:00
Joey Hess	c5b11561f0	handle running out of watch descriptors	2012-06-06 16:50:28 -04:00
Joey Hess	db8effb8f3	ignore .gitignore and .gitattributes	2012-06-06 15:50:12 -04:00
Joey Hess	993e6459a3	factor out nukeFile	2012-06-06 13:13:13 -04:00
Joey Hess	cbf16f1967	refactor	2012-06-04 19:43:29 -04:00
Joey Hess	5b4e5ce7e5	deletion When a new file is annexed, a deletion event occurs when it's moved away to be replaced by a symlink. Most of the time, there is no problimatic race, because the same thread runs the add event as the deletion event. So, once the symlink is in place, the deletion code won't run at all, due to existing checks that a deleted file is really gone. But there is a race at startup, as then the inotify thread is running at the same time as the main thread, which does the initial tree walking and annexing. It would be possible for the deletion inotify to run in a perfect race with the addition, and remove the newly added symlink from the git cache. To solve this race, added event serialization via a MVar. We putMVar before running each event, which blocks if an event is already running. And when an event finishes (or crashes!), we takeMVar to free the lock. Also, make rm -rf not spew warnings by passing --ignore-unmatch when deleting directories.	2012-06-04 18:09:18 -04:00
Joey Hess	529a3721e1	refactor	2012-06-04 17:17:05 -04:00
Joey Hess	47f8f43715	workaround other part of moved directory problem This fixes the scenario where: * directory foo is moved away (and still watched) * a new directory foo is made * file (or directory) foo/bar is created * the old directory's file (or directory) "bar" is deleted We don't want a deletion event for foo/bar in this case.	2012-06-04 14:54:14 -04:00
Joey Hess	fa9d479fd1	add explict test that a closed file even is on a regular file There are two reasons for this test. First, there could be a fifo or other non-regular file that was closed. Second, this test avoids ugliness when a subdirectory is moved out of the top of the watch directory to elsewhere, and a file added to it. Since the subdirectory has moved, the file won't be present under the old location, and nothing will be done. I cannot find a way to stop watching such directories, at least not without a lot of pain. The inotify interface in Haskell doesn't make it easy to stop watching a given subdirectory when it's moved out; it would require keeping a map of all watch handles that is shared between threads. This workaround avoids the problem in most cases; the only remaining case being deletion of a file from a moved subdirectory.	2012-06-04 14:31:06 -04:00
Joey Hess	23dbff4b43	add events for symlink creation and directory removal Improved the inotify code, so it will also notice directory removal and symlink creation. In the watch code, optimised away a stat of a file that's being added, that's done by Command.Add.start. This is the reason symlink creation is handled separately from file creation, since during initial tree walk at startup, a stat was already done, and can be reused.	2012-06-04 13:22:56 -04:00
Joey Hess	3b09281b44	add dirContentsRecursive	2012-05-31 19:25:33 -04:00
Joey Hess	bb4f31a0ee	Clean up handling of git directory and git worktree. Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.	2012-05-18 17:03:12 -04:00
Joey Hess	e36808e167	Pass -a to cp even when it supports --reflink=auto, to preserve permissions. Amoung other things, this makes unlocking a WORM backed file and then re-adding it without making any changes not add a new object, as the timestamp is preserved.	2012-05-15 14:18:51 -04:00
Joey Hess	913775410b	update	2012-05-02 11:43:30 -04:00
Joey Hess	5337c4e0c4	rsync protocol?	2012-05-02 11:38:27 -04:00
Joey Hess	0c9c14b52f	percentage library	2012-04-29 17:48:07 -04:00
Joey Hess	1c16f616df	Added shared cipher mode to encryptable special remotes. This option avoids gpg key distribution, at the expense of flexability, and with the requirement that all clones of the git repository be equally trusted.	2012-04-29 14:02:43 -04:00
Joey Hess	f766774209	unbreak code inside ifdef	2012-04-22 11:22:20 -04:00
Joey Hess	42e4145a17	bugfixes	2012-04-22 01:20:17 -04:00
Joey Hess	84ac8c58db	Add annex.httpheaders and annex.httpheader-command config settings Allow custom headers to be sent with all HTTP requests. (Requested by the Internet Archive)	2012-04-22 01:13:09 -04:00
Joey Hess	ed79596b75	noop	2012-04-21 23:32:33 -04:00
Joey Hess	bee420bd2d	in which I discover void void :: Functor f => f a -> f () -- ah, of course that's useful :)	2012-04-21 23:06:19 -04:00
Joey Hess	b98b69e8c6	honor core.sharedRepository when making all the other files in the annex Lock files, directories, etc.	2012-04-21 19:36:03 -04:00
Joey Hess	7e45712d19	better file mode setting code	2012-04-21 16:01:56 -04:00
Joey Hess	b4a5e39ee6	Support git's core.sharedRepository configuration This is incomplete, it does not honor it yet for hash directories and other annex bookkeeping files. Some of that is not needed for a bare repo; some of it may be.	2012-04-21 15:36:52 -04:00
Joey Hess	cbf9a9420e	avoid chown call if the current file mode is same as new Not only an optimisation. fsck always tried to preventWrite to make sure file modes are good, and in a shared repo, that will fail on directories not owned by the current user. Although there may be other problems with such a setup.	2012-04-21 11:59:50 -04:00
Joey Hess	37061c019d	tweak	2012-04-18 13:23:46 -04:00
Joey Hess	d75771b0ab	clear errno after successful call When preparing the debian stable backport, I am seeing a call to statvfs() succeed, but also set errno to 2 (ENOENT). Not sure why this happens; I am in a schroot when it does happen, or perhaps stable's libc is a little broken and sets errno incorrectly. Anyway, it should be perfectly fine to clear errno after the successful call, rather than before it.	2012-04-18 13:22:50 -04:00
Joey Hess	142bde13cd	import juggling	2012-04-14 12:33:32 -04:00
Joey Hess	3642c72320	Renamed diskfree.c to avoid OSX case insensativity bug.	2012-04-13 11:26:39 -04:00
Joey Hess	6464a576cd	allow add or del events to be ignored	2012-04-12 17:28:40 -04:00
Joey Hess	be4edbaaf1	allow excluding directories from being watched	2012-04-12 16:59:33 -04:00
Joey Hess	4ccc86669a	don't fall over broken links	2012-04-12 16:46:57 -04:00
Joey Hess	d08a67a78e	put in a warning so I remember to look at this	2012-04-12 15:48:59 -04:00
Joey Hess	1f34bf9acb	add waitForTermination	2012-04-11 20:28:01 -04:00
Joey Hess	b133a76f96	recursive inotify thing Recursive inotify has beaten me before, with its bad design and races, but not this time! (I think.) This is able to follow the strongest filesystem traffic I can throw at it, and robustly notices every file add and delete. Mostly that's down to Haskell having a quite nice threaded inotify library (that does its own buffering). A key insight was realizing that the inotify directory add race could be dealt with by scanning for files inside newly added directories. TODO: Add support for freebsd/osx kqueue; see http://hackage.haskell.org/package/kqueue Can a git-annex-monitor be far off?	2012-04-11 20:09:38 -04:00
Joey Hess	62c69e7e25	Disable diskfree on kfreebsd, as I have a build failure on kfreebsd-i386 that is quite likely caused by it.	2012-04-07 15:50:34 -04:00
Joey Hess	42cc85a82f	use struct statfs64 on OSX	2012-03-23 12:05:05 -04:00
Joey Hess	0f2052446f	tweak	2012-03-22 23:02:20 -04:00
Joey Hess	6f514155f1	update	2012-03-22 23:02:20 -04:00
Joey Hess	d7d2fefbf2	break out kfreebsd Now tested on linux and freebsd (amd64).	2012-03-22 17:32:47 -04:00
Joey Hess	e38a839a80	Rewrote free disk space checking code Moving the portability handling into a small C library cleans up things a lot, avoiding the pain of unpacking structs from inside haskell code.	2012-03-22 17:32:47 -04:00
Joey Hess	a362c46b70	fun with symbols Nothing at all on hackage is using <&&> or <\|\|>. (Also, <&&> should short-circuit on failure.)	2012-03-17 00:38:40 -04:00
Joey Hess	771052a85e	optimize monadic \|\| (\|\|) used applicative style runs both conditions rather than short circuiting. Add an orM that properly short-circuits.	2012-03-16 12:28:17 -04:00
Joey Hess	b06336fa3a	simplify	2012-03-16 02:12:56 -04:00
Joey Hess	c0c9991c9f	nukes another 15 lines thanks to ifM	2012-03-15 20:39:25 -04:00
Joey Hess	60ab3d84e1	added ifM and nuked 11 lines of code no behavior changes	2012-03-14 17:43:34 -04:00
Joey Hess	95a1f6b2ac	check hook executability	2012-03-14 12:17:38 -04:00
Joey Hess	6fd0c0bfec	move	2012-03-11 18:12:36 -04:00
Joey Hess	f9d44cccd9	perhaps more clear type	2012-03-10 11:38:38 -04:00
Joey Hess	10d9315b59	cleanup	2012-03-09 20:43:50 -04:00
Joey Hess	bca3fd65b9	fix key directory hash calculation code Fix Key directory hash calculation code to behave as it did before version 3.20120227 when a key contains non-ascii. The hash directories for a given Key are based on its md5sum. Prior to ghc 7.4, Keys contained raw, undecoded bytes, so the md5sum was taken of each byte in turn. With the ghc 7.4 filename encoding change, keys contains decoded unicode characters (possibly with surrigates for undecodable bytes). This changes the result of the md5sum, since the md5sum used is pure haskell and supports unicode. And that won't do, as git-annex will start looking in a different hash directory for the content of a key. The surrigates are particularly bad, since that's essentially a ghc implementation detail, so could change again at any time. Also, changing the locale changes how the bytes are decoded, which can also change the md5sum. Symptoms would include things like: * git annex fsck would complain that no copies existed of a file, despite its symlink pointing to the content that was locally present * git annex fix would change the symlink to use the wrong hash directory. Only WORM backend is likely to have been affected, since only it tends to include much filename data (SHA1E could in theory also be affected). I have not tried to support the hash directories used by git-annex versions 3.20120227 to 3.20120308, so things added with those versions with WORM will require manual fixups. Sorry for the inconvenience!	2012-03-09 20:03:51 -04:00
Joey Hess	d6e77595ba	factor out Utility.FileSystemEncoding	2012-03-09 19:08:10 -04:00
Joey Hess	789254747b	refactor	2012-03-09 18:52:03 -04:00
Joey Hess	51338486dc	Fix a bug in symlink calculation code, that triggered in rare cases where an annexed file is in a subdirectory that nearly matched to the .git/annex/object/xx/yy subdirectories. This is a straight up pure-code stinker. The relative path calculation looked for common subdirectories in the two paths, but failed to stop after the paths diverged. When a later pair of subdirectories were the same, the resulting relative path was wrong. Added regression test for this.	2012-03-05 12:42:52 -04:00
Joey Hess	6c0155efb7	refactor	2012-02-20 15:22:21 -04:00
Joey Hess	a1e52f0ce5	hlint	2012-02-16 00:44:51 -04:00
Joey Hess	ecfcb41abe	work around Network.Browser bug that converts a HEAD to a GET when following a redirect The code explicitly switches from HEAD to GET for most redirects. Possibly because someone misread a spec (which does require switching from POST to GET for 303 redirects). Or possibly because the spec really is that bad. Upstream bug: https://github.com/haskell/HTTP/issues/24 Since we absolutely don't want to download entire (large) files from the web when checking that they exist with HEAD, I wrote my own redirect follower, based closely on the one used by Network.Browser, but without this misfeature. Note that Network.Browser checks that the redirect url is a http url and fails if not. I don't, because I want to not need to change this code when it gets https support (related: I'm surprised to see it doesn't support https yet..). The check does not seem security significant; it doesn't support file:// urls for example. If a http url is redirected to https, the Network.Browser will actually make a http connection again. This could loop, but only up to 5 times.	2012-02-10 21:54:25 -04:00
Joey Hess	9030f68452	When checking that an url has a key, verify that the Content-Length, if available, matches the size of the key. If there's no Content-Length, or the key has no size, this check is not done, but it should happen most of the time, and protect against web content that has changed.	2012-02-10 19:23:41 -04:00
Joey Hess	56470ce3e5	really fix foreign C functions filename encodings GHC should probably export withFilePath.	2012-02-04 14:30:28 -04:00
Joey Hess	f1c7dc1212	fix touch and statfs to work on any files in any locale Use withCAString rather than withCString. XXX Actually, this only works in non-unicode locales when presented with unicode characters. Help?	2012-02-04 12:44:51 -04:00
Joey Hess	44b115e0b1	Merge branch 'master' into ghc7.4 Conflicts: Utility/Misc.hs	2012-02-03 16:48:40 -04:00
Joey Hess	146c36ca54	IO exception rework ghc 7.4 comaplains about use of System.IO.Error to catch exceptions. Ok, use Control.Exception, with variants specialized to only catch IO exceptions.	2012-02-03 16:47:24 -04:00
Joey Hess	d8fb97806c	support all filename encodings with ghc 7.4 Under ghc 7.4, this seems to be able to handle all filename encodings again. Including filename encodings that do not match the LANG setting. I think this will not work with earlier versions of ghc, it uses some ghc internals. Turns out that ghc 7.4 has a special filesystem encoding that it uses when reading/writing filenames (as FilePaths). This encoding is documented to allow "arbitrary undecodable bytes to be round-tripped through it". So, to get FilePaths from eg, git ls-files, set the Handle that is reading from git to use this encoding. Then things basically just work. However, I have not found a way to make Text read using this encoding. Text really does assume unicode. So I had to switch back to using String when reading/writing data to git. Which is a pity, because it's some percent slower, but at least it works. Note that stdout and stderr also have to be set to this encoding, or printing out filenames that contain undecodable bytes causes a crash. IMHO this is a misfeature in ghc, that the user can pass you a filename, which you can readFile, etc, but that default, putStr of filename may cause a crash! Git.CheckAttr gave me special trouble, because the filenames I got back from git, after feeding them in, had further encoding breakage. Rather than try to deal with that, I just zip up the input filenames with the attributes. Which must be returned in the same order queried for this to work. Also of note is an apparent GHC bug I worked around in Git.CheckAttr. It used to forkProcess and feed git from the child process. Unfortunatly, after this forkProcess, accessing the `files` variable from the parent returns []. Not the value that was passed into the function. This screams of a bad bug, that's clobbering a variable, but for now I just avoid forkProcess there to work around it. That forkProcess was itself only added because of a ghc bug, #624389. I've confirmed that the test case for that bug doesn't reproduce it with ghc 7.4. So that's ok, except for the new ghc bug I have not isolated and reported. Why does this simple bit of code magnet the ghc bugs? :) Also, the symlink touching code is currently broken, when used on utf-8 filenames in a non-utf-8 locale, or probably on any filename containing undecodable bytes, and I temporarily commented it out.	2012-02-03 16:23:20 -04:00
Joey Hess	a964012fc3	switch to the strict state monad I had not realized what a memory leak the lazy state monad could be, although I have not seen much evidence of actual leaking in git-annex. However, if running git-annex on a great many files, this could matter. The additional Utility.State.changeState adds even more strictness, avoiding a problem I saw in github-backup where repeatedly modifying state built up a huge pile of thunks.	2012-01-29 22:55:06 -04:00
Joey Hess	ce5637498f	remove Utility.Conditional and use IfElse This drops the >>! and >>? with the nice low fixity. IfElse does have undocumented >>=>>! and >>=>>? operators, but I deem that too fishy. Anyway, using whenM and unlessM is easier; I sometimes mixed the operators up.	2012-01-24 16:22:07 -04:00
Joey Hess	ba6088b249	rename readMaybe to readish a stricter (but also partial) readMaybe is getting added to base	2012-01-23 17:00:10 -04:00
Joey Hess	5e172b43c4	a few things available elsewhere...	2012-01-23 16:57:45 -04:00
Joey Hess	183bdacca2	treak	2012-01-21 02:49:32 -04:00
Joey Hess	25f998679c	typo	2012-01-20 15:06:17 -04:00
Joey Hess	0eed604446	Add a sanity check for bad StatFS results. git-annex FTBFS on s390, mips, powerpc, sparc. That StatFS code is failing on all of them. At least on s390, the failure appears as: Just (FileSystemStats {fsStatBlockSize = 4096, fsStatBlockCount = 0, fsStatByteCount = 0, fsStatBytesFree = 0, fsStatBytesAvailable = 0, fsStatBytesUsed = 0}) While I don't understand why this is happening, or how to fix it, bandaid over it by checking for obviously bad values and returning Nothing. That disables disk free space checking, but at least git-annex will work. Upstream bug: http://code.google.com/p/xmobar/issues/detail?id=70	2012-01-14 17:17:20 -04:00
Joey Hess	9ffd97442b	don't use GPG_AGENT_INFO to force batch mode in test suite Fails with gpg 2. Instead, use a different environment variable. The clean fix would instead be to add an annex.gpg-options configuration. But, that would be rather a lot of work and it's unlikely it would be useful for much else.	2012-01-09 18:19:29 -04:00
Joey Hess	60c1aeeb6f	Fix overbroad gpg --no-tty fix from last release. Only set --no-tty when GPG_AGENT_INFO is set and batch mode is used. In the test suite, set GPG_AGENT_INFO to /dev/null to avoid the test suite relying on /dev/tty.	2012-01-07 12:38:08 -04:00
Joey Hess	769edd6b08	Run gpg with --no-tty. Closes: #654721	2012-01-05 13:44:09 -04:00
Joey Hess	1e929c022d	typo	2012-01-03 18:40:47 -04:00
Joey Hess	ee554542c1	after is a better name for observe_	2012-01-03 00:29:27 -04:00
Joey Hess	34abd7bca8	no implicit dotfiles in add Dotfiles, and files inside dotdirs are not added by "git annex add" unless the dotfile or directory is explicitly listed. So "git annex add ." will add all untracked files in the current directory except for those in dotdirs. One reason for this is that it will make git-annex more usable with vcsh, where you don't want "vcsh big annex add" to check in all the dotfiles that are already versioned in other repositories. (If you're using vcsh for repos that contain non-dotfiles, this won't help, and you'll need to .gitignore such things, but this will cover the common case.) A more general reason why this seems like a good idea is the same reason ls ignores dotfiles, just the unix convention that they are cruft that is kept out of the way most of the time. All the other git-annex commands still do deal with any dotfiles that do get into the annex. This seemed right because if I've gone to the trouble to add a dotfile, I will want "git annex get ." to get it along with everything else.	2012-01-03 00:11:00 -04:00
Joey Hess	fc80b8d96b	factor observe_	2012-01-03 00:11:00 -04:00
Joey Hess	aa0882691b	Added remote.name.annex-web-options configuration setting, which can be used to provide parameters to whichever of wget or curl git-annex uses (depends on which is available, but most of their important options suitable for use here are the same).	2012-01-02 14:20:20 -04:00
Joey Hess	442202dd6d	add a new useful thing	2012-01-02 11:18:33 -04:00
Joey Hess	f015ef5fde	cleanup	2011-12-23 01:08:19 -04:00
Joey Hess	7227dd8f21	add escape_var hack Makes it easy to find files with duplicate contents, anyway.. :)	2011-12-23 01:08:19 -04:00
Joey Hess	db964e358f	reorg	2011-12-23 01:08:18 -04:00
Joey Hess	cba3ce08df	handle C-style escapes in Format I was happily able to repurpose some code from Git.Filename to handle this. I remember writing that code... a whole afternoon at a coffee shop, after which I felt I'd struggled with Haskell and git, and sorta lost, in needing to write this nasty peice of code. But was also pleased at the use of a pair of functions and quickcheck that allowed me to get it 100% right. So, turns out I not only got it right, but the code wasn't as special-purpose as I'd feared. Yay!	2011-12-23 01:05:16 -04:00
Joey Hess	a0872a8ec3	better data type	2011-12-22 19:56:31 -04:00
Joey Hess	06bafae9e0	Format strings can be specified using the new --find option, to control what is output by git annex find.	2011-12-22 18:31:44 -04:00
Joey Hess	cf496f09ab	add a text formatter This is built for speed; a format string is parsed once, generating a Format, that can be applied repeatedly to different sets of variables to generate output.	2011-12-22 17:59:14 -04:00
Joey Hess	82a145df91	test encrypted special remote This involved adding a test harness to run gpg with a dummy key, and lots of fun.	2011-12-20 23:24:06 -04:00
Joey Hess	c11cfea355	split out Utility.Gpg with the generic gpg interface, from Crypto	2011-12-20 23:24:06 -04:00
Joey Hess	8e2f74f7ab	update	2011-12-20 23:24:05 -04:00
Joey Hess	815d1318d7	comment	2011-12-20 23:24:05 -04:00
Joey Hess	dc5ed8d3b6	amusing name This is both a partial Prelude that conflicts with the real one, and a way to guard against the Prelude's partial functions.	2011-12-20 11:01:50 -04:00
Joey Hess	718a278f97	simplify	2011-12-15 22:19:05 -04:00
Joey Hess	9901fc04a0	move	2011-12-15 18:23:07 -04:00
Joey Hess	95d2391f58	more partial function removal Left a few Prelude.head's in where it was checked not null and too hard to remove, etc.	2011-12-15 18:19:36 -04:00
Joey Hess	817b1936e6	add beginning, end Safe versions of init and last	2011-12-15 16:59:18 -04:00
Joey Hess	e7a555bf21	fix types	2011-12-15 15:39:33 -04:00
Joey Hess	2332afb4bc	cleanup	2011-12-12 02:04:48 -04:00
Joey Hess	4b3c4c0c2b	avoid some read	2011-12-09 18:57:09 -04:00
Joey Hess	28699c95a7	some work on avoiding partial functions There are still hundreds of places that use partial functions head, tail, init, and last.	2011-12-09 18:10:41 -04:00
Joey Hess	d64132a43a	hslint	2011-12-09 01:57:13 -04:00
Joey Hess	64672c6262	refactor	2011-12-03 09:10:23 -04:00
Joey Hess	e19dc85547	factor out untilTrue	2011-12-02 16:12:31 -04:00
Joey Hess	6869e6023e	support .git/annex on a different disk than the rest of the repo The only fully supported thing is to have the main repository on one disk, and .git/annex on another. Only commands that move data in/out of the annex will need to copy it across devices. There is only partial support for putting arbitrary subdirectories of .git/annex on different devices. For one thing, but this can require more copies to be done. For example, when .git/annex/tmp is on one device, and .git/annex/journal on another, every journal write involves a call to mv(1). Also, there are a few places that make hard links between various subdirectories of .git/annex with createLink, that are not handled. In the common case without cross-device, the new moveFile is actually faster than renameFile, avoiding an unncessary stat to check that a file (not a directory) is being moved. Of course if a cross-device move is needed, it is as slow as mv(1) of the data.	2011-11-28 16:17:55 -04:00
Mark Wright	041d324125	Remove haskell98 to build with ghc 7.2.2, also built with ghc 7.0.4 Signed-off-by: Joey Hess <joey@kitenet.net>	2011-11-26 12:05:08 -04:00
Joey Hess	1326bb8635	Avoid excessive escaping for rsync special remotes that are not accessed over ssh. This is actually tricky, `45bbf210a1` added the escaping because it's needed for rsync that does go over ssh. So I had to detect whether the remote's rsync url will use ssh or not, and vary the escaping.	2011-11-18 12:53:48 -04:00
Joey Hess	49d2177d51	factored out some useful error catching methods	2011-11-10 20:57:28 -04:00
Joey Hess	2934a65ac5	add safeSystem This is more safe than System.Cmd.Utils.safeSystem, since it does not throw an error on nonzero exit status.	2011-11-09 17:28:35 -04:00
Joey Hess	fdf988be6d	indent	2011-11-07 21:27:43 -04:00
Joey Hess	0bb798e351	Pass -t to rsync to preserve timestamps.	2011-11-04 19:41:11 -04:00
Joey Hess	c643136e32	playing with >=> Apparently in haskell if you teach a man to fish, he'll write more pointfree code.	2011-10-31 23:39:55 -04:00
Joey Hess	66fa4c947c	correct spelling of "gibibyte"	2011-10-16 01:03:38 -04:00
Joey Hess	23f2a12816	broke up Utility	2011-10-16 00:50:12 -04:00
Joey Hess	91366c896d	clean Annex stuff out of Utility/	2011-10-16 00:04:26 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00
Joey Hess	b505ba83e8	minor syntax changes	2011-10-11 14:43:45 -04:00
Joey Hess	5414bbce58	git-annex-shell uuid verification * git-annex now asks git-annex-shell to verify that it's operating in the expected repository. * Note that this git-annex will not interoperate with remotes using older versions of git-annex-shell. The reason for this check is to avoid git-annex getting confused about what remote repository actually contains a value. It's a prerequisite for supporting git insteadOf aliases.	2011-10-06 19:24:11 -04:00
Joey Hess	8ef2095fa0	factor out common imports no code changes	2011-10-03 23:29:48 -04:00
Joey Hess	297bc648b9	make unused check branches and tags too needs time and space optimisation	2011-09-28 16:43:10 -04:00
Joey Hess	4bf1a5ef59	refactor	2011-09-23 18:13:24 -04:00
Joey Hess	9f6b7935dd	go go gadget hlint	2011-09-20 23:24:48 -04:00
Joey Hess	b62123c378	simplify	2011-09-20 00:59:13 -04:00
Joey Hess	5253379953	convert Token to have separate constructors for each peice of syntax	2011-09-20 00:49:40 -04:00
Joey Hess	dcded89129	reorg	2011-09-19 01:38:01 -04:00
Joey Hess	8d1e8c0760	golfing with curry	2011-09-18 21:02:40 -04:00
Joey Hess	b516cecff2	probably better to error on unknown token	2011-09-18 20:58:34 -04:00
Joey Hess	33cd1ffbfe	make find show files meeting limits, even when not present find: Rather than only showing files whose contents are present, when used with --exclude --copies or --in, displays all files that match the specified conditions. Note that this is a behavior change for find --exclude! Old behavior can be gotten with find --in . --exclude=...	2011-09-18 20:42:15 -04:00
Joey Hess	38c0f3eaf8	add a value to match against to match and matchM	2011-09-18 17:47:24 -04:00
Joey Hess	3e15187ac1	move to Utility	2011-09-18 16:36:30 -04:00
Joey Hess	2f4d4d1c45	basic json support This includes a generic JSONStream library built on top of Text.JSON (somewhat hackishly). It would be possible to stream out a single json document describing all actions, but it's probably better for consumers if they can expect one json document per line, so I did it that way instead. Output from external programs used for transferring files is not currently hidden when outputting json, which probably makes it not very useful there. This may be dealt with if there is demand for json output for --get or --move to be parsable. The version, status, and find subcommands have hand-crafted output and don't do json. The whereis subcommand needs to be modified to produce useful json.	2011-09-01 15:22:06 -04:00
Joey Hess	a64c16bf7d	tweak	2011-08-30 13:23:21 -04:00
Joey Hess	6e750764b7	The wget command will now be used in preference to curl, if available. Got tired of curl's various ugly progress bars.	2011-08-27 12:31:50 -04:00
Joey Hess	678726c10c	code simplification thanks to applicative functors	2011-08-25 01:27:19 -04:00
Joey Hess	203148363f	split groups of related functions out of Utility	2011-08-22 16:14:12 -04:00
Joey Hess	737b5d14c9	moved files around	2011-08-20 16:11:42 -04:00
Joey Hess	6c396a256c	finished hlint pass	2011-07-15 12:47:14 -04:00
Joey Hess	cab4ac247c	rename	2011-07-05 20:36:43 -04:00
Joey Hess	c98b5cf36e	rename	2011-07-05 20:24:10 -04:00

... 8 9 10 11 12 ...

668 commits