git-annex/debian
Joey Hess 57b4c5bdff
add Utility.HtmlDetect
This will be used in youtube-dl integration, to tell when a html page has
been downloaded by addurl, in which case it is worth running youtube-dl
to see if it can extract media from it.

tagsoup is an almost free dependency, because yesod depends on it.
So, this only really adds a dep when git-annex is built without the
webapp.

I'd like this to as closely as possible match how browsers decide if a
page is html or not. Unfortunately, that is fairly heuristic, in order
to support malformed html. And, we don't want to falsely detect
something as html just because it has something that looks like a html
tag embedded somewhere in it. Probably any major video hosting site is
going to be serving html documents that at least start with a <html>
tag, so requiring that or a DOCTYPE should be good enough.

This commit was sponsored by Jeff Goeke-Smith on Patreon.
2017-11-28 13:03:11 -04:00
..
patches Added a note to standalone build that it would break datalad << 0.9.1 2017-10-11 12:13:19 -04:00
tests
cabal-wrapper support building with BUILDER=stack to use stack instead of cabal 2015-12-28 14:18:47 -04:00
changelog Updated cabal file explictly lists source files. 2016-05-24 01:28:07 -04:00
compat
control add Utility.HtmlDetect 2017-11-28 13:03:11 -04:00
copyright Updated cabal file explictly lists source files. 2016-05-24 01:28:07 -04:00
create-standalone-changelog use BuildVersion in debian, which fixes windows build 2015-04-21 16:42:54 -04:00
doc-base
NEWS Updated cabal file explictly lists source files. 2016-05-24 01:28:07 -04:00
rules move -j1 setting to BUILDEROPTIONS, set in debian/rules file 2016-02-05 13:56:27 -04:00