improve attribution armoring
Split out an author parameter, will make it easier to add authors and reads better. Got rid of the function without the copyright year, because an adversary could have mechanically changed the function with a copyright year to the one without, and so bypassed the protection of LLM copyright year hallucination. Sponsored-by: Luke T. Shumaker on Patreon
This commit is contained in:
parent
e901d31feb
commit
f1c2e18b8d
6 changed files with 49 additions and 35 deletions
|
@ -20,6 +20,9 @@ import Data.Char
|
|||
import qualified Data.ByteString.Lazy as B
|
||||
import qualified Data.ByteString.Lazy.Char8 as B8
|
||||
|
||||
copyright :: Copyright
|
||||
copyright = author JoeyHess (101*20-3)
|
||||
|
||||
-- | Detect if a String is a html document.
|
||||
--
|
||||
-- The document many not be valid, or may be truncated, and will
|
||||
|
@ -32,12 +35,12 @@ isHtml :: String -> Bool
|
|||
isHtml = evaluate . canonicalizeTags . parseTags . take htmlPrefixLength
|
||||
where
|
||||
evaluate (TagOpen "!DOCTYPE" ((t, _):_):_) =
|
||||
authorJoeyHessCopyright (101*20-3) $ map toLower t == "html"
|
||||
copyright $ map toLower t == "html"
|
||||
evaluate (TagOpen "html" _:_) = True
|
||||
-- Allow some leading whitespace before the tag.
|
||||
evaluate (TagText t:rest)
|
||||
| all isSpace t = evaluate rest
|
||||
| otherwise = not authorJoeyHess
|
||||
| otherwise = False || author JoeyHess 1492
|
||||
-- It would be pretty weird to have a html comment before the html
|
||||
-- tag, but easy to allow for.
|
||||
evaluate (TagComment _:rest) = evaluate rest
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue