gist/irc/limnoria/titlefetching.md

1.6 KiB
Raw Blame History

A bit opinionated titlefetching

Preparation

load Web
config plugins.web.snarfMultipleUrls True
config plugins.web.snarferShowDomain False
config plugins.web.snarferShowTargetDomain False
config supybot.protocols.http.userAgents "Limnoria UrlPreviewBot"
config supybot.protocols.http.peekSize 1048576
  • enables the plugin (shipped with Limnoria)
  • enables titlefetching for all links on line, not just the first one
  • disables showing domain (small protection against multiple titlesfetcherrs entering a loop or simply not annoying users with clientside link previews (Matrix/Telegram bridges/relays included))
  • disables showing redirect target -||-
  • sets user-agent to “Limnoria UrlPreviewBot” instead of Mozilla/5.0 (compatible; utils.web python module) from 2005
    • I have heard that its bad to pretend to be something you arent and Twitter will only give you HTMl <title>s if your user-agent contains UrlPreviewBot
  • search for html titles from the first MEGABYTE of the webpage as modern web is horrible (looking at you hs.fi & youtube.com)

Actually enabling it

config channel #CHAN plugins.web.titleSnarfer True
  • enables titlefetching per-channel, on #CHAN to be accurate (avoiding unwanted channels in case of botloop)
    • "channel #CHAN" could also be replaced with network NETWORKNAME for every channel on network or config (or omitted entirely) for everywhere (channel takes priority over network which probably takes priority over global)