Commit Graph

62 Commits

Author SHA1 Message Date
Valentin Lorentz
a81d3ddae6 Web: add option for having titlesnarfer immune to defaultignore. Closes GH-1101 2015-05-15 12:39:30 +02:00
Mikaela Suomalainen
64c0e38635 Web: fix unmatched parenthesis and add missing dot 2014-12-20 13:14:33 +02:00
Valentin Lorentz
ba12692fb4 Web: Add support for charrefs. Closes GH-923. 2014-12-11 09:59:08 +01:00
Valentin Lorentz
8ab29fb291 Web: Add explicit error when page encoding cannot be guessed. 2014-10-13 01:13:15 +00:00
Valentin Lorentz
8cd0b4c1e3 Web: Increase timeout to 10 and improve error message. 2014-07-30 11:18:54 +00:00
Valentin Lorentz
b8f31a3fca Web: disable threading in commands. (They are run in separated processes anyway…) 2014-04-06 14:05:40 +00:00
Valentin Lorentz
35a62b4e77 Continue accelerating the 2to3 step (remove fix_ws_comma, fix_xreadlines, and fix_zip). 2014-01-21 10:40:18 +01:00
Valentin Lorentz
bb7db3ab21 Continue accelerating the 2to3 step (remove fix_except). 2014-01-20 15:49:15 +01:00
nyuszika7h
b5a9aee7a6 Web: Fix exception on timeout 2013-12-25 16:43:41 +01:00
Valentin Lorentz
289f614bfa Web: Make choice of displayed domain (origin/target) configurable. 2013-11-19 10:20:32 +00:00
Valentin Lorentz
11d8f4655b Web: Display the target domain in snarfer. Re-implements pull request GH-523. 2013-11-19 10:16:43 +00:00
Valentin Lorentz
790bda4664 Web: Fix nesting of commands (bug introduced in d8a4ef8421). 2013-08-20 11:37:39 +02:00
Kill Your TV
b46a0dd6a2 Unicode fixes for python 2.x
These changes have been tested with Python 3.2.3 and Python 2.7.5.
2013-08-17 14:12:10 +00:00
Valentin Lorentz
18cc1ff3bb Revert "Web: Disable @title and @doctype for non-HTML documents." (incompatible with Python 2)
This reverts commit 34b0e5faad.
2013-08-15 00:14:34 +00:00
Valentin Lorentz
34b0e5faad Web: Disable @title and @doctype for non-HTML documents. 2013-08-09 18:03:02 +02:00
Valentin Lorentz
d8a4ef8421 Web: Prevent memory bomb when calling commands with an URL to a page sending crafted requests. 2013-08-09 12:16:24 +02:00
Valentin Lorentz
b4402b28ed utils.web: Rename get_encoding to getEncoding for consistency. 2013-07-09 12:05:51 +00:00
Valentin Lorentz
820113344c Web: Use utils.web.get_encoding for guessing charset. 2013-07-09 12:02:43 +00:00
Valentin Lorentz
5f1535447c Web: Use @title's utf8 decoding in the snarfer. 2013-07-02 13:42:53 +02:00
Daniel Folkinshteyn
944f9c3e3f Web: create a cofigurable url whitelist
Prevent various forms of abuse that result via the Web plugin, such as fetching or titling
malicious content, or revealing bot IP.

Conflicts:
	plugins/Web/plugin.py
	plugins/Web/test.py
2013-06-27 07:09:22 +02:00
George Miller
0150c79924 Added a way to have the urlsnarfer report exceptions (hotsnotnound, ...)
(Should be possible changed to only ioExceptions)
Enable/Disable via 'supybot.plugins.Web.snarferReportIOExceptions'
2013-04-05 10:05:00 +02:00
Valentin Lorentz
9ef83f70cf Web: Fix encoding in @title. 2013-03-06 12:11:46 +00:00
Valentin Lorentz
603f44129d Web: Fix Python 3 compatibility. 2013-01-06 17:06:26 +01:00
Valentin Lorentz
2177429618 Web: Remove netcraft (which does not seem to want bots). 2013-01-05 19:14:58 +01:00
Valentin Lorentz
918092a54d Web: Fix snarfing of titles with UTF-8 characters. 2013-01-05 18:02:35 +01:00
Valentin Lorentz
3dba9088b0 Merge remote-tracking branch 'supybot/master' into testing
Conflicts:
	INSTALL
	plugins/ChannelLogger/README.txt
	plugins/ChannelStats/README.txt
	plugins/Google/plugin.py
	plugins/Google/test.py
	plugins/Plugin/test.py
	plugins/Web/test.py
	setup.py
	src/callbacks.py
	src/ircdb.py
	src/irclib.py
	src/utils/str.py
	test/test_irclib.py
2013-01-01 21:11:24 +01:00
Valentin Lorentz
22febc4a20 Web: Fix encoding issues in title snarfing and @title. 2012-11-17 15:10:36 +00:00
Valentin Lorentz
6ea2d062b7 Web: Filter special chars in @title, and add --no-filter.
I'm adding --no-filter just in case someone want to use @title to do this at purpose
2012-10-31 16:35:51 +00:00
Terje Hoås
cb623b2f4e Web: Fix fetch. Use getUrl instead of getUrlFd. 2012-10-02 18:19:53 +02:00
Valentin Lorentz
ad3bf1302f Web: Fix compatibility with Python <= 2.6. 2012-09-22 17:43:59 +00:00
Valentin Lorentz
a95546316f Web: Fix title snarfer with titles containing unicode characters. 2012-09-22 17:34:33 +00:00
Valentin Lorentz
841dc719e1 Web: Fix page decoding. 2012-09-19 15:08:52 +00:00
James McCoy
82ecf36fcd Update my name/contact information
Signed-off-by: James McCoy <jamessan@users.sourceforge.net>
2012-09-01 10:16:48 -04:00
Valentin Lorentz
8507498be4 Web: make plugin py3k-friendly. 2012-08-04 22:23:40 +02:00
Valentin Lorentz
86f541e6db Revert "Web: Fix unicode problems for titleSnarfer."
This reverts commit 3579701d2d.
2012-05-11 18:10:23 +02:00
Terje Hoås
3579701d2d Web: Fix unicode problems for titleSnarfer. 2012-05-04 12:11:53 +02:00
Florian Besser
74cd35f772 Web: localized 'Title:' in titleSnarfer 2011-10-26 11:25:51 +02:00
Valentin Lorentz
6ed15bafe4 Web: Fix use of %S converter. 2011-07-01 17:07:54 +02:00
Valentin Lorentz
8be4aaeff1 Merge branch 'readability' 2010-10-23 10:38:52 +02:00
Valentin Lorentz
6a9a8d81b8 Internationalize Time, Todo, Topic, URL, Unix, Utilities, and Web 2010-10-20 09:39:44 +02:00
Valentin Lorentz
b47c7c92a3 Edit Status and Web plugins in order to use %S 2010-10-16 11:37:58 +02:00
James Vega
288d7c6e02 Update plugins to ignore all non-ACTION CTCP messages.
Also update commands.urlSnarfer to do the same, which allows us to revert
"Don't bother snarfing URLs from non-Action CTCP messages."

This reverts commit 3282e3407e.

Signed-off-by: James Vega <jamessan@users.sourceforge.net>
2010-01-28 08:14:44 -05:00
Jeremy Fincher
3282e3407e Don't bother snarfing URLs from non-Action CTCP messages. 2010-01-28 06:35:53 -06:00
James Vega
25fc2de643 utils.web: Provide access to the raw httpUrlRe/urlRe strings
Using the compiled regexps for a PluginRegexp method's __doc__ doesn't work.

Closes Sourceforge #2879862

Signed-off-by: James Vega <jamessan@users.sourceforge.net>
2009-10-15 22:16:38 -04:00
James Vega
ca917d3528 Use utils.web.httpUrlRe for the Web/ShrinkUrl snarfer regexes.
Signed-off-by: James Vega <jamessan@users.sourceforge.net>
2009-10-04 21:41:05 -04:00
James Vega
cbc91c6a26 Use a more appropriate message if the URL definitely has no title. 2009-03-11 13:37:25 -04:00
James Vega
74e06ea52a Catch the proper exception when parsing the title fails. 2009-03-11 13:37:24 -04:00
James Vega
ee9aaa89d6 plugins/Web: Swtich the title parser back to HTMLParser sing sgmllib's parser spins on invalid input. 2006-09-13 19:40:51 +00:00
James Vega
a3e4fc5b1d Change the modeline to use softtabstop instead of tabstop. 2006-02-11 15:52:51 +00:00
James Vega
9d48f2c879 plugins/Web: Update the exception handling for the change in parsers. 2005-09-20 19:06:35 +00:00