88 Commits

Author SHA1 Message Date
James Lu
97fd1e7897 Bump version to 2022.02.01 2022-02-01 22:44:49 -08:00
James Lu
62387421d8 Bump version to 2021.08.01+git 2021-08-01 13:50:01 -07:00
James Lu
ae7646a274 Bump version to 2021.08.01 2021-08-01 13:49:55 -07:00
James Lu
7e959b9f24 Remove references to my old nick & emails 2021-07-03 16:36:43 -07:00
James Lu
c772ed1316 SysDNS, Wikifetch: remove broken tests 2021-05-29 13:15:31 -07:00
James Lu
5062d3c935 Wikifetch: remove broken Wikimedia commons test 2020-06-09 10:06:13 -07:00
James Lu
17b3e86d1b Bump version to 2020.01.05+git 2020-01-05 21:12:06 -08:00
James Lu
5675a97857 Bump version to 2020.01.05 2020-01-05 21:11:31 -08:00
James Lu
cbd953b32c Update plugin author/maintainer data 2019-11-16 12:29:03 -08:00
James Lu
2ae51939b3 Update my email & repo link references 2019-10-11 09:58:50 -07:00
James Lu
eaea9ebd5f Bump version to 2019.07.09+git 2019-07-09 17:39:53 -07:00
James Lu
653d64d561 Bump version to 2019.07.09 2019-07-09 17:39:50 -07:00
James Lu
9504ee446a Bump version to 2019.03.03+git 2019-03-03 22:26:12 -08:00
James Lu
07f860c735 Bump version to 2019.03.03 2019-03-03 22:25:50 -08:00
James Lu
831a0af53c Wikifetch: escape regexps as r'' strings (#75) 2019-01-05 19:40:08 -08:00
James Lu
a96e340dca Add some more dependency notes
[skip ci]
2018-12-03 17:59:57 -08:00
James Lu
0a3e6d9012 Wikifetch: fix tests 2018-12-03 17:47:18 -08:00
James Lu
b1e010d723 Wikifetch: fix testDisambig for c4c7f525410d68a84303595f6c2da6ffc32d25b1 2018-07-19 18:08:51 +00:00
James Lu
9221d87c29 Wikifetch: skip looking at empty leading paragraphs 2018-07-19 18:04:56 +00:00
James Lu
c4c7f52541 Wikifetch: separate disambiguation results by semicolons
This enhances readability, especially when individual results already contain semicolons (e.g. places).
2018-05-10 23:52:10 -07:00
James Lu
ca3b9323e7 Bump version to 2018.05.03+git 2018-05-03 12:09:23 -07:00
James Lu
f4bc06f0d3 Bump version to 2018.05.03 2018-05-03 12:01:10 -07:00
James Lu
d147207ad1 Wikifetch: ignore GPS coordinates from articles for countries, etc. 2017-11-12 01:39:28 -08:00
James Lu
d000140891 Wikifetch: roughly sort Wikipedia tests by theme 2017-11-12 01:39:28 -08:00
James Lu
400ffd7899 Wikifetch: fix quote_plus import 2017-09-07 19:20:21 -07:00
James Lu
6011742299 Wikifetch: remove Python 2 compatibility code 2017-09-01 18:09:19 -07:00
James Lu
08d8f48db5 Wikifetch: refactor text fetching, fix listing disambig results 2017-09-01 18:04:20 -07:00
James Lu
9986babd2e Wikifetch: strip inline notes in the form "text[note 1]" from IRC 2017-09-01 18:02:52 -07:00
James Lu
1dbcdb746d Wikifetch: declare encoding for ancient Python support 2017-06-03 18:40:24 -07:00
James Lu
71458857f9 Wikifetch: more tests for --site and foreign Wikipedia 2017-06-03 18:35:02 -07:00
James Lu
b6231f56ef Revert "Wikifetch: intelligently filter out <p> lines with little or no content"
This broke parsing for CJK languages (e.g. Chinese and Japanese), which don't use traditional spaces...

(but I should've known that)

This reverts commit 91cfa7acb0975fd5b5bab6e6f2c760781ccd84e2.
2017-06-03 18:30:25 -07:00
James Lu
b8e04f167e Wikifetch: add tests for non-English Wikipedia & articles with symbols in their title 2017-06-03 18:11:55 -07:00
James Lu
670b41950b Wikifetch: rm broken Commons test 2017-06-03 18:11:48 -07:00
James Lu
346f72d816 Wikifetch: fix lookup of articles with symbols (e.g. "/") in their title
The normalization for the special cases was previously ignored if the query matched a "/"; why was this added in the first place?
2017-06-03 18:09:20 -07:00
James Lu
b79ddf2f7e Wikifetch: update URL for commons.wikimedia.org test, as the old one has been removed 2017-06-03 17:47:06 -07:00
James Lu
092055d491 Wikifetch: fix Wikipedia parsing again
As of 2017-06-03, Wikipedia has put its text content under a new "mw-parser-output" div, while# other sites (e.g. Wikia) still have it directly under "mw-content-text".
2017-06-03 17:46:30 -07:00
James Lu
11a03ad9a0 Bump version to 2017.05.31 2017-05-31 13:10:47 -07:00
James Lu
394158bea5 Wikifetch: fix Wikipedia test 2017-04-16 16:53:37 -07:00
James Lu
7611f0fa9c Wikifetch: fix 'random' help text syntax 2017-03-24 19:10:47 -07:00
James Lu
001b49b6c3 Wikifetch: prefer <link rel="canonical"> links again when available 2017-03-24 19:09:38 -07:00
James Lu
d5f498bfcb Wikifetch: switch to a different article for testMediaWiki 2017-03-18 23:52:38 -07:00
James Lu
91cfa7acb0 Wikifetch: intelligently filter out <p> lines with little or no content
More specifically, this skips lines that have a lower word count than the search query (e.g. page titles, some navigation links).
This allows some pages on https://wiki.ubuntu.com/ to work, for example
2017-03-18 19:07:56 -07:00
James Lu
819fcc6c09 Wikifetch: add a --no-mw-parsing option in an attempt to support non-MediaWiki sites 2017-03-18 18:47:20 -07:00
James Lu
194ac4d7be Wikifetch: clarify _get_article_tree docstring 2017-03-18 18:23:00 -07:00
James Lu
a9dfb1009d Wikifetch: add a three second timeout in fetch 2017-02-04 18:28:11 -08:00
James Lu
2fbfc37f98 Wikifetch: leave a fallback reply if paragraph parsing failed 2017-01-27 18:16:00 -08:00
James Lu
2bd06a39a9 Wikifetch: return the address in _get_article_tree as well 2017-01-27 18:10:52 -08:00
James Lu
18493a5e23 Wikifetch: revamp tests to be more complete
This now tests different combinations of --site, and tries to parse some other common wikis.
2017-01-27 18:01:13 -08:00
James Lu
100f503783 Wikifetch: support wikimedia.org and mediawiki.org 2017-01-27 18:00:48 -08:00
James Lu
8d586dad47 Wikifetch: fix NameError on redirect parsing 2017-01-27 17:38:37 -08:00