Module pywikipedia.weblinkchecker
This bot is used for checking external links found at the wiki. It checks
several pages at once, with a limit set by the config variable
max_external_links.
The bot won't change any wiki pages, it will only report dead links such that
people can fix or remove the links themselves.
The bot will store all links found dead in a .dat file in the deadlinks
subdirectory. To avoid the removing of links which are only temporarily
unavailable, the bot only reports links which were reported dead at least
two times, with a time lag of at least one week. Such links will be stored
in a .txt file in the deadlinks subdirectory.
When a link is found alive, it will be removed from the .dat file.
Syntax examples:
python weblinkchecker.py
Loads all wiki pages in alphabetical order using the Special:Allpages
feature.
python weblinkchecker.py -start:Example_page
Loads all wiki pages using the Special:Allpages feature, starting at
"Example page"
python weblinkchecker.py Example page
Only checks links found in the wiki page "Example page"
python weblinkchecker.py -sql:20050516.sql
Checks all links found in an SQL cur dump.
- Imported modules:
-
codecs
,
pywikipedia.config
,
httplib
,
pywikipedia.pagegenerators
,
pickle
,
re
,
socket
,
sys
,
threading
,
time
,
urlparse
,
pywikipedia.wikipedia