|
2008 / urlwatch - a tool for monitoring webpages for updates urlwatch - a tool for monitoring webpages for updatesThis script is intended to help you watch URLs and get notified (via email or in your terminal) of any changes. The change notification will include the URL that has changed and a unified diff of what has changed. The script supports the use of a filtering hook function to strip trivially-varying elements of a webpage. Basic features
![]() DownloadOfficial Debian package (by Franck Joncourt)Package information: http://packages.debian.org/urlwatch If you have sid repositories enabled, you can install urlwatch via: apt-get install urlwatch Source tarballYou can download the source tarball of urlwatch here:
Python Package Indexurlwatch is also indexed in the Python Package Index as "urlwatch": Advanced features
3rd party patches / Contributions
Licenseurlwatch is released under the terms of the BSD license Code repositoryThe Git repository of urlwatch now has a more permanent home over at repo.or.cz/w/urlwatch.git. To checkout the code using git, use this command: git clone git://repo.or.cz/urlwatch.git How do I....watch only an element on a website?If you are lucky, the element has a "id" attribute (but other attributes work just fine as well) that you can use with the BeautifulSoup library to extract that part of the HTML document: from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(data)
data = str(soup.find(id='tisiDocumentBody'))
Information about the User-AgentSince version 1.3, urlwatch now sends a better User-Agent string. More information about this User-agent string can be found on this page.Thomas Perl (thp at this domain), jabber: thp@jabber.org |
|