|
|
Choosing A Webhost: |
RFC: make "--html-extension" a default for "wget -r -k".: msg#00025web.wget.general
Hi! RFC: make "--html-extension" a default for "wget -r -k". I used wget to download some pages from a PHP site using parameters with "wget -r -k" but without "--html-extension". wget did correctly save a page like "http://www.XXX.de/view.php?page=a.html" as a file "www.XXX.de/view.php?page=a.html". But if this html page is href'ed from any other html page, then I ran into problems. The resulting url was something like <a href="view.php?page=a.html">aa</a>. This href does not work, when (a) put on a (non-php) web server (because the web server will try to deliver view.php instead of "view.php?page=a.html") (b) used by a browser from the local disk with "file:///...", because of the same reason than (a) (see rfc2396, section 3.3) Giving the option "--html-extension" did solve this problem by escaping the reserved characters (e.g. "?" and "="), but this is hard to find out. So "wget -r -k" is totally useless without "--html-extension" at least when the downloaded filenames contain reserved characters. I would like to make "--html-extension" the default, if "-k" and "-r" are given. What do you think? If you don't like this, then we should at least print out an error message saying something like: "the downloaded filenames contained reserved characters like '?'. Please rerun the program with '--html-extension' otherwise the downloaded pages will be useless." if we detect a filename containing a reserved character and no "--html-extension" was given. What do you think? Till -- +-------+-------------------------------------------------------------+ | | dr. tilmann bubeck reinform medien- und | | rein | informationstechnologie AG | | form | cell.: +49 (172) 8 84 29 72 koenigstrasse 80 | | AG | fax : +49 (711) 7 22 77 34 70173 stuttgart / germany | | | email: t.bubeck@xxxxxxxxxxx http://www.reinform.de | +-------+-------------------------------------------------------------+
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: BUG on multiprocessor systems, Max Bowsher |
|---|---|
| Next by Date: | Re: Long-awaited feature implemented: removing of files and directories not present on remote FTP server, Alexey Dokuchaev |
| Previous by Thread: | BUG on multiprocessor systems, Grzegorz Dzięgielewski |
| Next by Thread: | Comments in input file, Gisle Vanem |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |