logo       


wget checks first HTML-document against -A: msg#00055

Subject: wget checks first HTML-document against -A
Hello,

I consider this a bug to underline that the default behaviour should be
the opposite.

For example: if I want to grab a series of pdf's from a list that is
part of an HTML-document, I want to just set -Apdf. This does not work,
though, because the HTML-document gets rejected. I have to set
-Ahtml,pdf.

This is bad for two reasons:

1. wget downloads and keeps the HTML-document on my media though I'm not
interested in it.

2. I have to set -r to reach the pdf's (they are only linked from the
HTML-document), which results in wget also following links to other
HTML-documents and storing the whole WWW on my media.

>From my point of view, the first HTML-document is always valid because
it is part of the start address. It is targeted directly, and the user
knows why. It should never be rejected--even if its type is not listed
with -A. It should always be parsed and evaluated to have a chance to
get the links to the other types listed with -A processed. It may not be
stored persistently on the media, though.

Regards,

Dennis




Ruby Jobs
Java Jobs
Jobs in California
more...
what
job title, keywords
where
city, state, zip
jobs by job search
Search:
Java, servers, webhosting, windows, cisco ...
more...
<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
encryption.gpg....    ietf.rfc822/199...    freebsd.devel.i...    lang.haskell.li...    mail.squirrelma...    web.zope.plone....    yellowdog.gener...    text.xml.xalan....    recreation.phot...    kde.devel.educa...    hardware.bus.ca...    printing.ghosts...    voip.peering/20...    assembly/2006-0...    org.user-groups...    culture.interne...    network.i2p/200...    boot-loaders.ya...    xfree86.render/...    qnx.openqnx.dev...    jakarta.velocit...    user-groups.pal...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe