• Download All HTML within a domain

    Author
    Topic
    #474373

    I was wondering if anyone knows of a program that can download a copy of a website. Essentially I was thinking of going to an e-comic site and downloading all the pages in it’s archive, this way I could read the comic offline (I’m usually offline). It would also be nice even when I do get home internet since I’d only have to do one huge download then my bandwidth would be free to do whatever else I (or anyone else in my house) wants to do.

    If possible it would be nice if said program could also remember what it’s downloaded before and only get updated pages.

    Viewing 0 reply threads
    Author
    Replies
    • #1264683

      Well, the lawyer in me reminds you that many sites have a terms of service page saying you shouldn’t use automated tools to scrape their sites. Setting that aside, there is a category of products sometimes called an offline browser which build local copy of a web site for you to browse offline. Many choices; I haven’t tried any of them.

      http://www.google.com/search?q=offline+browser

      • #1264773

        Well, the lawyer in me reminds you that many sites have a terms of service page saying you shouldn’t use automated tools to scrape their sites. Setting that aside, there is a category of products sometimes called an offline browser which build local copy of a web site for you to browse offline. Many choices; I haven’t tried any of them.

        http://www.google.com/search?q=offline+browser

        Thanks. The first option was something called HTTrack which I’ve actually heard of before but just couldn’t remember the name of. I’ll have to check but, chances are they don’t really care if I use a scraper to collect there comics. Chances are I’ll distribute my copies to other people which will bring more attention to the sites thus increasing revenue. Still I’ll have to check.

    Viewing 0 reply threads
    Reply To: Download All HTML within a domain

    You can use BBCodes to format your content.
    Your account can't use all available BBCodes, they will be stripped before saving.

    Your information: