Sciencemadness Discussion Board
Not logged in [Login ]
Go To Bottom

Printable Version  
 Pages:  1  2
Author: Subject: Harvesting scanned books from HathiTrust
arkoma
Redneck Overlord
*******




Posts: 1761
Registered: 3-2-2014
Location: On a Big Blue Marble hurtling through space
Member Is Offline

Mood: украї́нська

[*] posted on 26-6-2014 at 18:02


oh wow--libgen is cool. thanx



"We believe the knowledge and cultural heritage of mankind should be accessible to all people around the world, regardless of their wealth, social status, nationality, citizenship, etc" z-lib

View user's profile View All Posts By User
German
Harmless
*




Posts: 44
Registered: 13-5-2009
Member Is Offline

Mood: No Mood

[*] posted on 26-6-2014 at 18:11


Yep libgen.info has any book you could ever want completely free. Millions of them. From brand new to very old. Even graduate level textbooks. Best site on the internet barnone.
View user's profile View All Posts By User
arkoma
Redneck Overlord
*******




Posts: 1761
Registered: 3-2-2014
Location: On a Big Blue Marble hurtling through space
Member Is Offline

Mood: украї́нська

[*] posted on 26-6-2014 at 18:23


shit homeboy--don't be such a stranger around here......you upped my game already LOL



"We believe the knowledge and cultural heritage of mankind should be accessible to all people around the world, regardless of their wealth, social status, nationality, citizenship, etc" z-lib

View user's profile View All Posts By User
chris893
Harmless
*




Posts: 1
Registered: 16-3-2012
Member Is Offline

Mood: Motivated

[*] posted on 24-9-2014 at 16:51


how do you use it? Every time I click on a book, I get an error message saying that it can't be found?
View user's profile View All Posts By User
Denize1
Harmless
*




Posts: 1
Registered: 24-4-2016
Member Is Offline

Mood: No Mood

[*] posted on 24-4-2016 at 19:45


HathiTrust Downloader is not working. Is there a fix?
View user's profile View All Posts By User
Mush
National Hazard
****




Posts: 633
Registered: 27-12-2008
Member Is Offline

Mood: No Mood

[*] posted on 9-12-2017 at 05:52


Quote: Originally posted by Denize1  
HathiTrust Downloader is not working. Is there a fix?


https://sourceforge.net/projects/hathidownloadhelper/


Description

*************************
2017-07-20 PLEASE NOTE:
Due to an update to hathitrust website Hathi Download Helper 1.1.3 is not operable anymore.
Please update to version 1.1.4
*************************
View user's profile View All Posts By User
Mitigator
banned
*




Posts: 14
Registered: 23-5-2018
Member Is Offline


[*] posted on 25-6-2018 at 05:19


Yeah, libgen is cool and (somewhere) illegal. But they lost few domains and current are:
https://libgen.pw/ (i used this last year, only here could find updated "crc handbook of chemistry and physics 2016-2017")
http://libgen.io/ (wow, this one looks better, offers more options, gonna use this now instead of .pw)

The above version of book has only chapters bookmarked, but version 2015-2016 has even each subchapters bookmarked, much easier to browse.

Also probably many of you have noticed that archive.org offers so many scanned legal books but they don't give them for free download but only for borrowing (for 2 week online preview or temporary encrypted download viewable using adobe digital editions).

So I figured 2 workarounds how to get those book downloaded.

First method is discovered by me and is only recommended either if you wanna high quality book (high resolution pages) or if 2nd method doesn't work (adobe digital editions doesn't work on some vpn or ip or proxy or weird configurations).

For 1st method you simply browse borrowed book online and using nirsoft chromecacheview you can see al those images stored as cache jpg or jp2 files in cache folder. Just copy them using that program. Depending on your online preview size images quality will vary. So to get highest quality just use extension Resource Override, and let it automatically replace any letters showing resolution in url with it removed, something like "aaa-200x200.jpg" with "aaa.jpg" of course using pattern like "aaa-*.jpg" to be replaced with aaa.jpg. To find real images direct url just use inspect element - network tab and try loading next page and it will appear on list as jpg, or that cache viewer and search for something like name of book or domain archive.org, eventually you'll find one image sample and see its url. Only patter for end of such urls have to be replaced. Something like that. But this is slow, you have to manually broswe whole book to cache all those pages, and sometimes they may dissapear. Huh...

For 2nd method you simply download encrypted pdf and decrypt using any pdf digital license removers, or adobe digital editions removers, or whatever they are called, like epubsoft. But these books take about 20 MB size, while same jp2 files downloaded manually take about 200 MB size for each book. Difference: resolution (quality).

Of course one account is needed to be able to borrow anything from archive.org or operlibrary.org.

[Edited on 25-6-2018 by Mitigator]
View user's profile View All Posts By User
 Pages:  1  2

  Go To Top