Reads OpenLibrary books dump as Packets from the compressed .gz format
PHP Library that reads OpenLibrary books dump as Packets from the compressed .gz format
OpenLibrary Dumps are available at
Download Dump link
add folowining code inside your composer.json
"repositories": [
{
"type": "vcs",
"url": "https://github.com/manindersingh030/OLDumpReader.git"
}
...
],
"require": {
"taggar/ol-dump-reader": "dev-master",
...
}
`
and run $composer update
<?php
require __DIR__.'/../vendor/autoload.php';
use OLDumpReader\OlReader;
$dumpLocation ='<dump_full_path>';
$reader = new OLReader($dumpLocation);
$reader->seekToLastLocation();
var_dump($reader->getNextPacket());
returns next packet as array
note: includes starting_location and ending_location key which represents locations inside compressed file of starting and ending of packet respectively
returns next packet as raw string
each of following is seprated by a tab ‘\t’
<packet_starting_inside_compressed_file> <packet_ending_inside_compressed_file> <type> <key> <revision> <last_modified> <json>
moves currunt pointer to last known packet location.
note: this doesnot goes at the end of the file but seeks to location of last packet that was parsed.
moves pointer to $seekLocation