Book SP


  • Islandora Paged Content
gs -v
GPL Ghostscript 9.10 (2013-08-30)
Copyright (C) 2013 Artifex Software, Inc.  All rights reserved.

apt-get install poppler-utils

cd ~/github/islandora
git clone git://github.com/Islandora/islandora_paged_content
mv islandora_paged_content /var/www/fabb.to.cnr.it/sites/all/modules/
cd /var/www/fabb.to.cnr.it
drush -u 1 en islandora_paged_content

Browse to config: admin/islandora/solution_pack_config/paged_content
	gs (GhostScript) = /usr/bin/gs
        pdfinfo = /usr/bin/pdfinfo
        pdftotext = /usr/bin/pdftotext
        Allow Extraction of Raw Text = NO
 	djatoka URL = http://fabb.to.cnr.it/adore-djatoka/
 	Solr page sequence number field = RELS_EXT_isSequenceNumber_literal_ms
        Set page labels to sequence numbers = NO
        Hide Page Objects From Search Results = NO
  • Tesseract
apt-get install tesseract-ocr tesseract-ocr-eng tesseract-ocr-ita tesseract-ocr-fra tesseract-ocr-spa tesseract-ocr-deu

tesseract -v
tesseract 3.03
 leptonica-1.70
  libgif 4.1.6(?) : libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib 1.2.8 : webp 0.4.0

tesseract --list-langs
List of available languages (7):
eng
fra
equ
ita
osd
spa
deu
  • Islandora OCR
cd ~/github/islandora
git clone git://github.com/Islandora/islandora_ocr
mv islandora_ocr /var/www/fabb.to.cnr.it/sites/all/modules/
cd /var/www/fabb.to.cnr.it
drush -u 1 en islandora_ocr

Browse to config: admin/islandora/tools/ocr
     Tesseract = /usr/bin/tesseract
     Languages available for OCR 
       English = YES
       Italian = YES
     Enable Solr Fast Vector Highlighting = YES
     Solr field containing OCR text = text_nodes_HOCR_hlt
     Maximum number of results to return in a Solr query = 32
  • Islandora Internet Archive Bookreader
cd ~/github/islandora
git clone git://github.com/Islandora/internet_archive_bookreader
mv internet_archive_bookreader /var/www/fabb.to.cnr.it/sites/all/libraries/bookreader

nano -w sites/all/libraries/bookreader/BookReader/BookReader.css

div#BRpage {
    float: right;
-    width: 280px;
+    width: 300px;
    padding-left:12px;
    text-align: right;
}

cd ~/github/islandora
git clone git://github.com/Islandora/islandora_internet_archive_bookreader
mv islandora_internet_archive_bookreader /var/www/fabb.to.cnr.it/sites/all/modules/

cd /var/www/fabb.to.cnr.it
drush dl colorbox
drush en colorbox
cd ~
wget https://github.com/jackmoore/colorbox/archive/1.x.zip
unzip 1.x.zip
mv colorbox-1.x colorbox
mv colorbox /var/www/fabb.to.cnr.it/sites/all/libraries/

Configure colorbox: admin/config/media/colorbox

  • Enable Colorbox inline YES
cd /var/www/fabb.to.cnr.it
drush -u 1 en islandora_internet_archive_bookreader
  • Islandora Book SP
cd ~/github/islandora/
git clone git://github.com/Islandora/islandora_solution_pack_book
mv islandora_solution_pack_book /var/www/fabb.to.cnr.it/sites/all/modules/
cd /var/www/fabb.to.cnr.it
drush -u 1 en islandora_book

Browse to config: admin/islandora/solution_pack_config/book

  • PDF datastream. Requires ImageMagick = YES
  • Image datastreams (TN, JPEG, JP2). Requires Large Image Solution Pack = YES
  • OCR datastreams (OCR, HOCR). Requires Islandora OCR module = YES
  • Parent Solr Field = RELS_EXT_isMemberOf_uri_ms
  • Display object metadata = YES
  • Book Viewers = Internet Archive BookReader
  • Page Viewers = OpenSeadragon
 
 
isla7x/book.txt ยท Last modified: 2016/05/04 11:43 by giancarlo

Developers: CNR IRCrES IT Office and Library
Giancarlo Birello (giancarlo.birello _@_ ircres.cnr.it) and Anna Perin (anna.perin _@_ ircres.cnr.it)
FAbb@TO.CNR is licensed under: Creative Commons License
Recent changes RSS feed Creative Commons License Valid XHTML 1.0 Valid CSS Driven by DokuWiki
Drupal Garland Theme for Dokuwiki