Sample from 3 OCR-ed pages from Soros Media Center in Sarajevo. Media Center has digitalized one of the most important daily newspapers in the last century. They have a full text search database of the "Oslobodjenje" from 1991-1995 and some old Bosnian newspapers from the 19-th century. Thank you to Mr. Dragan Golubovic who provided me with the scanned files.
Made for Internationaal Instituut voor Sociale Geschiedenis in Amsterdam (International Institute for Social History). I consider the brochure to be of my best works so far having in mind that it's fully text searchable and compressed from 400MB to less than 800kb.
This is one of my best full text searchable files for the Macedonian branch of the PRISTOP communications agency http://www.pristop.si/ The Daily newspaper is UTRINSKI VESNIK dated from 14-th October 2006. It is consisted of 8 color and 24 Black & White TIF files scanned at 300 dpi with total 459 MB. The PDF that I have produced is OCR-ed and compressed, full text searchable at just 6.4 MB!
The "record" book that I have done with the permission of the author. It is scanned at B&W at 600 dpi with an automated feeder on CANON IR 2800. The scan is around 7GB and the actual PDF book is only 7 MB!!! It contains 4 plays by William Shakespeare translated in Macedonian by prof. Dragi Mihajlovski (one of the best English translators in Macedonia). He also has a publishing company KAPRIKORNUS http://www.kaprikornus.com.mk Please have in mind that the book have the digital rights. If you want to put it in the digital library or use it for presentation purposes please contact me or the author Dragi Mihajlovski.
This is the most important book in the history of the Republic of Macedonia and in the History of the Macedonian people. Also, It a very rare book and It's very difficult to be found anywhere. After the release from print in 1903 it was burned and there are only few copies that can be found. I have found this copy at my Faculty and made a "digital camera" scan and some experiments with pattern recognition OCR. It seems that it gone pretty well cause I've made the whole book in just 3 MB. The above sample is done with the color scan at 300dpi. Write for the full version of the book.