Electronic Content Management System (eCMS)

Electronic Content Management System (eCMS) is a research project which its main task is extracting metadata such as title, authors, ISBN, publisher, year, and etc. from ebook files (pdf, djvu, djv, epub, mobi, ...). ‎ECMS can extract metadata from ebook files and check them with LoC website for validity, then stores these information in database in order to be used by eLibrary. It also extracts covers from ebooks and uses Tesseract OCR technology for scanned ebooks..

Grant & Certification:

  • Isfahan Science & Technology Town (ISTT) Reseach Grant, Isfahan, Iran, 2011.
  • Isfahan Science & Technology Town (ISTT) Accomplishment Certification, Isfahan, Iran, 2013

Specifications:

  • Programming Languages: Java.
  • Programming Design: Object Oriented Programming, Socket Programming, Data Crawling and Data Scraping, Multithreaded Programming.
  • Technologies & Frameworks: PDFBox, iText, PDFRenderer, ICEpdf, Ghost4J, JavaDJVU, Swing, Java Sockets, JCF, NIO, J4L OCR, RegEx, Log4j, JDBC.
  • Database: MySQL.
Date
Year: 
2013
Month: 
JANUARY