Optical Character Recognition (OCR)

  1. OCR-fileformat validates and transforms various OCR file formats (hOCR, ALTO, PAGE, FineReader): [code], [GUI], [MIT License].
  2. ocr-gt-tools is an ergonomic line-by-line transcription of scanned text: [code], [GNU Affero General Public License v3.0].
  3. Zotero OCR adds the functionality to perform an OCR for the PDFs selected in Zotero: [code], [GNU Affero General Public License v3.0].
  4. ocrd-pagetopdf is an OCR-D wrapper for prima-pagetopdf. It transforms all PAGE-XML+IMG to PDF with text layer and (optionally) polygon outlines: [code], [Apache License 2.0].
  5. TesseractXplore is an easy-to-use graphical interface to tesseract with full control: [code], [MIT License].
  6. ocromore is a tool for processing, enhancing and evaluating multiple OCR outputs: [code], [Apache License 2.0].
  7. crass is a tool aimed to crop and splice segments of scanned pages: [code], [Apache License 2.0].
  8. Mocrin coordinates multiple ocr-engines to create a uniform workflow and folder structure: [code], [Apache License 2.0].

Data management & bibliographic tools

  1. Zotkat is an extension of Zotero for cataloguing in a broad sense and contains also some experimental approaches: [Zotkat], [GNU Affero General Public License v3.0].
  2. malibu (Mannheim library utilities) is a collection of lightweight web-based tools to work with bibliographic metadata from various sources on the web, aimed at supporting the workflows of subject librarians and acquisitions librarians: [code], [GUI].
  3. uma_publist a TYPO3 extension to include publication lists from an EPrints repository in TYPO3 websites where the lists are generated and synced automatically: [code], [GNU General Public License v2.0].

Screen sharing & team work

  1. PalMA enables people to share several contents on one monitor: [code], [GNU General Public License].

Wikibase knowledge graphs & Natural Language Processing (NLP)

  1. bbw is an automatic semantic annotator for tabular data using a Wikibase instance and metasearch in SearX: [code], [GUI], [tutorial], [MIT License].
  2. RaiseWikibase is a tool for fast data import and knowledge graph construction with Wikibase: [code], [docs], [poster], [MIT License].