Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.
Browse Packages by Type
- Archival Record Manager and Editor (6)
- Bibliography (6)
- Content Management Framework (4)
- Content Management System (12)
- Course Management (1)
- Digital Repository (27)
- Discovery Interface (30)
- Electronic Reserves (2)
- Electronic Resource Management (12)
- Image Display and Manipulation (18)
- Integrated Library System (17)
- Interlibrary Loan (4)
- Journal Publishing (1)
- Knowledge Base (6)
- Link Resolver (4)
- Metadata Manipulation (17)
- Online Public Access Catalog (10)
- Preservation Repository (1)
- Reference (4)
- Subject Page Curation (2)
JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to the format's specifications. Additionally jpylyzer is able to extract the technical characteristics of each image.
The File Information Tool Set (FITS) identifies, validates, and extracts technical metadata for various file formats. It wraps several third-party open source tools, normalizes and consolidates their output, and reports any errors. FITS was created by the Harvard Library for use in its Digital Repository Service (DRS).
The current tools used are:
dspace-cris is a new additional open source module for the DSpace platform. It extends the DSpace data model giving the ability to manage, collect and expose data about all the Research aspects (people, organization units, prize, project, grants, etc.).
Pdf-extract is an open source set of tools and libraries for identifying and extracting semantically significant regions of a scholarly journal article (or conference proceeding) PDF. The pdf-extract tools allow you to identify and extract the individual references from a scholarly journal article.
Since 1998, the Public Knowledge has been releasing and upgrading software intended to improve the public and scholarly quality of research, and with Open Monograph Press (OMP), it seeks to support a format that remains critical to the advancement of learning.
The Open Harvester Systems is a free metadata indexing system developed by the Public Knowledge Project through its federally funded efforts to expand and improve access to research.