Xena attempts to avoid digital obsolescence by converting files into an openly specified format, such as ODF or PNG. If the file format is not supported or the Binary Normalisation option is selected, Xena will perform ASCIIBase64 encoding on binary files and wrap the output in XML metadata. The resulting.xena file is plain text, although the content of the data itself is not directly human-readable. The exact original file can be retrieved by stripping the metadata and reversing the Base64 encoding, using an internal viewer.
Features
Platforms supported by Xena are Microsoft Windows, Linux and Mac OS X. Xena uses a series of plugins to identify file formats and convert them to an appropriate openly specified format. Xena has an application programming interface which allows any reasonably skilled Java developer to develop a plugin to cover a new file type. Xena can process individual files or whole directories. When processing a whole directory, it can preserve the original directory structure of the converted records. Xena can create plain text versions of file formats such as TIFF, Word and PDF, with the use of Tesseract. The Xena interface or Xena Viewer can be used to view or export a Xena file in its target file format. These files contain the normalised file as well as any extra information relevant to the normalisation process. The Xena Viewer supports bulk export of Xena files to target file formats. Xena can be used via its graphical user interface or the command line. For Xena to be fully functional, it requires a local installation of the following external software:
LibreOffice suite - to convert office documents to OpenDocument format
Tesseract - to create plain text versions of file formats
ImageMagick - to convert a subset of image files to PNG
Xena will recognize and process the file types listed below, plus a few others of minor importance. Unsupported file types will automatically undergo binary normalization. Office file formats:
Microsoft Office files are converted to the corresponding OpenDocument files
Microsoft Outlook PST files are parsed for their individual messages, which are converted to XML files and a Xena index file is created
Microsoft Project MPP files are converted to XML
OpenOffice.org XML files are converted to the corresponding OpenDocument formats
WordPerfect WPD files are converted to OpenDocument ODT
Mailbox files are converted to individual XML files
Graphics:
BMP, GIF, PSD, PCX, RAS, and the X Window SystemXBM and XPM bitmap files are converted to PNG; TIFF files additionally get embedded metadata stored in Xena XML. If the Tesseract OCR software is installed, text will be extracted from TIFF files.
OpenDocument Drawings and SVG files are wrapped in Xena XML
JPG and PNG files are stored unchanged
Archive Files:
Files are extracted from archives and normalised into a separate Xena file. A Xena index file is created, which when opened in the internal Xena viewer will display the files in a table.
Audio files:
MP3, WAV, AIFF, and OGG formats are converted to FLAC files.
Databases:
SQL files are processed as plain text wrapped in XML
Other file types:
HTML is converted to XHTML
TXT text files are stored as plain text wrapped in XML; CSS files are stored as plain text wrapped in XML
Reviews
An April 22, 2010 review in Practical e-Records rated Xena at 82/100 points. At present Xena has no target preservation format for video files.