The original version of file originated in Unix Research Version 4 in 1973. System V brought a major update with several important changes, most notably moving the file type information into an external text file rather than compiling it into the binary itself. Most major BSD and Linux distributions use a free, open-source reimplementation which was written in 1986-87 by Ian Darwinfrom scratch. It was expanded by Geoff Collyer in 1989 and since then has had input from many others, including Guy Harris, Chris Lowth and Eric Fischer; from late 1993 onward its maintenance has been organized by Christos Zoulas. The OpenBSD system has its own subset implementation written from scratch, but still uses the Darwin/Zoulas collection of magic file formatted information.
if the file cannot be read, or its Unix file type is undetermined, the file program will indicate that the file was processed but its type was undetermined.
an initial part of file is considered and file is to use position-sensitive tests
the entire file is considered and file is to use context-sensitive tests
the file is identified as a data file
file's position-sensitive tests are normally implemented by matching various locations within the file against a textual database of magic numbers. This differs from other simpler methods such as file extensions and schemes like MIME. In most implementations, the file command uses a database to drive the probing of the lead bytes. That database is implemented in a file called magic, whose location is usually in /etc/magic, /usr/share/file/magic or a similar location.
Usage
The SUS mandates the following options: Other Unix and Unix-like operating systems may add extra options than these, such as -s 'special files', -k 'keep-going' or -r 'raw'. The command tells only what the file looks like, not what it is. It is easy to fool the program by putting a magic number into a file the content of which does not match it. Thus the command is not usable as a security tool other than in specific situations.
Examples
$ file file.c file.c: C programtext $ file program program: ELF32-bitLSBexecutable, Intel 80386, version 1, dynamically linked , stripped $ file /dev/hda1 /dev/hda1: block special $ file -s /dev/hda1 /dev/hda1: Linux/i386 ext2filesystem Note that -s is a non-standard option available only on some platforms, which tells file to read device files and try to identify their contents rather than merely identifying them as device files. Normally file does not try to read device files since reading such a file can have undesirable side effects. $ file -k -r libmagic-dev_5.35-4_armhf.deb # libmagic-dev_5.35-4_armhf.deb: Debian binary package - current ar archive - data Through the non-standard option -k the program does not stop after the first hit found, but looks for other matching patterns. The -r option, which is available in some versions, causes the unprintable new line character to be displayed in its raw form rather than in its octal representation. $ file compressed.gz compressed.gz: gzipcompressed data, deflated, original filename, `compressed', last modified: Thu Jan 26 14:08:23 2006, os: Unix $ file -i compressed.gz # compressed.gz: application/x-gzip; charset=binary $ file data.ppm data.ppm: Netpbm PPM "rawbits" image data $ file /bin/cat /bin/cat: Mach-Ouniversal binary with 2 architectures /bin/cat architecture ppc7400): Mach-O executable ppc /bin/cat : Mach-O executable [i386">Instruction set architecture">architecture ppc7400): Mach-O executable ppc /bin/cat : Mach-O executable [i386
As of version 4.00 of the Ian Darwin/Christos Zoulas version of file, the functionality of file is incorporated into a libmagic library that is accessible via C linking; file is implemented using that library.
Manual pages
Other
- homepage for version of file used in major BSD and Linux distributions.
- webpage of native GnuWin32 port of file for 32 bit Windows.
The on
, an alternative providing ranked answers based on statistics.