Email address
An email address identifies an email box to which email messages are delivered. A wide variety of formats were used in early email systems, but only a single format is used today, following the specifications developed for Internet mail systems since the 1980s. This article uses the term email address to refer to the addr-spec defined in
An email address such as John.Smith@example.com is made up of a [|local-part], an @ symbol, then a case-insensitive domain. Although the standard requires
the local part to be case-sensitive, it also urges that receiving hosts deliver messages in a case-independent fashion,
e.g., that the mail system at example.com treat John.Smith as equivalent to john.smith; some mail systems even treat them as equivalent to johnsmith. Mail systems often limit their users' choice of name to a subset of the technically valid characters, and in some cases also limit which addresses it is possible to send mail to.
With the introduction of internationalized domain names, efforts are progressing to permit non-ASCII characters in email addresses.
Overview
The transmission of electronic mail within the Internet uses the Simple Mail Transfer Protocol, defined in, and extensions likeThe general format of an email address is local-part@domain, and a specific example is jsmith@example.com. An address consists of two parts. The part before the @ symbol identifies the name of a mailbox. This is often the username of the recipient, e.g., jsmith. The part after the @ symbol is a domain name that represents the administrative realm for the mail box, e.g., a company's domain name, example.com.
When delivering email, an SMTP client, e.g., Mail User Agent, Mail Transfer Agent, uses the domain name system to look up a Resource Record for the recipient's domain ; if there is a mail exchange Resource Record then the returned MX record contains the name of the recipient's mailserver, otherwise the SMTP client uses an address record. The MTA next connects to this server as an SMTP client.
The local part of an email address has no significance for intermediate mail relay systems other than the final mailbox host. Email senders and intermediate relay systems must not assume it to be case-insensitive, since the final mailbox host may or may not treat it as such. A single mailbox may receive mail for multiple email addresses, if configured by the administrator. Conversely, a single email address may be the alias to a distribution list to many mailboxes. Email aliases, electronic mailing lists, sub-addressing, and catch-all addresses, the latter being mailboxes that receive messages regardless of the local part, are common patterns for achieving a variety of delivery goals.
The addresses found in the header fields of an email message are not directly used by mail exchanges to deliver the message. An email message also contains a message envelope that contains the information for mail routing. While envelope and header addresses may be equal, forged email addresses are often seen in spam, phishing, and many other Internet-based scams. This has led to several initiatives which aim to make such forgeries easier to spot.
To indicate the message recipient, an email address also may have an associated display name for the recipient, which is followed by the address specification surrounded by angled brackets, for example: John Smith
Earlier forms of email addresses on other networks than the Internet included other notations, such as that required by X.400, and the UUCP bang path notation, in which the address was given in the form of a sequence of computers through which the message should be relayed. This was widely used for several years, but was superseded by the Internet standards promulgated by the Internet Engineering Task Force.
Syntax
The format of email addresses islocal-part@domain
where the local part may be up to 64 octets long and the domain may have a maximum of 255 octets. The formal definitions are in Note that unlike the syntax of
Local-part
The local-part of the email address may be unquoted or may be enclosed in quotation marks.If unquoted, it may use any of these ASCII characters:
- uppercase and lowercase Latin letters
A
toZ
anda
toz
- digits
0
to9
- printable characters
!#$%&'*+-/=?^_`~
- dot
.
, provided that it is not the first or last character and provided also that it does not appear consecutively .
".John.Doe"@example.com
, "John.Doe."@example.com
and "John..Doe"@example.com
are allowed.The maximum total length of the local-part of an email address is 64 octets.
Note that some mail servers support wildcard recognition of local parts, typically the characters following a plus and less often the characters following a minus, so fred+bah@domain and fred+foo@domain might end up in the same inbox as fred+@domain or even as fred@domain. This can be useful for tagging emails for sorting, and for spam control. Braces
are also used in that fashion, although less often. - space and special characters
",:;<>@
are allowed with restrictions ; - comments are allowed with parentheses at either end of the local-part; e.g.,
john.smith@example.com
andjohn.smith@example.com
are both equivalent tojohn.smith@example.com
.
A local part is either a Dot-string or a Quoted-string; it cannot be a combination.
Quoted strings and characters, however, are not commonly used.
The local-part
postmaster
is treated specially—it is case-insensitive, and should be forwarded to the domain email administrator. Technically all other local-parts are case-sensitive, therefore jsmith@example.com
and JSmith@example.com
specify different mailboxes; however, many organizations treat uppercase and lowercase letters as equivalent. Indeed, Despite the wide range of special characters which are technically valid, organisations, mail services, mail servers and mail clients in practice often do not accept all of them. For example, Windows Live Hotmail only allows creation of email addresses using alphanumerics, dot, underscore and hyphen. Common advice is to avoid using some special characters to avoid the risk of rejected emails.
Domain
The domain name part of an email address has to conform to strict guidelines: it must match the requirements for a hostname, a list of dot-separated DNS labels, each label being limited to a length of 63 characters and consisting of:- uppercase and lowercase Latin letters
A
toZ
anda
toz
; - digits
0
to9
, provided that top-level domain names are not all-numeric; - hyphen
-
, provided that it is not the first or last character.
, such as jsmith@
or jsmith@
, although this is rarely seen except in email spam. Internationalized domain names allow for presentation of non-ASCII domains. In mail systems compliant with Comments are allowed in the domain as well as in the local-part; for example,
john.smith@example.com
and john.smith@example.com
are equivalent to john.smith@example.com
.Reserved domains
specifies that certain domains intended for, e.g., documentation, testing, should not be resolvable and that as a result mail addressed to mailboxes in them and their subdomains should be non-deliverable. Of note for e-mail are- .example
- .invalid
- example.com
- example.net
- example.org
Examples
; Invalid email addresses
Common local-part semantics
According toThis means that no assumptions can be made about the meaning of the local-part of another mail server. It is entirely up to the configuration of the mail server.
Local-part normalization
Interpretation of the local part of an email address is dependent on the conventions and policies implemented in the mail server. For example, case sensitivity may distinguish mailboxes differing only in capitalization of characters of the local-part, although this is not very common. Gmail ignores all dots in the local-part of a @gmail.com address for the purposes of determining account identity.Subaddressing
Some mail services support a tag included in the local-part, such that the address is an alias to a prefix of the local part. For example, the address joeuser+tag@example.com denotes the same delivery address as joeuser@example.com.Addresses of this form, using various separators between the base name and the tag, are supported by several email services, including Runbox, Gmail, Rackspace, Yahoo! Mail Plus, Apple's iCloud, Outlook.com, ProtonMail, FastMail, , MMDF, Qmail and Courier Mail Server. Postfix and Exim allow configuring an arbitrary separator from the legal character set.
The text of the tag may be used to apply filtering, or to create single-use, or disposable email addresses.
In practice, the form validation of some web sites may reject characters such as "+" in an email address – treating them, incorrectly, as invalid characters. This can lead to an incorrect user receiving an e-mail if the "+" is silently stripped by a website without any warning or error messages. For example, an email intended for the user-entered email address foo+bar@example.com could be incorrectly sent to foobar@example.com. In other cases a poor user experience can occur if some parts of a site, such as a user registration page, allow the "+" character whilst other parts, such as a page for unsubscribing from a site's mailing list, do not.
Validation and verification
Email addresses are often requested as input to website as user identification for the purpose of data validation. Other validation methods are available, such as cell phone number validation, postal mail validation, and fax validation.An email address is generally recognized as having two parts joined with an at-sign, although technical specification detailed in
Syntactically correct, verified email addresses do not guarantee that an email box exists. Thus many mail servers use other techniques and check the mailbox existence against relevant systems such as the Domain Name System for the domain or using callback verification to check if the mailbox exists. Callback verification is an imperfect solution, as it may be disabled to avoid a directory harvest attack.
Several validation techniques may be utilized to validate an email address. For example,
- Verification links: Email address validation is often accomplished for account creation on websites by sending an email to the user-provided email address with a special temporary hyperlink. On receipt, the user opens the link, immediately activating the account. Email addresses are also useful as means of forwarding messages from a website, e.g., user messages, user actions, to the email inbox.
- Formal and informal standards:
RFC 3696 provides specific advice for validating Internet identifiers, including email addresses. Some websites instead attempt to evaluate the validity of email addresses through arbitrary standards, such as by rejecting addresses containing valid characters, such as + and /, or enforcing arbitrary length limitations. Email address internationalization provides for a much larger range of characters than many current validation algorithms allow, such as all Unicode characters above U+0080, encoded as UTF-8. - Algorithmic tools: Large websites, bulk mailers and spammers require efficient tools to validate email addresses. Such tools depend upon heuristic algorithms and statistical models.
- Sender reputation: An email sender's reputation may used to attempt to verify whether the sender is trustworthy or a potential spammer. Factors that may be incorporated into an assessment of sender reputation include the quality of past contact with or content provided by, and engagement levels of, the sender's IP address or email address.
- Browser-based verification: HTML5 forms implemented in many browsers allow email address validation to be handled by the browser.
Internationalization
The IETF conducts a technical and standards working group devoted to internationalization issues of email addresses, entitled Email Address Internationalization. This group produced, and continues to work on additional EAI-related RFCs.The IETF's EAI Working group published
The basic EAI concepts involve exchanging mail in UTF-8. Though the original proposal included a downgrading mechanism for legacy systems, this has now been dropped. The local servers are responsible for the local-part of the address, whereas the domain would be restricted by the rules of internationalized domain names, though still transmitted in UTF-8. The mail server is also responsible for any mapping mechanism between the IMA form and any ASCII alias.
EAI enables users to have a localized address in a native language script or character set, as well as an ASCII form for communicating with legacy systems or for script-independent use. Applications that recognize internationalized domain names and mail addresses must have facilities to convert these representations.
Significant demand for such addresses is expected in China, Japan, Russia, and other markets that have large user bases in a non-Latin-based writing system.
For example, in addition to the.in top-level domain, the government of India in 2011 got approval for ".bharat",, written in seven different scripts for use by Gujrati, Marathi, Bangali, Tamil, Telugu, Punjabi and Urdu speakers. Indian company XgenPlus.com claims to be the world's first EAI mailbox provider, and the Government of Rajasthan now supplies a free email account on domain राजस्थान.भारत for every citizen of the state. A leading media house Rajasthan Patrika launched their IDN domain पत्रिका.भारत with contactable email.
Internationalization examples
The example addresses below would not be handled by- Latin alphabet with diacritics: Pelé@example.com
- Greek alphabet: δοκιμή@παράδειγμα.δοκιμή
- Traditional Chinese characters: 我買@屋企.香港
- Japanese characters: 二ノ宮@黒川.日本
- Cyrillic characters: медведь@с-балалайкой.рф
- Devanagari characters: संपर्क@डाटामेल.भारत
Internationalization support
- Postfix mailer supports internationalized mail since 2015-02-08 with a stable release 3.0.0.
- Google has support for sending emails to and from internationalized domains, but does not allow the registration of non-ASCII email addresses.
- Microsoft added similar functionality in Outlook 2016
- DataMail launches internationalized email support for 8 Indian languages using the XgenPlus email platform in India.
Standards documents
- RFC 821 – Simple Mail Transfer Protocol
- RFC 822 – Standard for the Format of ARPA Internet Text Messages
- RFC 1035 – Domain names, Implementation and specification
- RFC 1123 – Requirements for Internet Hosts, Application and Support
- RFC 2142 – Mailbox Names for Common Services, Roles and Functions
- RFC 2821 – Simple Mail Transfer Protocol
- RFC 2822 – Internet Message Format
- RFC 3696 – Application Techniques for Checking and Transformation of Names
- RFC 4291 – IP Version 6 Addressing Architecture
- RFC 5321 – Simple Mail Transfer Protocol
- RFC 5322 – Internet Message Format
- RFC 5952 – A Recommendation for IPv6 Address Text Representation
- RFC 6530 – Overview and Framework for Internationalized Email
- RFC 6531 – SMTP Extension for Internationalized Email
- RFC 6854 – Update to Internet Message Format to Allow Group Syntax in the "From:" and "Sender:" Header Fields