Raj Krishnamurthy and Deepak Alur started working on EMML in 2006. Their objective was to enable user-oriented and user-enabled mashups by creating what was then a new type of middleware called an Enterprise Mashup Platform. Raj Krishnamurthy became the chief language designer and implementer of EMML and also led the team to create an Eclipse-based EMML IDE called Mashup Studio. This work evolved into the EMML reference implementation that was donated to the Open Mashup Alliance. Raj Krishnamurthy continues to be one of the key contributors to EMML through the Open Mashup Alliance.
EMML features
EMML language provides a rich set of high-level mashup-domain vocabulary to consume and mash a variety of Web data-sources in flexible ways. EMML provides a uniform syntax to invoke heterogeneous service styles: REST, WSDL, RSS/ATOM, RDBMS, and POJO. The EMML language also provides ability to mix diverse data formats: XML, JSON, JDBC, JavaObjects, and primitive types. High-level EMML language features include:
Filter and sort data coming from heterogeneous services
Join data across heterogeneous services and data formats
EMML is primarily a XML-based declarative language, but also provides ability to encode complex logic using embedded scripting engines. XPath is the expression language used in EMML.
Directinvoke statement
directinvoke provides ability to invoke and consume a variety of data services. These data services may be REST, RSS/ATOM, or SOAP services. directinvoke also supports Web clipping by allowing HTML pages to be specified as service endpoints. HTTP GET, POST, PUT, and DELETE protocols are supported in directinvoke. HTTP Header and cookie support is also available thus providing capability to consume a wide variety of REST/SOAP Web services. It is possible to use directinvoke with a proxy server. Code sample of passing attributes as parameters to a service: method="GET" outputvariable="$result" query="items=all" appID="67GYH30N25" /> method="GET" outputvariable="$news" xmlns:sc="http://www.svcltd.com/" sc:date="20070515" sc:nights="3"/>
Filter statement
The filter statement filters the content of a variable using an XPath expression and places the result in a new variable. Code sample for filtering west-coast customers using region data-item:
Sort statement
The sort statement sorts the content of a document-type variable or variable fragment based on key expressions and places the result in another variable. Code sample that sorts tickets based on created date and customer: sortexpr="ticket" sortkeys="xs:date descending, customer ascending" outputvariable="$troubleTickets"/>
Groupby statement
groupby provides the ability to group and aggregate data sets. Standard XPath aggregation operations can be used and there is an extension mechanism for adding user-defined functions. Nested Grouping of hierarchical data sets are also supported. There is a Having clause to filter Group attributes. Code sample that groups books by genre and computes total copies for each genre:
Merge statement
merge provides ability to combine various data sources including RSS/ATOM feeds, XML, JSON payload formats. The merge feature is similar to SQL UNION functionality but merges hierarchical document structures. Code sample that merges Yahoo! News, Financial News, and Reuters feeds: outputvariable="$NewsAggregate"/>
Annotate statement
annotate provides ability to enrich the semantic meaning of source service data with microformat-like elements/attributes. These data annotations can be used by mashlets or gadgets to provide richer visual user interfaces. Code sample for annotating vendor payload with geo-coordinates: element geo:lat, element geo:long
Join statement
The join statement defines how disparate, hierarchical data formats are joined and is comparable to inner joins for relational databases. Code sample where output variable contains a element with a repeating set of children, which are the repeating items. Each contains a child with data from the variable named movies and and children with data from the variable named reviews: joincondition="$movies/movie/@id = $reviews/review/movie/title">
Scripting in EMML
EMML is a declarative language, but provides programmatic scripting extensions for performing complex mashup logic. JavaScript, JRuby, Groovy, POJO, XQuery scripting environments are supported. Data flows seamlessly between EMML and scripting environments. Code sample where JavaScript snippet is used to extract authentication token that is required for subsequent calls "result" variable that gets propagated to JavaScript environment: