Re: [wcsplus] Design of asynchronous request in DEWS WCS

To: "Jon Blower" <jdb@xxxxxxxxxxxxxxxxxxxx>, "Simon.Cox@xxxxxxxx" <Simon.Cox@xxxxxxxx>
Subject: Re: [wcsplus] Design of asynchronous request in DEWS WCS
From: Stefano Nativi <nativi@xxxxxxxxxxx>
Date: Fri, 02 Nov 2007 18:36:10 +0100

Dear all,

I really appreciate this discussion which touches several of theissues we have been discussing and facing in our research anddevelopment activity.

We have been developing OWS on SOAP; recently, we decided to playwith some REST implementations (especially for asynch interactions).Therefore, I'd like to add some comments stemming from ourunderstanding of REST and experience with it. Please, forgive thelong content of this email; actually I put together Paolo's and mycomments :-) .

Let me distinguish between the REST approach (the architecturalstyle) and the RESTful implementation (the current technologicalsolutions for implementing REST).


The REST approach proved to be highly scalable and sufficiently

flexible in many contexts, primarly the WEB infrastructure but alsoDB and filesystem access. In all these cases we have resourcessingularly addressed with a uniform interface.

Indeed the possible REST actions are limited by the uniform interfacewhich tipically maps the simple CRUD (create, retrieve, update anddelete) paradigm. Often simplicity means generality and flexibility(see the netCDF data model case); in fact, this simplicity was one ofthe reason for the WEB pervasive success and for its scalability. Onthe other hand, advanced semantic actions (e.g. resource processingactions) must be mapped to the basic CRUD vocabulary.

For example in the DB domain we can use SQL: a DB is the resourcedomain; the uniform interface is made ofSELECT/INSERT/CREATE/UPDATE/DELETE methods; resource-IDs are all thepossible SQL "WHERE" clauses.For the WEB (which may be seen as a globally distributed DB),resource-IDs are the WEB URIs (i.e. the "WEB clauses").In both cases the resource-ID may become really complex (i.e. verylong KVP strings; or complex SQL JOIN SELECTS) and, hence, it may bedifficult to efficiently manage these IDs. For a REST WCSimplementation (at the abstract level; no implementation details),resource-IDs are the GetCoverage clauses (analogous to the "SELECT"request content).

In our opinion, this is the real asset/limitation of REST: theapplication business logic must be faced and partially addressed atthe interaction level (the protocol level), leaving the rest of thebusiness logic to the server which, consequently, may result simpler(almost any Institution can manage a WEB server, today). With theService-oriented approach, the entire application business logic isleft to the server (i.e. the service provider) implementing a evensimpler interaction: Exchange/Send an Electronic Document. Thus, SOAguarantees high flexibility, but the server (the service provider)has to face all the resource-related issues (e.g. resource caching,ID, creation, encoding, etc.) anyway.

Thus REST focus is on uniform interface and resource addressing noton resources nature (discrete, existing, etc.). If we can provide auniform interface and a complete resources addressing we can adopt aREST architecture.In our opinion WCS seems to be implicitly based on a uniforminterface (since we GET coverages, GET coverages descriptions and GETserver capabilities and we do not explicitly define other action likeINTERPOLATE, SUBSET, etc.), allowing to address each resource. Hence,a REST architecture seems an effective choice for this domain.

As to RESTful implementation for Geospatial resources, several issuesmust be considered.

First of all we should define what "resource" and " resourcerepresentation" are in this domain. We could decide that a dataset isthe resource and all the features extracted from the dataset throughinterpolation, subsetting and resampling are simply differentrepresentations. In such case we should only address the dataset witha known URI and possibly create new resources if required. On theother hand we could consider each feature extracted from a dataset asa different resource. In such case we should address each featurewith a different URI.

Presently, we are working on this second approach for some reasons:for theoretical consistency (according to the Web architecture arepresentation should only affect formats), and for implementationreasons (different URIs could support server-side caching).

Concerning the addressing problem we do not need to explicitly defineURIs for each possible feature. We can simply provide a functionalmapping between a URI-space and resource representations. In the OWSthe URL-encoding of KVP string in a GET request IS the resourceaddressing. The fact that the feature is dynamically created is notan architectural problem but an implementation issue which mightrequire smart caching servers.


For example:

http://someserver.net/wcs?name=foo&bbox=-180,-90,180,90&;...

is the URI for the feature extracted by the coverage named "foo" withthe interpolation, subsetting and resampling defined by bbox (andother) parameters.(A better URI could be defined leaving only non-hierarchicalparameters in the query part of the URI. Something like:


http://someserver.net/coverages/foo?bbox=-180,-90,180,90&;...

)

When the request is encoded in a POST it should be considered as aquery to the root resource which responds with the representation ofthe target resource. This could also be viewed as anextraction-from-dataset service; however, this may introduce uselesscomplexity since the request is still a GET action. In fact, thereexists an implicit hierarchy of our features, and the root feature(the "foo" coverage in our example) doesn't support only its own GEToperation, but also the selection of its children via a POST operation.

These considerations seem to be valid not only for WCS but for allthe data access services (e.g. WCS, WFS and WMS). They conform to aresource-oriented approach and can be implemented in a RESTfularchitecture with "minimal" modifications of existing specifications.Besides, the RESTful implementation might be easily adopted by dataproviders, since it should be based on well-known technologies.

The case of WPS and WCTS seems to be different. In fact, they don'tdefine a uniform interface for the many operations they shouldsupport; on the contrary, they introduce a uniform interface toreceive a message which contains specific operation requests. In thiscase we should use the POST method as the extension point forinteraction with HTTP based services which create new addressableresources (a sort of ending point in the SOA view). In such a way weshould have the advantages of pervasive and scalable data provision(through the RESTful implementation) and modular and composableprocessing (through the service-oriented architecture).



Some possible conclusions:

A RESTful implementation is valuable for scalability andextensibility (derived by the REST architectural style) as well asfor simplicity (the implementation is simple since it is based onwell-known technology and only simple operations must be supported server-side)

The RESTful implementation seems feasible for data access servicesbecause they are typically resource-based.

The RESTful architecture must interact with a Service-orientedarchitecture for basic and advanced processing. XML and HTTP are thekey technologies for bridging.





Thank you for your patience,

Stefano and Paolo

References:
- [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Jon Blower
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Ethan Davis
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Jon Blower
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Ethan Davis
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Jon Blower
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Simon.Cox
- Re: [wcsplus] Design of asynchronous request in DEWS WCS
  - From: Jon Blower