gcat/docs/index.rst

399 lines
21 KiB
ReStructuredText
Raw Normal View History

2022-09-21 11:24:15 +02:00
#############################
Welcome to gCat documentation
#############################
2022-09-13 19:00:30 +02:00
2022-09-13 17:28:21 +02:00
gCat is a RESTful application which exposes operations ...
See the available REST-API on `its API docs <../api-docs/index.html>`_.
2022-09-21 11:24:15 +02:00
**********
Base URL
**********
2022-09-13 17:28:21 +02:00
2022-09-13 17:36:05 +02:00
In the production environment, its current value is https://api.d4science.org/gcat
2022-09-13 17:28:21 +02:00
2022-09-21 11:24:15 +02:00
*************
2022-09-13 17:28:21 +02:00
Authorization
2022-09-21 11:24:15 +02:00
*************
2022-09-13 17:28:21 +02:00
D4Science adopts state-of-the-art industry standards for authentication and authorization.
2022-09-16 18:01:30 +02:00
Specifically, the implementation fully adopts `OIDC (OpenID Connect) <https://openid.net/connect>`_ for authentication and UMA 2 (User-Managed Authorization) for authorization flows.
2022-09-13 17:28:21 +02:00
`JSON Web Token (JWT) Access token <https://jwt.io/>`_ are used for both authentication and authorization.
Obtain your Bearer token here: https://dev.d4science.org/how-to-access-resources
2022-09-21 11:24:15 +02:00
*******
2022-09-13 19:00:30 +02:00
Service
2022-09-21 11:24:15 +02:00
*******
2022-09-13 17:28:21 +02:00
2022-09-16 18:01:30 +02:00
You can call the methods of the Web Service by writing your own REST client application or using existing REST client plugins.
2022-09-13 19:00:30 +02:00
2022-09-21 11:24:15 +02:00
=============
2022-09-15 16:51:17 +02:00
HTTP Statuses
2022-09-21 11:24:15 +02:00
=============
2022-09-15 16:51:17 +02:00
2022-09-21 11:24:15 +02:00
Any successful operation returns *200 OK* HTTP status code.
2022-09-19 12:45:30 +02:00
The create operation returns *201 Created*.
2022-09-21 11:24:15 +02:00
Any Background operation returns *202 Accepted*.
Any operation which does not provide any content return *204 No Content*.
2022-09-15 16:51:17 +02:00
The most common error status a client can obtain are:
* **400 Bad Request** used to indicate a clients error `<https://tools.ietf.org/html/rfc7231#section-6.5.1>`_;
2022-09-16 18:01:30 +02:00
* **401 Unauthorized** used to indicate that the client does not provide the authorization token in the HTTP Header or the client has not enough right to perform such request `<https://tools.ietf.org/html/rfc7235#section-3.1>`_;
* **404 Not Found** used to indicate that the requested instance does not exist `<https://tools.ietf.org/html/rfc7231#section-6.5.4>`_;
2022-09-15 16:51:17 +02:00
* **405 Method Not Allowed** the used HTTP method is not supported for the requested URL `<https://tools.ietf.org/html/rfc7231#section-6.5.5>`_.
The response contains the *Allow* HTTP Header indicating the supported HTTP method for such URL `<https://tools.ietf.org/html/rfc7231#section-7.4.1>`_;
* **409 Conflict** the request could not be completed due to a conflict with the current state of the target resource (e.g. the name of the resource already exists) `<https://tools.ietf.org/html/rfc7231#section-6.5.8>`_;
* **500 Internal Server Error** indicate a server failure `<https://tools.ietf.org/html/rfc7231#section-6.6.1>`_.
2022-09-16 18:01:30 +02:00
You can find a complete list of HTTP Status at `<https://httpstatuses.com/>`_
2022-09-15 16:51:17 +02:00
2022-09-16 18:01:30 +02:00
If you get a *500 Internal Server Error*, please report it in the `gCube ticketing system `<https://support.d4science.org>`_.
2022-09-15 16:51:17 +02:00
Please use this checklist before reporting an error:
2022-09-16 18:01:30 +02:00
* Replicate the request;
* The failure could be temporal due to network error, server issue and many other temporal issues. For this reason, please retry the request after a certain amount of time before reporting the issue;
2022-09-15 16:51:17 +02:00
* indicate how to replicate the error;
2022-09-16 18:01:30 +02:00
* indicate the time when the error occurred (this simplifies identifying the issue).
2022-09-15 16:51:17 +02:00
2022-09-21 11:24:15 +02:00
============
2022-09-15 16:51:17 +02:00
HTTP Methods
2022-09-21 11:24:15 +02:00
============
2022-09-16 18:01:30 +02:00
gCat is a pure RESTful service. It uses standard HTTP Methods to perform a listing of collections and CRUD (Create Read Update Delete) operations on instances.
2022-09-15 16:51:17 +02:00
.. table::
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Operation | HTTP Method | URL | Success HTTP Status | Safe | Idempotent |
+==============+=============+========================================+=====================+========+============+
2022-09-16 16:21:36 +02:00
| Supported | OPTIONS | /{COLLECTION} | 204 No Content | Y | Y |
| HTTP Methods | | | [#allow]_ | | |
2022-09-15 16:51:17 +02:00
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| List | GET | /{COLLECTION} | 200 OK | Y | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Count | GET | /{COLLECTION}?count=true | 200 OK | Y | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Exists | HEAD | /{COLLECTION} | 204 No Content | Y | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Create | POST | /{COLLECTION} | 201 Created | N | N |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
2022-09-16 16:21:36 +02:00
| Supported | OPTIONS | /{COLLECTION}/{INSTANCE_ID} | 204 No Content | Y | Y |
| HTTP Methods | | | [#allow]_ | | |
2022-09-15 16:51:17 +02:00
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Exist | HEAD | /{COLLECTION}/{INSTANCE_ID} | 204 No Content | Y | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Read | GET | /{COLLECTION}/{INSTANCE_ID} | 200 OK | Y | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Update | PUT | /{COLLECTION}/{INSTANCE_ID} | 200 OK | N | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
| Patch | PATCH | /{COLLECTION}/{INSTANCE_ID} | 200 OK | N | Y |
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
2022-09-16 16:21:36 +02:00
| Delete | DELETE | /{COLLECTION}/{INSTANCE_ID} | 204 No Content | N | N [#del]_ |
2022-09-15 16:51:17 +02:00
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
2022-09-16 16:21:36 +02:00
| Purge | DELETE | /{COLLECTION}/{INSTANCE_ID}?purge=true | 204 No Content | N | N [#del]_ |
2022-09-15 16:51:17 +02:00
+ +-------------+----------------------------------------+---------------------+--------+------------+
2022-09-16 16:21:36 +02:00
| | PURGE | /{COLLECTION}/{INSTANCE_ID} | 204 No Content | N | N [#del]_ |
2022-09-15 16:51:17 +02:00
+--------------+-------------+----------------------------------------+---------------------+--------+------------+
2022-09-16 16:21:36 +02:00
.. [#allow] Supported HTTP Methods in **Allow** HTTP Header
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
.. [#del] DELETE has been defined as idempotent.
*Allamaraju* [#Allamaraju]_ argues that DELETE idempotency should be accomplished client-side.
2022-09-16 18:01:30 +02:00
The server should inform the client if the delete operation succeeded because the resource was really deleted or it was not found, i.e., **404 Not Found** error is suggested instead of **204 No Content**.
2022-09-16 16:21:36 +02:00
The latter situation should be treated as idempotent by the client.
We share the same vision. For this reason, gCat does not provide server-side idempotency for DELETE and PURGE operations.
.. [#Allamaraju] Allamaraju S. RESTful Web Services Cookbook: Solutions for Improving Scalability and Simplicity . OReilly. first ed. 2010
2022-09-21 11:24:15 +02:00
2022-09-16 16:21:36 +02:00
About URL
2022-09-21 11:24:15 +02:00
*********
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
The presented URL uses the following convention:
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
* **{COLLECTION}** is the plural name of the entity type;
2022-09-16 18:01:30 +02:00
* **{INSTANCE_ID}** is an identification that enables univocally identifying the instance in the collection.
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
About Safety and Idempotency properties
2022-09-21 11:24:15 +02:00
***************************************
2022-09-15 16:51:17 +02:00
* A method is *Safe* if it does not produce any side effects.
"This does not prevent an implementation from including behaviour that is potentially harmful, that is not entirely read-only, or that causes side effects while invoking a safe method"
`<https://tools.ietf.org/html/rfc7231#section-4.2.1>`_;
* A method is *Idempotent* if the same operation repeated multiple times has the same side effect than using it one time.
"repeating the request will have the same intended effect, even if the original request succeeded, though the response might differ"
`<https://tools.ietf.org/html/rfc7231#section-4.2.2>`_.
You can find more information about HTTP Methods at `<https://restfulapi.net/http-methods/>`_
2022-09-21 11:24:15 +02:00
Uncommon HTTP Methods
*********************
2022-09-15 16:51:17 +02:00
2022-09-16 18:01:30 +02:00
* PATCH method allows to perform a differential update (i.e. an update which provides only the differences and not the whole new representation);
* PURGE method is not a standard but is widely used in service which requires this action
2022-09-15 16:51:17 +02:00
(e.g. `Varnish <https://varnish-cache.org/docs/3.0/tutorial/purging.html>`_, `Squid <https://wiki.squid-cache.org/SquidFaq/OperatingSquid#How_can_I_purge_an_object_from_my_cache.3F>`_).
2022-09-16 18:01:30 +02:00
gCat provides support for this method, but to support a wider range of clients, it also provides the Purge action via *DELETE* with the additional get parameter ``purge=true``.
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
2022-09-21 11:24:15 +02:00
============
2022-09-16 18:01:30 +02:00
Content-Type
2022-09-21 11:24:15 +02:00
============
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
Any request must contain an indication of the interesting content type.
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
The client must specify the **Accept** HTTP Header for any operation returning a result.
2022-09-13 19:00:30 +02:00
.. code-block:: rest
Accept: application/json
For any operation sending content to the service, it is necessary to specify the **Content-Type** HTTP Header.
.. code-block:: rest
Content-Type: application/json
2022-09-16 18:01:30 +02:00
The service accepts and returns only JSON objects.
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
`Profile Collection <../api-docs/resource\_Profile.html>`_ instead can be manipulated in XML only.
2022-09-13 19:00:30 +02:00
2022-09-21 11:24:15 +02:00
===========
2022-09-15 16:51:17 +02:00
Collections
2022-09-21 11:24:15 +02:00
===========
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
The following collections are available to any user.
Catalogue-Editor or above can invoke Non-safe methods only.
2022-09-13 19:00:30 +02:00
2022-09-16 16:21:36 +02:00
* `Item Collection <../api-docs/resource_Item.html>`_;
2022-09-15 16:51:17 +02:00
2022-09-16 16:21:36 +02:00
* `Resource Collection <../api-docs/resource_Resource.html>`_;
* `Profile Collection <../api-docs/resource_Profile.html>`_;
* `Namespace Collection <../api-docs/resource_Namespace.html>`_;
* `License Collection <../api-docs/resource_License.html>`_;
2022-09-16 18:00:38 +02:00
* `Trash Collection <../api-docs/resource_Trash.html>`_;
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
The following collections are available for Catalogue-Admins or above only:
2022-09-13 19:00:30 +02:00
2022-09-16 16:21:36 +02:00
* `Group Collection <../api-docs/resource_Group.html>`_;
* `Organization Collection <../api-docs/resource_Organization.html>`_;
* `User Collection <../api-docs/resource_User.html>`_;
* `Configuration Collection <../api-docs/resource_Configuration.html>`_.
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
An overview of the available collections is available at `<../api-docs/index.html>`_;
2022-09-13 19:00:30 +02:00
2022-09-21 11:24:15 +02:00
=====
2022-09-15 16:51:17 +02:00
Roles
2022-09-21 11:24:15 +02:00
=====
2022-09-13 19:00:30 +02:00
2022-09-16 18:01:30 +02:00
Any user has one or more roles in the catalogue.
The VRE Manager can only assign roles.
2022-09-16 18:00:38 +02:00
2022-09-15 16:51:17 +02:00
The catalogue uses the following hierarchic roles:
2022-09-13 19:00:30 +02:00
2022-09-16 16:35:17 +02:00
Catalogue-Member:
2022-09-16 18:01:30 +02:00
A user with such a role is mainly capable of listing and reading items;
2022-09-16 16:35:17 +02:00
Catalogue-Editor:
2022-09-19 15:17:55 +02:00
A user with such a role is capable of managing the items he/she creates and capable of using other safe APIs;
2022-09-16 16:35:17 +02:00
Catalogue-Admin:
2022-09-16 18:01:30 +02:00
A user with such a role is capable of administrating many aspects of the catalogue;
2022-09-16 16:35:17 +02:00
Catalogue-Manager:
2022-09-20 17:59:14 +02:00
A user with such a role can use all the APIs exposed by the service except item moderation APIs (e.g. approve, reject, ...).
2022-09-16 16:35:17 +02:00
2022-09-13 19:00:30 +02:00
2022-09-15 16:51:17 +02:00
Another role that is not in the role hierarchy:
2022-09-13 19:00:30 +02:00
2022-09-16 16:35:17 +02:00
Catalogue-Moderator:
2022-09-16 18:01:30 +02:00
A user with such a role is capable of invoking the item moderation APIs.
2022-09-13 19:00:30 +02:00
2022-09-15 16:51:17 +02:00
.. TIP::
2022-09-16 16:35:17 +02:00
Please note that not all catalogues are moderated.
2022-09-21 11:24:15 +02:00
********************
2022-09-16 16:35:17 +02:00
Moderated Catalogues
2022-09-21 11:24:15 +02:00
********************
2022-09-16 16:35:17 +02:00
Any catalogues can be declared as moderated.
2022-09-20 15:26:20 +02:00
This means that, a Catalogue-Moderator must approve any submitted items to make them available to the other users of the catalogue.
2022-09-16 16:35:17 +02:00
2022-09-16 18:00:38 +02:00
In a moderated catalogue, an item can be in the following states:
pending:
2022-09-20 15:26:20 +02:00
The item published by any allowed author (a Catalogue-Editor or above) but not available to the other users of the catalogue.
A Catalogue-Moderator has to approve or reject it;
2022-09-16 18:00:38 +02:00
approved:
2022-09-16 18:01:30 +02:00
A Catalogue-Moderator has approved the item published by any allowed users;
2022-09-16 18:00:38 +02:00
rejected:
2022-09-16 18:01:30 +02:00
A Catalogue-Moderator has rejected the item published by any allowed users.
2022-09-16 18:00:38 +02:00
2022-09-16 18:01:30 +02:00
The following are the moderation operations that an allowed user can perform on an item.
To present the moderation operations, we use the following convention:
2022-09-16 18:00:38 +02:00
2022-09-16 18:01:30 +02:00
``initial_state`` ---**operation** (*User/Role performing the operation*)---> ``final_state``
2022-09-16 18:00:38 +02:00
2022-09-16 18:01:30 +02:00
``initial_state`` can be ``none``, meaning the item does not exist.
2022-09-16 18:00:38 +02:00
2022-09-19 17:51:59 +02:00
The following are the allowed moderation operation on an item
2022-09-16 18:00:38 +02:00
``none`` ---**create** (*Author*)---> ``pending``
``pending`` ---**reject** (*Catalogue-Moderator*)---> ``rejected``
``pending`` ---**approve** (*Catalogue-Moderator*)---> ``approved``
2022-09-19 15:17:55 +02:00
``rejected`` ---**update** (*Author*)---> ``pending``
2022-09-16 18:00:38 +02:00
``approved`` ---**update** (*Author*)---> ``pending``
2022-09-19 17:51:59 +02:00
**TO DO: EXPLAIN WHICH USER CAN INVOKE AN UPDATE ON ITEM IN MODERATION**
2022-09-16 18:01:30 +02:00
In a moderated catalogue, both the Catalogue-Moderators and the item author can send messages to
discuss the approval process of the item. The messages are related to a specific item.
Any Catalogue-Moderators receive a message sent by an Author.
The author receives a message sent by a Catalogue-Moderator as well as the other Catalogue-Moderators (if any).
2022-09-16 18:00:38 +02:00
2022-09-16 18:01:30 +02:00
Messages can be sent both with an action which changes the status of the item or as explicit action which does not change the status of the item:
2022-09-16 18:00:38 +02:00
``pending`` ---**message** (*Author OR Catalogue-Moderator*)---> ``pending``
``rejected`` ---**message** (*Author OR Catalogue-Moderator*)---> ``rejected``
``approved`` ---**message** (*Author OR Catalogue-Moderator*)---> ``approved``
2022-09-16 16:35:17 +02:00
2022-09-20 15:26:20 +02:00
The following table summarize the allowed/forbidden operations depending on: the role of the user and the state of the item.
.. table::
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| Operation | Item State | Roles |
+ + +----------------------+--------------------------+------------------------------------------+-------------------+
| | | Catalogue Moderator | Catalogue Admin/Manager | Catalogue Editor | Catalogue Member |
+=====================================+=============+======================+==========================+==========================================+===================+
2022-09-20 17:59:14 +02:00
| List | Yes all states | Yes all states | Yes only approved - All states if Author | Yes only approved |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Count | Yes all states | Yes all states | Yes only approved - All states if Author | Yes only approved |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Create | 403 Forbidden | Yes -> Pending | Yes -> Pending | 403 Forbidden |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Read | Yes all states | Yes all states | Yes only approved - All states if Author | Yes only approved |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| Update | Pending | Yes -> Pending | 403 Forbidden | Yes if Author -> Pending | 403 Forbidden |
+ +-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| | Rejected | Yes -> Pending | 403 Forbidden | Yes if Author -> Pending | 403 Forbidden |
+ +-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| | Approved | 403 Forbidden | Yes -> Approved | Yes if Author -> Pending | 403 Forbidden |
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| Delete/Purge | Pending | Yes | Yes | Yes if Author | 403 Forbidden |
+ +-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| | Rejected | Yes | Yes | Yes if Author | 403 Forbidden |
+ +-------------+----------------------+--------------------------+------------------------------------------+-------------------+
| | Approved | 403 Forbidden | Yes | Yes if Author | 403 Forbidden |
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Approve a pending item | Yes | 403 Forbidden | 403 Forbidden | 403 Forbidden |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Reject a pending item | Yes | 403 Forbidden | 403 Forbidden | 403 Forbidden |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
| Message about an item | Yes | Yes if Author | Yes if Author | 403 Forbidden |
2022-09-20 15:26:20 +02:00
+-------------------------------------+-------------+----------------------+--------------------------+------------------------------------------+-------------------+
2022-09-20 17:59:14 +02:00
The Moderation process has associated notification to authors and Catalogue-Moderators.
Please note that the user who has acted is not self-notified, e.g.
approve operation made by a Catalogue-Moderator notifies the item author and the other Catalogue-Moderators of the VRE.
The following table summarises the addressee of the notification for any action.
.. table::
+------------+-------------------------+-----------------+
| Operation | Notified user/role |
+ +-------------------------+-----------------+
| | Catalogue-Moderators | Author |
+============+===========================================+
| Create | Yes | No |
+------------+-------------------------------------------+
| Update | Yes | Yes |
+------------+-------------------------------------------+
| Approve | Yes + Social Post if enabled for the VRE |
+------------+-------------------------------------------+
| Reject | Yes | Yes |
+------------+-------------------------------------------+
| Message | Yes | Yes |
+------------+-------------------------------------------+
2022-09-21 11:24:15 +02:00
***********
2022-09-13 19:00:30 +02:00
Java Client
2022-09-21 11:24:15 +02:00
***********
2022-09-13 17:28:21 +02:00
2022-09-13 19:00:30 +02:00
We provide the following Java Client out-of-the-box.
2022-09-13 17:28:21 +02:00
.. TIP::
2022-09-16 18:01:30 +02:00
If you're coding in Java, it is recommended that you use this Java Client.
2022-09-13 17:28:21 +02:00
**Maven Coordinates**
.. code:: xml
<groupId>org.gcube.data-catalogue</groupId>
<artifactId>gcat-client</artifactId>
2022-09-13 17:36:05 +02:00
<version>[2.2.0, 3.0.0-SNAPSHOT)</version>
2022-09-13 17:28:21 +02:00
**Methods Result**
2022-09-16 18:01:30 +02:00
The service exposes `its methods <../api-docs/index.html>`_ using a standard naming approach. Moreover, they accept (in the case of HTTP POST/PUT methods) JSON objects.
2022-09-13 17:28:21 +02:00
.. IMPORTANT::
The result of all methods is always a JSON object as per below:
.. code:: javascript
{
2022-09-13 17:44:31 +02:00
....
2022-09-13 17:28:21 +02:00
}
*Inputs are automatically validated before the request is served.*
**Usage examples**
- Example 1
.. code:: java
import org.gcube.gcat.client.Item;
// count item number
2022-09-13 17:59:58 +02:00
Item item = new Item();
int count = item.count();
2022-09-13 17:28:21 +02:00
...
2022-09-13 17:59:58 +02:00