Collection of reusable docker images
Go to file
Sandro La Bruzzo 22c5f18a0d Merge branch 'main' of code-repo.d4science.org:D-Net/docker-images 2026-04-28 13:50:04 +02:00
airflow Bump airflow to 3.1.8 2026-03-30 15:00:09 +02:00
collection-worker updated dependencies 2026-04-28 13:48:48 +02:00
fos-spark Add pip modules to spark-fos package 2026-03-30 15:00:35 +02:00
hive Addded README files 2026-02-23 14:34:25 +01:00
jupyter-notebook Addded README files 2026-02-23 14:34:25 +01:00
monitor-aggregator updated dependencies 2026-04-28 13:48:48 +02:00
spark-image updated dependencies 2026-04-28 13:48:48 +02:00
spark-monitor Addded README files 2026-02-23 14:34:25 +01:00
spark-operator Addded README files 2026-02-23 14:34:25 +01:00
LICENSE Initial commit 2025-11-12 15:49:41 +01:00
README.md Addded README files 2026-02-23 14:34:25 +01:00

README.md

OpenAIRE Graph Pipeline — Docker Images

This repository contains Docker images used in the OpenAIRE graph pipeline, an infrastructure that collects, transforms, and aggregates scholarly metadata from research data providers worldwide.

The images fall into two categories:

  • Adaptations of existing open-source projects — upstream images extended with the connectors, drivers, or plugins required by the OpenAIRE pipeline.
  • Specific OpenAIRE projects — purpose-built services (spark-monitor, monitor-aggregator) that implement pipeline-specific business logic.

All images are published as multi-arch builds (linux/amd64 + linux/arm64) to the internal Harbor registry (harbor.d4science.org/scilab/).


Building

Each subdirectory contains a Dockerfile with the exact docker buildx command as a comment at the top, e.g.:

docker buildx build \
  --tag harbor.d4science.org/scilab/<image>:<tag> \
  --platform linux/arm64,linux/amd64 \
  --push .

Refer to each subfolder's README.md for version-specific build instructions.