|
|
||
|---|---|---|
| airflow | ||
| collection-worker | ||
| fos-spark | ||
| hive | ||
| jupyter-notebook | ||
| monitor-aggregator | ||
| spark-image | ||
| spark-monitor | ||
| spark-operator | ||
| LICENSE | ||
| README.md | ||
README.md
OpenAIRE Graph Pipeline — Docker Images
This repository contains Docker images used in the OpenAIRE graph pipeline, an infrastructure that collects, transforms, and aggregates scholarly metadata from research data providers worldwide.
The images fall into two categories:
- Adaptations of existing open-source projects — upstream images extended with the connectors, drivers, or plugins required by the OpenAIRE pipeline.
- Specific OpenAIRE projects — purpose-built services (
spark-monitor,monitor-aggregator) that implement pipeline-specific business logic.
All images are published as multi-arch builds (linux/amd64 + linux/arm64) to the internal Harbor registry (harbor.d4science.org/scilab/).
Building
Each subdirectory contains a Dockerfile with the exact docker buildx command as a comment at the top, e.g.:
docker buildx build \
--tag harbor.d4science.org/scilab/<image>:<tag> \
--platform linux/arm64,linux/amd64 \
--push .
Refer to each subfolder's README.md for version-specific build instructions.