forked from D-Net/dnet-hadoop
Lampros Smyrnaios
8d0ed7d414
- Update the "uoa-validator-engine2" dependency. - Update the "installProject.sh" script to account for potential conflict with previous builds. - Add documentation. |
||
---|---|---|
.. | ||
src/main | ||
README.md | ||
installProject.sh | ||
pom.xml | ||
runOozieWorkflow.sh |
README.md
Continuous Validation
This module is responsible for deploying an Oozie Workflow (on the desired cluster), which executes a Spark action.
This action takes the HDFS-path of a directory of parquet files containing metadata records, and applies the validation process on all of them, in parallel. Then it outputs the results, in json-format, in the given directory.
The validation process is powered by the uoa-validator-engine2 software.
Install and run
Run the ./installProject.sh script and then the ./runOozieWorkflow.sh script.
[...]