The Controller app of the PDF Aggregation Service.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Go to file
Lampros Smyrnaios a01e11eef0 When all the data is processed, increase the number of "max-attempts" to retry some very old records, in the next requests. 2 years ago
gradle/wrapper - Workaround a bug of Impala-JDBC-Driver, when creating insert-prepared-statements. 2 years ago
scripts Initial commit of UrlsController. 3 years ago
src/main When all the data is processed, increase the number of "max-attempts" to retry some very old records, in the next requests. 2 years ago
README.md - Implement the "getAndUploadFullTexts" functionality. In order to access the S3-ObjectStore from one trusted place, the Controller will request the files from the workers and upload them on S3. Afterwards, the workers will delete those files from their local storage. Previously, each worker uploaded its own files. 2 years ago
build.gradle - Change the repository for the Impala JDBC Driver, as the previous one had networking issues. 2 years ago
installAndRun.sh Fix not prioritizing the gradle version defined inside the "installAndRun.sh" script. 2 years ago
settings.gradle - Add the "isControllerAlive"-endpoint. 3 years ago

README.md

UrlsController

This is the Controller's Application.
It receives requests coming from the workers , constructs an assignments-list with data received from a database and returns the list to the workers.
Then it receives the "WorkerReports" and writes them into the database.
The database used is the Impala .
[...]

To install and run the application, run git clone. Then, provide a file "S3_minIO_credentials.txt", inside the working directory.
In the "S3_minIO_credentials.txt" file, you should provide the endpoint, the accessKey, the secretKey, the region and the bucket, in that order, separated by comma.
Afterwards, execute the installAndRun.sh script.
If you want to just run the app, then run the script with the argument "1": ./installAndRun.sh 1.