You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
Lampros Smyrnaios 9096137008 Update documentation. 1 month ago
gradle/wrapper - Change the fileNames' structure in the S3-ObjectStore. 2 months ago
src/main Update documentation. 1 month ago
.gitignore springified project 4 months ago
Dockerfile - Allow the user to build, push and run the App in Docker, straight though the "installAndRun.sh" script. 4 months ago
README.md Update the README.md 3 months ago
build.gradle - Remove the obsolete "parenthesis" and "increasing duplicate-num" from the full-texts' names, before sending them to the S3-Object-Store. They now end with the "file-hash", so it is guaranteed that they will be unique. The Worker continues to produce the previous kind of names, without any disturbance. 1 month ago
installAndRun.sh - Change the fileNames' structure in the S3-ObjectStore. 2 months ago
settings.gradle - Add the "isControllerAlive"-endpoint. 8 months ago

README.md

UrlsController

The Controller's Application receives requests coming from the Workers , constructs an assignments-list with data received from a database and returns the list to the workers.
Then, it receives the "WorkerReports", it requests the full-texts from the workers, in batches, and uploads them on the S3-Object-Store. Finally, it writes the related reports, along with the updated file-locations into the database.
The database used is the Impala .

To install and run the application:

  • Run git clone and then cd UrlsController.
  • Provide the S3 Object Store related configurations, inside the src/main/resources/application.properties file.
  • Execute the installAndRun.sh script which builds and runs the app.
    If you want to just run the app, then run the script with the argument "1": ./installAndRun.sh 1.
    If you want to build and run the app on a docker container, then run the script with the argument "0" followed by the argument "1": ./installAndRun.sh 0 1.