Oozie workflow for cleancontext #216

Merged
claudio.atzori merged 7 commits from cleancontext into beta 2022-04-22 15:46:40 +02:00

This PR extends the oozie workflow used for the cleaning with a last step to remove not wanted contexts from results. It adds also one parameter to the cleaning workflow shouldCleanContext that, if true, performes the cleaning

This PR extends the oozie workflow used for the cleaning with a last step to remove not wanted contexts from results. It adds also one parameter to the cleaning workflow **shouldCleanContext** that, if true, performes the cleaning
claudio.atzori was assigned by miriam.baglioni 2022-04-22 11:42:56 +02:00
miriam.baglioni added 4 commits 2022-04-22 11:42:58 +02:00
claudio.atzori reviewed 2022-04-22 11:46:50 +02:00
@ -16,0 +18,4 @@
<description>true if the context have to be cleaned</description>
</property>
<property>
<name>contextId</name>

It should be better to include a description for this parameter, to explain its purpose and if it possible to include multiple contentIds, how they should be formatted.

It should be better to include a description for this parameter, to explain its purpose and if it possible to include multiple contentIds, how they should be formatted.
Author
Member

This is just the first naive implementation of the context cleaning. I have no idea how it will be once done properly

This is just the first naive implementation of the context cleaning. I have no idea how it will be once done properly

It might be the 1st naive implementation, but looking at the oozie workflow, it is not obvious what a parameter plays when it is not accompanied by any description.

It might be the 1st naive implementation, but looking at the oozie workflow, it is not obvious what a parameter plays when it is not accompanied by any description.
Author
Member

extended

extended
claudio.atzori reviewed 2022-04-22 11:47:22 +02:00
@ -16,0 +22,4 @@
<value>sobigdata</value>
</property>
<property>
<name>verifyParam</name>

Missing description.

Missing description.
Author
Member

Same holds as for the previous comment. Anyway if you think it is important to have the descriptions I will add them

Same holds as for the previous comment. Anyway if you think it is important to have the descriptions I will add them

I think it is. Again: this PR might just provide a first implementation, but the businness logic around these two parameters exists only in your head. To understand their role I'd need to open the actual job implementation and reverse engeener it. I would appreciate if you could add a description.

I think it is. Again: this PR might just provide a first implementation, but the businness logic around these two parameters exists only in your head. To understand their role I'd need to open the actual job implementation and reverse engeener it. I would appreciate if you could add a description.
claudio.atzori requested changes 2022-04-22 11:49:28 +02:00
claudio.atzori left a comment
Owner

Minor changes, please check the comments inline.

Minor changes, please check the comments inline.
@ -1,16 +1,13 @@

At first glance, this class doesn't seem to include any significant change. If it was not changed, please revert to its original formatting. Otherwise the diff just creates noise.

At first glance, this class doesn't seem to include any significant change. If it was not changed, please revert to its original formatting. Otherwise the diff just creates noise.

This comment is outdated. It seems you did not issue git pull before introducing these further changes thus you did not get the reformatted file CleanContextSparkJob.java.

This comment is outdated. It seems you did not issue `git pull` before introducing these further changes thus you did not get the reformatted file CleanContextSparkJob.java.
claudio.atzori marked this conversation as resolved
claudio.atzori added 1 commit 2022-04-22 11:49:38 +02:00
miriam.baglioni added 2 commits 2022-04-22 15:41:50 +02:00
claudio.atzori merged commit 81242538e6 into beta 2022-04-22 15:46:40 +02:00
claudio.atzori deleted branch cleancontext 2022-04-22 15:46:45 +02:00
Sign in to join this conversation.
No description provided.