[UsageCount] Usage count per result split by datasource #318
No reviewers
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
RDGraph
RSAC
wontfix
No Milestone
No project
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: D-Net/dnet-hadoop#318
Loading…
Reference in New Issue
No description provided.
Delete Branch "UsageStatsRecordDS"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This PR extends the code for the ingestion of the Usage Count at the level of the result. It splits the count for result wrt the Datasource contributing to that count. For each indicator one unit is specified for each datasource contributing to that indicator value. The datasource key is the value of the key element in the unit for the measure, while the count for that datasource is in the value.
So given that the downloads and views for R1 come from three different data sources fake1, fake2, and fake3 as
we will get for R1 an element measures as
The JSON above is the serialization of the internal model, our idea is to produce the following XML snippet out of it, to be made part of the result level information on the Solr records.
would this serialisation be ok for the portal presentation requirements? The information we are missing here is the datasource name, but the same information is available in both the
collectedfrom
andhostedby
elements.Hello! The pull request seems very nice! I will plan to adjust the parsing code in the portal side. Just please keep me posted when the update will be available.
Hi again @miriam.baglioni and @claudio.atzori! I was just checking this again and i am wondering if it is possible to also include the data source name together with the data source identifier for display purposes.
Hi Konstantina! The solution that Miriam and I proposed was based on the assumption to not alter the model from how it is currently defined. So we cannot include another field at the same level of the
key
element to store the datasource name, but we can agree on something dirtier, e.g. combining the datasource id and its name in thekey
field, concatenating the two strings with a separator character/sequence in between.Would this be acceptable for you?
Hi Claudio! Apologies for the late response. It completely slipped my attention.
Yes, combining the datasource id and its name in the key field sounds fine. Maybe we could use the same format/hack we are applying for some of the refine filters: id||name.
What do you think?
Hi @konstantina.galouni OK I will use || as the split character
Great! Thank you very much, Miriam!
WIP: [UsageCount] Usage count per result split by datasourceto [UsageCount] Usage count per result split by datasourceThe code was extended to include also the name of the datasource and not only its identifier as requested