forked from D-Net/openaire-graph-docs
Update 'docs/data-provision/enrichment/classified.md'
This commit is contained in:
parent
8fda5c81cf
commit
8f9184146c
|
@ -7,10 +7,10 @@ sidebar_position: 5
|
||||||
|
|
||||||
| Property | Description |
|
| Property | Description |
|
||||||
| --- | --- |
|
| --- | --- |
|
||||||
| Short description | Classifiers |
|
| Short description | A document classification algorithm that employs analysis of free text stemming from the abstracts of the publications. The purpose of applying a document classification module is to assign a scientific text to one or more predefined content classes. |
|
||||||
| Authority | ATHENA Research Center, Greece |
|
| Authority | ATHENA Research Center, Greece |
|
||||||
| Licence | CC-BY/CC-0 |
|
| Licence | CC-BY/CC-0 |
|
||||||
| Algorithmic details | The algorithm classifies publication's fulltexts using a Bayesian classifier and weighted terms according to an offline training phase. The training has been done using the following taxonomies: arxiv, dcc, acm |
|
| Algorithmic details | The algorithm classifies publication's fulltexts using a Bayesian classifier and weighted terms according to an offline training phase. The training has been done using the following taxonomies: arXiv, MeSH (Medical Subject Headings), ACM, and DDC (Dewey Decimal Classification, or Dewey Decimal System). |
|
||||||
| Parameters | Publication's identifier and fulltext |
|
| Parameters | Publication's identifier and fulltext |
|
||||||
| Limitations | N/A |
|
| Limitations | N/A |
|
||||||
| Code repository | https://github.com/openaire/iis/tree/master/iis-wf/iis-wf-referenceextraction/src/main/resources/eu/dnetlib/iis/wf/referenceextraction |
|
| Code repository | https://github.com/openaire/iis/tree/master/iis-wf/iis-wf-referenceextraction/src/main/resources/eu/dnetlib/iis/wf/referenceextraction |
|
||||||
|
|
Loading…
Reference in New Issue