amercader
0d79252a09
Add command for reindex all harvest sources
2013-01-22 16:43:25 +00:00
amercader
6c861afe39
Update template with new harvest source status
2013-01-22 16:37:31 +00:00
amercader
9ba6e8f3b3
[ #5 ] Add error summary to harvest_job_dictize
...
It will return the counts for the 20 most common errors for that
particular job. These will available when calling harvest_job_show.
Also refactor the harvest source status object to just call
harvest_job_dictize on the 'last_job' key, as it has all the
interesting fields anyway.
2013-01-22 13:13:24 +00:00
amercader
30c9eedf5f
Improve harvest source status creation
...
Use report_status field to improve speed, remove unnecessary fields.
2013-01-17 15:43:45 +00:00
amercader
bfce5185f0
[ #4 ] Add db_to_form_schema_options to harvest plugin to avoid validation on show
2013-01-16 17:45:33 +00:00
amercader
2ab10afcf9
[ #4 ] Fix typo in auth functions
2013-01-16 12:56:58 +00:00
amercader
2f4cd3a4b0
[ #4 ] Fix logic functions importer
2013-01-15 19:29:17 +00:00
amercader
2bb669af21
[ #4 ] Add owner_org field to schema and form
...
This should store the owner organization id.
Also added the errors box on the form.
2013-01-10 12:23:01 +00:00
kindly
acb17ff3b0
capture errors more cleanly
2013-01-10 10:48:48 +00:00
amercader
e49dd94b34
[ #4 ] Remove authorization functions for the publisher profile
...
The different profiles will be now configured via the harvest source
datasets on CKAN core, so they are no longer needed.
2013-01-09 17:35:47 +00:00
amercader
288e1429a6
[ #4 ] Remove the loading of different authorization profiles
...
The different profiles will be now configured via the harvest source
datasets on CKAN core, so it is no longer needed.
Also simplify IActions and IAuthFunction hook calls.
2013-01-09 17:32:05 +00:00
amercader
058dcad435
[ #4 ] Minor change on the state field to fix a bug on harvest_source_show
2013-01-09 17:31:30 +00:00
amercader
a866445023
[ #4 ] Refactor authorization functions
...
The authorization functions have been refactored to take into account
both the new organizaton based authorization on CKAN core and the
harvest source datasets.
Basically at the source level, authorization checks are forwarded to the
relevant package auth function (package_create, package_update, etc.)
wich will check for organizations membership, sysadmin, etc.
Also we only use functions available on the plugins toolkit whenever
possible.
2013-01-09 17:26:48 +00:00
amercader
1342463f8a
Merge branch '2.0-dataset-sources' into 4-new-auth-for-2.0
...
Conflicts:
ckanext/harvest/logic/action/get.py
2013-01-09 11:09:34 +00:00
amercader
6b23082010
Move logic from setup_template_variables to helper functions
2013-01-09 11:07:44 +00:00
kindly
7b6beb1470
fix wrong authorization logic
2012-12-24 22:34:37 +00:00
kindly
01dfda59b6
Merge branch 'release-v2.0' into 4-new-auth-for-2.0
2012-12-24 12:46:56 +00:00
kindly
36389e7ce0
make sure gather phase finishes job if there is a severe error
2012-12-24 12:21:21 +00:00
amercader
43950aa4ff
Merge branch 'release-v2.0' into 4-new-auth-for-2.0
...
Conflicts:
ckanext/harvest/logic/action/get.py
ckanext/harvest/tests/test_queue.py
2012-12-20 16:38:57 +00:00
amercader
fdac761fba
Merge branch 'release-v2.0' into 2.0-dataset-sources
...
Conflicts:
ckanext/harvest/logic/action/get.py
ckanext/harvest/tests/test_queue.py
2012-12-20 16:16:30 +00:00
amercader
19cd80b264
[ #4 ] Fixes on the auth layer against the new core auth
...
Thanks @locusf for the original patch
2012-12-20 16:09:26 +00:00
amercader
510e2d3725
Fix pager links in harvest source page
2012-12-19 17:27:05 +00:00
kindly
b940baacc0
make statistics use new report_field
2012-12-18 02:39:14 +00:00
kindly
6b42d96fe0
add report_status field
2012-12-17 23:50:26 +00:00
kindly
596b9bb475
fix auth to use new sysadmin flag
2012-12-17 23:46:43 +00:00
amercader
478326922b
Fix tests
...
* Adapt test_queue to harvest source datasets
* Don't use the same mock harvester on different datasets as it messes
the tests up
* Skip auth tests for the time being
2012-12-14 14:52:19 +00:00
amercader
6df525a377
Reindex the harvest source dataset after finishing jobs
...
This ensures that the status details shown on the harvest sources search
page is up to date (as it is loaded from the indexed data_dict)
2012-12-14 14:27:55 +00:00
amercader
c1b0415cb6
Merge branch 'release-v2.0' into 2.0-dataset-sources
...
Conflicts:
ckanext/harvest/model/__init__.py
2012-12-13 18:33:59 +00:00
amercader
d57e73458a
Make harvest object - package FK deferrable
...
Allows eg to add the harvest object id to the package dict before
indexing.
2012-12-13 18:21:40 +00:00
amercader
b424ba1cea
Add flag to avoid returning all objects when getting a job
2012-12-13 18:20:49 +00:00
amercader
0dde483992
Set job status to Finished when actually finishing it
...
Until now, harvest jobs were set to Finished just after sending all
objects to the fetch stage. Now every time the run command is run, jobs
are set to Running, and all previous Running jobs are checked to see if
all harvest objects have a state of Complete or Error. Only then the job
is flagged as Finished.
2012-12-13 18:19:22 +00:00
amercader
81c3881a1a
Add active field to source dict
2012-12-13 18:00:07 +00:00
amercader
37efb3b978
Set harvest object state depending on the output of import_stage
...
Either to COMPLETE or ERROR, depending on whether it returns True or
False.
2012-12-13 14:30:13 +00:00
amercader
4da64a84ae
Add more elements to the harvest sources page (still provisional)
2012-12-12 18:49:38 +00:00
amercader
e0f3d47cb9
Add extra information to the harvest source page
...
The status object gives extra information about the source and there is
a helper function to build the dataset list for this particular source.
TODO: Pager still needs fixing.
2012-12-12 11:54:50 +00:00
amercader
b567e562f4
Add after_show extension point
...
We hook into the package_show extension point in order to:
1. For harvest_source type datasets, add extra information about the
source, jobs, etc (calling harvest_source_show_status)
2. For normal datasets, check if they were harvested, and if so, add a
reference to the harvest object and harvest source.
2012-12-12 11:49:55 +00:00
amercader
2557636994
Update endpoints to receive the context object
2012-12-12 11:47:57 +00:00
amercader
8e1621731b
Move harvest source status function as a logic function
...
The status dict is added automatically to harvest source packages.
Note that the actual queries still need to be updated as they proabably
won't scale.
2012-12-12 11:45:13 +00:00
amercader
b0407bb2ac
Update harvest_source_show logic function
2012-12-11 12:49:05 +00:00
amercader
fcbe6aa6de
Script for creating harvest source datasets on old versions
...
The way we check whether datasets need to be created might need to be
improved.
2012-12-05 18:54:28 +00:00
amercader
22ec9cb5af
Fix old controller import
2012-12-05 18:53:35 +00:00
amercader
697933f8d0
Add custom harvest source read page (provisional)
2012-12-05 15:47:02 +00:00
amercader
2dba7fbf78
Add custom harvest sources search page
2012-12-05 14:51:20 +00:00
amercader
a605564a41
Fix links to harvest sources page
2012-12-05 13:01:56 +00:00
amercader
d77bf255b4
Finish up create and edit forms, including breadcrumbs, links, etc
2012-11-30 18:53:13 +00:00
amercader
9d83322591
Fix config validator and add tests
2012-11-30 17:02:06 +00:00
amercader
803b228d1c
Update harvest source create and update logic functions
...
`harvest_source_create` and `harvest_source_update` now call
`package_create` and `package_update` respectively, making sure to
define a 'harvest_source' type. The returned dict uses the db_to_form
schema.
2012-11-30 14:11:24 +00:00
amercader
0e0aed0503
Clean up schemas
...
Better naming, remove old ones, ignore __extras field
2012-11-30 13:20:37 +00:00
amercader
875a773f1c
Check if type property is actually there
2012-11-30 11:10:21 +00:00
amercader
7db09fceb0
Various fixes for the harvest source dataset type forms
...
Add a db to form schema to show the fields stored in extras. Validate
the source url on the Package object.
2012-11-29 16:57:20 +00:00
amercader
ab7a379058
Behind the scenes creation and updating of HarvestSource objects
...
Taking advantage of the new after_create/after_update extensions points,
the extension checks if the dataset type is harvest source and creates
or updates the corresponding HarvestSource object. When creating a new
one, it will use the same id as the dataset.
2012-11-29 16:48:44 +00:00
amercader
9d36fd6841
First stub of the new dataset type forms
...
Adds a 'harvest_source' dataset type that mimics the original harvest
source form.
It works against the 3022 branch on CKAN core.
2012-11-29 12:31:48 +00:00
amercader
866fd69730
Do not remove XML declaration and add utf-8 charset to headers
2012-11-20 15:43:39 +00:00
amercader
c52ed3b163
Add line field to object error table
2012-11-20 11:29:58 +00:00
amercader
03fd1884f4
Implement retry times for harvest objects
2012-11-15 18:11:35 +00:00
kindly
202c9d9fcc
use correct queue for gather stage
2012-11-15 14:21:09 +00:00
kindly
c9c1eb4848
use generator to consume
2012-11-15 14:14:55 +00:00
amercader
33d5e09722
Change fetch_callback to proper acknowledge objects
2012-11-15 11:36:06 +00:00
amercader
13357893ad
Fix typo
2012-11-13 14:41:38 +00:00
amercader
54ff0526bb
Return original document if present when requesting an object
2012-11-13 12:06:36 +00:00
amercader
820443d58f
Add cascade option to harvest object extras and errors
2012-11-09 14:52:34 +00:00
kindly
5063626554
make sure state is changed to error on fetch error
2012-11-07 09:53:16 +00:00
kindly
28e5e9137a
add perge queues command
2012-11-07 09:51:25 +00:00
kindly
6db65b5826
made manual default not null
2012-11-05 13:17:32 +00:00
amercader
fdf01c09f2
Fix wrong check for harvest sources
2012-11-01 14:12:45 +00:00
amercader
d598c0707b
Ignore frequency field on the frontend for the time being
2012-11-01 14:12:01 +00:00
amercader
d7f8c9165c
Merge branch 'model_upgrade' into release-v2.0
2012-10-30 18:07:24 +00:00
amercader
d502b925a6
Remove old deprecated tests and some whitespace
2012-10-30 18:07:05 +00:00
amercader
a136cbf202
Fix typos in migration script
2012-10-30 17:52:10 +00:00
amercader
61b99e8eff
Merge branch 'pika' into release-v2.0
2012-10-30 17:31:30 +00:00
amercader
82a498d9fc
Rename function to be implementation independent
2012-10-30 17:13:39 +00:00
kindly
2529a17304
add jobs at certain frequencies
2012-10-29 17:15:02 +00:00
kindly
9fc0ae9937
add next run field
2012-10-26 10:50:35 +01:00
kindly
bc079c6644
model upgrade with tests and migration
2012-10-25 19:01:54 +01:00
kindly
1153c1c5c9
add full queue test and new test harvester
2012-10-24 11:58:00 +01:00
kindly
da125cdcc2
pika now used as queue library
2012-10-24 00:34:32 +01:00
amercader
8233b2ec23
Strip spaces from url when creating or updating a source
2012-08-17 12:25:06 +01:00
amercader
c1f83e0d3e
Strip spaces from url when creating or updating a source
2012-08-17 12:24:41 +01:00
tobes
a17e8208de
Very small text fix
2012-08-16 09:30:04 +01:00
tobes
7e940b497d
Text message minor fix
2012-08-16 09:27:36 +01:00
tobes
b6a32fd23b
Add descriptions for sources
2012-08-16 09:16:34 +01:00
tobes
c984727de5
Minor template tidy
2012-08-16 08:56:25 +01:00
tobes
5b7a9c0855
Flash messages to notices plus translatable
2012-08-16 08:49:35 +01:00
amercader
19ea538097
[#2852,#2853] Reword errors
2012-08-15 18:28:08 +01:00
amercader
7609a93422
Minor css tweaks on the forms
2012-08-15 18:26:36 +01:00
tobes
c6c4f6d098
Remove about text placeholder
2012-08-15 10:40:52 +01:00
tobes
3b8075b670
Only specify autoform items once
2012-08-14 18:01:29 +01:00
tobes
e1c74bdbe6
Fixes for autoform extra_text
2012-08-14 17:56:49 +01:00
tobes
8f6bab104e
Dirty form changes pending cleanup
2012-08-14 17:33:32 +01:00
amercader
1979517706
Widen url field
2012-08-14 12:02:02 +01:00
amercader
a76140650d
Merge branch 'release-v2.0' of github.com:okfn/ckanext-harvest into release-v2.0
2012-08-14 11:40:35 +01:00
amercader
4b68e4c31b
Fix details page template and style
2012-08-14 11:23:56 +01:00
amercader
eb12152089
Fix index page template and style
2012-08-14 11:04:17 +01:00
David Raznick
4b4e5dba62
fix broken show form
2012-08-14 00:44:00 +02:00
tobes
7efca28c22
Template updates
2012-08-10 13:05:54 +01:00
tobes
5557da653f
First draft of new source page
2012-08-10 10:06:37 +01:00
tobes
3feca92d55
First draft of index page
2012-08-10 10:00:02 +01:00
tobes
d8a98fd64a
Move to new plugins model
2012-08-10 09:59:18 +01:00
amercader
a8aebac965
Fix the harvest object show call
2012-08-09 13:38:17 +01:00
amercader
bb5ba43ebb
Allow showing harvest objects by default (on the default auth profile)
2012-08-09 13:37:28 +01:00
amercader
4c562e5f5f
Do not store the object when importing
2012-08-09 11:17:41 +01:00
amercader
e4b3cb440c
Do not use repo.are_tables_created
...
When checking whether the core tables have been alredy created it is
best to use package_table.exists(), as are_tables_created reflects the
tables, causing conflicts with other extensions.
This allows ckanext-harvest and ckanext-spatial to be used together on
ckan 1.8 onwards.
2012-08-09 11:06:05 +01:00
amercader
4d2fdeac57
Allow defining segments of harvest objects to import
...
Useful when importing large number of objects, as it allows
parallelization
2012-08-02 18:41:59 +01:00
amercader
7011efe5dc
Allow not linking to datasets when importing records
...
With the -j flag, harvest objects are not linked to datasets when
importing. This is useful sometimes when importing records for the first
time.
2012-07-30 12:11:55 +01:00
David Read
203bcb053b
Status can have links in it now.
2012-07-23 16:15:11 +01:00
David Read
a61ea06faf
Merge branch 'master' of github.com:okfn/ckanext-harvest
2012-07-19 15:27:04 +01:00
David Read
1a4e43a2a9
Status message added - change config to set the text.
2012-07-19 15:17:50 +01:00
amercader
4d00e665f1
[cli] Speed up run command
2012-06-29 11:32:18 +01:00
David Read
5df2b64dda
Merge branch 'master' of github.com:okfn/ckanext-harvest
2012-06-15 18:38:33 +01:00
David Read
c0a9965b52
Reword warning.
2012-06-15 18:38:22 +01:00
Sean Hammond
528e98120c
[ #2533 ] Fix some imports broken by ckan cleanups
2012-06-15 12:08:35 +02:00
David Read
ccf0cd3da2
Add copious logging to record what happens in harvesting.
2012-06-08 17:09:22 +01:00
David Read
a5fac2ac86
Added logic for getting a harvest source when viewing a dataset.
2012-06-01 17:03:40 +01:00
David Read
5151f4ee23
In publisher auth mode, any member of the group can make the changes. This brings things in line with the general idea that Admins have the power to do this plus authorize other editors/admins.
2012-05-29 15:21:34 +01:00
David Read
017222afd2
Withdrawn language introduced for uklp.
2012-05-28 18:41:03 +01:00
David Read
103eea9d50
Improve layout of sources to span page.
2012-05-28 18:13:34 +01:00
amercader
b1a4cc7721
[ui][xs] Nicer button
2012-05-10 11:53:19 +01:00
amercader
3941b691f4
[ui] Fix style for CKAN 1.7
2012-05-09 16:03:03 +01:00
amercader
f0a09e8299
[logic] Fix small bug in dictization
2012-05-09 15:58:23 +01:00
David Read
718202d886
Logging is now assured.
2012-04-10 20:53:29 +01:00
David Read
00e911a70c
Fix name of the queue logger. Moved imports of ckanext.harvest until after _load_config so that the loggers do not start disabled.
2012-04-10 20:10:17 +01:00
amercader
e797f50a05
[cli] Fix create job command
2012-03-19 17:28:53 +00:00
amercader
38a9a03355
[logic] Fix variables naming
2012-03-19 17:01:20 +00:00
Ian Murray
1145e6ea72
[master][auth/publisher] Check for 'ignore_auth' in harvest_object_show
...
Use case: In ckanext-dgu we want to index the harvest_object.content field. As indexing is done synchronously we need
to provide a way for that harvest_object to be accessed when the current http request is made by a non-sysadmin user.
2012-03-15 18:14:57 +00:00
Ian Murray
7f10418f44
[master][auth] get_obj_object() function was missing
2012-03-15 18:09:44 +00:00
amercader
871eae94b6
[ckan harvester] Fix bug on force all check
2012-03-15 11:31:12 +00:00
amercader
f210455aef
[ckan harvester] Replace title on default extras
2012-03-13 12:38:14 +00:00
amercader
e0bef2ef9c
[base] Minor fix for harvesters without config
2012-03-12 14:46:28 +00:00
amercader
4fe38ec49d
[tests,auth] Add tests for the auth profiles
...
Note that only the tests related to the currently loaded auth
profile will be run.
2012-03-08 16:14:44 +00:00
amercader
4a7007460b
[logic] Fix broken imports
2012-03-07 17:08:17 +00:00
amercader
763f07fcad
[logic,cli] Add session to the context in cli commands
2012-03-07 15:20:49 +00:00
amercader
bf6df2dcd6
Fix merge
2012-03-07 15:04:50 +00:00
amercader
9fcaefe8ff
[ui] Fix source datasets paging
2012-03-07 15:03:33 +00:00
amercader
6cccbb61c9
Bug fix, new job count property had not been updated
2012-03-07 12:10:32 +00:00
amercader
124f3191c8
[ui] Add class to config fields so they can be hidden via CSS
2012-03-07 11:56:18 +00:00
amercader
d9cfc52643
[ui,auth] Aggregate sources by publisher on the sources list
2012-03-07 11:49:12 +00:00
amercader
97b390f3c1
[auth,logic,ui] Handle publishers on the UI
...
Add fields for publishers in the form when using the publihser auth
profile. Some changes related to the source schema.
2012-03-06 16:01:43 +00:00
amercader
aea785701f
[logic,auth] Check that users actually exist
2012-03-06 10:37:31 +00:00
amercader
f0e2521d9b
[logic,auth] Modify checks to ensure users are admins of their publishers
2012-03-06 10:16:27 +00:00
amercader
d98206858d
[plugin,auth] Check on startup if ckan is also using the publisher profile
2012-03-05 17:10:02 +00:00
amercader
2a2397c0ed
[logic,auth] Implement publisher auth profile
...
The publisher profile allows general users to handle harvest sources
based on membership to a certain group (publisher), as opposed to the
default auth profile where only sysadmins can perform any harvesting
task.
To enable it, put this directive in your ini file:
ckan.harvest.auth.profile = publisher
TODO:
* Save publisher id / user id when creating sources
* Show publisher in form and index page
2012-03-02 16:49:39 +00:00
amercader
3b68298bba
[logic,auth] Use the site user for CLI commands auth checks
2012-03-01 12:46:42 +00:00
amercader
a35eb75440
[logic,auth] Add auth logic layer
...
The first version of the auth layer is based on the current policy, i.e.
you need to be sysadmin to perform any action.
TODO: the CLI is still not working.
2012-03-01 12:02:16 +00:00
amercader
c798013752
[logic] Refactor the rest of the logic functions (create,update,delete)
2012-02-29 15:20:35 +00:00
amercader
651474e9f1
[logic] Refactor logic layer to follow CKAN core conventions
...
To make maintenance easier and better support the upcoming auth checks,
the logic layer has been refactored to mimic the structure of the one on
CKAN core: separate actions and dictize functions and logic functions
receive a context.
Only get functions are included in this commit.
2012-02-29 10:59:02 +00:00
amercader
50537a6738
Merge branch 'master' into enh-1726-harvesting-model-update
2012-02-15 12:01:15 +00:00
amercader
e03c2545ca
[ui,logic] Expose source title in the source form
2012-02-15 11:49:59 +00:00
amercader
3489a004ad
[ui] Minor tweak to support older themes
2012-02-14 17:23:17 +00:00
amercader
2990353533
[ui,logic] Expose source state (active/inactive) in the source form
2012-02-14 14:24:32 +00:00
amercader
4d7b8143b9
[lib] Renable unique constraint in url for inactive sources
2012-02-14 11:28:11 +00:00
amercader
9ed152cbea
[ckan harvester] Add support for forcing gathering of all remote packages
2012-02-03 17:54:34 +00:00
amercader
a5cf445fa6
[ #1727 ][lib] Use 'current' field in queries returning harvest objects
2012-02-02 13:20:03 +00:00
amercader
479750da09
[ #1726 ][base harvester] Set current field when importing
2012-02-02 13:18:43 +00:00
amercader
4c81c7c3a7
[ #1726 ][model] Harvest source reference compatibility
...
The 'source' property of harvest objects now comes from the actual
foreign key. For compatibility with old harvesters, an before insert
event listener has beeen added to check if the source id has been set,
and set it automatically from the job if not.
Note that this requires SQLAlchemy 0.7 (ie CKAN 1.5.1)
2012-02-01 12:52:52 +00:00
amercader
004210935a
[model] Avoid unicode warning
2012-02-01 11:10:44 +00:00
amercader
b64d97118c
[ #1726 ][model] Add scripts for populating source_id and current fields
2012-02-01 11:08:41 +00:00
amercader
d1783f5415
[model] Changes in harvest model
...
Added three changes to the harvest model:
* 'title' column in harvest_source table
* 'current' column in harvest_job table
* foreign key from harvest_object to harvest_source
Tables are checked on startup to see if they need to be updated.
TODO: populate current and harvest_source_id fields
2012-01-30 18:38:35 +00:00
amercader
f086e908bc
[model] Clearer table initialization
2012-01-30 17:09:28 +00:00
amercader
a997e45470
[lib] Ignore deleted packages in source stats
2012-01-25 17:47:35 +00:00
amercader
3a489bbb82
[ui] Cleanup sources list and details page
2012-01-24 16:55:47 +00:00
David Read
0f8c607187
[tests]: Another test moved in wholesale from dgu repo.
2012-01-11 10:35:37 +00:00
David Read
81ed69c4da
[tests]: Moved to this repo test code from dgu repo that might be useful or might not. Completely broken, but maybe be worth something.
2012-01-11 10:29:05 +00:00
amercader
a53b79c181
[ui] Show edit and refresh links in source page
2012-01-10 17:24:05 +00:00
amercader
2ad29df5c5
[lib] Fix bug: couldn't delete source conf
2012-01-10 17:15:56 +00:00
amercader
eb646b3385
[ckan harvester] Add support for defining default extras
2012-01-10 17:07:19 +00:00
amercader
ae51093213
[ckan harvester] Ignore __junk field, was causing imports to fail
2012-01-10 14:46:12 +00:00
Adrià Mercader
da469ab08e
[base harvester] Custom tag munge function. TODO: check with flexible tags
2011-11-23 11:05:52 +00:00
Adrià Mercader
cfaba6e1e8
[ckan harvester] Add support for sending an API key
2011-11-21 17:29:10 +00:00
Adrià Mercader
0ab5c53b47
[ckan harvester] Fix typo
2011-11-18 17:53:01 +00:00
Adrià Mercader
f02ee45aae
[ui] Show config options in harvest source details page
2011-11-18 14:35:46 +00:00
Adrià Mercader
994590531e
[ckan harvester] Support for creating read-only packages
2011-11-18 14:30:10 +00:00
Adrià Mercader
c939d90dbb
[ckan harvester] Support for defining a custom user to do the harvesting
2011-11-18 14:12:30 +00:00
Adrià Mercader
2018d9e513
[ckan harvester] Support for default tags and groups
2011-11-18 13:20:41 +00:00
Adrià Mercader
8ec05bc3e3
Modify import command to avoid problems with the Session
2011-11-15 11:26:24 +00:00
Adrià Mercader
c04d80e27e
Use get_action function instead of directly calling the action functions
2011-10-26 17:26:18 +01:00
David Raznick
31dac7029a
fix to make import stage work on its own
2011-09-28 14:27:28 +01:00
Adrià Mercader
63afd199a9
Add link to source documents from object errors
2011-09-08 10:27:36 +01:00
Adrià Mercader
31c1ea1c21
Separate gather and object errors in reports. Add info about guid and object id in the object ones
2011-09-08 09:58:21 +01:00
Adrià Mercader
c36d9bdd8e
Add new command to create new jobs for all active sources
2011-09-06 18:25:17 +01:00
David Read
dd00e98d9d
[model]: More careful about creating tables, since paster db upgrade loads the envrionment and therefore runs setup() before it does the migrations, and therefore in this instance we do not want to create the db tables.
2011-08-10 16:25:57 +01:00
Adrià Mercader
7927329536
Make harvesters work with latest ckan release
2011-07-29 11:31:03 +01:00
Adrià Mercader
cabbb4922d
Use API version defined in config if present
2011-07-18 17:35:32 +01:00
Adrià Mercader
c867660e7d
Add docs to base harvester functions
2011-07-18 17:35:03 +01:00
Adrià Mercader
54de6759fe
Fix bug with empty config
2011-06-28 15:04:40 +01:00
Adrià Mercader
c80e68a12f
Ensure the correct configuration is used on each stage
2011-06-14 15:59:13 +01:00
Adrià Mercader
3125bb1514
Add a check to ensure sources with no packages are reharvested
2011-06-14 12:59:48 +01:00
Adrià Mercader
2ac9885150
Page packages in the harvest source details page
2011-06-14 10:27:48 +01:00
Friedrich Lindenberg
0d9d1f8096
reduce number of queries for harvest index to a less insane number. still heavy.
2011-06-13 17:36:35 +02:00
Adrià Mercader
ef04ce1774
Add support for config options in CLI
2011-06-13 15:56:19 +01:00
Friedrich Lindenberg
13f2fb3b96
use hasattr for config validation
2011-06-09 11:35:58 +02:00
Adrià Mercader
7b61fb62bf
Ignore missing config values
2011-06-07 15:32:46 +01:00
Adrià Mercader
98bfd50f47
Load config in the CKAN harvester
2011-06-07 13:35:11 +01:00
Adrià Mercader
6e75d362e3
Add a simple way for harvesters to store configuration options. If form_config_interface is Text on the info dictionary, the configuration field will be enabled in the form. Harvesters can also provide a validate_config method.
2011-06-07 12:07:53 +01:00
Adrià Mercader
ca6af0249a
Reverting previous changeset, as it conflicts with dgu_form_api
2011-06-07 11:58:35 +01:00
Friedrich Lindenberg
f9c0ee37aa
spacing in template paths
2011-06-06 10:16:34 +02:00
Friedrich Lindenberg
89934b8538
[harvesters] factor out a base harvester for use in generic harvesting apps
2011-06-02 12:07:07 +02:00
David Raznick
79fd966573
add database setup at configure time
2011-05-31 18:06:26 +01:00
David Raznick
264b606c48
take tables out of global scope at import time
2011-05-31 18:02:07 +01:00
Adrià Mercader
2b98080266
[merge] from new-forms, as forms refactoring has been merged in core
2011-05-20 13:50:15 +01:00
Adrià Mercader
235d822458
[ckan harvester] Request only packages modified since last harvest job. Also support older versions which do not include 'metadata_modified'
2011-05-17 17:26:42 +01:00
Adrià Mercader
565eaf3d0a
Add a new info method to the harvester interface so implementations can provide details. Use this to build the WUI form
2011-05-13 18:39:36 +01:00
Adrià Mercader
fecee82b1a
Minor enhancements in the WUI
2011-05-13 17:02:18 +01:00
Adrià Mercader
b3a88070e3
[forms] Adapt CLI commands to changes in lib
2011-05-13 16:00:36 +01:00
Adrià Mercader
bbe459527f
[forms] Major refactoring of the harvest forms. Forms no longer use the DGU form
...
API, and are handled similarly to the new ones on CKAN core (logic, schema,
validators...). The UI is also more consistent with the CKAN one.
2011-05-13 14:17:58 +01:00
Adrià Mercader
26cdc1089d
Change date definitions in Harvest Objects. reference_date -> metadata_modified_date, created -> gathered
2011-05-11 17:07:05 +01:00
Adrià Mercader
e1080e349e
Set a flag to force harvesters to import objects
2011-05-10 17:11:12 +01:00
Adrià Mercader
e320d0588f
Add command to reimport existing harvest objects
2011-05-10 16:06:57 +01:00
Adrià Mercader
f7c6854a1d
Save reference date in Harvest Objects when harvesting CKAN instances
2011-05-10 12:57:57 +01:00
Adrià Mercader
329ca2dd29
Add a reference date to the Harvest Objects. This must be set during the harvest
...
process.
2011-05-10 11:05:44 +01:00
Adrià Mercader
c697bc3350
Normalize https ports too ( #736 )
2011-05-09 18:47:30 +01:00
Adrià Mercader
5594f22be7
More robust URL checking ( #736 )
2011-05-09 14:03:46 +01:00
Adrià Mercader
0e56c0ab4f
Abort pending jobs when removing sources
2011-05-05 17:13:07 +01:00
Adrià Mercader
e0a1e5752d
Fix redirects in view controller
2011-05-05 16:47:34 +01:00
Adrià Mercader
43453b6938
Show GUID on object errors
2011-04-19 17:16:25 +01:00
David Raznick
c9d43b2e4d
overide default create schema
2011-04-19 15:34:56 +01:00
Adrià Mercader
e3bca3ceee
Add first version of the CKAN harvester [ #985 ]
2011-04-19 14:54:59 +01:00
Adrià Mercader
b5aea2d863
Allow sources with existing URLs if the existing ones are inactive
2011-04-18 17:19:04 +01:00
james
bc8caebc30
[packaging] Moved templates and publics into source
2011-04-18 08:49:25 +00:00
Adrià Mercader
7e75673cff
Add function to edit harvest sources
2011-04-15 18:28:58 +01:00
Adrià Mercader
e3a83ea14d
Source URL not unique
2011-04-15 18:28:38 +01:00
Adrià Mercader
1edf984f1b
Log an error if no harvester could be found for the source
2011-04-15 16:12:47 +01:00
Adrià Mercader
d92b1df8c6
Only send harvesting jobs from active sources to queue
2011-04-15 15:50:32 +01:00
Adrià Mercader
5fb6512d82
Do not output messages when running the 'run' command
2011-04-15 15:35:19 +01:00
David Raznick
9018beeedb
use create_all instead of create
2011-04-15 12:19:24 +01:00
Adrià Mercader
a11480aa8f
[refactoring] Use ckan.harvest for options in the ini file
2011-04-13 12:46:52 +01:00
Adrià Mercader
040e4d6956
[refactoring] Add a command to create the necessary tables in the database
2011-04-13 12:39:53 +01:00
Adrià Mercader
312e9b8209
[refactoring] Update view controller and templates
2011-04-12 16:15:14 +01:00
Adrià Mercader
aaf7885f43
[refactoring] Return default values if provided in getters. Also return dates as strings
2011-04-12 16:13:38 +01:00
Adrià Mercader
a234d6ff6a
[refactoring] Remove unneeded interface
2011-04-12 10:11:02 +01:00
Adrià Mercader
842bea8433
[refactoring] Remove all geo related functions as they now have it own extension ckanext-spatial
2011-04-11 19:19:09 +01:00
Adrià Mercader
280dce1626
[refactoring] Add status info to harvest sources. Updated dict functions and UI.
2011-04-11 16:30:56 +01:00
Adrià Mercader
e2faa14b8d
[refactoring] Save gathering starting and finishing time
2011-04-11 16:29:47 +01:00
Adrià Mercader
3c505693ff
[refactoring] Do not delete sources, just inactivate them. Also don't delete jobs.
2011-04-08 17:07:19 +01:00
Adrià Mercader
2588352bc5
[refactoring] Simplify model relations
2011-04-08 16:48:29 +01:00
Adrià Mercader
9465d59d3a
[refactoring] Remove old harvesting controller. This functionality now lives on ckanext-inspire
2011-04-08 15:54:33 +01:00
Adrià Mercader
b38fc57ec5
[refactoring] Changes in the queue code and the IHarvester interface"
2011-04-07 16:59:11 +01:00
Adrià Mercader
90ae9d27db
[refactoring] Tweaks in the model and a setup function to create the tables
2011-04-07 16:57:36 +01:00
Adrià Mercader
56d5acc867
[refactoring] Add code to handle queuing and the new IHarvester interface. Add
...
new commands in the CLI to start the queue consumers and fire the harvesting
process.
2011-04-06 12:45:00 +01:00
Adrià Mercader
4023bb7222
[refactoring] Use the common functions in the web interface.
...
Not yet implemented in create and edit harvest source as they use the DGU forms API.
Also TODO, think of what report info is needed in the listing and details page.
2011-04-05 13:39:23 +01:00
Adrià Mercader
3d32a18802
[refactoring] Return dictionaries from the common functions, and use them in the CLI"
2011-04-05 12:55:58 +01:00
Adrià Mercader
dca4ff328f
[refactoring] Add missing column in error table
2011-04-05 12:53:36 +01:00
Adrià Mercader
a4d93e4ad3
[refactoring] Add a reference to objects from jobs
2011-04-05 12:36:27 +01:00
Adrià Mercader
e819a68f21
[refactoring] Move common functions to lib. Adapt the CLI to use these common functions.
2011-04-05 11:53:39 +01:00
Adrià Mercader
ce86cfde1b
[refactoring] Clean up the model definitions
2011-04-05 11:51:59 +01:00
Adrià Mercader
d3f06b7b4f
[refactoring] Changes in model
2011-03-30 16:55:42 +01:00
Adrià Mercader
0396838e36
[refactoring] First version of the new model
2011-03-30 14:42:59 +01:00
Adrià Mercader
d35f86f559
Stop harvesting job if the package extent could not be saved. Still shows an ugly SA exception
2011-03-29 17:23:49 +01:00
Adrià Mercader
95623b0f7a
Set format as WMS if the harvested document refers to a service
2011-03-29 16:24:14 +01:00
james gardner
e855d2ad9a
Moving the extension points code to package as a .deb file
2011-03-28 15:52:43 +01:00
Adrià Mercader
6a877666bd
Use the URL of a WMS type resource instead of the package URL
2011-03-28 09:40:17 +01:00
David Read
d6e35bc5cb
[controllers,tests]: Added requirement to be sysadmin for all controller operations. Provided test for this and some useful test infrastrure.
2011-03-25 17:01:26 +00:00
Adrià Mercader
61e23196bf
Better handling of unauthorized access
2011-03-23 17:02:02 +00:00
Adrià Mercader
0cf2c7e7ac
Nicer design for the WMS preview
2011-03-23 10:00:39 +00:00
Adrià Mercader
32a180f39c
First version of the WMS viewer. The viewer is based on OpenLayers, which is used to parse the GetCapabilities response and build a map with the layers found.
2011-03-22 17:33:58 +00:00
Adrià Mercader
57631a8931
Use the projection from harvested documents. Add a configure option to define the database projection
2011-03-18 17:42:43 +00:00
Adrià Mercader
35537bbbcb
Update help in CLI
2011-03-18 15:48:08 +00:00
Adrià Mercader
4036858ac9
Add a CLI command to create or update the geometries for package extents
2011-03-18 15:44:40 +00:00
Adrià Mercader
b83443af4a
Create geometry when harvesting documents with geographic extent
2011-03-18 13:40:49 +00:00
Adrià Mercader
51102c9330
Add spatial search based on PostGIS for packages with geographic extent
2011-03-18 12:05:16 +00:00
Adrià Mercader
ee186376bd
Display an error if the API call fails
2011-03-15 16:19:13 +00:00
Adrià Mercader
94ec121029
[merge]
2011-03-14 17:17:15 +00:00
Adrià Mercader
2c8eb94344
#1030 Move harvesting controller to ckanext-harvest
2011-03-14 17:04:41 +00:00
james gardner
898149bb8f
[merge]
2011-03-14 16:06:30 +00:00
james gardner
52b17ffceb
[security] change public file path code to avoid a trailing , and thus adding / as a static file app in the Pylons middleware cascade
2011-03-14 16:05:34 +00:00
Adrià Mercader
c2bfdd16fc
#1030 Move harvesting model to ckanext-harvest
2011-03-14 15:19:58 +00:00
Adrià Mercader
544a88d18d
#1030 Move paster harvester CLI command to ckanext-harvest
2011-03-14 13:34:48 +00:00
Adrià Mercader
dfd23d720c
Add option to edit harvesting sources
2011-03-11 13:42:21 +00:00
Adrià Mercader
696a8dc058
Add option to delete harvesting sources
2011-03-11 12:41:13 +00:00
Adrià Mercader
58164e915d
Fix typo
2011-03-11 12:38:09 +00:00
Adrià Mercader
81569c0f2c
Show error notices for duplicate sources and existing jobs.
2011-03-11 12:35:27 +00:00
Adrià Mercader
70bff06b48
Add refresh link to the UI.
2011-03-10 17:24:23 +00:00
Adrià Mercader
4f2a233cf1
Harvesting jobs creation controller
2011-03-10 16:48:50 +00:00
Adrià Mercader
6cc73aa1f7
Send the authz headers on all requests
2011-03-10 15:32:51 +00:00
Adrià Mercader
ad239b93e8
Add controller to create a harvesting source
...
Before sending a create request to the forms API, an authz header is
added, with the API key defined in the config file
(ckan.harvesting.api_key). Also the data is formatted as the JSON object
expected by the forms API.
2011-03-10 14:02:21 +00:00
Adrià Mercader
572650d26a
Use urllib2 and better error handling
2011-03-10 09:45:16 +00:00
Adrià Mercader
66c65f53f4
First draft of the Harvesting extension
...
This extension will hold all the harvesting code that is now located in
different parts of the ckan core and ckanext.dgu. This revision still
uses the ckanext-dgu forms API.
Controllers and templates have been defined for listing, creating and
showing the details of harvesting sources.
2011-03-09 18:56:55 +00:00