Comments (7)
@montxo5 That was caused by the harvesters not being careful when checking if two requests had the same contents (to check if the remote server supported pagination).
In Madrid's case, there are some real time datasets that got the timestamp updated on each request:
<dct:modified rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-06-11T03:05:31</dct:modified>
Can you update your sources and check if you only get 101 records?
from ckanext-dcat.
Tanks for the reply. Sorry, but I didn't understand what do you mean when you say updating my resources.
You mean reharvesting?
from ckanext-dcat.
I meant doing git pull
to update the ckanext-dcat source and reharvesting.
Let me know how it goes,
from ckanext-dcat.
I've updated the ckanext-dcat with git pull and it's still duplicating datasets.
I've also tried uninstalling and installing dcat, restarting, but also fails.
from ckanext-dcat.
Did you restart the two harvester consumers? ctrl+c
if running them directly on the terminal or sudo supervisorctl restart all
if using Supervisor on production.
from ckanext-dcat.
You were right, I forgot to restart the consumers... Thanks!! Now works perfectly!
from ckanext-dcat.
Glad you got it working! :)
from ckanext-dcat.
Related Issues (20)
- Could not build url for endpoint 'dcat.read_catalog' HOT 5
- Does not install with python 3.10 HOT 1
- dcat:mediaType must be a resource HOT 3
- Already deleted records are to be deleted again
- Backslash? Forward slash HOT 2
- New version for dropped Py2 and CKAN<2.9 support HOT 3
- two many locn:geometry
- do not split keywords HOT 2
- Harvester crashes with missing title HOT 1
- Support for DCAT 3 HOT 2
- Improving Pagination Handling in RDF Harvester's gather_stage
- Google Search Console: contentUrl missing
- [META] DCAT v3 support HOT 2
- Create profile and parser for DCAT-AP 3.0.0
- Create serializer for DCAT-AP 3.0.0
- Create profile and parser for DCAT-US 3.0.0
- Create serializer for DCAT-US 3.0.0
- Create schema file(s) for DCAT-AP 3.0.0
- Create schema file(s) for DCAT-US 3.0.0
- Create config declaration
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ckanext-dcat.