Comments (10)
@JariPekko Thanks, it looks like there is definitely an issue with writing the file queue. Please make sure to create the queue for processing yourself before starting the Level 2 processing.
@davidfrantz There is potential for this to happen in the current version. However, according to the traceback the issue here is that the callback function (called after downloading a scene) isn't getting the url passed on properly.
from force.
I pulled the latest image used it without changing anything else and it seems to be working as intended now.
The download is going for 1h now with no error.
A small update to the process before the latest davidfrantz/force:latest
image:
The downloads did stop completely at some point. The process was still going but no download for several hours. After aborting manually and starting again it was always the same pattern:
Download works as intended -> after a short while the error message from above appears but download is still ongoing ->some time later the download stops
Thanks, and i'll post an update about how it went
from force.
Hi Jari,
the error occurs when trying to add the scene that was just downloaded to the QUEUE file, so you probably don't need to worry about the download itself.
It's not easy to say what the issue is with the information at hand. Does this only happen once? Can you specify which scene it was?
Thanks,
Stefan
from force.
Hi Stefan,
thanks for the quick reply. So the download happens correctly but the QUEUE file is not updated correctly?
I just counted the files in the download dir (6129) and the lines of the QUEUE file (5567) in case this is heplful.
I think so far the error happened only once each time i started the process.
I don't know which scene caused it but i can give you all scenes i'm trying to download.
Sensor(s): TM, ETM, OLI
Tile(s): 171074,172074,172075,173074,173075,174074,174075,175074,175075,176073,176074,176075,177070,177071,177072,177073,177074,177075,178070,178071,178072,178073,178074,178075,179070,179072,179073,179074,179075,180072,180073
Date range: 1970-01-01 to 2023-01-04
Included months: 1,2,3,4,5,6,7,8,9,10,11,12
Cloud cover: 0% to 70%
20793 Landsat Level 1 scenes matching criteria found
10.97 TB data volume found
5850 product bundles found in output directory, 14943 not downloaded yet.
Remaining download size: 9.78 TB
Downloading: 1%|=> | 102/14909 [18:55<42:35:28, 10.36s/product bundle]
Downloading: 1%|== | 152/14909 [27:11<30:28:13, 7.43s/product bundle]
from force.
could this be a potential file conflict when parallelly downloading images?
from force.
I have run several tests and was unfortunately not able to reproduce the issue.
However, the way that the force queue file is created has been reworked to make sure that there aren't conflicts due to parallel access of processes on the same file. Instead of using a callback, we now use multiprocessing.Queue and a dedicated process that listens for results of the other processes and writes the queue file.
@JariPekko maybe you can try to pull the latest davidfrantz/force:latest
image and let us know if that solves your issue? Thanks!
from force.
I'm happy to report that the download went flawlessly and rather quickly. In one day the ~11TB were downloaded.
Though the QUEUE file didn't seem to update at all. I downloaded 20792 scenes (as requested minus 1) but the QUEUE file had only 5771 lines, which it had before using the new davidfrantz/force:latest
image. The QUEUE file had to be written manually afterwards.
Thanks a lot for your quick help!!
from force.
Thanks for the feedback Jari!
I also noticed that the download speed has improved by orders of magnitude. I hope there have been changes to the infrastructure and it will stay like this now.
Glad to hear the issue is solved! To be honest I'm a bit puzzled that the queue file wasn't updated in your case. This was tested successfully here and I also had someone contact me in private with the same issue who had no issues writing the queue file after the update. Was the file maybe locked by another process by any chance?
from force.
I'm new to Linux and may be overlooking something, but I can't think of another process that would have locked the QUEUE file. I stopped the download (force-level1-landsat search --download
) process from before the update. The only other command i did involving the QUEUE file was to count its lines sometimes wc -l queue.txt
.
For testing purposes i just downloaded another scene with a new QUEUE file and new directories. Now the QUEUE file was updated correctly.
from force.
Good to hear, thanks.
Leaving the commit for reference and closing this as completed.
Feel free to re-open if needed.
from force.
Related Issues (20)
- Optional cirrus buffer HOT 1
- [Documentaton] Texture Metrics, Landscape Metrics, Sampling HOT 2
- Masked blocks in TSS are assigned 0 instead of nodata value HOT 7
- RTD is referencing an outdated revision HOT 5
- Projection problem when force-cubing Sentinel-1 images HOT 3
- Apparent data loss when selecting EQUI7-EU as output HOT 4
- MODIS to be decommissioned; is there an alternative? HOT 2
- 24h interaction cooldown for new GitHub users
- Error when using force-level1-landsat search with --forcelogs HOT 2
- Changes in the authorization process for Google Cloud may affect force-level1-csd soon HOT 9
- Level2 - Couldn't open the directory: Input/output error HOT 4
- level-1-landsat-download: Some links seem to be broken HOT 8
- Example command in documentation failing
- Documentation on Read The Docs broken HOT 3
- Sentinel-2 Volume 11 and Volume 12: FORCE differences from sen2cor HOT 2
- [force-l2ps] High number of opened files by the force-l2ps process HOT 2
- force-cube fails when using gdal 3.8.4 because output is redirected
- force-level1-landsat does not download all the found scenes and fails to populate the queue HOT 8
- Hierachy of the interpolation module HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from force.