deadsoul / dugu Goto Github PK
View Code? Open in Web Editor NEWFind, remove and avoid duplicates with dugu: The Duplicates Guru
License: GNU General Public License v3.0
Find, remove and avoid duplicates with dugu: The Duplicates Guru
License: GNU General Public License v3.0
[/unsorted/scans] python3 /root/py/dugu/src/dugu.py ## python 3.5
Traceback (most recent call last):
File "/root/py/dugu/src/dugu.py", line 23, in <module>
from dugu.main import DuGuMain
File "/root/py/dugu/src/dugu/main.py", line 78
p(f"Current memory usage is {current / 10 ** 6}MB; Peak was {peak / 10 ** 6}MB")
^
SyntaxError: invalid syntax
[/unsorted/scans] python3.8 /root/py/dugu/src/dugu.py
Minimum required Python version is: 3.8.6 not 3.8.4
Took: 0.015789298922754824 seconds
Current memory usage is 0.049036MB; Peak was 0.04926MB
It's possible I missed it but it looks like dugu read/writes to /tmp/dugu_root and this isn't documented.
I can't see an option to change this behavior (I've not debugged build_path()
)
If a relative path used when cache is generated, the cache won't be available unless the same relative path is being called in the next scan.
for example (first scan):
we're in /home/deadsoul/Documents
, and we would like to scan /home/deadsoul/Pictures
:
dugu scan ../Pictures
if we repeat dugu scan ../Pictures
it should take advantage of the generated cache.
But if we're in /home/deadsoul
and tried to dugu scan Pictures
it will re-generate another cache. since '../Picutres' is not equal to 'Pictures'. which means dugu scan /home/deadsoul/Pictures
should generate another cache.
and the same thing should apply if the scanned target directory is a link.
Suggested solution:
convert the given target-directory to its absolute path.
if there is any broken symlink in DIR1 (source_dir)
dugu
will fail when using the argument precopy
it takes long time because the time is being wasted in copying the whole DIR1
!!
the way precopy
works now is:
DIR1
to DIR1/_UniqueFiles_
DIR2
from DIR1/_UniqueFiles_
by removing themwhile it should be like
DIR1
because we're wasting time by copying all the files, then removing some of them. instead we should only copy unique files
Need to check which counter is correct. is it the one in caches, or the one used to compare the cached one!
commit: caba8a2
It would be great if I could install this package with python3 -m pip install dugu
, and then run it with python3 -m dugu -h
. Please publish this package to the PyPI package index!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.