Comments (2)
The logs are :
blocklib/signature_generator.py ...F [ 16%]
tests/test_blocks_generators.py ... [ 29%]
tests/test_candidate_block_generator.py .. [ 37%]
tests/test_get_config.py .. [ 45%]
tests/test_pprlindex.py .. [ 54%]
tests/test_pprllambdafold.py ... [ 66%]
tests/test_pprlpsig.py .. [ 75%]
tests/test_signature_generator.py ..F. [ 91%]
tests/test_validation.py .. [100%]
=================================== FAILURES ===================================
__________ [doctest] blocklib.signature_generator.generate_by_soundex __________
071 Generate a phonetic encoding of features using soundex.
072
073 >>> sigs = generate_by_soundex(1, ('Joyce', 'Wang', 2134))
UNEXPECTED EXCEPTION: UnicodeDecodeError('ascii', b'@\x8f\x08\xbe\xacU', 1, 2, 'ordinal not in range(128)')
Traceback (most recent call last):
File "/opt/hostedtoolcache/Python/3.6.9/x64/lib/python3.6/doctest.py", line 1330, in __run
compileflags, 1), test.globs)
File "<doctest blocklib.signature_generator.generate_by_soundex[0]>", line 1, in <module>
File "/home/vsts/work/1/s/blocklib/signature_generator.py", line 79, in generate_by_soundex
return soundex(feature)
File "src/fuzzy.pyx", line 230, in fuzzy.Soundex.__call__
UnicodeDecodeError: 'ascii' codec can't decode byte 0x8f in position 1: ordinal not in range(128)
/home/vsts/work/1/s/blocklib/signature_generator.py:73: UnexpectedException
______________________ TestPSig.test_generate_signatures _______________________
self = <test_signature_generator.TestPSig testMethod=test_generate_signatures>
def test_generate_signatures(self):
"""Test a multi-stragegy signatures."""
dtuple = ('Joyce', 'Wang', 2134)
signatures = [
[
{'type': 'feature-value', 'feature_idx': 0},
{'type': 'feature-value', 'feature_idx': 1},
],
[
{'type': 'soundex', 'feature_idx': 0},
{'type': 'soundex', 'feature_idx': 1},
]
]
signatures = generate_signatures(signatures, dtuple)
> assert signatures == {"J2W52", "JoyceWang"}
E AssertionError: assert {'', 'JoyceWang'} == {'J2W52', 'JoyceWang'}
E Extra items in the left set:
E ''
E Extra items in the right set:
E 'J2W52'
E Use -v to get the full diff
from blocklib.
This issue is not yet closed (at least it was not fixed in master). It is supposed to come with the branch #13 were some functionality and the corresponding tests will be removed.
The PR #19 was ensuring that the failing tests in Python 3.5 were the same as in any other Python version, and failing for the same reason.
from blocklib.
Related Issues (20)
- Automate release with CI
- feedback on filtering for P-Sig blocking
- Add tests
- Ideas for extra signature strategies
- Python API for signature generation
- Sentinel check for input type HOT 1
- Inconsistent block keys in filtered reversed index with psig
- Convert block key into string
- Throw exception when clks are fed to p-sig blocking HOT 1
- Support column names in blocking schema
- Add number of encodings in blocking metadata HOT 2
- Dependabot couldn't authenticate with https://pypi.python.org/simple/
- float division by zero issue HOT 9
- Docs, examples and tests should use feature names
- Convert printing to logging
- Serialize to a blocking schema
- Blocking Schema consistency
- module 'blocklib.validation' has no attribute 'validate_blocking_schema' HOT 1
- 'CandidateBlockingResult' object has no attribute 'print_summary_statistics' HOT 1
- Dependabot errors HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blocklib.