Comments (3)
One proposed fix would be to make a single regex out of the various date formats instead of using N of them, but I don't know whether python's re is powerful enough to signal more than one end state.
from messytables.
We don't seem to be using regexes for this. From what I can tell, we're
doing datetime.strptime
repeatedly with multiple formats until one of
them doesn't throw an exception. That sounds less than ideal :)
from messytables.
I'd recommend using the DateUtilType
that you can find here: https://github.com/okfn/messytables/blob/master/messytables/types.py#L124
It's much faster but does not support format guessing.
from messytables.
Related Issues (20)
- Rename types.py to messytypes.py to prevent file name clash on sys.path which gives error: AttributeError: 'module' object has no attribute 'ClassType' HOT 1
- Boolean type parses values 0 and 1 backwards HOT 1
- Should DateUtilType use is_date?
- Add a sampling rate to the rowsets
- TypeError("object of type 'float' has no len()",) when calling type_guess HOT 10
- detect ods types: boolean, currency, time and percentage. support repeated column and multi-line text in a cell. HOT 1
- from six import viewkeys: cannot import name viewkeys
- Application for maintainership HOT 10
- type guessing a column of zeros (0) fails with exception
- Two fixes in reading ods format: duration correction and multi-text inside a paragraph
- Add support for Flat XML ODF Spreadsheet (.fods)
- Cell starting with a space in ods file is not parsed correctly
- Pass instantiated types to type_guess
- Potential dependency conflicts between messytables and chardet HOT 1
- Please add LICENSE
- requirements-test.txt should have xlrd==1.2.0 (or >=) for Python 3.8+ tests
- test_attempt_read_encrypted_no_password_xls failure in Python 3.7+
- messytables guesses wrong type for decimal number HOT 1
- Failure to load with Python 3.10
- headers_make_unique output not always unique
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from messytables.