Comments (3)
We removed this operation from the class because it is pretty easy to get
with numpy.unique. You need to ask it for both unique values and the map.
Absolutely important so we should have the example.
On Sep 14, 2015 3:06 PM, "davidwagner" [email protected] wrote:
Goal: Given a table T and a column C, build a new table that has one row
for each unique value in T.C along with a count of the number of times that
value appears in T.C.I was not able to find any clean way to do this within the Table API.
Should this be doable using Tables, without leaving Table space and going
back to arrays and raw Python?Here is the solution I came up with:
from collections import Counter
c = Counter(origtbl['column_label'])
t = Table.from_rows(c.items(), ['column_label', 'count'])Not so terrible if you know Python idioms, but also probably not so
discoverable for students. Should there be an API in Table that's
accessible to students that allows performing this kind of task? Or some
suitable generalized primitive, which is enough to solve this problem?—
Reply to this email directly or view it on GitHub
#68.
from datascience.
Cool. Thank you. Note to future self: np.unique(x, return_counts=True)
from datascience.
Here's the example
T = Table([['A','A','B','C','A']],['letters'])
labels, counts = np.unique(T['letters'],return_counts=True)
R = Table(list(counts),list(labels))
from datascience.
Related Issues (20)
- make test doesn't work HOT 1
- Make Plots Compatible with %matplotlib notebook HOT 2
- Deprecation warning for np.int in _vertical_x()
- Actions for running tests are erroring out HOT 3
- Docs are not getting automatically built HOT 2
- v0.17.5 not pip installable HOT 2
- `Table.scatter` method doesn't recognize `sizes` argument HOT 2
- Getting warning from numpy when creating pivot table HOT 4
- Getting warning from numpy when creating histogram HOT 5
- getting version in setup.py HOT 5
- We use Python f-strings HOT 3
- Possible bug with Marker copy HOT 2
- Remove `Table.empty` from `tables.py` HOT 1
- bokeh required for tests to pass HOT 2
- 0.17.6 release not on pypi or conda HOT 5
- Do we need to check for `collections.abc.Sequence` in `utils. is_non_string_iterable`? HOT 2
- test_date_formatter_format_value() should call tzset() HOT 1
- Please restore: column_labels in 0.17.5 versus 0.17.6 HOT 6
- Error Handling issue. HOT 1
- plots don't work with datascience module HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datascience.