Comments (6)
Please provide some examples
from bga-payroll.
yeah, here's a spreadsheet of all the exact duplicates and the number of times they appear in the source data. i generated it with the command tail +2 raw/2017_payroll.csv | sort --field-separator=',' -k2 -k6 -k5 -k4 -k3 | uniq -c | grep -v '^ * 1 ' > duplicates.csv
and added the header by hand in the spreadsheet.
from bga-payroll.
From Jared:
If all of the fields are the same, the duplicates could be eliminated.
from bga-payroll.
In order to avoid collapsing salaries that should not be collapsed, we should not remove exact duplicates automatically. Have advised Jared that we will treat exact duplicates as different people –perhaps we should flag records that look exactly the same in the upload wizard.
from bga-payroll.
related to #55.
from bga-payroll.
we may want to flag exact dupes within a given year to the user. it could be a case of three john smiths in the cpd from the same recruiting class, making the same pay, but it could be an error.
from bga-payroll.
Related Issues (20)
- Unit/department not populated correctly when downloading data from unit page HOT 4
- Style standardized data download link HOT 2
- Differentiate between standardized and source data downloads HOT 1
- sticky year picker is running into sticky navbar HOT 1
- Disallow crawling of data export links HOT 3
- Prompt for login when unauthorized user tries to download data
- Vary cached employer pages on authentication cookie HOT 3
- Hamburger menu does not push content
- Missing nav links
- Add images into "From the newsroom" section
- get link for new rss feed
- update social cards
- update favicon
- Check to make sure that our new NGINX rules work
- signups are not completing HOT 1
- Remove Lorem Ipsum text from sidebar
- add blocklist for temporary mail domains HOT 1
- remove data coordinator from popup
- investigate switching address backend from salsa to mailchimp
- suppress invalid HTTP_HOST errors
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bga-payroll.