Comments (10)
purpose -
to filter the input csv data as per the provided arguments
filter types
- invalid data
- all words
- specific topic
- specific alphabet (which will be initial of the english word)
- TBD (more can be added)
about invalid data filter
the invalid data filter is to be run always before running any other filter, as it eliminates those data elements which have insufficient or invalid data
-
insufficient data - any of the english or marathi word is missing
-
invalid data - if english word contains non english characters (this can be thought of later, and is low priority right now), same for marathi word.
IMP
- make separate functions for each filter type
- as per the passed argument of the "filter type" call the relavant function
steps
- take passed csv data and filter type as argument
- generate a truncated csv data structure as per the target filter
- call the specific filter function here
- return this truncated data to the calling function
from marathi-shabd.
Hey so when you say "insufficient data" do you mean missing English or Marathi words exclusively, or does it also include missing examples and tags?
from marathi-shabd.
only the main 2 words. 1 en and 1 mr.
from marathi-shabd.
Thanks. Also could you explain what the "all words" filter is supposed to do?
from marathi-shabd.
Thanks. Also could you explain what the "all words" filter is supposed to do?
"All words" basically means no filtering (other than the invalid/insufficient data, of course).
from marathi-shabd.
So I would just call the invalid/insufficient data scripts when the filter type is "All words"?
from marathi-shabd.
Yes. Pretty much.
from marathi-shabd.
Also, the filter by topic function will require the topic as an argument. Do you want me to add an optional topic argument to the main filter function?
from marathi-shabd.
Yes you can do it in whichever way that makes the functions easy to use and also reusable.
What I've written in the gen-out.py file is just a basic example.
You can fill in the details and missing gaps.
from marathi-shabd.
pending issues from PR #32
among this priority ones are the # 1 and # 3
from marathi-shabd.
Related Issues (20)
- search site - hitting "enter" key should do same as pressing the "search" button HOT 1
- Markdown Word block update HOT 3
- remove header row from csv when used for markdown file generation HOT 3
- delete the "filtered.csv" file after it is no longer necessary
- topics parser HOT 7
- remove python cache folder __pycache__ HOT 5
- topic specific md files do not contain words which have multiple tags HOT 1
- extract topics list in topics-list.md HOT 7
- each browse md file must have a its file name in its heading HOT 3
- Translate and add Marathi text to the website HOT 1
- खगोलशास्त्राचे शब्द जोडणे HOT 1
- क्रिकेटचे शब्द जोडणे HOT 2
- Create a form for users to submit NEW words HOT 3
- Update readme in all folders HOT 1
- Add Marathi text to index.MD
- Form for users to submit missing Marathi words for EXISTING English words HOT 11
- Word info graphic creator HOT 3
- संख्या
- temporary form to add missing/incorrect words
- जाहिरात पत्रकात इंस्टाग्राम खात्याचे नाव सुधारणे HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from marathi-shabd.