ajdamico / lodown Goto Github PK

View Code? Open in Web Editor NEW

95.0 95.0 47.0 1.19 MB

locally download and prepare publicly-available microdata

R 100.00%

officialstatistics public-data rstats survey

lodown's People

Contributors

Stargazers

Watchers

Forkers

jjsjaime thiagomendesrosa yammeportella mpancia statsccpr potterzot hannes hjacobcarlson thyagorezende majazaloznik jeffrosenblum yluair lgallindo leandrogomide floswald raivtash ioannispapadakis leoniedu ernietedeschi epix-project dataeducation ghowoo abesolberg kusti8 ezanutto seanofahey brunoritter gianfoss gpompeo bitcababy doc4child simondgreenhill rodrigoesborges gocdata drewmcnichols ccrandall07 korgera rdosreis angryeng lukaswallrich luanmugarte schoulten jfontestad 12sbxu charlesyip2016 dryezl nikolabaci98

lodown's Issues

add the progress in international reading literacy study

edit lodown.Rd so the catalogs are meaningful

# only download 2011
meps_cat <- get_catalog( "meps" , output_dir = "C:/My Directory/MEPS" )
lodown( "meps" , subset( meps_cat , year == 2011 ) )

add datasus births, deaths, prenatal

add mtps administrative data

i can't get to this ftp site from either a us or an ivory coast ip address :/ if it's only administrative data, is it big enough to require monetdb? (2 million+ records?) thanks

DataSUS date formatting issue

@ajdamico, when I run
dbGetQuery( mdb_src$con , "SELECT RIGHT( cast( dtobito as text ) , 4 ) as ano , COUNT(*) from geral_cid10 GROUP BY ano order by ano" )
it returns:

    ano      L5
1  +004       1
2  +006  615147
3  +007 1467989
4  1996  908883
5  1997  903516
6  1998  931895
7  1999  938658
8  2000     496
9  2001  961492
10 2002  982807
11 2003 1002340
12 2004 1024073
13 2005 1006827
14 2006 1031691
15 2007 1047824
16 2008 1077007
17 2009 1103088
18 2011 1170498
19 2012 1181166
20 2013 1210474
21 2014 1227039

The first 3 lines and the year 2000 are wrong, as you can see in http://tabnet.datasus.gov.br/cgi/tabcgi.exe?sim/cnv/obt10uf.def, when you select Linha: Ano do óbito and all years.

can you tell me why?

add function to download any extract from the ipums system

add the national health & aging trends study

https://www.nhatsdata.org/Project/PubUseFiles.aspx

add health professional shortage areas

add the medicare current beneficiary survey

resolve parsing failures in cps_basic

think about how to catalog/present all available lodown datasets

just in an asdfree book, or elsewhere?

please cran check before you push a commit

* checking loading without being on the library search path ... OK
* checking dependencies in R code ... NOTE
There are ::: calls to the package's namespace in its code. A package
  almost never needs to use ::: for its own objects:
  'recursive_ftp_scrape'
* checking S3 generic/method consistency ... OK
* checking replacement functions ... OK
* checking foreign function calls ... OK
* checking R code for possible problems ... NOTE
get_catalog_mtps: no visible binding for global variable 'URLdecode'
lodown_mtps: no visible global function definition for 'read.csv2'
Undefined global functions or variables:
  URLdecode read.csv2
Consider adding
  importFrom("utils", "URLdecode", "read.csv2")
to your NAMESPACE file.
* checking Rd files ... OK
* checking Rd metadata ... OK
* checking Rd line widths ... OK
* checking Rd cross-references ... OK
* checking for missing documentation entries ... OK
* checking for code/documentation mismatches ... OK
* checking Rd \usage sections ... OK
* checking Rd contents ... OK
* checking for unstated dependencies in examples ... OK
* checking examples ... OK
* DONE
Status: 2 NOTEs

See
  'C:/Users/anthonyd/Documents/GitHub/lodown.Rcheck/00check.log'
for details.


checking dependencies in R code ... NOTE
There are ::: calls to the package's namespace in its code. A package
  almost never needs to use ::: for its own objects:
  'recursive_ftp_scrape'

checking R code for possible problems ... NOTE
get_catalog_mtps: no visible binding for global variable 'URLdecode'
lodown_mtps: no visible global function definition for 'read.csv2'
Undefined global functions or variables:
  URLdecode read.csv2
Consider adding
  importFrom("utils", "URLdecode", "read.csv2")
to your NAMESPACE file.
R CMD check results
0 errors | 0 warnings | 2 notes

R CMD check succeeded

create docdir= in the catalog not a lodown_* function parameter

it should be another column in the data frame that get_catalog_* makes. that way, it defaults to something nicely, but users can change it the same way they can change other output names. you will also need to add this to the dir.create() line within lodown.R so the directory gets built. make sense? thanks

add the pesquisa nacional por amostra de domicilios

add sec edgar and/or related data

http://opendata.stackexchange.com/questions/6100/company-ownership-data

if lodown fails due to memory limit, recommend memory.size() increase

capture the error and add the note about how to increase disk paging

verify platform independent

cc @guilhermejacob

add sia/sih

add the national survey of children with special health care needs

https://www.cdc.gov/nchs/slaits/cshcn.htm

looks very similar in structure to nsch

add three new lines to the ?lodown rd files for each dataset

when you add datasets, make sure you also add lines in lodown.R and reoxygenize. each new function pair needs one line in the first block and two lines in the second block. i don't have this hooked up to travis so make sure you test build in rstudio.. thanks bud