shiqiyang2022 / template Goto Github PK
View Code? Open in Web Editor NEWThis project forked from gentzkow/template_archive
This project forked from gentzkow/template_archive
@ShiqiYang2022: Let me know if you see any issues here, or come across any problems. Thanks for testing! Instructions are below:
template
repository, and switch to the branch 85_ditab_format
.config_user.yaml
file and initialize git lfs
as usual.conda
environment. I had to add a couple of Python
packages for the automated export to PDF to compile, so you may need to delete and rebuild the environment if you have an older version stored locally: conda env remove -n template
conda config --set channel_priority strict
conda env create -f setup/conda_env.yaml
Initialize and update the submodule as usual. Then, navigate to ~/lib/gslab_make
and switch to the branch template85_ditab_format
.
Execute python run_all.py
from root. This will populate the placeholder scalars when running the analysis
module, and produce the formatted PDF tables when running paper_slides
.
You should see the populated PDFs with the proper formatting now live in ~/paper_slides/output
. They are the files prefixed with gs_
(see here).
A few notes:
run_all.py
).run_all.py
for the first time to populate the outputs from the ~/analysis
module, you can make desired formatting edits to the Excel sheets in the ~/paper_slides/skeleton
folder, and (assuming the link references are also properly updated), the formatting will update if you run only ~/paper_slides
.Thanks chiefs in advance helping me revise the issue wording ❤️ .
Follow gentzkow#84 (comment). Add detailed workflow into instruction.
In this issue I just try to create repository locally and sync to overleaf, make edits and push back.
Attempt to proposal gentzkow#84 (comment) in gentzkow#84.
Step 9 for Practice task on RA Manual (Sherlock Extension).
Update: tag jms nano
I conducted rounds of testing(thanks snd jc) on Sherlock to determine resources available, and I implemented job runs with different ordering structures and parallel execution strategies. I am therefore posting my notes and proposing my job submission re-structure solution.
gentzkow
group account.gentzkow
group is gentzkow
and hns
(Humanities and Sciences nodes) nodes. We can also access normal
(public) nodes but the average job queued time on those nodes is sshare
score(a metric in SLURM's fair-share scheduling) preventing us to use normal
nodes.gentzkow
nodes have in total hns
nodes have 20 CPUs with 128 GB RAM
, 24 CPUs with 192 GB RAM
, 32 CPUs with 256 GB RAM
, and 128 CPUs with 1024 GB RAM
.In the last full run before NBER
submission, we outputted in total RCNL
, RCL
, and L
estimations. Per my investigation, on average one RCNL
estimation costs RCL
and L
. The total amount of hours*CPU of conducting the full run is roughly
If we run jobs in full capacity, ideally each CPU would have been busy for approximately
My proposed solution would be combining 4 jobs in previous submit_jobs.py
into 1 new job. For each new job, we run the 4 previous jobs in parallel using parpool()
function, and assign 1 CPU to each job(so 4 CPU for each new job). The strength of this new approach are:
RCNL
job, RCL
and L
,RCNL
job, RCL
and L
.In previous runs, we didn't fully utilize the CPU capacities(using
For this proposal, the user submits all
I tested the parallelized job submission structure on RCNL
estimations of
The only issue might be worth concerning is, we might have scrambled .out
file(job running log file) because of the parallelized run, but I think we can always add some "light run" version of estimation(say, 5 simulations) that outputs the .out
file in non-parallelized way.
RCNL
jobs tooks more than 9 hours to output one estimation. This is due to the usage of different combinations, and different tolerance level, by investigating and specify those we can improve the job submission efficiency.RCNL
job. Previously we applied for 20GB per RCNL
estimation, but on sherlock most nodes exhibit a ratio of 8 GB of memory per CPU. @ NB, do you think 20 GB is necessary for RCNL
or we can cut the requested memory a bit?Step 5 for Practice task on RA Manual (Basic functionality).
In this issue I just try to link the related branch to other repository, to solve the problem that overleaf only link to master.
Attempt to proposal (1) in template issue.
Post weekly updates.
Step 3 for Practice task on RA Manual (Basic functionality).
Follow the gentzkow#84 (comment).
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.