melbournebioinformatics / analytics Goto Github PK
View Code? Open in Web Editor NEWUsage data analytics
Usage data analytics
Currently, rows slurm data are "dependent", the might be multiple job steps for a single job. Need to collapse dependent rows into a single independent job description.
e.g.
1234.bash
1234.0
1234.1
...
convert to
1234
Generate a distribution graphs for each data field. Comment on each one, report interesting findings.
Need README, licence etc
Review project titles and descriptions to identify main project categories
[guess at example categories, Molecular dynamics, Fluid dynamics, data pipeline, class/training, system administration, ...]
Get all job (sacct), project and disk data
There are columns (data fields) that are "uninformative" in the sense that they might be filled with "nan" or has the same value throughout (0 information content). Should write code that does such checking. If a column is uninformative for all dataset, we should workout why and remove it from downstream analysis.
Fields such as wait time, total memory request/used, job frequency, etc could be derived from the data.
Want candidate list for discussion
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.