comses / citation Goto Github PK
View Code? Open in Web Editor NEWCitation Management and Deduplication for Django
Home Page: http://citation.readthedocs.io/en/latest/
License: GNU General Public License v3.0
Citation Management and Deduplication for Django
Home Page: http://citation.readthedocs.io/en/latest/
License: GNU General Public License v3.0
investigate http://opencitations.net/ corpus for citation lookups when deduping or filling out sparse citation metadata
right now there is no easy way to change author order in a publication
Currently date published information is stored in the database as text and parsed at run time to use in elasticsearch indices as dates. We cannot store the data as a date field because quite often month and day information is absent. With date information across three fields it will be much simpler to filter by publication year from the database.
log the orphans that were deleted and counts
A few guidelines:
all
queries for every publicationvalues_list(flat=True)
instead of generating a tuple that you indexget_attribute_values
and generate_boolean_list
with list comprehensionshttps://developer.clarivate.com/apis
store the entire object in the audit log before any destructive operation
Come up with a taxonomy of terms:
journal
personal / project URL
Version Control Repository: github / bitbucket
Digital Repository: osf, figshare, zenodo, comses
add schema support + automatic classifier of Publications that assigns the field if it's not already set
extract data_export.py
from the management command using Calvin's example data file as the target schema
add D3 visualizations for real-time interactive stats and scheduled static image generation
potential design: scheduled processes write from primary data to redis or other cache, D3 + VueJS frontend on the wagtail comses.net site dashboard to interact with the redis cache
We can migrate the data from the date_published_text field of the publication to the year_published field
remove bibtexparser spam from console logs
for a given publication, have some external lookup for number of citations referencing that pub
Perform code review with Dhruvil and identify areas to improve.
The main goal is to design a coherent API + data structures + algorithms for generating and accessing the visualization data.
Things to note:
Data inconsistencies should be noted on a publication so that a entry can be reviewed
(right now data import problems are just saved in a pickle file to inspect later)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.