Code Monkey home page Code Monkey logo

Comments (18)

julien-c avatar julien-c commented on August 18, 2024 13

Ok we're now officially working on this 🔥

No ETA yet but it seems to be not-super-hard to do =)

On a related subject we can embed a "How to cite" button to those model or dataset repos where the authors will have generated a DOI. Supporting the full https://citation-file-format.github.io/about/ citation file format is maybe a bit overkill for now, so I was thinking to maybe just generate BibTeX snippet for the repos that have a DOI. Any other citation format we should support? WDYT?

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 7

Hi everyone!! We just launched DOI generation on the HuggingFace Hub, thanks to DataCite, @Kakulukian @sashavor @cakiki and @Ahleroy 🔥

Here's a blogpost about the feature: https://huggingface.co/blog/introducing-doi

Please try assigning DOIs to some of your repo(s) on the Hub and post any feedback here! Thank you 🤗

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 4

(also gotta say that I love your README.md on your github @cakiki, and I am absolutely going to rip it off 😂)

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 3

Yes I think we should do that.

I've reached out to Datacite and a few other potential registrants – I will follow up here when I know more.

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 3

maybe we could try again to see how to provide DOIs directly now. (I feel like syncing to Zenodo is a bit less elegant and depending on dataset size not sure how well it will work)

Will try to sync up with the https://datacite.org/ team in the coming weeks

from hub-docs.

cakiki avatar cakiki commented on August 18, 2024 3

+1 for bibtex! as @BramVanroy said, one can convert bibtex to just about anything that people care about.

I personally quite like the citation UX of both the ACL Anthology and that of Semantic Scholar.

from hub-docs.

yoshitomo-matsubara avatar yoshitomo-matsubara commented on August 18, 2024 3

I'm so glad to hear the update! 🙏
+1 for bibtex and +1 for citation UX like those @cakiki mentioned :)

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 2

@cakiki would you want to participate to a call with them?

from hub-docs.

Ahleroy avatar Ahleroy commented on August 18, 2024 2

+1 for Bibtex.
Zbib.org is a great example of easy-to-use ref export. They offer Copy / Download buttons and RIS / Bibtex exports.
https://zbib.org/

@julien-c I would be happy to have a chat to discuss how I can contribute to this feature.

Alix

from hub-docs.

julien-c avatar julien-c commented on August 18, 2024 1

BTW (because this subject came up again recently), i remember trying to ping some DOI registrants last year and it looked very bureaucratically complex, TBH

Maybe at some point someone wants to give this another shot, but be aware this will probably be a long endeavour :)

from hub-docs.

davanstrien avatar davanstrien commented on August 18, 2024 1

BTW (because this subject came up again recently), i remember trying to ping some DOI registrants last year and it looked very bureaucratically complex, TBH

Maybe at some point someone wants to give this another shot, but be aware this will probably be a long endeavour :)

One possible stop-gap solution (mainly applicable to models) could be to create a GitHub Action that in response to a webhook (or on a schedule), downloads a snapshot of a repository and pushes it to a Zenodo repository via their API.

This would require manual setup for those wanting to use it but would be a route to getting a citable and versioned DOI for their model and has the added benefit of creating a 'preservation' copy of the model. This extra copy is also quite desirable for some communities.

This would be a little bit hands-on for people to set up but could give a sense of how many people want this kind of feature. Full integration between Zenodo like the one with GitHub would be great, but I think this would be much more involved to establish.

I have it on my todo list to set up something similar to push models from https://huggingface.co/BritishLibraryLabs to https://bl.iro.bl.uk/. I will ping this thread when I get around to that in case it is helpful for other people.

from hub-docs.

cakiki avatar cakiki commented on August 18, 2024 1

@julien-c I would love to; thank you for including me!

Another thing to try would be to reach out to Kaggle and ask about their experience. They also had it requested by users before they added the DOI feature to datasets.

from hub-docs.

yoshitomo-matsubara avatar yoshitomo-matsubara commented on August 18, 2024 1

@julien-c @cakiki
I'm here just to say thank you all (including those involving this thread) for revisiting this issue
Providing DOIs directly (upon request? like Kaggle does) sounds like a great idea!

from hub-docs.

BramVanroy avatar BramVanroy commented on August 18, 2024 1

This is awesome! Very glad to hear it. Will it take commit/tag into consideration, or just one DOI per repo for now? (Which is also already awesome!)

Bibtex should be a good start (there are plenty of online bibtext-to-X converters). However, I do notice that Github seems to provide both the Bibtex and APA. E.g., on transformers on the sidebar, when you click on "Cite this repository":

cite this repo

This is also described in the Github docs.

from hub-docs.

cakiki avatar cakiki commented on August 18, 2024

You are more than welcome to it! 😃
You could also experiment with nicer layouts (https://rich.readthedocs.io/en/latest/layout.html#creating-layouts)
(@willmcgugan did most of the heavy lifting on this)

from hub-docs.

cakiki avatar cakiki commented on August 18, 2024

Fantastic idea @davanstrien ! It would still require some organization and negotiation (zenodo limits uploads to 50GB I believe), but substantially less bureaucratic hassle I'm sure than dealing with registrars.

from hub-docs.

yoshitomo-matsubara avatar yoshitomo-matsubara commented on August 18, 2024

It's great news!
Many thanks to the team for making this happen!

from hub-docs.

cakiki avatar cakiki commented on August 18, 2024

Thank you everyone; really happy this is now a feature! 🤗

from hub-docs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.