Code Monkey home page Code Monkey logo

Comments (4)

DingoEatingFuzz avatar DingoEatingFuzz commented on August 22, 2024

I responded to this in the sagemaker PR: hackoregon/hackoregon-aws-infrastructure#61 (comment)

from 2019hackordatasciencetemplate.

karenng-civicsoftware avatar karenng-civicsoftware commented on August 22, 2024

@DingoEatingFuzz being able to list the data files will allow better collaboration between the different teams. The desired workflow is to do

$ aws s3 ls s3://hacko-data-archive

An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied

I do not think this will be computationally intensive operation. Why wouldn't you want to allow access?

from 2019hackordatasciencetemplate.

DingoEatingFuzz avatar DingoEatingFuzz commented on August 22, 2024

It is not computationally intensive, and I don't mind if data scientists are given access tokens to use with the aws cli tool, the issue is around security.

Even though the S3 bucket is public, you'd have to know the URL you want in order to download anything. If I make the ListObjects operation available to the public, then anyone could crawl through the bucket and clone the whole thing.

I would like to eventually open this up, but not before doing an audit and making sure there is truly nothing in there that we would want exposed.

from 2019hackordatasciencetemplate.

karenng-civicsoftware avatar karenng-civicsoftware commented on August 22, 2024

resolution: @karenng-civicsoftware will gather a list of emails of the data scientists needing AWS accounts to list the files on the S3 buckets. Then share it with Michael to create the accounts.

from 2019hackordatasciencetemplate.

Related Issues (18)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.