Code Monkey home page Code Monkey logo

devcenter's People

Contributors

glynnbird avatar jenschlot avatar vabarbosa avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

devcenter's Issues

Curation search and filter

Hard to find a piece of content that you've already cataloged. Need to be able to sort / search through cataloged content.

Now you can only get a list of what's cataloged -- you have to use browser search on the page to step through Live or Provisional list of links. Right now the full list is CDS and DSX combined. You need to know a keyword in the title to find a link.

When viewing items in 'live', 'provisional' and 'deleted', provide a way to filter the view on the 'namespace' field

I propose we add a search facility into the DevCenter. A search box that allows free-text searches and a list of the facets (type/technology/namespace) that allows the user to make the list of content smaller with a few clicks.

Taxonomy value additions and changes

Could you please make the following changes? Thanks

Languages:

  • change BigSQL to Big SQL

Technologies:
add

  • Brunel
  • Object Storage
  • Pandas

Topic:
add

  • Communication
  • Data Exploration
  • Deep Learning
  • Finance
  • Geospatial Analytics
  • Neural Networks
  • Packages
  • Probabilities
  • Theory
  • Time Series
  • Use Case

Always include a default title if none present when adding a new record

Problem: There have been instances where new records added in to the tool (perhaps via the /curate option in slack) have not had any titles. This causes a problem in accessing that record in devCenter Uploader as the only way to access the record is from the title.

Solution: Ensure that some text - like "Missing title" - is included if there is no title pulled in.

Show source of the content (slack/RSS/etc)

We get information fed into the database from different sources, such as slack, RSS, etc. We'd like to be able to tell what the source is so that we can prioritize accordingly, for example:

  • the slack /curate command
  • RSS
  • manual entry (could be default)

Schema additions

Languages (add):

  • DML
  • BigSQL

(we don't call Node.js a language we mark things as JavaScript instead)

Technologies (add):

  • DB2
  • Hadoop
  • SPSS

Topic (add):

  • Data Shaping
  • Data Modeling
  • Text Analytics
  • Predictive Analytics
  • Prescriptive Analytics
  • Monte Carlo
  • Simulation

Type (add)

  • API
    
  • Blog
    
  • Book
    
  • Course
    
  • Data set
    
  • Demo
    
  • Docs
    
  • Event
    
  • Forum
    
  • Notebook
    
  • Paper
    
  • Podcast
    
  • Presentation
    
  • Project
    
  • SDK
    
  • Sample
    

Set up curated docs for future publishing

We want to be able to curate a set of items at one time, but then set them up for future publishing so that we establish a good cadence for content that goes live.

Search returns no results when searching on keywords in titles

Is there information on what is allowed in the search field? When I search on keywords in titles, I get no results. For example, I've tried searching for "Working with notebooks in IBM Data Science Experience" both with and without quotes, and just using one word, or subset of words, but I get no results.

Thanks.

Lifecycle governance: flag live items after a time period for possible removal

When content is initially curated, apply a flag/expiration date by default. Allow this date to be manually edited. Here's the list of possible date values:

https://ibm.app.box.com/notes/71982223665 (see table) Let me know if you can't reach the box note.

When the date is reached the items are flagged and shown in a view where a curator can come in and review them for possible deletion or reset the expiration date to another in the future.

Access to record from URL link in search view

Hi,

One of the feeds, brunelvis.org, is not working correctly - the title field is not being filled. Since that is the only way to access the record from the tool, I can't access these items. I've looked at a couple of the URLs, and viewing the page source, there appears to be a title tag so I'm not sure why these items are not complete.

In any case, would it be possible to allow access to a record from the URL? This would allow access to incomplete records.

Thanks.

Needed by April 7th: Taxonomy updates

Along with items in #30, we'd like the following additions for GA of IBM Streaming Analytics with support for DSX python notebooks:

Technologies:
Add: IBM Streams

Topic:
Add: Streaming Analytics
Add: Streaming Pipelines

New "technologies" & "topic" values required

Please add the following new values under technologies:

  • Data Refinery - I think that this replaces Data Connect, but there are 4 items that use Data Connect, so I don't know if we should rename Data Connect to Data Refinery, or just add Data Refinery
  • Data Catalog
  • DSX

The following should be modified:

  • DB2: change to Db2
  • dashDB: change to Db2 Warehouse on Cloud
    Note: Folks managing CDS content might want to weight in on this, but is it possible to run a query to apply these changes to existing entries?

(FYI @gfilla @deirdrelongo)

Avoid spidering PDF URLs - causes crash or incomplete record

Problem: I managed to crash the devCenter Uploader trying to add a link to a PDF with the 'Create New Document' tab. (Using the 'Create provisional document' tab has a different result in that it creates a record, but doesn't populate the title, so I can't access that record via the UI. Issue #28 opened for that.)

The problem is related to the fact that the tool tries to crawl (spider) for PDF, but there is no data. Every a document is edited with a blank body, it tries to fetch the content again.

Glynn suggested that the URL could be 'pre-fetched' in order to get its content-type and if it's not text/html just skip the crawler.

Workaround suggested: for PDF URL, ensure that you have a title, and put some words in the body field when creating a new record with 'Create new document tab' - this would avoid the attempt to fetch again.

New fields: event date, and location

For DSX, we plan on starting to make use of the Event type. For this, we need two new fields:

  1. Event date: text field to capture a range like "June 21-25, 2017"
  2. Event location: text field to capture text like "Toronto, Canada"

Note that we could make use of the fields for other types, like course, webinar, etc - right now, I work around this for Webinar by adding the date in the title...

Thanks @glynnbird :)

Alphabetise the facets

The DataSciX folks would like the facets ordering alphabetically.

This isn't something we can do at the Cloudant end, but can easily be achieved at the front-end by simply sorting the array before it is rendered on the web page.

> var x = ['b','a','c'];
> x.sort()
[ 'a', 'b', 'c' ]

No work required.

Allow user to create a view with columns of their choosing

Allow user to create a view similar to the Search view, but with columns of their choosing.

For example, I want to see all the live entries for DATASCIX and along with title and url, I would like to display the imageurl field to identify missing one.

  1. Allow user to choose columns to be displayed
  2. Allow columns to be sorted
  3. Have ability to select the record from a row displayed

crawl additional format types

PDF / ebook in particular.

I know we discussed this, but can't remember where we ended up. If we ended up with -- can't do it, then please reject.

Add auto-generated "ts" element

Each document should have an element "ts" which is a numeric timestamp indicating when it was last touched. This could then be used for sorting.

Level value is not sticky

We can set Level (beginner, intermediate, advanced) for a content item, and it's written to the database, but when we go back to make the item live the Level is blank -- it's not being pulled from the db.

Slack integration - reply back quickly

When Slack calls our /slack endpoint it needs a speedy reply otherwise the user gets a timeout error message.

@bradnoble proposed that we get DevCenter to reply back quickly to say it has acknowledged the request but does the spidering and database writing in the background.

Add "format" facet

Add a "format" facet to indicate the document's format (as opposed to document type).
Initial values:

  • HTML
  • PDF
  • Podcast
  • Video
  • Webcast

Need type=DSX Blog

blog is an existing value for the type field.

We would like to change that to DSX Blog. As far as I can tell, we (DSX) requested that value and there are currently no docs in the db that use the Blog value.

Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.