The devcenter from ibm-watson-data-lab

Make updated_at record the date a doc went live

If a document is marked "provisional" but then is marked "live", it's updated_at field should be set to the current date.

New Topic needed in time for Think

Sorry Glenn... can you add one more?

Topic

Watson

Curation search and filter

Hard to find a piece of content that you've already cataloged. Need to be able to sort / search through cataloged content.

Now you can only get a list of what's cataloged -- you have to use browser search on the page to step through Live or Provisional list of links. Right now the full list is CDS and DSX combined. You need to know a keyword in the title to find a link.

When viewing items in 'live', 'provisional' and 'deleted', provide a way to filter the view on the 'namespace' field

I propose we add a search facility into the DevCenter. A search box that allows free-text searches and a list of the facets (type/technology/namespace) that allows the user to make the list of content smaller with a few clicks.

New technology values needed for renaming by THINK

Please add the following values:

Technologies:

Watson
Watson Knowledge Catalog

(I will open another issue to remove DSX and other affected names from renaming.)

Taxonomy value additions and changes

Could you please make the following changes? Thanks

Languages:

change BigSQL to Big SQL

Technologies:
add

Brunel
Object Storage
Pandas

Topic:
add

Always include a default title if none present when adding a new record

Problem: There have been instances where new records added in to the tool (perhaps via the /curate option in slack) have not had any titles. This causes a problem in accessing that record in devCenter Uploader as the only way to access the record is from the title.

Solution: Ensure that some text - like "Missing title" - is included if there is no title pulled in.

Show source of the content (slack/RSS/etc)

We get information fed into the database from different sources, such as slack, RSS, etc. We'd like to be able to tell what the source is so that we can prioritize accordingly, for example:

the slack /curate command
RSS
manual entry (could be default)

Format facet: change HTML to Web page

Hi Glynn, sorry, a change on format. Could you please change HTML to 'Web page'? Thanks!

Schema additions

Languages (add):

DML
BigSQL

(we don't call Node.js a language we mark things as JavaScript instead)

Technologies (add):

DB2
Hadoop
SPSS

Topic (add):

Data Shaping
Data Modeling
Text Analytics
Predictive Analytics
Prescriptive Analytics
Monte Carlo
Simulation

Type (add)

```
API
```
```
Blog
```
```
Book
```
```
Course
```
```
Data set
```
```
Demo
```
```
Docs
```
```
Event
```
```
Forum
```
```
Notebook
```
```
Paper
```
```
Podcast
```
```
Presentation
```
```
Project
```
```
SDK
```
```
Sample
```

Set up curated docs for future publishing

We want to be able to curate a set of items at one time, but then set them up for future publishing so that we establish a good cadence for content that goes live.

Search returns no results when searching on keywords in titles

Is there information on what is allowed in the search field? When I search on keywords in titles, I get no results. For example, I've tried searching for "Working with notebooks in IBM Data Science Experience" both with and without quotes, and just using one word, or subset of words, but I get no results.

Thanks.

Lifecycle governance: flag live items after a time period for possible removal

When content is initially curated, apply a flag/expiration date by default. Allow this date to be manually edited. Here's the list of possible date values:

https://ibm.app.box.com/notes/71982223665 (see table) Let me know if you can't reach the box note.

When the date is reached the items are flagged and shown in a view where a curator can come in and review them for possible deletion or reset the expiration date to another in the future.

Access to record from URL link in search view

Hi,

One of the feeds, brunelvis.org, is not working correctly - the title field is not being filled. Since that is the only way to access the record from the tool, I can't access these items. I've looked at a couple of the URLs, and viewing the page source, there appears to be a title tag so I'm not sure why these items are not complete.

In any case, would it be possible to allow access to a record from the URL? This would allow access to incomplete records.

Thanks.

Spidered articles truncate article titles at non-text characters like ":" and "-"

Also worth noting this happens for en dashes (–) and em dashes (—). Ain't no thing, but it would save me some fidgeting with the "name" and "full_name" fields, which I find myself regularly modifying because of incomplete titles.

Filtering on Deleted or Provisional returns no resutls

Filtering on Deleted or Provisional status in the new search functionality returns no results.

Search view changes: Allow sort by date and name in Search view, include URL

In the Search view, we would like to:

be able to sort by date (asc/desc) and by name to help manage the content, by clicking on 'Date' and 'Name' column headings.
have the URL included in the 'live' and 'deleted' views as it is for the 'provisional' view.

Thanks.

Needed by April 7th: Taxonomy updates

Along with items in #30, we'd like the following additions for GA of IBM Streaming Analytics with support for DSX python notebooks:

Technologies:
Add: IBM Streams

Topic:
Add: Streaming Analytics
Add: Streaming Pipelines

New "technologies" & "topic" values required

Please add the following new values under technologies:

Data Refinery - I think that this replaces Data Connect, but there are 4 items that use Data Connect, so I don't know if we should rename Data Connect to Data Refinery, or just add Data Refinery
Data Catalog
DSX

The following should be modified:

DB2: change to Db2
dashDB: change to Db2 Warehouse on Cloud
Note: Folks managing CDS content might want to weight in on this, but is it possible to run a query to apply these changes to existing entries?

(FYI @gfilla @deirdrelongo)

Export data to .csv

Include the ability to export data to .csv, filtered on namespace.

Avoid spidering PDF URLs - causes crash or incomplete record

Problem: I managed to crash the devCenter Uploader trying to add a link to a PDF with the 'Create New Document' tab. (Using the 'Create provisional document' tab has a different result in that it creates a record, but doesn't populate the title, so I can't access that record via the UI. Issue #28 opened for that.)

The problem is related to the fact that the tool tries to crawl (spider) for PDF, but there is no data. Every a document is edited with a blank body, it tries to fetch the content again.

Glynn suggested that the URL could be 'pre-fetched' in order to get its content-type and if it's not text/html just skip the crawler.

Workaround suggested: for PDF URL, ensure that you have a title, and put some words in the body field when creating a new record with 'Create new document tab' - this would avoid the attempt to fetch again.

New fields: event date, and location

For DSX, we plan on starting to make use of the Event type. For this, we need two new fields:

Event date: text field to capture a range like "June 21-25, 2017"
Event location: text field to capture text like "Toronto, Canada"

Note that we could make use of the fields for other types, like course, webinar, etc - right now, I work around this for Webinar by adding the date in the title...

Thanks @glynnbird :)

Curate command does not work in IBM WDP Slack team

@glynnbird The /curate command is working fine if used in the IBM Watson Cloud Platform Slack team, but not in the IBM Watson Data Platform team. Is there an easy way to fix that? Thanks.

Alphabetise the facets

The DataSciX folks would like the facets ordering alphabetically.

This isn't something we can do at the Cloudant end, but can easily be achieved at the front-end by simply sorting the array before it is rendered on the web page.

> var x = ['b','a','c'];
> x.sort()
[ 'a', 'b', 'c' ]

No work required.

Allow user to create a view with columns of their choosing

Allow user to create a view similar to the Search view, but with columns of their choosing.

For example, I want to see all the live entries for DATASCIX and along with title and url, I would like to display the imageurl field to identify missing one.

Allow user to choose columns to be displayed
Allow columns to be sorted
Have ability to select the record from a row displayed

Change language "BigSQL" to "Big SQL"

Should be a space between Big and SQL.

crawl additional format types

PDF / ebook in particular.

I know we discussed this, but can't remember where we ended up. If we ended up with -- can't do it, then please reject.

Add auto-generated "ts" element

Each document should have an element "ts" which is a numeric timestamp indicating when it was last touched. This could then be used for sorting.

Select multiple pieces of content to delete

This is a feature request - would like to select multiple pieces of content and delete them without needing to go into each one and changing the status.

Please delete Machine Learning for Designers https://www.oreilly.com/learning/machine-learning-for-designers

Hi,

I can't modify or delete the following item:
Machine Learning for Designers
https://www.oreilly.com/learning/machine-learning-for-designers

Can you please move it to deleted status?

Thanks.

HTML
PDF
Podcast
Video
Webcast

Need type=DSX Blog

blog is an existing value for the type field.

We would like to change that to DSX Blog. As far as I can tell, we (DSX) requested that value and there are currently no docs in the db that use the Blog value.

Thanks

ibm-watson-data-lab / devcenter Goto Github PK

devcenter's People

Contributors

Watchers

Forkers

devcenter's Issues

Languages (add):

Technologies (add):

Topic (add):

Type (add)

Recommend Projects

Recommend Topics

Recommend Org