The kerckhoff from dailybruin

[Vue} Make Repo link open in new tab

Make this link open in new tab

Old Kerckhoff Delete Package Button [Vue.js, Django]

When a package is deleted, it should not delete the google drive folder.
Also, when you press the delete button, it should ask for confirmation that you want to delete this package.

Kerckhoff integration with online design notes

Reset "processing" field on package model on fetch fail

If a fetch fails on the frontend, the package model still has processing=True, which causes it to seem like the package is always fetching. Reset the processing field on a failed request (timeout, bad request, etc).

Missing webpack-stats on setup?

Tried setting up Kerckhoff for the first time and I got this error after running docker-compose up:

I noticed that webpack-stats.json is in both .gitignore and .dockerignore. I'm not familiar with it, but I would assume that it needs to be autogenerated by webpack? Why then was it not autogenerated on my machine during setup? I'm guessing what's happening is it's looking for webpack-stats.json before webpack runs?

Video Upload and Streaming

For this flatpage, external sites needed to upload a video to s3. They requested a way for kerckhoff to upload videos to s3, similar to how it does for images.

The one difference is that the video should be "streamable". I believe in s3, there should be an option called publically playable which should allow video streaming. But I haven't looked into this.

Switch out YAML in frontmatter

YAML is hard to write for non-technical people, and is prone to parsing errors that are hard to debug and track down.

Short term solution: Use AML to parse frontmatter
Long term solution: Enforce and extend metadata schemas on PackageSet level

Thanks @yyc for the suggestion!

caching asset hashes

right now assets are downloaded before we calculate a MD5 to determine if they have changed which is not cheap on bandwidth. one idea is to create a .kerckhoff-meta to keep track of this in the google drive, and the api also offers some interesting metadata extensions to make this easier.

Turns out google drive store last edited values - we should use those

implement the MediaObject model

[Django] Make article.aml instead of article.md by default

Also fill the article.aml with some example aml. You can find some good example aml in the package repo dev

implement GET /pages API with pagination

Tasks

Add mock data
Write the API
Pagination

Gifs don't play

When sarthak uploaded a gif to kerckhoff, it did not play

https://drive.google.com/file/d/1IEH2hYZWveh3P-ru7NuIToVUMklluk7Y/view

Turned into

https://assets.dailybruin.com/images/interactive.2021.springsing/springsing_landingillo-0fa5fd4d746d873e40954f2bb0bb6184.gif

Sarthak (the 2020-21 external sites editor) said that there is a setting in aws

Search functionality

investigate elasticsearch
implement

Cannot create multiple flatpages

After one package is created, the creating... loading appears on future popups. Guessing this is a state issue.

Kerckhoff elastic search connection error in prod

For the new PR #89

corresponding docker image tag = procedure

Update footer to only have relevant links

model tags

process image one by one

currently, images are all kept in memory, so if you process a lot of images at the same time, the server just crashes. you should refactor this so only a few images are kept in memory and others are in disk. so we won't encounter out of memory issues.

Fix kerckhoff typo in footer

Parsing error to generate ID from new package set

Description

When generating a "Drive folder id" from a "Drive folder url," the current parser does not handle the ?open or ?share sections of the Google Sharing URL properly and they get put into the ID.

fix the ArchieML parser

the current library we're using for parsing AML doesn't handle a number of edge cases properly. the spec for archieml is actually rather straightforward and it might actually be helpful to write a parser ourselves using something like https://github.com/dabeaz/sly or ply

Add a readme?

This is cool and I want to contribute but I don't know how 😢. Can you add a readme with details on how to get this set up on my computer?

Kerckhoff Build hook

The build hook should trigger be able to call some api endpoint whenever the AML changes.

The purpose of it is to rebuild static site pages whenever the data changes. As of now, the static sites just fetch from kerckhoff every time the user loads them. However, if we build the static sites with the data, the frontend would not have to do a fetch and the page would load faster and be more efficient overall. And even if kerckhoff crashes, the static site will still work.

his is something experiemental we were thinking for design.dailybruin.com

YAML error in frontend is not obvious to the user

Use logger to replace print statements

Currently, prints are used, but they won't be displayed in Kubernetes logs. We should replace print statements with logger.log

implement file upload

Tasks

implement S3 API
grab the necessary keys in upload UI
get barebones uploading to work

Support images with capitalized file extensions

Example:
https://drive.google.com/drive/u/4/folders/1mta9hfI-HmzWQSJJOTlqJ1NofDIXetDi
In Google drive, we have the file IMG_7274.JPG present, and it is the cover image according to metadata of article.md. However, in https://kerckhoff.dailybruin.com/api/packages/prime-old/still-breathing-still-building/ , the entry is not present in images.s3.

Expose HTML fetching option on frontend

[] Expose the status on API
[] Make the status editable (a PUT API)
[] Give each page a settings screen to be able to change some settings

model package states

Integrate new AML Parser

The old AML parser has some weird bugs with certain edge cases.

Tianyu made a better AML parser. We have integrated that into new kerckhoff but not old kerckhoff. Here is the link the code: dailybruin/kerckhoff-server#23

Kerckhoff sign in redirect link does not uses https

If you log into kerckhoff.dailybruin.com, you will see "internal server error".

Use srcsets!

[Django] Image is sometimes thought to be uploaded even though it isn't

What shows up on user side:

They put a photo in google drive but it never shows up on kerckhoff.dailybruin.com

In the rancher logs, it prints these lines, even for the photos which aren't showed to be uploaded in kerckhoff.dailybruin.com:

4/28/2021 7:21:46 PM2021-04-29 02:21:46,031 kerckhoff    INFO     BakurMadini.jpg has not been modified since last fetch.
4/28/2021 7:21:46 PM2021-04-29 02:21:46,031 kerckhoff    INFO     CarlKing.jpg has not been modified since last fetch.
4/28/2021 7:21:46 PM2021-04-29 02:21:46,034 kerckhoff    INFO     BrandonMcLelland.jpg has not been modified since last fetch.
4/28/2021 7:21:46 PM2021-04-29 02:21:46,034 kerckhoff    INFO     AngelinaQuint.jpg has not been modified since last fetch.
4/28/2021 7:21:46 PM2021-04-29 02:21:46,034 kerckhoff    INFO     AngelinaQuint.jpg has not been modified since last fetch.
4/28/2021 7:21:46 PM2021-04-29 02:21:46,035 kerckhoff    INFO     ArtharvaKulkarni.jpg has not been modified since last fetch.

Have Kerkchoff package files be owned by an admin Gmail

Right now, the files in the Kerckhoff Google Drive are owned by whoever created the package. However, when they leave, the media emails get deleted, so these files are lost. Using an admin email to create these files would prevent this issue.

Add celery or other task scheduler to reduce peak cpu usage

Currently, if a lot of packaging requests are sent to Kerkhoff, the peak CPU usage exceeds Kubernetes resource limit as in last night, which causes worker timeout:

We can add a task scheduler so large packaging such as for prime can be done asynchronously and can improve our availability.

Also #82 is needed. Currently, if a fetch fails, we cannot re-fetch it easily.

Progress bar/message on fetch

Some packages take very long to fetch, making it seem like the fetch has failed even though it's still going on. Add a progress bar, or maybe something that indicates the progress of the fetch (e.g. fetching image 1 of 4) on the frontend so that the user knows not to refresh the page.

In the same vein as #82 (issues with slow/failing fetches)

dailybruin / kerckhoff Goto Github PK

kerckhoff's People

Contributors

Stargazers

Watchers

Forkers

kerckhoff's Issues

Tasks

Description

Tasks

Recommend Projects

Recommend Topics

Recommend Org