The arandabot's discuss from notatallshaw

Move away from using uploaded timestamp to track if videos are new

It's a little unreliable, instead keep a list of videos IDs already posted.

Things to consider:

How big is the list allowed to get?
Is the list saved to file?
How do we use it to determine if we no longer need to search older videos? (I was thinking of having a confidence value, e.g. if more than 50% of videos on this search are already in our list, no need to check out older videos)

Add timeouts to ytvideos class methods

I think sometimes the Google API classes just stop responding as apposed to throwing an exception. The idea to this would to be put the calls in to a separate thread / subprocess and kill the thread after a set amount of time if nothing happens.

Maybe measuring "nothing happens" could be via a queue which is populated by the loops in the method.

Catch all these exceptions

I've been running the bot for a while and come across all these exceptions

add logging

lots of logging

add the ability to tag reddit posts based on the name of the channel

config goes somewhere in settings

oauth link to Google you can't copy it out of the python terminal

either push in to the clipboard or write to a file the user can open. Or use the functionality to open the browser directly if you can get that to work properly.

Add heartbeat option in logging

Once added logging add option to keep updating to show script is still working.

Allow many feed inputs for youtube videos

Allow API v2 (if it really isn't decommed), APIv3, RSS Feeds, to all run in parallel to get data fastest possible way

Add option to check for new subscriptions in youtube

A users subscriptions may be updated while the script is running. Should allow to check if this has happened.

Print stack trace when catching unexpected exception

Currently got an error like this and not sure where it came from:

Some unexpected exception occured in arandabot backing off for 5 mins and trying
again:
'charmap' codec can't encode character u'\u2019' in position 38: character maps
to

Use requirements.txt instead of noting dependencies in readme

Currently dependencies are noted in the readme.md of this project, the more 'pythonic' and maintainable way to do this would be to pipe the requirements into a requirements.txt file.

You generate this by running the following command:
pip freeze >requirements.txt

commit the requirements file to your repo.

Then anyone can install all your dependencies by running:
pip install -r requirements.txt

Ability to filter posted videos based on video title or description

Add in to the settings

Gracefully handle quota exceeded message

Message looks like this:
<HttpError 403 when requesting https://www.googleapis.com/youtube/v3/search?channelId=UCBa659QWEk1AI4Tg--mrJ2A&maxResults=50&safeSearch=none&part=snippet&alt=json&type=video&order=date returned "Quota Exceeded">

Back off for some time when this happens, print to screen why backing off for some time

Update to PRAW 3.0

Will need to depreciate password login to reddit in favor of ouath 2.0

Add a walkthrough script in main.py

Make it much easier to set up for the first time by adding an interactive walk through script, allowing user to put in username and password and channel configuration details of for the script to handle the saving to json

Allow to post to mutiple subreddits

Consider using YAML instead of JSON Configuration

Low Priority

Major Bug: Doesn't appear to be getting all subscribed videos

Noticed an issue where script just isn't pulling some subscribed videos when it's looping for a long time, need to investigate

Allow to add dev key and use instead of oauth when subscription data not required

This would add more resiliency as it seems less flaky than oauth

Create settings.template.json

This will be used to populate all the default values and be kept in the git repository. If settings.json is not in the directory of the bot it will create it based of settings.tempate.json and user input from various questions.

settings.json will be added to the git ignore file so people's secret details (such as reddit username/password) aren't accidentally uploaded)

I don't 100% trust this will be updated with all channel content but it might be worth merging this feed in to the requests in case it's more performant in general?

Migrate reddit authetication to OAuth

Password based authentication is being removed:
https://www.reddit.com/comments/2ujhkr/

When you load main.py in Windows and there is an error from the settings the window closes immediately so you don't see the instructions to follow.

Add a timer or an input capture to allow user to read instructions

Switch from looking up playlist to using search

As per this SO answer, it is apparently much faster to use the search functionality than to use playlist lookup: http://stackoverflow.com/questions/29204271/are-upload-playlists-on-youtube-api-v3-purpsofully-slow-to-be-updated/29441811#29441811

Allow many feed outputs

Add more output capabilities, twitter?

Another another exception to catch

Affects both main and search branch:

Traceback (most recent call last):
File "main.py", line 19, in
main()
File "main.py", line 16, in main
arandabot.arandabot(settings=settings)
File "C:\Users\Damian\Documents\arandabot - prod - search\arandabot.py", line46, in arandabot
yt.getNewestVideos()
File "C:\Users\Damian\Documents\arandabot - prod - search\ytvideos.py", line 251, in getNewestVideos
self.youtube = self.initilize_youtube(self.set)
File "C:\Users\Damian\Documents\arandabot - prod - search\ytvideos.py", line 120, in initilize_youtube
http=credentials.authorize(httplib2.Http()))
File "C:\Python27\lib\site-packages\oauth2client\util.py", line 132, in positional_wrapper
return wrapped(_args, *kwargs)
File "C:\Python27\lib\site-packages\apiclient\discovery.py", line 192, in build
resp, content = http.request(requested_url)
File "C:\Python27\lib\site-packages\oauth2client\util.py", line 132, in positional_wrapper
return wrapped(args, *kwargs)
File "C:\Python27\lib\site-packages\oauth2client\client.py", line 490, in new_request
redirections, connection_type)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1593, in request
(response, content) = self.request(conn, authority, uri, request_uri, method, body, headers, redirections, cachekey)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1335, in request
(response, content) = self.conn_request(conn, request_uri, method, body, headers)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1263, in _conn_request
raise ServerNotFoundError("Unable to find the server at %s" % conn.host)
httplib2.ServerNotFoundError: Unable to find the server at www.googleapis.com

Unit tests, everywhere!

Add lots of unit tests to help stop regressions.

Another exception to catch

Traceback (most recent call last):
File "main.py", line 19, in
main()
File "main.py", line 16, in main
arandabot.arandabot(settings=settings)
File "C:\Users\Damian\Documents\arandabot - prod\arandabot.py", line 46, in arandabot
yt.getNewestVideos()
File "C:\Users\Damian\Documents\arandabot - prod\ytvideos.py", line 246, in getNewestVideos
self.youtube = self.initilize_youtube(self.set)
File "C:\Users\Damian\Documents\arandabot - prod\ytvideos.py", line 119, in initilize_youtube
http=credentials.authorize(httplib2.Http()))
File "C:\Python27\lib\site-packages\oauth2client\util.py", line 132, in positional_wrapper
return wrapped(_args, *kwargs)
File "C:\Python27\lib\site-packages\apiclient\discovery.py", line 192, in build
resp, content = http.request(requested_url)
File "C:\Python27\lib\site-packages\oauth2client\util.py", line 132, in positional_wrapper
return wrapped(args, *kwargs)
File "C:\Python27\lib\site-packages\oauth2client\client.py", line 490, in new_request
redirections, connection_type)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1593, in request
(response, content) = self.request(conn, authority, uri, request_uri, method, body, headers, redirections, cachekey)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1335, in request
(response, content) = self.conn_request(conn, request_uri, method, body, headers)
File "C:\Python27\lib\site-packages\httplib2__init.py", line 1291, in _conn_request
response = conn.getresponse()
File "C:\Python27\lib\httplib.py", line 1061, in getresponse
raise ResponseNotReady()
httplib.ResponseNotReady

notatallshaw / arandabot Goto Github PK

arandabot's Issues

Recommend Projects

Recommend Topics

Recommend Org