Comments (5)
If anyone is having the same issue, I've found how to fix that! Just change the following code from the comments scraper:
def request_until_succeed(url):
req = Request(url)
success = False
while success is False:
try:
response = urlopen(req)
if response.getcode() == 200:
success = True
except Exception as e:
print(e)
time.sleep(5)
print("Error for URL {}: {}".format(url, datetime.datetime.now()))
print("Retrying.")
return response.read()
To this one (i've added .decode('utf-8')
before returning the value):
req = Request(url)
success = False
while success is False:
try:
response = urlopen(req)
if response.getcode() == 200:
success = True
except Exception as e:
print(e)
time.sleep(5)
print("Error for URL {}: {}".format(url, datetime.datetime.now()))
print("Retrying.")
return response.read().decode('utf-8')
Now it's working fine here, but don't know if it's reliable for everyone, so I'm not going to submit a pull request with this fix.
from facebook-page-post-scraper.
The script does encoding/decoding shenanigans in order to be compatible with both Python 2 and 3. I will have to check if that solution will work for Python 2.
from facebook-page-post-scraper.
Thanks for the fast reply, @minimaxir !
from facebook-page-post-scraper.
Guys, again I have an issue with paging. Cannot figure out why it is happening. Can you help me? Thanks!
`---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
in ()
176
177 if name == 'main':
--> 178 scrapeFacebookPageFeedStatus(group_id, access_token)
in scrapeFacebookPageFeedStatus(group_id, access_token)
160 if 'paging' in statuses:
161 next_url = statuses['paging']['next']
--> 162 until = re.search('until=([0-9]*?)(&|$)', next_url).group(1)
163 if until is None:
164 return None
AttributeError: 'NoneType' object has no attribute 'group'`
from facebook-page-post-scraper.
@paladini thanks worked for me
from facebook-page-post-scraper.
Related Issues (20)
- Error in line 184 and 154 HOT 4
- Cannot download the fanpage's post in April HOT 1
- HTTP Error 403: Forbidden HOT 15
- different page_id and file_id? HOT 1
- HTTP Error 400: Bad Request HOT 10
- Error, in "get_fb_comments_from_fb" HOT 4
- Hi, i am facing a problem when runing the file get_fb_comments_from_fb.py and telling me that it's encoding error: File "C:\ProgramData\Anaconda3\lib\encodings\cp1252.py", line 19, in encode return codecs.charmap_encode(input,self.errors,encoding_table)[0] UnicodeEncodeError: 'charmap' codec can't encode character '\U0001f602' in position 85: character maps to <undefined>
- Page Public Content Access HOT 10
- No data retrieved!!
- Research Project. HTTP Error 400 - can't get a reviewed App
- Get the fields "Video Total Organic Views" and "Total Lifetime Impressions"
- HTTP 400 Error Bad Request : Error for URL HOT 4
- New gTLDs , IDNs, EAI and Linkification issues HOT 2
- something like this coming on cmd HOT 3
- scrapping the check-in data
- Error in line 232 and 121
- Chronical posts
- KeyError: 'next' = if there is no next page HOT 1
- Comments dictionary changed format HOT 6
- Get Message_Tags in comments
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from facebook-page-post-scraper.