Comments (6)
Hi,
Sys.setlocale("LC_ALL","Arabic")
works for me.
from rcrawler.
We have fixed the issue now encoding will reflect on saved HTML files, but before start crawling you need to run :
Sys.setlocale("LC_CTYPE","Arabic_Saudi Arabia.1256")
Update will be released in the next few days you will be notified
from rcrawler.
update version 0.1.9 is released, enjoy!
subscribe to our mailing list to stay updated http://eepurl.com/dMv_7s
from rcrawler.
If I use this list(DATA, localeToCharset('UTF-8'))
, the Arabic text is rendered correctly in R Studio, but still the crawled HTML files are showing unicode characters. Any idea how to save the crawled HTML files in regular Arabic text?
from rcrawler.
Thanks @salimk and look forward to the update.
from rcrawler.
thanks so much.
from rcrawler.
Related Issues (20)
- Rcrawler is only saving internal HTML pages
- IP shuffeling services integration
- ContenScraper speed bump optional / configurable
- Avoid big websites
- It can only return 30 links using Rcrawler()
- Extract emails from domain
- how to run Javascript tag before extracting data from each page HOT 2
- Id in INDEX data.frame and of html-files don't match
- Rcrawler() and LinkExtractor() do not collect 'external urls' from HTML>footer
- Extracting urls from a list of urls HOT 2
- Obeying robots.txt seems to work only for some links
- Depricated Function HOT 1
- Issue with site confirmation page HOT 1
- ContentScraper store in between results
- Rcrawler fail cluster setup R v4.0.0
- Use_Proxy for Contentscraper
- Localhost port
- makeCluster type parameter cannot be changed
- Downloads HOT 1
- Error in preparing browser process HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rcrawler.