Comments (3)
Please provide the capture options you use (copy from Capture as > Advanced).
from webscrapbook.
Please see the information you requested below. I'm using the default options.
If I disable the adblock filters, accept the cookies manually and enable adblock again, the cookies notice doesn't appear when I make a new capture for this site, but this is not ideal, especially if it also may occur with other sites.
{
"tasks": [
{
"comment": "",
"tabId": 299064096,
"title": "Daily briefing: Visual clutter skews our time perception",
"url": "https://www.nature.com/articles/d41586-024-01202-5"
}
],
"bookId": null,
"parentId": "root",
"index": null,
"mode": "",
"delay": null,
"options": {
"capture.applet": "blank",
"capture.audio": "save",
"capture.backupForRecapture": true,
"capture.base": "blank",
"capture.canvas": "save",
"capture.contentSecurityPolicy": "remove",
"capture.deleteErasedOnCapture": true,
"capture.deleteErasedOnSave": false,
"capture.downLink.doc.delay": null,
"capture.downLink.doc.depth": null,
"capture.downLink.doc.mode": "source",
"capture.downLink.doc.urlFilter": "",
"capture.downLink.file.extFilter": "###image\n#bmp, gif, ico, jpg, jpeg, jpe, jp2, png, tif, tiff, svg\n###audio\n#aac, ape, flac, mid, midi, mp3, ogg, oga, ra, ram, rm, rmx, wav, wma\n###video\n#avc, avi, flv, mkv, mov, mpg, mpeg, mp4, wmv\n###archive\n#zip, rar, jar, bz2, gz, tar, rpm, 7z, 7zip, xz, jar, xpi, lzh, lha, lzma\n#/z[0-9]{2}|r[0-9]{2}/\n###document\n#pdf, doc, docx, xls, xlsx, ppt, pptx, odt, ods, odp, odg, odf, rtf, txt, csv\n###executable\n#exe, msi, dmg, bin, xpi, iso\n###any non-web-page\n#/(?!$|html?|xht(ml)?|php|py|pl|aspx?|cgi|jsp)(.*)/i",
"capture.downLink.file.mode": "none",
"capture.downLink.urlExtra": "",
"capture.downLink.urlFilter": "###skip common logout URL\n/[/=]logout\\b/i",
"capture.downloadRetryCount": 3,
"capture.downloadRetryDelay": 1000,
"capture.downloadWorkers": 4,
"capture.embed": "blank",
"capture.favicon": "save",
"capture.faviconAttrs": "",
"capture.font": "save-used",
"capture.formStatus": "keep",
"capture.frame": "save",
"capture.frameRename": true,
"capture.helpers": "",
"capture.helpersEnabled": false,
"capture.image": "save",
"capture.imageBackground": "save-used",
"capture.insertInfoBar": false,
"capture.linkUnsavedUri": false,
"capture.mergeCssResources": true,
"capture.noscript": "save",
"capture.object": "blank",
"capture.ping": "blank",
"capture.prefetch": "remove",
"capture.preload": "remove",
"capture.prettyPrint": false,
"capture.recordDocumentMeta": true,
"capture.recordRewrites": false,
"capture.referrerPolicy": "",
"capture.referrerSpoofSource": false,
"capture.remoteTabDelay": null,
"capture.removeHidden": "none",
"capture.resourceSizeLimit": null,
"capture.rewriteCss": "url",
"capture.saveAs": "folder",
"capture.saveAsciiFilename": false,
"capture.saveDataUriAsFile": true,
"capture.saveDataUriAsSrcdoc": true,
"capture.saveFileAsHtml": false,
"capture.saveFilename": "%id%",
"capture.saveFilenameMaxLenUtf16": 120,
"capture.saveFilenameMaxLenUtf8": 240,
"capture.saveFolder": "WebScrapBook/data",
"capture.saveOverwrite": false,
"capture.saveResourcesSequentially": false,
"capture.saveTo": "folder",
"capture.script": "remove",
"capture.serverUploadRetryCount": 3,
"capture.serverUploadRetryDelay": 2000,
"capture.serverUploadWorkers": 4,
"capture.shadowDom": "save",
"capture.style": "save",
"capture.styleInline": "save",
"capture.video": "save",
"capture.zipCompressLevel": null
}
}
from webscrapbook.
As the known issues page has documented: "Internal stylesheets of another browser extension cannot be captured. This could cause an issue like ads hidden by an ad-blocker extension be still visible in the captured page."
The ad blocker only HIDEs the cookie wanings, rather than REMOVE them (by injecting an extension stylesheet, which can never be captured by WSB), and that's why you always get them if you haven't really clicked accept manually.
To solve the issue you have to really do the clicking manually, or switch to another browset extension that really removes the cookie wanings for you. Another approach is setting up a capture helper to do it, but it's largely site-dependant and would thus be somehow impractical.
from webscrapbook.
Related Issues (20)
- can you help me write a capture helper that will capture webpages to a particular folder automatically HOT 4
- is it possible to have a global setting "Put an item into: top/bottom of folder/root" option like Scrapbook-X used to have HOT 2
- is it safe to delete \wsbdb\.wsb\backup HOT 1
- shortcuts while dragging and dropping HOT 1
- A regex that used to work with scrapbook-x doesn't work with webscrapbook HOT 3
- why is right click browse locally greyed out? HOT 3
- "Remove" only removes the page from the sidebar overview, it does not delete the HTML file HOT 2
- PTT網頁版imgur的圖片儲存失敗 HOT 3
- search everywhere by default
- per scrapbook default item to save in
- can't setup PyWebScrapbook to show saved pages. HOT 4
- Fatal Error (invalid URL) during rebuilding HOT 4
- Download page if string occurs in the page HOT 1
- 功能栏能否移动? HOT 1
- Manifest V3 Support HOT 1
- Unable to download to the folder HOT 3
- Unable to find a matching scrapbook item HOT 4
- [Feature request] Show a list of recently used folder when saving the page HOT 1
- [Feature request] 用 github actions 自動打包安裝檔 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from webscrapbook.