Comments (8)
If you want the element itself, you have to do h1 = sel.xpath(".//div/h1")[0]
.
You use extract()
or extract_first()
when you're only interested in the extracted content.
from parsel.
Ok, but i get parsel.selector.Selector
:
>>> h1 = sel.xpath(".//div/h1")[0]
>>> type(h1)
<class 'parsel.selector.Selector'> # but no lxml.html.HtmlElement
from parsel.
From the selector, use the root
attribute to get the lxml element.
from parsel.
From the selector, use the root attribute to get the lxml element.
Thanks. this make my day 👍 (may be add this to docs?)
from parsel.
Could be a nice addition to the docs, but I'd rather have a decision on #40 first, but I'd review a PR to the docs if you want to add something. :)
from parsel.
Yes, first i get <class 'lxml.etree._Element'>
instead <class 'lxml.html.HtmlElement'>
.
Maybe change this line:
from lxml import etree, html
...
def create_root_node(text, parser_cls, base_url=None):
...
return html.fromstring(body, base_url=base_url)
That is call function fromstring
from the package lxml.html
but not lxml.etree
from parsel.
@mega7star see #40
from parsel.
Using Parsel from master, you're now able to get lxml.html.HtmlElement
as we've switched to html.HTMLParser on #63
from parsel.
Related Issues (20)
- Create new element/tag like bs4's soup.new_tag("div") HOT 1
- Scrapy lxml xpath parser not correctly parse HOT 4
- Adding a `strip` kwarg to `get()` and `getall()` HOT 1
- Dropping support of python 3.6 HOT 2
- New version release HOT 2
- Fix mypy issues HOT 2
- Modernize SelectorList-related code
- Improve typing in parsel.selector._ctgroup HOT 2
- There seems to be an error in the parsing of the xpath and css HOT 2
- UserWarning message after chances in commit 3b3ec90 HOT 1
- Scrapy/Parsel type of response issue. HOT 13
- Import error HOT 2
- Unexpected css parsing caused by the default prefix : `descendant-or-self::` HOT 1
- Constructing selector with text argument raises if a numeric string is passed HOT 1
- Why attrib() is the only one method that doesn't contain any checks? HOT 1
- Cannot work well with gevent HOT 1
- Drop pytest-runner usage
- Parsel import causes crash HOT 3
- remove_namespaces asserts on json TextResponse HOT 7
- XPath query is buggy HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parsel.