I've get a resultset by python BeautifulSoup, but I don't know how to fetch the NavigableString inside them

Question

Welcome To Ask or Share your Answers For Others

I've get a resultset by python BeautifulSoup, but I don't know how to fetch the NavigableString inside them

posted Jan 27, 2021 in Technique[技术] by 深蓝 (71.8m points)

I've get a resultset by python BeautifulSoup, but I don't know how to fetch the NavigableString inside them

html_text = driver.page_source
soup = BeautifulSoup(html_text, "html.parser")
get_details = soup.find_all('li', attrs={"class":"news"})
# get_details is an aggregation of results fetched by BeautifulSoup find_all() method

one instance of the resultset is as below:

<li class="news">blah blah blah what i want blah blah blah  <a href="/graphic/graphicInfoData/000002230030421305">View details</a></li>

What I want is the "blah blah blah what i want blah blah blah", the so-called Navigable string in BeautifulSoup. But I can not use .string attribute to a list, even when I use the print(get_details[0].string), the result is None, why?

by the way , as a comparison, below code works!

print(get_details[0].a.string)
>>> print(get_details[0].li.string)
    Traceback (most recent call last):
    File "<pyshell#57>", line 1, in <module>
    print(get_details[0].li.string)
    AttributeError: 'NoneType' object has no attribute 'string'

Any thoughts will be highly appreciated!

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-01-27T04:33:29+0000

Use .get_text() instead of .string:

print(get_details[0].a.get_text())

Output: View details

print(get_details[0].get_text())

Output: blah blah blah what i want blah blah blah View details

Be aware, that get_details[0].get_text() will get all the text of the li.

Following will only get the first part:

get_details[0].contents[0].strip()

Output: blah blah blah what i want blah blah blah

Categories

I've get a resultset by python BeautifulSoup, but I don't know how to fetch the NavigableString inside them

I've get a resultset by python BeautifulSoup, but I don't know how to fetch the NavigableString inside them

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags