Skip to content
This repository was archived by the owner on Oct 17, 2022. It is now read-only.

Fixed bugs in HTML parsing#27

Open
TheTechromancer wants to merge 2 commits intobyt3bl33d3r:masterfrom
TheTechromancer:master
Open

Fixed bugs in HTML parsing#27
TheTechromancer wants to merge 2 commits intobyt3bl33d3r:masterfrom
TheTechromancer:master

Conversation

@TheTechromancer
Copy link
Copy Markdown

This PR addresses two bugs:

  1. The google and bing lxml parsers fail to find any matches (the syntax is still correct, so heaven only knows why it stopped working). Resolved by using BeautifulSoup, which still leverages lxml behind the scenes.
  2. When the linkedin name parser fails to split on '|', it throws an exception and the entire page of results are lost. Resolved by handling that AttributeError.

Cheers

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant