Skip to content

Commit 4021f6e

Browse files
authored
Merge pull request #41 from VikParuchuri/dev
Dev
2 parents a002c7f + 4a5b8aa commit 4021f6e

2 files changed

Lines changed: 3 additions & 3 deletions

File tree

pdftext/pdf/pages.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -107,8 +107,8 @@ def span_break():
107107
span_break()
108108
continue
109109

110-
# we also break on hyphenation
111-
if span['text'].endswith("\x02"):
110+
# we break on hyphenation or newline
111+
if span['text'].endswith("\x02") or span['text'].endswith("\n"):
112112
span_break()
113113
continue
114114

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[tool.poetry]
22
name = "pdftext"
3-
version = "0.6.1"
3+
version = "0.6.2"
44
description = "Extract structured text from pdfs quickly"
55
authors = ["Vik Paruchuri <vik.paruchuri@gmail.com>"]
66
license = "Apache-2.0"

0 commit comments

Comments
 (0)